Task 15902591

Name	hadcm3n_n1ka_1880_40_008403072_0
Workunit	8553928
Created	23 Jul 2013, 10:56:41 UTC
Sent	23 Jul 2013, 14:04:59 UTC
Report deadline	22 Oct 2013, 21:32:10 UTC
Received	14 Aug 2013, 15:48:01 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1143361
Run time	8 days 16 hours 38 min 55 sec
CPU time	6 days 21 hours 11 min 8 sec
Validate state	Invalid
Credit	4,665.60
Device peak FLOPS	3.17 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 13:49:33 (4472): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 01:33:10 (8128): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 09:46:45 (3800): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 09:57:47 (5924): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 13:04:11 (5740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4924, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4924, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4924, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4924, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4824, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4824, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
14 Aug 2013 15:58:11	1143361	15902591	hadcm3n_n1ka_1880_40_008403072_0	388,800	591,887	1.5223
14 Aug 2013 15:58:11	1143361	15902591	hadcm3n_n1ka_1880_40_008403072_0	362,880	551,925	1.5210
14 Aug 2013 15:58:11	1143361	15902591	hadcm3n_n1ka_1880_40_008403072_0	336,960	513,644	1.5243
30 Jul 2013 09:48:37	1143361	15902591	hadcm3n_n1ka_1880_40_008403072_0	311,040	471,955	1.5173
30 Jul 2013 09:48:36	1143361	15902591	hadcm3n_n1ka_1880_40_008403072_0	285,120	432,399	1.5166
30 Jul 2013 09:48:36	1143361	15902591	hadcm3n_n1ka_1880_40_008403072_0	259,200	392,554	1.5145
30 Jul 2013 09:48:36	1143361	15902591	hadcm3n_n1ka_1880_40_008403072_0	233,280	352,749	1.5121
30 Jul 2013 09:48:36	1143361	15902591	hadcm3n_n1ka_1880_40_008403072_0	207,360	313,950	1.5140
30 Jul 2013 09:48:36	1143361	15902591	hadcm3n_n1ka_1880_40_008403072_0	181,440	275,413	1.5179
30 Jul 2013 09:48:36	1143361	15902591	hadcm3n_n1ka_1880_40_008403072_0	155,520	237,019	1.5240
26 Jul 2013 09:46:03	1143361	15902591	hadcm3n_n1ka_1880_40_008403072_0	129,600	198,692	1.5331
25 Jul 2013 20:01:05	1143361	15902591	hadcm3n_n1ka_1880_40_008403072_0	103,680	160,031	1.5435
25 Jul 2013 04:23:07	1143361	15902591	hadcm3n_n1ka_1880_40_008403072_0	77,760	120,720	1.5525
24 Jul 2013 14:38:22	1143361	15902591	hadcm3n_n1ka_1880_40_008403072_0	51,840	80,540	1.5536
24 Jul 2013 02:20:56	1143361	15902591	hadcm3n_n1ka_1880_40_008403072_0	25,920	40,462	1.5610