climateprediction.net home page
Task 13464441

Task 13464441

Name hadcm3n_o5i2_1940_40_007443791_3
Workunit 7641294
Created 6 Oct 2011, 18:14:59 UTC
Sent 6 Oct 2011, 18:15:27 UTC
Report deadline 6 Jan 2012, 1:42:38 UTC
Received 25 Oct 2011, 17:18:38 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1122757
Run time 12 days 21 hours 46 min 19 sec
CPU time 12 days 17 hours 8 min 4 sec
Validate state Invalid
Credit 4,043.52
Device peak FLOPS 1.70 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
22:53:01 (3628): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:53:06 (3628): No heartbeat from core client for 30 sec - exiting
22:53:07 (3628): No heartbeat from core client for 30 sec - exiting
22:53:09 (3628): No heartbeat from core client for 30 sec - exiting
22:53:10 (3628): No heartbeat from core client for 30 sec - exiting
22:53:11 (3628): No heartbeat from core client for 30 sec - exiting
22:53:12 (3628): No heartbeat from core client for 30 sec - exiting
22:53:13 (3628): No heartbeat from core client for 30 sec - exiting
22:53:14 (3628): No heartbeat from core client for 30 sec - exiting
22:53:15 (3628): No heartbeat from core client for 30 sec - exiting
22:53:16 (3628): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
22:55:13 (1392): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
23:15:10 (6112): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:15:14 (6112): No heartbeat from core client for 30 sec - exiting
23:15:15 (6112): No heartbeat from core client for 30 sec - exiting
23:15:16 (6112): No heartbeat from core client for 30 sec - exiting
23:18:38 (5628): No heartbeat from core client for 30 sec - exiting
23:18:39 (5628): No heartbeat from core client for 30 sec - exiting
23:18:40 (5628): No heartbeat from core client for 30 sec - exiting
23:18:41 (5628): No heartbeat from core client for 30 sec - exiting
23:18:42 (5628): No heartbeat from core client for 30 sec - exiting
23:18:43 (5628): No heartbeat from core client for 30 sec - exiting
23:18:44 (5628): No heartbeat from core client for 30 sec - exiting
23:18:45 (5628): No heartbeat from core client for 30 sec - exiting
23:18:46 (5628): No heartbeat from core client for 30 sec - exiting
23:18:47 (5628): No heartbeat from core client for 30 sec - exiting
23:18:48 (5628): No heartbeat from core client for 30 sec - exiting
23:18:50 (5628): No heartbeat from core client for 30 sec - exiting
23:18:51 (5628): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:18:52 (5628): No heartbeat from core client for 30 sec - exiting
23:18:53 (5628): No heartbeat from core client for 30 sec - exiting
23:18:54 (5628): No heartbeat from core client for 30 sec - exiting
20:22:00 (5652): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:51:42 (3756): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:51:58 (3756): No heartbeat from core client for 30 sec - exiting
05:51:59 (3756): No heartbeat from core client for 30 sec - exiting
05:52:00 (3756): No heartbeat from core client for 30 sec - exiting
05:52:01 (3756): No heartbeat from core client for 30 sec - exiting
05:52:02 (3756): No heartbeat from core client for 30 sec - exiting
05:52:03 (3756): No heartbeat from core client for 30 sec - exiting
05:52:04 (3756): No heartbeat from core client for 30 sec - exiting
05:52:06 (3756): No heartbeat from core client for 30 sec - exiting
05:52:07 (3756): No heartbeat from core client for 30 sec - exiting
05:52:08 (3756): No heartbeat from core client for 30 sec - exiting
05:52:09 (3756): No heartbeat from core client for 30 sec - exiting
05:52:10 (3756): No heartbeat from core client for 30 sec - exiting
05:52:11 (3756): No heartbeat from core client for 30 sec - exiting
05:52:12 (3756): No heartbeat from core client for 30 sec - exiting
05:52:13 (3756): No heartbeat from core client for 30 sec - exiting
05:52:14 (3756): No heartbeat from core client for 30 sec - exiting
05:52:15 (3756): No heartbeat from core client for 30 sec - exiting
05:52:16 (3756): No heartbeat from core client for 30 sec - exiting
05:52:18 (3756): No heartbeat from core client for 30 sec - exiting
05:52:19 (3756): No heartbeat from core client for 30 sec - exiting
05:52:20 (3756): No heartbeat from core client for 30 sec - exiting
05:52:21 (3756): No heartbeat from core client for 30 sec - exiting
05:52:22 (3756): No heartbeat from core client for 30 sec - exiting
05:52:23 (3756): No heartbeat from core client for 30 sec - exiting
05:52:24 (3756): No heartbeat from core client for 30 sec - exiting
05:52:25 (3756): No heartbeat from core client for 30 sec - exiting
05:52:26 (3756): No heartbeat from core client for 30 sec - exiting
05:52:27 (3756): No heartbeat from core client for 30 sec - exiting
05:52:28 (3756): No heartbeat from core client for 30 sec - exiting
05:52:30 (3756): No heartbeat from core client for 30 sec - exiting
05:52:31 (3756): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
02:34:35 (1872): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:34:47 (1872): No heartbeat from core client for 30 sec - exiting
02:34:48 (1872): No heartbeat from core client for 30 sec - exiting
02:34:49 (1872): No heartbeat from core client for 30 sec - exiting
02:34:50 (1872): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
01:50:27 (4592): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:50:41 (4592): No heartbeat from core client for 30 sec - exiting
01:50:43 (4592): No heartbeat from core client for 30 sec - exiting
01:50:44 (4592): No heartbeat from core client for 30 sec - exiting
01:50:45 (4592): No heartbeat from core client for 30 sec - exiting
01:50:46 (4592): No heartbeat from core client for 30 sec - exiting
01:50:47 (4592): No heartbeat from core client for 30 sec - exiting
01:50:48 (4592): No heartbeat from core client for 30 sec - exiting
01:50:49 (4592): No heartbeat from core client for 30 sec - exiting
07:42:47 (4548): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:16:11 (5000): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:16:41 (5000): No heartbeat from core client for 30 sec - exiting
14:16:42 (5000): No heartbeat from core client for 30 sec - exiting
14:16:43 (5000): No heartbeat from core client for 30 sec - exiting
14:16:44 (5000): No heartbeat from core client for 30 sec - exiting
14:16:45 (5000): No heartbeat from core client for 30 sec - exiting
14:16:46 (5000): No heartbeat from core client for 30 sec - exiting
14:16:47 (5000): No heartbeat from core client for 30 sec - exiting
14:16:48 (5000): No heartbeat from core client for 30 sec - exiting
14:16:49 (5000): No heartbeat from core client for 30 sec - exiting
14:16:50 (5000): No heartbeat from core client for 30 sec - exiting
14:16:51 (5000): No heartbeat from core client for 30 sec - exiting
14:16:53 (5000): No heartbeat from core client for 30 sec - exiting
14:16:54 (5000): No heartbeat from core client for 30 sec - exiting
14:16:55 (5000): No heartbeat from core client for 30 sec - exiting
14:16:56 (5000): No heartbeat from core client for 30 sec - exiting
14:16:57 (5000): No heartbeat from core client for 30 sec - exiting
14:16:58 (5000): No heartbeat from core client for 30 sec - exiting
14:16:59 (5000): No heartbeat from core client for 30 sec - exiting
14:17:00 (5000): No heartbeat from core client for 30 sec - exiting
14:17:01 (5000): No heartbeat from core client for 30 sec - exiting
14:17:02 (5000): No heartbeat from core client for 30 sec - exiting
14:17:04 (5000): No heartbeat from core client for 30 sec - exiting
14:17:05 (5000): No heartbeat from core client for 30 sec - exiting
14:17:06 (5000): No heartbeat from core client for 30 sec - exiting
14:17:07 (5000): No heartbeat from core client for 30 sec - exiting
14:17:08 (5000): No heartbeat from core client for 30 sec - exiting
14:17:09 (5000): No heartbeat from core client for 30 sec - exiting
14:17:10 (5000): No heartbeat from core client for 30 sec - exiting
14:17:11 (5000): No heartbeat from core client for 30 sec - exiting
14:17:12 (5000): No heartbeat from core client for 30 sec - exiting
14:17:13 (5000): No heartbeat from core client for 30 sec - exiting
14:29:47 (3084): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:14:39 (4064): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5184, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5184, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5184, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5184, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5184, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5184, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
31 Oct 2011 14:55:28 1122757 13464441 hadcm3n_o5i2_1940_40_007443791_3 336,960 1,036,930 3.0773
31 Oct 2011 14:55:28 1122757 13464441 hadcm3n_o5i2_1940_40_007443791_3 311,040 956,498 3.0752
31 Oct 2011 14:55:27 1122757 13464441 hadcm3n_o5i2_1940_40_007443791_3 285,120 876,439 3.0739
31 Oct 2011 14:55:27 1122757 13464441 hadcm3n_o5i2_1940_40_007443791_3 259,200 796,898 3.0745
31 Oct 2011 14:55:27 1122757 13464441 hadcm3n_o5i2_1940_40_007443791_3 233,280 717,297 3.0748
31 Oct 2011 14:55:27 1122757 13464441 hadcm3n_o5i2_1940_40_007443791_3 207,360 637,377 3.0738
18 Oct 2011 18:50:31 1122757 13464441 hadcm3n_o5i2_1940_40_007443791_3 181,440 557,610 3.0732
17 Oct 2011 20:05:58 1122757 13464441 hadcm3n_o5i2_1940_40_007443791_3 155,520 477,292 3.0690
16 Oct 2011 21:35:29 1122757 13464441 hadcm3n_o5i2_1940_40_007443791_3 129,600 397,299 3.0656
15 Oct 2011 22:53:12 1122757 13464441 hadcm3n_o5i2_1940_40_007443791_3 103,680 317,361 3.0610
15 Oct 2011 00:09:59 1122757 13464441 hadcm3n_o5i2_1940_40_007443791_3 77,760 236,472 3.0410
14 Oct 2011 02:03:20 1122757 13464441 hadcm3n_o5i2_1940_40_007443791_3 51,840 156,550 3.0199
13 Oct 2011 03:59:19 1122757 13464441 hadcm3n_o5i2_1940_40_007443791_3 25,920 76,739 2.9606


©2024 climateprediction.net