climateprediction.net home page
Task 13420727

Task 13420727

Name hadcm3n_u2zq_1980_40_007459118_3
Workunit 7656621
Created 25 Sep 2011, 14:45:37 UTC
Sent 25 Sep 2011, 15:04:23 UTC
Report deadline 25 Dec 2011, 22:31:34 UTC
Received 4 Oct 2011, 15:36:36 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1021113
Run time 2 days 13 hours 55 min 11 sec
CPU time 2 days 10 hours 59 min 7 sec
Validate state Invalid
Credit 1,244.16
Device peak FLOPS 2.53 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6196, iMonCtr=1
Model crash detected, will try to restart...
15:29:15 (5048): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6868, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6156, iMonCtr=1
Model crash detected, will try to restart...
07:15:48 (7012): No heartbeat from core client for 30 sec - exiting
07:15:49 (7012): No heartbeat from core client for 30 sec - exiting
07:15:50 (7012): No heartbeat from core client for 30 sec - exiting
07:15:51 (7012): No heartbeat from core client for 30 sec - exiting
07:15:52 (7012): No heartbeat from core client for 30 sec - exiting
07:15:53 (7012): No heartbeat from core client for 30 sec - exiting
07:15:54 (7012): No heartbeat from core client for 30 sec - exiting
07:15:55 (7012): No heartbeat from core client for 30 sec - exiting
07:15:56 (7012): No heartbeat from core client for 30 sec - exiting
07:15:57 (7012): No heartbeat from core client for 30 sec - exiting
07:15:58 (7012): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1240, iMonCtr=1
Model crash detected, will try to restart...
08:36:43 (5164): No heartbeat from core client for 30 sec - exiting
08:36:44 (5164): No heartbeat from core client for 30 sec - exiting
08:36:45 (5164): No heartbeat from core client for 30 sec - exiting
08:36:46 (5164): No heartbeat from core client for 30 sec - exiting
08:36:47 (5164): No heartbeat from core client for 30 sec - exiting
08:36:48 (5164): No heartbeat from core client for 30 sec - exiting
08:36:49 (5164): No heartbeat from core client for 30 sec - exiting
08:36:50 (5164): No heartbeat from core client for 30 sec - exiting
08:36:51 (5164): No heartbeat from core client for 30 sec - exiting
08:36:53 (5164): No heartbeat from core client for 30 sec - exiting
08:36:54 (5164): No heartbeat from core client for 30 sec - exiting
08:36:55 (5164): No heartbeat from core client for 30 sec - exiting
08:36:56 (5164): No heartbeat from core client for 30 sec - exiting
08:36:57 (5164): No heartbeat from core client for 30 sec - exiting
08:36:58 (5164): No heartbeat from core client for 30 sec - exiting
08:36:59 (5164): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6876, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2664, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2664, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6020, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4252, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4252, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4252, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4252, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4252, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4252, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
03 Oct 2011 17:10:04 1021113 13420727 hadcm3n_u2zq_1980_40_007459118_3 103,680 205,505 1.9821
01 Oct 2011 12:37:08 1021113 13420727 hadcm3n_u2zq_1980_40_007459118_3 77,760 155,035 1.9938
30 Sep 2011 11:48:56 1021113 13420727 hadcm3n_u2zq_1980_40_007459118_3 51,840 103,251 1.9917
28 Sep 2011 18:19:57 1021113 13420727 hadcm3n_u2zq_1980_40_007459118_3 25,920 51,596 1.9906


©2024 climateprediction.net