climateprediction.net home page
Task 15775091

Task 15775091

Name hadcm3n_4l8i_1980_40_008365059_1
Workunit 8515918
Created 11 May 2013, 0:21:34 UTC
Sent 11 May 2013, 0:26:04 UTC
Report deadline 10 Aug 2013, 7:53:15 UTC
Received 15 May 2013, 20:32:31 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1168327
Run time 4 days 13 hours 28 min 5 sec
CPU time 3 days 21 hours 8 min 51 sec
Validate state Invalid
Credit 1,866.24
Device peak FLOPS 2.31 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
14:15:08 (7156): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:15:09 (7156): No heartbeat from core client for 30 sec - exiting
14:15:10 (7156): No heartbeat from core client for 30 sec - exiting
14:15:11 (7156): No heartbeat from core client for 30 sec - exiting
14:15:12 (7156): No heartbeat from core client for 30 sec - exiting
14:15:13 (7156): No heartbeat from core client for 30 sec - exiting
14:15:14 (7156): No heartbeat from core client for 30 sec - exiting
14:15:15 (7156): No heartbeat from core client for 30 sec - exiting
14:15:16 (7156): No heartbeat from core client for 30 sec - exiting
14:15:17 (7156): No heartbeat from core client for 30 sec - exiting
14:15:18 (7156): No heartbeat from core client for 30 sec - exiting
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6896, iMonCtr=1
Model crash detected, will try to restart...
14:17:42 (6896): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8772, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8772, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8772, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8772, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8772, iMonCtr=1
Model crash detected, will try to restart...
14:19:49 (8772): No heartbeat from core client for 30 sec - exiting
14:19:50 (8772): No heartbeat from core client for 30 sec - exiting
14:19:51 (8772): No heartbeat from core client for 30 sec - exiting
14:19:52 (8772): No heartbeat from core client for 30 sec - exiting
14:19:53 (8772): No heartbeat from core client for 30 sec - exiting
14:19:54 (8772): No heartbeat from core client for 30 sec - exiting
14:19:55 (8772): No heartbeat from core client for 30 sec - exiting
14:19:56 (8772): No heartbeat from core client for 30 sec - exiting
14:19:57 (8772): No heartbeat from core client for 30 sec - exiting
14:19:58 (8772): No heartbeat from core client for 30 sec - exiting
14:19:59 (8772): No heartbeat from core client for 30 sec - exiting
14:20:00 (8772): No heartbeat from core client for 30 sec - exiting
14:20:01 (8772): No heartbeat from core client for 30 sec - exiting
14:20:02 (8772): No heartbeat from core client for 30 sec - exiting
14:20:03 (8772): No heartbeat from core client for 30 sec - exiting
14:20:04 (8772): No heartbeat from core client for 30 sec - exiting
14:20:05 (8772): No heartbeat from core client for 30 sec - exiting
14:20:06 (8772): No heartbeat from core client for 30 sec - exiting
14:20:07 (8772): No heartbeat from core client for 30 sec - exiting
14:20:08 (8772): No heartbeat from core client for 30 sec - exiting
14:20:09 (8772): No heartbeat from core client for 30 sec - exiting
14:20:10 (8772): No heartbeat from core client for 30 sec - exiting
14:20:11 (8772): No heartbeat from core client for 30 sec - exiting
14:20:12 (8772): No heartbeat from core client for 30 sec - exiting
14:20:13 (8772): No heartbeat from core client for 30 sec - exiting
Sorry, too many model crashes! :-(
15:21:31 (10880): No heartbeat from core client for 30 sec - exiting
15:21:32 (10880): No heartbeat from core client for 30 sec - exiting
15:21:33 (10880): No heartbeat from core client for 30 sec - exiting
15:21:34 (10880): No heartbeat from core client for 30 sec - exiting
15:21:35 (10880): No heartbeat from core client for 30 sec - exiting
15:21:36 (10880): No heartbeat from core client for 30 sec - exiting
15:21:37 (10880): No heartbeat from core client for 30 sec - exiting
15:21:38 (10880): No heartbeat from core client for 30 sec - exiting
15:21:39 (10880): No heartbeat from core client for 30 sec - exiting
15:21:40 (10880): No heartbeat from core client for 30 sec - exiting
15:21:41 (10880): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7992, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7992, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7992, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7992, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7992, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7992, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
15 May 2013 02:43:28 1168327 15775091 hadcm3n_4l8i_1980_40_008365059_1 155,520 289,432 1.8611
14 May 2013 10:29:22 1168327 15775091 hadcm3n_4l8i_1980_40_008365059_1 129,600 241,513 1.8635
13 May 2013 17:38:51 1168327 15775091 hadcm3n_4l8i_1980_40_008365059_1 103,680 193,784 1.8691
13 May 2013 00:54:35 1168327 15775091 hadcm3n_4l8i_1980_40_008365059_1 77,760 145,348 1.8692
12 May 2013 09:11:14 1168327 15775091 hadcm3n_4l8i_1980_40_008365059_1 51,840 96,843 1.8681
11 May 2013 18:17:07 1168327 15775091 hadcm3n_4l8i_1980_40_008365059_1 25,920 48,509 1.8715


©2024 climateprediction.net