climateprediction.net home page
Task 15683830

Task 15683830

Name hadcm3n_3btq_1940_40_008265352_2
Workunit 8420476
Created 26 Mar 2013, 14:24:49 UTC
Sent 26 Mar 2013, 14:24:58 UTC
Report deadline 25 Jun 2013, 21:52:09 UTC
Received 20 Apr 2013, 22:04:52 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 1091586
Run time 22 days 5 hours 10 min 47 sec
CPU time 19 days 7 hours 20 min 55 sec
Validate state Invalid
Credit 7,464.96
Device peak FLOPS 1.77 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
Atmos Hold Restart file rename failed on atmos_restart.hold
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10564, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11892, iMonCtr=1
Model crash detected, will try to restart...
19:11:01 (10272): No heartbeat from core client for 30 sec - exiting
19:11:02 (10272): No heartbeat from core client for 30 sec - exiting
19:11:04 (10272): No heartbeat from core client for 30 sec - exiting
19:11:05 (10272): No heartbeat from core client for 30 sec - exiting
19:11:06 (10272): No heartbeat from core client for 30 sec - exiting
19:11:07 (10272): No heartbeat from core client for 30 sec - exiting
19:11:08 (10272): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:35:37 (11164): No heartbeat from core client for 30 sec - exiting
22:35:38 (11164): No heartbeat from core client for 30 sec - exiting
22:35:40 (11164): No heartbeat from core client for 30 sec - exiting
22:35:41 (11164): No heartbeat from core client for 30 sec - exiting
22:35:42 (11164): No heartbeat from core client for 30 sec - exiting
22:35:43 (11164): No heartbeat from core client for 30 sec - exiting
22:35:44 (11164): No heartbeat from core client for 30 sec - exiting
22:35:45 (11164): No heartbeat from core client for 30 sec - exiting
22:35:46 (11164): No heartbeat from core client for 30 sec - exiting
22:35:47 (11164): No heartbeat from core client for 30 sec - exiting
22:35:48 (11164): No heartbeat from core client for 30 sec - exiting
22:35:49 (11164): No heartbeat from core client for 30 sec - exiting
22:35:50 (11164): No heartbeat from core client for 30 sec - exiting
22:35:51 (11164): No heartbeat from core client for 30 sec - exiting
22:35:52 (11164): No heartbeat from core client for 30 sec - exiting
22:35:53 (11164): No heartbeat from core client for 30 sec - exiting
22:35:54 (11164): No heartbeat from core client for 30 sec - exiting
22:35:55 (11164): No heartbeat from core client for 30 sec - exiting
22:35:56 (11164): No heartbeat from core client for 30 sec - exiting
22:35:57 (11164): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11936, iMonCtr=1
Model crash detected, will try to restart...
14:12:50 (3580): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:21:52 (11804): No heartbeat from core client for 30 sec - exiting
14:21:53 (11804): No heartbeat from core client for 30 sec - exiting
14:21:54 (11804): No heartbeat from core client for 30 sec - exiting
14:21:55 (11804): No heartbeat from core client for 30 sec - exiting
14:21:56 (11804): No heartbeat from core client for 30 sec - exiting
14:21:57 (11804): No heartbeat from core client for 30 sec - exiting
14:21:58 (11804): No heartbeat from core client for 30 sec - exiting
14:21:59 (11804): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11600, iMonCtr=1
Model crash detected, will try to restart...
17:26:29 (11520): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:26:30 (11520): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
06:32:17 (12824): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:32:19 (12824): No heartbeat from core client for 30 sec - exiting
06:32:20 (12824): No heartbeat from core client for 30 sec - exiting
06:32:21 (12824): No heartbeat from core client for 30 sec - exiting
06:32:22 (12824): No heartbeat from core client for 30 sec - exiting
06:32:23 (12824): No heartbeat from core client for 30 sec - exiting
06:32:24 (12824): No heartbeat from core client for 30 sec - exiting
06:32:25 (12824): No heartbeat from core client for 30 sec - exiting
06:32:26 (12824): No heartbeat from core client for 30 sec - exiting
06:32:27 (12824): No heartbeat from core client for 30 sec - exiting
06:32:28 (12824): No heartbeat from core client for 30 sec - exiting
22:53:22 (12036): No heartbeat from core client for 30 sec - exiting
22:53:23 (12036): No heartbeat from core client for 30 sec - exiting
22:53:25 (12036): No heartbeat from core client for 30 sec - exiting
22:53:26 (12036): No heartbeat from core client for 30 sec - exiting
22:53:27 (12036): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12228, iMonCtr=1
Model crash detected, will try to restart...
20:37:59 (9364): No heartbeat from core client for 30 sec - exiting
20:38:00 (9364): No heartbeat from core client for 30 sec - exiting
20:38:01 (9364): No heartbeat from core client for 30 sec - exiting
20:38:02 (9364): No heartbeat from core client for 30 sec - exiting
20:38:03 (9364): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:27:37 (11904): No heartbeat from core client for 30 sec - exiting
17:27:38 (11904): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:19:07 (11936): No heartbeat from core client for 30 sec - exiting
12:19:10 (11936): No heartbeat from core client for 30 sec - exiting
12:19:11 (11936): No heartbeat from core client for 30 sec - exiting
12:19:12 (11936): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 Apr 2013 08:43:55 1091586 15683830 hadcm3n_3btq_1940_40_008265352_2 622,080 1,636,772 2.6311
19 Apr 2013 05:12:30 1091586 15683830 hadcm3n_3btq_1940_40_008265352_2 596,160 1,564,860 2.6249
17 Apr 2013 14:20:27 1091586 15683830 hadcm3n_3btq_1940_40_008265352_2 570,240 1,490,745 2.6142
16 Apr 2013 16:56:23 1091586 15683830 hadcm3n_3btq_1940_40_008265352_2 544,320 1,424,022 2.6161
16 Apr 2013 02:52:43 1091586 15683830 hadcm3n_3btq_1940_40_008265352_2 518,400 1,374,279 2.6510
15 Apr 2013 06:54:45 1091586 15683830 hadcm3n_3btq_1940_40_008265352_2 492,480 1,310,631 2.6613
14 Apr 2013 10:37:12 1091586 15683830 hadcm3n_3btq_1940_40_008265352_2 466,560 1,245,724 2.6700
13 Apr 2013 18:43:52 1091586 15683830 hadcm3n_3btq_1940_40_008265352_2 440,640 1,190,029 2.7007
12 Apr 2013 20:33:12 1091586 15683830 hadcm3n_3btq_1940_40_008265352_2 414,720 1,121,400 2.7040
11 Apr 2013 19:53:14 1091586 15683830 hadcm3n_3btq_1940_40_008265352_2 388,800 1,045,607 2.6893
10 Apr 2013 19:06:56 1091586 15683830 hadcm3n_3btq_1940_40_008265352_2 362,880 970,013 2.6731
09 Apr 2013 00:06:50 1091586 15683830 hadcm3n_3btq_1940_40_008265352_2 336,960 895,409 2.6573
07 Apr 2013 20:00:58 1091586 15683830 hadcm3n_3btq_1940_40_008265352_2 311,040 820,379 2.6375
06 Apr 2013 17:20:01 1091586 15683830 hadcm3n_3btq_1940_40_008265352_2 285,120 743,150 2.6064
05 Apr 2013 18:32:11 1091586 15683830 hadcm3n_3btq_1940_40_008265352_2 259,200 672,331 2.5939
05 Apr 2013 02:22:11 1091586 15683830 hadcm3n_3btq_1940_40_008265352_2 233,280 615,444 2.6382
04 Apr 2013 10:06:10 1091586 15683830 hadcm3n_3btq_1940_40_008265352_2 207,360 557,893 2.6905
03 Apr 2013 16:21:07 1091586 15683830 hadcm3n_3btq_1940_40_008265352_2 181,440 497,683 2.7430
02 Apr 2013 08:58:41 1091586 15683830 hadcm3n_3btq_1940_40_008265352_2 155,520 431,395 2.7739
31 Mar 2013 19:32:50 1091586 15683830 hadcm3n_3btq_1940_40_008265352_2 129,600 357,579 2.7591


©2024 climateprediction.net