climateprediction.net home page
Task 15481347

Task 15481347

Name hadcm3n_zlgb_1920_40_008256623_0
Workunit 8411747
Created 16 Dec 2012, 22:00:14 UTC
Sent 16 Dec 2012, 22:01:56 UTC
Report deadline 18 Mar 2013, 5:29:07 UTC
Received 20 Jan 2013, 19:27:13 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 943847
Run time 30 days 15 hours 7 min 9 sec
CPU time 24 days 13 hours 33 min 10 sec
Validate state Invalid
Credit 12,441.60
Device peak FLOPS 2.23 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
 - exit code 193 (0xc1)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7028, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Ocean Restart file copy failed on zlgbko.dac6440
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:46:38 (752): No heartbeat from core client for 30 sec - exiting
20:46:39 (752): No heartbeat from core client for 30 sec - exiting
20:46:40 (752): No heartbeat from core client for 30 sec - exiting
20:46:41 (752): No heartbeat from core client for 30 sec - exiting
20:46:42 (752): No heartbeat from core client for 30 sec - exiting
20:46:43 (752): No heartbeat from core client for 30 sec - exiting
20:46:44 (752): No heartbeat from core client for 30 sec - exiting
20:46:45 (752): No heartbeat from core client for 30 sec - exiting
20:46:46 (752): No heartbeat from core client for 30 sec - exiting
20:46:47 (752): No heartbeat from core client for 30 sec - exiting
20:46:48 (752): No heartbeat from core client for 30 sec - exiting
20:46:49 (752): No heartbeat from core client for 30 sec - exiting
20:46:50 (752): No heartbeat from core client for 30 sec - exiting
20:46:51 (752): No heartbeat from core client for 30 sec - exiting
20:46:52 (752): No heartbeat from core client for 30 sec - exiting
20:46:53 (752): No heartbeat from core client for 30 sec - exiting
20:46:54 (752): No heartbeat from core client for 30 sec - exiting
20:46:55 (752): No heartbeat from core client for 30 sec - exiting
20:46:56 (752): No heartbeat from core client for 30 sec - exiting
20:46:57 (752): No heartbeat from core client for 30 sec - exiting
20:46:58 (752): No heartbeat from core client for 30 sec - exiting
20:46:59 (752): No heartbeat from core client for 30 sec - exiting
20:47:00 (752): No heartbeat from core client for 30 sec - exiting
20:47:01 (752): No heartbeat from core client for 30 sec - exiting
20:47:02 (752): No heartbeat from core client for 30 sec - exiting
20:47:03 (752): No heartbeat from core client for 30 sec - exiting
20:47:04 (752): No heartbeat from core client for 30 sec - exiting
20:47:05 (752): No heartbeat from core client for 30 sec - exiting
20:47:06 (752): No heartbeat from core client for 30 sec - exiting
20:47:07 (752): No heartbeat from core client for 30 sec - exiting
20:47:08 (752): No heartbeat from core client for 30 sec - exiting
20:47:09 (752): No heartbeat from core client for 30 sec - exiting
20:47:10 (752): No heartbeat from core client for 30 sec - exiting
20:47:11 (752): No heartbeat from core client for 30 sec - exiting
20:47:12 (752): No heartbeat from core client for 30 sec - exiting
20:47:13 (752): No heartbeat from core client for 30 sec - exiting
20:47:14 (752): No heartbeat from core client for 30 sec - exiting
20:47:15 (752): No heartbeat from core client for 30 sec - exiting
20:47:16 (752): No heartbeat from core client for 30 sec - exiting
20:47:17 (752): No heartbeat from core client for 30 sec - exiting
20:47:18 (752): No heartbeat from core client for 30 sec - exiting
20:47:19 (752): No heartbeat from core client for 30 sec - exiting
20:47:20 (752): No heartbeat from core client for 30 sec - exiting
20:47:21 (752): No heartbeat from core client for 30 sec - exiting
20:47:22 (752): No heartbeat from core client for 30 sec - exiting
20:47:23 (752): No heartbeat from core client for 30 sec - exiting
20:47:24 (752): No heartbeat from core client for 30 sec - exiting
20:47:25 (752): No heartbeat from core client for 30 sec - exiting
20:47:26 (752): No heartbeat from core client for 30 sec - exiting
20:47:27 (752): No heartbeat from core client for 30 sec - exiting
20:47:28 (752): No heartbeat from core client for 30 sec - exiting
20:47:29 (752): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:50:09 (5464): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
09:31:34 (4084): No heartbeat from core client for 30 sec - exiting
09:31:36 (4084): No heartbeat from core client for 30 sec - exiting
09:31:37 (4084): No heartbeat from core client for 30 sec - exiting
09:31:38 (4084): No heartbeat from core client for 30 sec - exiting
09:31:39 (4084): No heartbeat from core client for 30 sec - exiting
09:31:40 (4084): No heartbeat from core client for 30 sec - exiting
09:31:41 (4084): No heartbeat from core client for 30 sec - exiting
09:31:42 (4084): No heartbeat from core client for 30 sec - exiting
09:31:43 (4084): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/zlgbko.pje1c10
Error converting file to netcdf: dataout/zlgbko.pie1c10
Error converting file to netcdf: dataout/zlgbko.pfe1c10
Error converting file to netcdf: dataout/zlgbka.phe1c10
Error converting file to netcdf: dataout/zlgbka.pge1c10
Error converting file to netcdf: dataout/zlgbka.pee1c10
Error converting file to netcdf: dataout/zlgbka.pde1c10
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
10:55:53 (1988): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
11:39:13 (5612): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:44:19 (5612): No heartbeat from core client for 30 sec - exiting
18:42:35 (6088): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Ocean Restart file copy failed on zlgbko.daf0560
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
14:58:44 (2236): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
15:11:26 (5308): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x77893541 read attempt to address 0x40A6C301

Engaging BOINC Windows Runtime Debugger...



Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x77893541 read attempt to address 0x40A6C301

Engaging BOINC Windows Runtime Debugger...

Cannot serialize file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_zlgb_1920_40_008256623/dataout/shmem_restart.day
Signal 11 received, exiting...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 Jan 2013 10:49:59 943847 15481347 hadcm3n_zlgb_1920_40_008256623_0 1,036,800 2,121,647 2.0463
19 Jan 2013 14:50:33 943847 15481347 hadcm3n_zlgb_1920_40_008256623_0 1,010,880 2,063,997 2.0418
18 Jan 2013 17:32:23 943847 15481347 hadcm3n_zlgb_1920_40_008256623_0 984,960 2,004,522 2.0351
17 Jan 2013 11:20:04 943847 15481347 hadcm3n_zlgb_1920_40_008256623_0 959,040 1,945,954 2.0291
16 Jan 2013 02:16:07 943847 15481347 hadcm3n_zlgb_1920_40_008256623_0 933,120 1,887,971 2.0233
14 Jan 2013 22:46:05 943847 15481347 hadcm3n_zlgb_1920_40_008256623_0 907,200 1,830,341 2.0176
14 Jan 2013 01:37:00 943847 15481347 hadcm3n_zlgb_1920_40_008256623_0 881,280 1,772,613 2.0114
12 Jan 2013 22:52:33 943847 15481347 hadcm3n_zlgb_1920_40_008256623_0 855,360 1,715,512 2.0056
12 Jan 2013 02:53:12 943847 15481347 hadcm3n_zlgb_1920_40_008256623_0 829,440 1,660,271 2.0017
11 Jan 2013 09:44:27 943847 15481347 hadcm3n_zlgb_1920_40_008256623_0 803,520 1,606,498 1.9993
10 Jan 2013 13:35:21 943847 15481347 hadcm3n_zlgb_1920_40_008256623_0 777,600 1,552,555 1.9966
08 Jan 2013 18:49:41 943847 15481347 hadcm3n_zlgb_1920_40_008256623_0 751,680 1,501,690 1.9978
08 Jan 2013 05:21:19 943847 15481347 hadcm3n_zlgb_1920_40_008256623_0 725,760 1,453,268 2.0024
07 Jan 2013 16:21:55 943847 15481347 hadcm3n_zlgb_1920_40_008256623_0 699,840 1,409,987 2.0147
05 Jan 2013 10:56:16 943847 15481347 hadcm3n_zlgb_1920_40_008256623_0 648,000 1,313,482 2.0270
04 Jan 2013 16:59:52 943847 15481347 hadcm3n_zlgb_1920_40_008256623_0 622,080 1,262,428 2.0294
03 Jan 2013 22:07:33 943847 15481347 hadcm3n_zlgb_1920_40_008256623_0 596,160 1,211,272 2.0318
03 Jan 2013 01:19:17 943847 15481347 hadcm3n_zlgb_1920_40_008256623_0 570,240 1,159,448 2.0333
02 Jan 2013 05:49:28 943847 15481347 hadcm3n_zlgb_1920_40_008256623_0 544,320 1,107,271 2.0342
01 Jan 2013 08:51:22 943847 15481347 hadcm3n_zlgb_1920_40_008256623_0 518,400 1,055,903 2.0368


©2024 climateprediction.net