climateprediction.net home page
Task 11014363

Task 11014363

Name hadsm3dhet2_jowd_006595263_5
Workunit 6798636
Created 15 Mar 2010, 12:01:03 UTC
Sent 4 Oct 2010, 14:57:21 UTC
Report deadline 16 Sep 2011, 20:17:21 UTC
Received 29 Mar 2011, 0:31:58 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -177 (0xFFFFFF4F) ERR_RSC_LIMIT_EXCEEDED
Computer ID 1070025
Run time 103 days 18 hours 55 min 9 sec
CPU time 101 days 1 hours 38 min 52 sec
Validate state Invalid
Credit 5,557.63
Device peak FLOPS 2.74 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
Maximum elapsed time exceeded
</message>
<stderr_txt>
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	09:43:42 PM	No files match the supplied pattern.
MainError:	07:57:17 PM	No files match the supplied pattern.
MainError:	07:57:17 PM	No files match the supplied pattern.
MainError:	07:57:17 PM	No files match the supplied pattern.
MainError:	07:57:17 PM	No files match the supplied pattern.
MainError:	07:57:17 PM	No files match the supplied pattern.
MainError:	07:57:17 PM	No files match the supplied pattern.
MainError:	07:57:17 PM	No files match the supplied pattern.
MainError:	07:57:17 PM	No files match the supplied pattern.
MainError:	07:57:17 PM	No files match the supplied pattern.
MainError:	07:57:17 PM	No files match the supplied pattern.
MainError:	07:57:17 PM	No files match the supplied pattern.
MainError:	07:57:17 PM	No files match the supplied pattern.
MainError:	07:57:17 PM	No files match the supplied pattern.
MainError:	07:57:17 PM	No files match the supplied pattern.
MainError:	07:57:17 PM	No files match the supplied pattern.
MainError:	07:57:17 PM	No files match the supplied pattern.
MainError:	07:57:17 PM	No files match the supplied pattern.
MainError:	07:57:17 PM	No files match the supplied pattern.
MainError:	07:57:17 PM	No files match the supplied pattern.
MainError:	07:57:17 PM	No files match the supplied pattern.
MainError:	07:57:17 PM	No files match the supplied pattern.
MainError:	07:57:17 PM	No files match the supplied pattern.
MainError:	07:57:17 PM	No files match the supplied pattern.
MainError:	07:57:17 PM	No files match the supplied pattern.
MainError:	07:57:17 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
MainError:	07:57:18 PM	No files match the supplied pattern.
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5716, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7528, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Abort request from BOINC...
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
17 Nov 2010 12:23:10 1070025 11014363 hadsm3dhet2_jowd_006595263_5 86,416 2,249,154 3.7182
02 Nov 2010 04:07:51 1070025 11014363 hadsm3dhet2_jowd_006595263_5 75,614 1,511,119 2.5435
20 Oct 2010 03:19:09 1070025 11014363 hadsm3dhet2_jowd_006595263_5 64,812 816,187 1.3992
19 Oct 2010 21:27:21 1070025 11014363 hadsm3dhet2_jowd_006595263_5 54,010 800,726 1.3986
19 Oct 2010 15:26:21 1070025 11014363 hadsm3dhet2_jowd_006595263_5 43,208 785,385 1.3982
19 Oct 2010 11:13:44 1070025 11014363 hadsm3dhet2_jowd_006595263_5 32,406 770,187 1.3980
19 Oct 2010 06:19:51 1070025 11014363 hadsm3dhet2_jowd_006595263_5 21,604 755,063 1.3980
19 Oct 2010 03:47:46 1070025 11014363 hadsm3dhet2_jowd_006595263_5 10,802 739,914 1.3979
18 Oct 2010 19:59:25 1070025 11014363 hadsm3dhet2_jowd_006595263_5 259,248 724,921 1.3981
18 Oct 2010 12:45:00 1070025 11014363 hadsm3dhet2_jowd_006595263_5 248,446 709,751 1.3980
18 Oct 2010 05:22:30 1070025 11014363 hadsm3dhet2_jowd_006595263_5 237,644 694,616 1.3979
17 Oct 2010 15:55:15 1070025 11014363 hadsm3dhet2_jowd_006595263_5 226,842 679,455 1.3978
17 Oct 2010 11:42:30 1070025 11014363 hadsm3dhet2_jowd_006595263_5 216,040 664,404 1.3979
16 Oct 2010 20:17:32 1070025 11014363 hadsm3dhet2_jowd_006595263_5 205,238 649,487 1.3983
16 Oct 2010 13:58:45 1070025 11014363 hadsm3dhet2_jowd_006595263_5 194,436 634,469 1.3985
16 Oct 2010 09:37:31 1070025 11014363 hadsm3dhet2_jowd_006595263_5 183,634 619,638 1.3991
16 Oct 2010 04:09:23 1070025 11014363 hadsm3dhet2_jowd_006595263_5 172,832 604,468 1.3990
15 Oct 2010 23:50:13 1070025 11014363 hadsm3dhet2_jowd_006595263_5 162,030 589,119 1.3984
15 Oct 2010 15:04:25 1070025 11014363 hadsm3dhet2_jowd_006595263_5 151,228 573,741 1.3977
15 Oct 2010 10:44:31 1070025 11014363 hadsm3dhet2_jowd_006595263_5 140,426 558,282 1.3968


©2024 climateprediction.net