climateprediction.net home page
Task 18419162

Task 18419162

Name hadam3p_anz_f72h_2012_1_009778526_2
Workunit 9834490
Created 7 May 2015, 19:28:16 UTC
Sent 7 May 2015, 19:36:06 UTC
Report deadline 19 Apr 2016, 0:56:06 UTC
Received 26 May 2015, 8:07:41 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1296485
Run time 10 days 17 hours 42 min 41 sec
CPU time 10 days 5 hours 0 min 3 sec
Validate state Invalid
Credit 3,490.64
Device peak FLOPS 2.55 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.4.42</core_client_version>
<![CDATA[
<stderr_txt>
00:53:54 (15644): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:36:26 (8924): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:36:27 (8924): No heartbeat from core client for 30 sec - exiting
11:36:28 (8924): No heartbeat from core client for 30 sec - exiting
11:36:29 (8924): No heartbeat from core client for 30 sec - exiting
11:36:30 (8924): No heartbeat from core client for 30 sec - exiting
11:36:31 (8924): No heartbeat from core client for 30 sec - exiting
11:36:32 (8924): No heartbeat from core client for 30 sec - exiting
11:36:33 (8924): No heartbeat from core client for 30 sec - exiting
11:36:34 (8924): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5020, selfPID=6088, iMonCtr=1
Model crash detected, will try to restart...
09:05:46 (5764): No heartbeat from core client for 30 sec - exiting
09:05:47 (5764): No heartbeat from core client for 30 sec - exiting
09:05:48 (5764): No heartbeat from core client for 30 sec - exiting
09:05:49 (5764): No heartbeat from core client for 30 sec - exiting
09:05:50 (5764): No heartbeat from core client for 30 sec - exiting
09:05:51 (5764): No heartbeat from core client for 30 sec - exiting
09:05:52 (5764): No heartbeat from core client for 30 sec - exiting
09:05:53 (5764): No heartbeat from core client for 30 sec - exiting
09:05:54 (5764): No heartbeat from core client for 30 sec - exiting
09:05:55 (5764): No heartbeat from core client for 30 sec - exiting
09:05:56 (5764): No heartbeat from core client for 30 sec - exiting
09:05:57 (5764): No heartbeat from core client for 30 sec - exiting
09:05:58 (5764): No heartbeat from core client for 30 sec - exiting
09:05:59 (5764): No heartbeat from core client for 30 sec - exiting
09:06:00 (5764): No heartbeat from core client for 30 sec - exiting
09:06:01 (5764): No heartbeat from core client for 30 sec - exiting
09:06:02 (5764): No heartbeat from core client for 30 sec - exiting
09:06:03 (5764): No heartbeat from core client for 30 sec - exiting
09:06:04 (5764): No heartbeat from core client for 30 sec - exiting
09:06:05 (5764): No heartbeat from core client for 30 sec - exiting
09:06:06 (5764): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8100, selfPID=7656, iMonCtr=1
Model crash detected, will try to restart...
09:06:15 (6728): No heartbeat from core client for 30 sec - exiting
09:06:16 (6728): No heartbeat from core client for 30 sec - exiting
09:06:17 (6728): No heartbeat from core client for 30 sec - exiting
09:06:18 (6728): No heartbeat from core client for 30 sec - exiting
09:06:19 (6728): No heartbeat from core client for 30 sec - exiting
09:06:20 (6728): No heartbeat from core client for 30 sec - exiting
09:06:21 (6728): No heartbeat from core client for 30 sec - exiting
09:06:22 (6728): No heartbeat from core client for 30 sec - exiting
09:06:23 (6728): No heartbeat from core client for 30 sec - exiting
09:06:24 (6728): No heartbeat from core client for 30 sec - exiting
09:06:25 (6728): No heartbeat from core client for 30 sec - exiting
09:06:26 (6728): No heartbeat from core client for 30 sec - exiting
09:06:27 (6728): No heartbeat from core client for 30 sec - exiting
09:06:28 (6728): No heartbeat from core client for 30 sec - exiting
09:06:29 (6728): No heartbeat from core client for 30 sec - exiting
09:06:30 (6728): No heartbeat from core client for 30 sec - exiting
09:06:31 (6728): No heartbeat from core client for 30 sec - exiting
09:06:32 (6728): No heartbeat from core client for 30 sec - exiting
09:06:33 (6728): No heartbeat from core client for 30 sec - exiting
09:06:34 (6728): No heartbeat from core client for 30 sec - exiting
09:06:35 (6728): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6920, selfPID=6920, iMonCtr=2
09:20:18 (6744): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:06:05 (7072): No heartbeat from core client for 30 sec - exiting
09:06:06 (7072): No heartbeat from core client for 30 sec - exiting
09:06:07 (7072): No heartbeat from core client for 30 sec - exiting
09:06:08 (7072): No heartbeat from core client for 30 sec - exiting
09:06:09 (7072): No heartbeat from core client for 30 sec - exiting
09:06:10 (7072): No heartbeat from core client for 30 sec - exiting
09:06:11 (7072): No heartbeat from core client for 30 sec - exiting
09:06:12 (7072): No heartbeat from core client for 30 sec - exiting
09:06:13 (7072): No heartbeat from core client for 30 sec - exiting
09:06:14 (7072): No heartbeat from core client for 30 sec - exiting
09:06:15 (7072): No heartbeat from core client for 30 sec - exiting
09:06:16 (7072): No heartbeat from core client for 30 sec - exiting
09:06:17 (7072): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4136, selfPID=4136, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4916, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7596, selfPID=4044, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6912, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7444, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7548, selfPID=4372, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6984, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3868, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6936, selfPID=4580, iMonCtr=1
Model crash detected, will try to restart...
00:52:02 (4104): No heartbeat from core client for 30 sec - exiting
00:52:03 (4104): No heartbeat from core client for 30 sec - exiting
00:52:04 (4104): No heartbeat from core client for 30 sec - exiting
00:52:05 (4104): No heartbeat from core client for 30 sec - exiting
00:52:06 (4104): No heartbeat from core client for 30 sec - exiting
00:52:07 (4104): No heartbeat from core client for 30 sec - exiting
00:52:08 (4104): No heartbeat from core client for 30 sec - exiting
00:52:09 (4104): No heartbeat from core client for 30 sec - exiting
00:52:10 (4104): No heartbeat from core client for 30 sec - exiting
00:52:11 (4104): No heartbeat from core client for 30 sec - exiting
00:52:12 (4104): No heartbeat from core client for 30 sec - exiting
00:52:13 (4104): No heartbeat from core client for 30 sec - exiting
00:52:14 (4104): No heartbeat from core client for 30 sec - exiting
00:52:15 (4104): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7708, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7956, iMonCtr=2
C09:06:24 (5152): No heartbeat from core client for 30 sec - exiting
09:06:25 (5152): No heartbeat from core client for 30 sec - exiting
09:06:26 (5152): No heartbeat from core client for 30 sec - exiting
09:06:27 (5152): No heartbeat from core client for 30 sec - exiting
09:06:28 (5152): No heartbeat from core client for 30 sec - exiting
09:06:29 (5152): No heartbeat from core client for 30 sec - exiting
09:06:30 (5152): No heartbeat from core client for 30 sec - exiting
09:06:31 (5152): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1216, selfPID=1216, iMonCtr=2
09:08:01 (6332): No heartbeat from core client for 30 sec - exiting
09:08:02 (6332): No heartbeat from core client for 30 sec - exiting
09:08:03 (6332): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7204, iMonCtr=2
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7492, selfPID=4944, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5924, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5680, iMonCtr=2
Model crash detected, will try to restart...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4744, selfPID=4744, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4744, selfPID=3988, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_f72h_2012_1_009778526_2_8.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_f72h_2012_1_009778526_2_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_f72h_2012_1_009778526_2_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_f72h_2012_1_009778526_2_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_f72h_2012_1_009778526_2_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
24 May 2015 12:29:22 1296485 18419162 hadam3p_anz_f72h_2012_1_009778526_2 80,939 796,567 9.8416
22 May 2015 11:17:56 1296485 18419162 hadam3p_anz_f72h_2012_1_009778526_2 69,419 681,928 9.8234
19 May 2015 21:07:02 1296485 18419162 hadam3p_anz_f72h_2012_1_009778526_2 57,899 569,488 9.8359
18 May 2015 12:14:47 1296485 18419162 hadam3p_anz_f72h_2012_1_009778526_2 46,379 455,946 9.8309
14 May 2015 22:29:33 1296485 18419162 hadam3p_anz_f72h_2012_1_009778526_2 34,859 341,587 9.7991
12 May 2015 16:51:16 1296485 18419162 hadam3p_anz_f72h_2012_1_009778526_2 23,339 230,227 9.8645
10 May 2015 12:29:48 1296485 18419162 hadam3p_anz_f72h_2012_1_009778526_2 11,819 117,202 9.9164


©2024 climateprediction.net