Name | hadam3p_pnw_h1y2_2012_1_009207237_0 |
Workunit | 9332859 |
Created | 21 Nov 2014, 17:40:13 UTC |
Sent | 21 Nov 2014, 22:55:00 UTC |
Report deadline | 4 Nov 2015, 4:15:00 UTC |
Received | 8 Jan 2015, 18:06:07 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1279759 |
Run time | 1 days 3 hours 5 min 27 sec |
CPU time | 1 days 2 hours 8 min |
Validate state | Invalid |
Credit | 507.13 |
Device peak FLOPS | 2.35 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v7.22 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <stderr_txt> 21:15:16 (2504): No heartbeat from client for 30 sec - exiting 21:15:16 (2504): timer handler: client dead, exiting 21:15:17 (2504): No heartbeat from client for 30 sec - exiting 21:15:17 (2504): timer handler: client dead, exiting 21:15:18 (2504): No heartbeat from client for 30 sec - exiting 21:15:18 (2504): timer handler: client dead, exiting 21:15:19 (2504): No heartbeat from client for 30 sec - exiting 21:15:19 (2504): timer handler: client dead, exiting CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1592, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4084, selfPID=2252, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3208, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2664, selfPID=1084, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2140, selfPID=3624, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=916, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3788, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=768, selfPID=2696, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1344, selfPID=3272, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 18:41:27 (3924): No heartbeat from client for 30 sec - exiting 18:41:27 (3924): timer handler: client dead, exiting 18:41:28 (3924): No heartbeat from client for 30 sec - exiting 18:41:28 (3924): timer handler: client dead, exiting 18:41:29 (3924): No heartbeat from client for 30 sec - exiting 18:41:29 (3924): timer handler: client dead, exiting 18:41:30 (3924): No heartbeat from client for 30 sec - exiting 18:41:30 (3924): timer handler: client dead, exiting 18:41:32 (3924): No heartbeat from client for 30 sec - exiting 18:41:32 (3924): timer handler: client dead, exiting 18:41:33 (3924): No heartbeat from client for 30 sec - exiting 18:41:33 (3924): timer handler: client dead, exiting 18:41:34 (3924): No heartbeat from client for 30 sec - exiting 18:41:34 (3924): timer handler: client dead, exiting 18:41:35 (3924): No heartbeat from client for 30 sec - exiting 18:41:35 (3924): timer handler: client dead, exiting 18:41:36 (3924): No heartbeat from client for 30 sec - exiting 18:41:36 (3924): timer handler: client dead, exiting 18:41:37 (3924): No heartbeat from client for 30 sec - exiting 18:41:37 (3924): timer handler: client dead, exiting 18:41:38 (3924): No heartbeat from client for 30 sec - exiting 18:41:38 (3924): timer handler: client dead, exiting 18:41:39 (3924): No heartbeat from client for 30 sec - exiting 18:41:39 (3924): timer handler: client dead, exiting 18:41:40 (3924): No heartbeat from client for 30 sec - exiting 18:41:40 (3924): timer handler: client dead, exiting 18:41:41 (3924): No heartbeat from client for 30 sec - exiting 18:41:41 (3924): timer handler: client dead, exiting 18:41:42 (3924): No heartbeat from client for 30 sec - exiting 18:41:42 (3924): timer handler: client dead, exiting 18:41:44 (3924): No heartbeat from client for 30 sec - exiting 18:41:44 (3924): timer handler: client dead, exiting 18:41:45 (3924): No heartbeat from client for 30 sec - exiting 18:41:45 (3924): timer handler: client dead, exiting 18:41:46 (3924): No heartbeat from client for 30 sec - exiting 18:41:46 (3924): timer handler: client dead, exiting 18:41:47 (3924): No heartbeat from client for 30 sec - exiting 18:41:47 (3924): timer handler: client dead, exiting 18:41:48 (3924): No heartbeat from client for 30 sec - exiting 18:41:48 (3924): timer handler: client dead, exiting 18:41:49 (3924): No heartbeat from client for 30 sec - exiting 18:41:49 (3924): timer handler: client dead, exiting 18:41:50 (3924): No heartbeat from client for 30 sec - exiting 18:41:50 (3924): timer handler: client dead, exiting 18:41:52 (3924): No heartbeat from client for 30 sec - exiting 18:41:52 (3924): timer handler: client dead, exiting 18:41:53 (3924): No heartbeat from client for 30 sec - exiting 18:41:53 (3924): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 13:24:50 (3732): No heartbeat from client for 30 sec - exiting 13:24:50 (3732): timer handler: client dead, exiting 13:24:51 (3732): No heartbeat from client for 30 sec - exiting 13:24:51 (3732): timer handler: client dead, exiting 13:24:52 (3732): No heartbeat from client for 30 sec - exiting 13:24:52 (3732): timer handler: client dead, exiting 13:24:54 (3732): No heartbeat from client for 30 sec - exiting 13:24:54 (3732): timer handler: client dead, exiting 13:24:55 (3732): No heartbeat from client for 30 sec - exiting 13:24:55 (3732): timer handler: client dead, exiting 13:24:56 (3732): No heartbeat from client for 30 sec - exiting 13:24:56 (3732): timer handler: client dead, exiting 13:24:57 (3732): No heartbeat from client for 30 sec - exiting 13:24:57 (3732): timer handler: client dead, exiting 13:24:58 (3732): No heartbeat from client for 30 sec - exiting 13:24:58 (3732): timer handler: client dead, exiting 13:24:59 (3732): No heartbeat from client for 30 sec - exiting 13:24:59 (3732): timer handler: client dead, exiting 13:25:00 (3732): No heartbeat from client for 30 sec - exiting 13:25:00 (3732): timer handler: client dead, exiting 13:25:01 (3732): No heartbeat from client for 30 sec - exiting 13:25:01 (3732): timer handler: client dead, exiting 13:25:02 (3732): No heartbeat from client for 30 sec - exiting 13:25:02 (3732): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=976, selfPID=976, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3072, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3924, selfPID=2488, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 17:51:55 (2940): No heartbeat from client for 30 sec - exiting 17:51:56 (2940): timer handler: client dead, exiting 17:51:57 (2940): No heartbeat from client for 30 sec - exiting 17:51:57 (2940): timer handler: client dead, exiting 17:51:58 (2940): No heartbeat from client for 30 sec - exiting 17:51:58 (2940): timer handler: client dead, exiting 17:51:59 (2940): No heartbeat from client for 30 sec - exiting 17:51:59 (2940): timer handler: client dead, exiting 17:52:00 (2940): No heartbeat from client for 30 sec - exiting 17:52:00 (2940): timer handler: client dead, exiting 17:52:01 (2940): No heartbeat from client for 30 sec - exiting 17:52:01 (2940): timer handler: client dead, exiting 17:52:02 (2940): No heartbeat from client for 30 sec - exiting 17:52:02 (2940): timer handler: client dead, exiting 17:52:03 (2940): No heartbeat from client for 30 sec - exiting 17:52:03 (2940): timer handler: client dead, exiting 17:52:04 (2940): No heartbeat from client for 30 sec - exiting 17:52:04 (2940): timer handler: client dead, exiting 17:52:05 (2940): No heartbeat from client for 30 sec - exiting 17:52:05 (2940): timer handler: client dead, exiting 17:52:06 (2940): No heartbeat from client for 30 sec - exiting 17:52:06 (2940): timer handler: client dead, exiting 17:52:08 (2940): No heartbeat from client for 30 sec - exiting 17:52:08 (2940): timer handler: client dead, exiting 17:52:09 (2940): No heartbeat from client for 30 sec - exiting 17:52:09 (2940): timer handler: client dead, exiting 17:52:10 (2940): No heartbeat from client for 30 sec - exiting 17:52:10 (2940): timer handler: client dead, exiting 17:52:11 (2940): No heartbeat from client for 30 sec - exiting 17:52:11 (2940): timer handler: client dead, exiting 17:52:12 (2940): No heartbeat from client for 30 sec - exiting 17:52:12 (2940): timer handler: client dead, exiting 17:52:13 (2940): No heartbeat from client for 30 sec - exiting 17:52:13 (2940): timer handler: client dead, exiting 17:52:14 (2940): No heartbeat from client for 30 sec - exiting 17:52:14 (2940): timer handler: client dead, exiting 17:52:15 (2940): No heartbeat from client for 30 sec - exiting 17:52:15 (2940): timer handler: client dead, exiting 17:52:16 (2940): No heartbeat from client for 30 sec - exiting 17:52:16 (2940): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:59:29 (3984): No heartbeat from client for 30 sec - exiting 19:59:29 (3984): timer handler: client dead, exiting 19:59:30 (3984): No heartbeat from client for 30 sec - exiting 19:59:30 (3984): timer handler: client dead, exiting 19:59:31 (3984): No heartbeat from client for 30 sec - exiting 19:59:31 (3984): timer handler: client dead, exiting 19:59:32 (3984): No heartbeat from client for 30 sec - exiting 19:59:32 (3984): timer handler: client dead, exiting 19:59:33 (3984): No heartbeat from client for 30 sec - exiting 19:59:33 (3984): timer handler: client dead, exiting 19:59:35 (3984): No heartbeat from client for 30 sec - exiting 19:59:35 (3984): timer handler: client dead, exiting 19:59:36 (3984): No heartbeat from client for 30 sec - exiting 19:59:36 (3984): timer handler: client dead, exiting 19:59:37 (3984): No heartbeat from client for 30 sec - exiting 19:59:37 (3984): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=196, selfPID=196, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=196, selfPID=3460, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 20:01:08 (3460): called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_pnw_h1y2_2012_1_009207237_0_3.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_h1y2_2012_1_009207237_0_4.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_h1y2_2012_1_009207237_0_5.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_h1y2_2012_1_009207237_0_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_h1y2_2012_1_009207237_0_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_h1y2_2012_1_009207237_0_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_h1y2_2012_1_009207237_0_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_h1y2_2012_1_009207237_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_h1y2_2012_1_009207237_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_h1y2_2012_1_009207237_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
31 Dec 2014 18:08:09 | 1279759 | 17463937 | hadam3p_pnw_h1y2_2012_1_009207237_0 | 23,339 | 72,206 | 3.0938 |
04 Dec 2014 19:26:43 | 1279759 | 17463937 | hadam3p_pnw_h1y2_2012_1_009207237_0 | 11,819 | 37,435 | 3.1674 |
©2024 climateprediction.net