Name | hadam3p_pnw_8rvq_2003_1_007709090_2 |
Workunit | 7864198 |
Created | 7 Feb 2012, 0:22:01 UTC |
Sent | 7 Feb 2012, 0:42:55 UTC |
Report deadline | 19 Jan 2013, 6:02:55 UTC |
Received | 13 Feb 2012, 5:26:05 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -177 (0xFFFFFF4F) ERR_RSC_LIMIT_EXCEEDED |
Computer ID | 1097837 |
Run time | 5 days 8 hours 56 min 49 sec |
CPU time | 4 days 14 hours 57 min 44 sec |
Validate state | Invalid |
Credit | 1,003.35 |
Device peak FLOPS | 1.42 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.09 windows_intelx86 |
Stderr | <core_client_version>6.6.36</core_client_version> <![CDATA[ <message> Maximum memory exceeded </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... 08:49:09 (3964): No heartbeat from core client for 30 sec - exiting 08:49:10 (3964): No heartbeat from core client for 30 sec - exiting 08:49:11 (3964): No heartbeat from core client for 30 sec - exiting 08:49:13 (3964): No heartbeat from core client for 30 sec - exiting 08:49:14 (3964): No heartbeat from core client for 30 sec - exiting 08:49:15 (3964): No heartbeat from core client for 30 sec - exiting 08:49:16 (3964): No heartbeat from core client for 30 sec - exiting 08:49:17 (3964): No heartbeat from core client for 30 sec - exiting 08:49:18 (3964): No heartbeat from core client for 30 sec - exiting 08:49:19 (3964): No heartbeat from core client for 30 sec - exiting 08:49:20 (3964): No heartbeat from core client for 30 sec - exiting 08:49:21 (3964): No heartbeat from core client for 30 sec - exiting 08:49:22 (3964): No heartbeat from core client for 30 sec - exiting 08:49:23 (3964): No heartbeat from core client for 30 sec - exiting 08:49:24 (3964): No heartbeat from core client for 30 sec - exiting 08:49:25 (3964): No heartbeat from core client for 30 sec - exiting 08:49:26 (3964): No heartbeat from core client for 30 sec - exiting 08:49:27 (3964): No heartbeat from core client for 30 sec - exiting 08:49:28 (3964): No heartbeat from core client for 30 sec - exiting 08:49:29 (3964): No heartbeat from core client for 30 sec - exiting 08:49:30 (3964): No heartbeat from core client for 30 sec - exiting 08:49:31 (3964): No heartbeat from core client for 30 sec - exiting 08:49:32 (3964): No heartbeat from core client for 30 sec - exiting 08:49:33 (3964): No heartbeat from core client for 30 sec - exiting 08:49:34 (3964): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 09:40:44 (360): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:40:45 (360): No heartbeat from core client for 30 sec - exiting 09:40:46 (360): No heartbeat from core client for 30 sec - exiting 13:35:40 (3412): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:35:41 (3412): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:34:32 (5088): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:34:33 (5088): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 00:33:29 (6676): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:33:31 (6676): No heartbeat from core client for 30 sec - exiting 05:32:27 (5384): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:32:28 (5384): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 10:31:24 (9860): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 16:30:16 (2728): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:30:17 (2728): No heartbeat from core client for 30 sec - exiting 21:29:12 (7496): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:29:13 (7496): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 01:27:55 (6632): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:27:57 (6632): No heartbeat from core client for 30 sec - exiting 01:27:58 (6632): No heartbeat from core client for 30 sec - exiting 01:27:59 (6632): No heartbeat from core client for 30 sec - exiting 03:26:52 (9888): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:26:53 (9888): No heartbeat from core client for 30 sec - exiting 05:25:51 (6020): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:25:52 (6020): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:24:45 (6936): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:24:46 (6936): No heartbeat from core client for 30 sec - exiting 16:23:36 (7368): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:23:37 (7368): No heartbeat from core client for 30 sec - exiting 21:58:03 (9004): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:58:04 (9004): No heartbeat from core client for 30 sec - exiting 21:58:06 (9004): No heartbeat from core client for 30 sec - exiting 21:58:07 (9004): No heartbeat from core client for 30 sec - exiting Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9684, selfPID=9684, iMonCtr=2 21:58:08 (9004): No heartbeat from core client for 30 sec - exiting 21:58:09 (9004): No heartbeat from core client for 30 sec - exiting 21:58:10 (9004): No heartbeat from core client for 30 sec - exiting 21:58:11 (9004): No heartbeat from core client for 30 sec - exiting 21:58:12 (9004): No heartbeat from core client for 30 sec - exiting 21:58:13 (9004): No heartbeat from core client for 30 sec - exiting 21:58:14 (9004): No heartbeat from core client for 30 sec - exiting 21:58:15 (9004): No heartbeat from core client for 30 sec - exiting 22:22:25 (3616): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:21:22 (9496): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:21:23 (9496): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9240, selfPID=9240, iMonCtr=2 13:19:07 (5252): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5388, selfPID=5388, iMonCtr=2 18:18:03 (5912): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:17:03 (9740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:17:04 (9740): No heartbeat from core client for 30 sec - exiting 04:16:01 (2912): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 09:14:59 (6880): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 04:13:50 (9788): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:13:52 (9788):09:12:48 (5508): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4624, selfPID=4624, iMonCtr=2 09:12:49 (5508): No heartbeat from core client for 30 sec - exiting 14:11:43 (9876): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:11:44 (9876): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7024, selfPID=7024, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 18:10:36 (5644): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:10:37 (5644): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 21:09:04 (7656): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:09:05 (7656): No heartbeat from core client for 30 sec - exiting 23:08:00 (7596): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:08:01 (7596): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Abort request from BOINC... Regional yearly means requires 12 input files got 4 Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
11 Feb 2012 23:53:46 | 1097837 | 14072158 | hadam3p_pnw_8rvq_2003_1_007709090_2 | 46,176 | 320,458 | 6.9399 |
10 Feb 2012 23:33:56 | 1097837 | 14072158 | hadam3p_pnw_8rvq_2003_1_007709090_2 | 34,656 | 241,401 | 6.9656 |
09 Feb 2012 19:57:54 | 1097837 | 14072158 | hadam3p_pnw_8rvq_2003_1_007709090_2 | 23,136 | 157,353 | 6.8012 |
08 Feb 2012 17:02:19 | 1097837 | 14072158 | hadam3p_pnw_8rvq_2003_1_007709090_2 | 11,652 | 79,682 | 6.8385 |
08 Feb 2012 17:02:19 | 1097837 | 14072158 | hadam3p_pnw_8rvq_2003_1_007709090_2 | 11,634 | 78,649 | 6.7603 |
08 Feb 2012 15:56:08 | 1097837 | 14072158 | hadam3p_pnw_8rvq_2003_1_007709090_2 | 11,616 | 77,680 | 6.6873 |
©2024 climateprediction.net