Name | hadcm3n_3mk9_1980_40_008319896_0 |
Workunit | 8471031 |
Created | 24 Feb 2013, 11:44:04 UTC |
Sent | 24 Feb 2013, 11:44:32 UTC |
Report deadline | 26 May 2013, 19:11:43 UTC |
Received | 10 Apr 2013, 22:19:34 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1228270 |
Run time | 5 days 8 hours 13 min 58 sec |
CPU time | 4 days 18 hours 16 min 50 sec |
Validate state | Invalid |
Credit | 6,842.88 |
Device peak FLOPS | 3.51 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The device does not recognise the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 15:34:42 (7196): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:35:44 (6284): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 10:52:18 (784): No heartbeat from core client for 30 sec - exiting 10:52:19 (784): No heartbeat from core client for 30 sec - exiting 10:52:20 (784): No heartbeat from core client for 30 sec - exiting 10:52:21 (784): No heartbeat from core client for 30 sec - exiting 10:52:23 (784): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:54:08 (5316): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:54:55 (1332): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:12:36 (1228): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 10:42:05 (6284): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 08:20:41 (7060): No heartbeat from core client for 30 sec - exiting 08:20:43 (7060): No heartbeat from core client for 30 sec - exiting 08:20:44 (7060): No heartbeat from core client for 30 sec - exiting 08:20:45 (7060): No heartbeat from core client for 30 sec - exiting 08:20:46 (7060): No heartbeat from core client for 30 sec - exiting 08:20:47 (7060): No heartbeat from core client for 30 sec - exiting 08:20:48 (7060): No heartbeat from core client for 30 sec - exiting 08:20:49 (7060): No heartbeat from core client for 30 sec - exiting 08:20:50 (7060): No heartbeat from core client for 30 sec - exiting 08:20:51 (7060): No heartbeat from core client for 30 sec - exiting 08:20:53 (7060): No heartbeat from core client for 30 sec - exiting 08:20:54 (7060): No heartbeat from core client for 30 sec - exiting 08:20:55 (7060): No heartbeat from core client for 30 sec - exiting 08:20:56 (7060): No heartbeat from core client for 30 sec - exiting 08:20:57 (7060): No heartbeat from core client for 30 sec - exiting 08:20:58 (7060): No heartbeat from core client for 30 sec - exiting 08:20:59 (7060): No heartbeat from core client for 30 sec - exiting 08:21:00 (7060): No heartbeat from core client for 30 sec - exiting 08:21:01 (7060): No heartbeat from core client for 30 sec - exiting 08:21:02 (7060): No heartbeat from core client for 30 sec - exiting 08:21:03 (7060): No heartbeat from core client for 30 sec - exiting 08:21:05 (7060): No heartbeat from core client for 30 sec - exiting 08:21:06 (7060): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:11:50 (2372): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:11:51 (2372): No heartbeat from core client for 30 sec - exiting 12:11:52 (2372): No heartbeat from core client for 30 sec - exiting 12:12:36 (768): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 11:20:25 (7988): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:20:26 (7988): No heartbeat from core client for 30 sec - exiting 11:20:27 (7988): No heartbeat from core client for 30 sec - exiting 11:21:05 (7848): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:29:04 (8104): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 07:37:25 (6880): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:37:26 (6880): No heartbeat from core client for 30 sec - exiting 07:37:27 (6880): No heartbeat from core client for 30 sec - exiting 07:37:28 (6880): No heartbeat from core client for 30 sec - exiting 07:55:53 (6440): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:57:02 (3464): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:57:51 (7116): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:58:35 (7048): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:59:24 (1064): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:00:53 (3984): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:01:48 (2700): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 12:44:29 (5016): No heartbeat from core client for 30 sec - exiting 12:44:30 (5016): No heartbeat from core client for 30 sec - exiting 12:44:31 (5016): No heartbeat from core client for 30 sec - exiting 12:44:32 (5016): No heartbeat from core client for 30 sec - exiting 12:44:33 (5016): No heartbeat from core client for 30 sec - exiting 12:44:35 (5016): No heartbeat from core client for 30 sec - exiting 12:44:36 (5016): No heartbeat from core client for 30 sec - exiting 12:44:37 (5016): No heartbeat from core client for 30 sec - exiting 12:44:38 (5016): No heartbeat from core client for 30 sec - exiting 12:44:39 (5016): No heartbeat from core client for 30 sec - exiting 12:44:40 (5016): No heartbeat from core client for 30 sec - exiting 12:44:41 (5016): No heartbeat from core client for 30 sec - exiting 12:44:42 (5016): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:44:43 (5016): No heartbeat from core client for 30 sec - exiting 12:44:44 (5016): No heartbeat from core client for 30 sec - exiting 12:45:18 (8060): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:45:19 (8060): No heartbeat from core client for 30 sec - exiting 12:46:56 (7720): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:46:57 (7720): No heartbeat from core client for 30 sec - exiting 12:46:58 (7720): No heartbeat from core client for 30 sec - exiting 12:46:59 (7720): No heartbeat from core client for 30 sec - exiting 12:47:00 (7720): No heartbeat from core client for 30 sec - exiting 12:47:01 (7720): No heartbeat from core client for 30 sec - exiting 13:43:30 (5392): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:55:38 (8864): No heartbeat from core client for 30 sec - exiting 20:55:39 (8864): No heartbeat from core client for 30 sec - exiting 20:55:40 (8864): No heartbeat from core client for 30 sec - exiting 20:55:41 (8864): No heartbeat from core client for 30 sec - exiting 20:55:42 (8864): No heartbeat from core client for 30 sec - exiting 20:55:43 (8864): No heartbeat from core client for 30 sec - exiting 20:55:49 (8864): No heartbeat from core client for 30 sec - exiting 20:55:50 (8864): No heartbeat from core client for 30 sec - exiting 20:55:51 (8864): No heartbeat from core client for 30 sec - exiting 20:55:52 (8864): No heartbeat from core client for 30 sec - exiting 20:55:53 (8864): No heartbeat from core client for 30 sec - exiting 20:55:54 (8864): No heartbeat from core client for 30 sec - exiting 20:55:55 (8864): No heartbeat from core client for 30 sec - exiting 20:55:56 (8864): No heartbeat from core client for 30 sec - exiting 20:55:57 (8864): No heartbeat from core client for 30 sec - exiting 20:55:58 (8864): No heartbeat from core client for 30 sec - exiting 20:55:59 (8864): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:57:04 (4728): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:03:27 (8376): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:04:07 (8284): No heartbeat from core client for 30 sec - exiting 21:04:08 (8284): No heartbeat from core client for 30 sec - exiting 21:04:09 (8284): No heartbeat from core client for 30 sec - exiting 21:04:10 (8284): No heartbeat from core client for 30 sec - exiting 21:04:11 (8284): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 07:22:37 (6280): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:24:09 (2084): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:51:08 (6528): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:51:43 (3932): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:51:44 (3932): No heartbeat from core client for 30 sec - exiting 07:53:08 (7136): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:53:09 (7136): No heartbeat from core client for 30 sec - exiting 07:53:10 (7136): No heartbeat from core client for 30 sec - exiting 07:53:11 (7136): No heartbeat from core client for 30 sec - exiting 07:53:12 (7136): No heartbeat from core client for 30 sec - exiting 07:53:13 (7136): No heartbeat from core client for 30 sec - exiting 07:53:14 (7136): No heartbeat from core client for 30 sec - exiting Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7044, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7044, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7044, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7044, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7044, iMonCtr=1 Model crash detected, will try to restart... 07:49:39 (5244): No heartbeat from core client for 30 sec - exiting 07:49:41 (5244): No heartbeat from core client for 30 sec - exiting 07:49:42 (5244): No heartbeat from core client for 30 sec - exiting 07:49:43 (5244): No heartbeat from core client for 30 sec - exiting 07:49:44 (5244): No heartbeat from core client for 30 sec - exiting 07:49:45 (5244): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:14:32 (4180): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4768, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
06 Apr 2013 20:12:41 | 1228270 | 15634427 | hadcm3n_3mk9_1980_40_008319896_0 | 570,240 | 406,772 | 0.7133 |
06 Apr 2013 15:25:38 | 1228270 | 15634427 | hadcm3n_3mk9_1980_40_008319896_0 | 544,320 | 388,857 | 0.7144 |
27 Mar 2013 23:15:12 | 1228270 | 15634427 | hadcm3n_3mk9_1980_40_008319896_0 | 518,400 | 370,039 | 0.7138 |
25 Mar 2013 07:57:02 | 1228270 | 15634427 | hadcm3n_3mk9_1980_40_008319896_0 | 492,480 | 352,055 | 0.7149 |
23 Mar 2013 13:30:01 | 1228270 | 15634427 | hadcm3n_3mk9_1980_40_008319896_0 | 466,560 | 333,623 | 0.7151 |
20 Mar 2013 20:23:38 | 1228270 | 15634427 | hadcm3n_3mk9_1980_40_008319896_0 | 440,640 | 314,911 | 0.7147 |
17 Mar 2013 21:09:33 | 1228270 | 15634427 | hadcm3n_3mk9_1980_40_008319896_0 | 414,720 | 296,917 | 0.7159 |
15 Mar 2013 20:57:56 | 1228270 | 15634427 | hadcm3n_3mk9_1980_40_008319896_0 | 388,800 | 277,870 | 0.7147 |
15 Mar 2013 13:25:06 | 1228270 | 15634427 | hadcm3n_3mk9_1980_40_008319896_0 | 362,880 | 259,269 | 0.7145 |
14 Mar 2013 09:45:07 | 1228270 | 15634427 | hadcm3n_3mk9_1980_40_008319896_0 | 336,960 | 240,092 | 0.7125 |
11 Mar 2013 16:05:48 | 1228270 | 15634427 | hadcm3n_3mk9_1980_40_008319896_0 | 311,040 | 221,333 | 0.7116 |
08 Mar 2013 13:10:01 | 1228270 | 15634427 | hadcm3n_3mk9_1980_40_008319896_0 | 285,120 | 202,893 | 0.7116 |
04 Mar 2013 13:09:39 | 1228270 | 15634427 | hadcm3n_3mk9_1980_40_008319896_0 | 259,200 | 184,276 | 0.7109 |
03 Mar 2013 13:25:10 | 1228270 | 15634427 | hadcm3n_3mk9_1980_40_008319896_0 | 233,280 | 165,991 | 0.7116 |
01 Mar 2013 21:17:33 | 1228270 | 15634427 | hadcm3n_3mk9_1980_40_008319896_0 | 207,360 | 147,709 | 0.7123 |
28 Feb 2013 09:07:50 | 1228270 | 15634427 | hadcm3n_3mk9_1980_40_008319896_0 | 181,440 | 129,016 | 0.7111 |
26 Feb 2013 20:11:35 | 1228270 | 15634427 | hadcm3n_3mk9_1980_40_008319896_0 | 155,520 | 110,199 | 0.7086 |
26 Feb 2013 09:40:36 | 1228270 | 15634427 | hadcm3n_3mk9_1980_40_008319896_0 | 129,600 | 91,344 | 0.7048 |
25 Feb 2013 18:18:44 | 1228270 | 15634427 | hadcm3n_3mk9_1980_40_008319896_0 | 103,680 | 72,546 | 0.6997 |
25 Feb 2013 12:26:00 | 1228270 | 15634427 | hadcm3n_3mk9_1980_40_008319896_0 | 77,760 | 53,861 | 0.6927 |
©2024 climateprediction.net