Name | hadcm3n_zfak_1880_40_008251369_2 |
Workunit | 8406493 |
Created | 25 Jan 2013, 14:52:38 UTC |
Sent | 25 Jan 2013, 14:52:40 UTC |
Report deadline | 26 Apr 2013, 22:19:51 UTC |
Received | 22 Feb 2013, 18:21:54 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1143361 |
Run time | 18 days 2 hours 17 min 37 sec |
CPU time | 14 days 2 hours 17 min 6 sec |
Validate state | Invalid |
Credit | 9,020.16 |
Device peak FLOPS | 3.15 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 18:01:54 (468): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 16:42:20 (3520): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:35:04 (3488): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:00:35 (4564): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:51:57 (2328): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:15:18 (4124): No heartbeat from core client for 30 sec - exiting 09:15:19 (4124): No heartbeat from core client for 30 sec - exiting 09:15:20 (4124): No heartbeat from core client for 30 sec - exiting 09:15:21 (4124): No heartbeat from core client for 30 sec - exiting 09:15:22 (4124): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:08:19 (4224): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:08:58 (6408): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 09:05:50 (2112): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:00:45 (3036): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:45:26 (4624): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:46:07 (6424): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:17:34 (7704): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:18:23 (7836): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:20:40 (7700): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:21:17 (7732): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:52:58 (5632): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:53:53 (5996): No heartbeat from core client for 30 sec - exiting 03:53:54 (5996): No heartbeat from core client for 30 sec - exiting 03:53:55 (5996): No heartbeat from core client for 30 sec - exiting 03:53:56 (5996): No heartbeat from core client for 30 sec - exiting 03:53:57 (5996): No heartbeat from core client for 30 sec - exiting 03:53:58 (5996): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Error converting file to netcdf: dataout/zfakko.pja4c10 Error converting file to netcdf: dataout/zfakko.pia4c10 Error converting file to netcdf: dataout/zfakko.pfa4c10 Error converting file to netcdf: dataout/zfakka.pha4c10 Error converting file to netcdf: dataout/zfakka.pga4c10 Error converting file to netcdf: dataout/zfakka.pea4c10 Error converting file to netcdf: dataout/zfakka.pda4c10 03:54:35 (5628): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Error converting file to netcdf: dataout/zfakko.pja4c10 Error converting file to netcdf: dataout/zfakko.pia4c10 Error converting file to netcdf: dataout/zfakko.pfa4c10 Error converting file to netcdf: dataout/zfakka.pha4c10 Error converting file to netcdf: dataout/zfakka.pga4c10 Error converting file to netcdf: dataout/zfakka.pea4c10 Error converting file to netcdf: dataout/zfakka.pda4c10 CPDN Monitor - Quit request from BOINC... 14:15:56 (6436): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 16:20:29 (820): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:20:30 (820): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 02:59:15 (4632): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 18:37:22 (336): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 12:19:39 (6056): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:57:52 (4160): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:58:50 (6716): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:57:51 (8128): No heartbeat from core client for 30 sec - exiting 23:57:52 (8128): No heartbeat from core client for 30 sec - exiting 23:57:53 (8128): No heartbeat from core client for 30 sec - exiting 23:57:54 (8128): No heartbeat from core client for 30 sec - exiting 23:57:55 (8128): No heartbeat from core client for 30 sec - exiting 23:57:56 (8128): No heartbeat from core client for 30 sec - exiting 23:57:57 (8128): No heartbeat from core client for 30 sec - exiting 23:57:58 (8128): No heartbeat from core client for 30 sec - exiting 23:57:59 (8128): No heartbeat from core client for 30 sec - exiting 23:58:00 (8128): No heartbeat from core client for 30 sec - exiting 23:58:01 (8128): No heartbeat from core client for 30 sec - exiting 23:58:02 (8128): No heartbeat from core client for 30 sec - exiting 23:58:03 (8128): No heartbeat from core client for 30 sec - exiting 23:58:04 (8128): No heartbeat from core client for 30 sec - exiting 23:58:05 (8128): No heartbeat from core client for 30 sec - exiting 23:58:06 (8128): No heartbeat from core client for 30 sec - exiting 23:58:07 (8128): No heartbeat from core client for 30 sec - exiting 23:58:08 (8128): No heartbeat from core client for 30 sec - exiting 23:58:09 (8128): No heartbeat from core client for 30 sec - exiting 23:58:10 (8128): No heartbeat from core client for 30 sec - exiting 23:58:11 (8128): No heartbeat from core client for 30 sec - exiting 23:58:12 (8128): No heartbeat from core client for 30 sec - exiting 23:58:13 (8128): No heartbeat from core client for 30 sec - exiting 23:58:14 (8128): No heartbeat from core client for 30 sec - exiting 23:58:15 (8128): No heartbeat from core client for 30 sec - exiting 23:58:16 (8128): No heartbeat from core client for 30 sec - exiting 23:58:17 (8128): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:58:55 (7268): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:14:39 (7092): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: C I/O Error feof - Unit 62 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 01:15:17 (7352): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: C I/O Error feof - Unit 62 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: C I/O Error feof - Unit 62 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: C I/O Error feof - Unit 62 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: C I/O Error feof - Unit 62 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: C I/O Error feof - Unit 62 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: C I/O Error feof - Unit 62 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
22 Feb 2013 00:00:05 | 1143361 | 15557353 | hadcm3n_zfak_1880_40_008251369_2 | 751,680 | 1,174,551 | 1.5626 |
21 Feb 2013 09:19:47 | 1143361 | 15557353 | hadcm3n_zfak_1880_40_008251369_2 | 725,760 | 1,134,309 | 1.5629 |
20 Feb 2013 17:36:18 | 1143361 | 15557353 | hadcm3n_zfak_1880_40_008251369_2 | 699,840 | 1,093,693 | 1.5628 |
20 Feb 2013 03:47:04 | 1143361 | 15557353 | hadcm3n_zfak_1880_40_008251369_2 | 673,920 | 1,053,505 | 1.5632 |
19 Feb 2013 14:20:40 | 1143361 | 15557353 | hadcm3n_zfak_1880_40_008251369_2 | 648,000 | 1,013,384 | 1.5639 |
18 Feb 2013 19:58:07 | 1143361 | 15557353 | hadcm3n_zfak_1880_40_008251369_2 | 622,080 | 973,776 | 1.5654 |
18 Feb 2013 05:32:38 | 1143361 | 15557353 | hadcm3n_zfak_1880_40_008251369_2 | 596,160 | 933,051 | 1.5651 |
17 Feb 2013 14:35:26 | 1143361 | 15557353 | hadcm3n_zfak_1880_40_008251369_2 | 570,240 | 891,944 | 1.5642 |
16 Feb 2013 23:25:04 | 1143361 | 15557353 | hadcm3n_zfak_1880_40_008251369_2 | 544,320 | 850,828 | 1.5631 |
16 Feb 2013 08:28:53 | 1143361 | 15557353 | hadcm3n_zfak_1880_40_008251369_2 | 518,400 | 810,094 | 1.5627 |
15 Feb 2013 16:25:51 | 1143361 | 15557353 | hadcm3n_zfak_1880_40_008251369_2 | 492,480 | 770,230 | 1.5640 |
15 Feb 2013 01:10:33 | 1143361 | 15557353 | hadcm3n_zfak_1880_40_008251369_2 | 466,560 | 729,254 | 1.5630 |
14 Feb 2013 11:56:22 | 1143361 | 15557353 | hadcm3n_zfak_1880_40_008251369_2 | 440,640 | 688,834 | 1.5633 |
06 Feb 2013 09:38:19 | 1143361 | 15557353 | hadcm3n_zfak_1880_40_008251369_2 | 414,720 | 647,930 | 1.5623 |
05 Feb 2013 18:56:26 | 1143361 | 15557353 | hadcm3n_zfak_1880_40_008251369_2 | 388,800 | 607,563 | 1.5627 |
05 Feb 2013 03:50:53 | 1143361 | 15557353 | hadcm3n_zfak_1880_40_008251369_2 | 362,880 | 566,435 | 1.5609 |
04 Feb 2013 13:40:25 | 1143361 | 15557353 | hadcm3n_zfak_1880_40_008251369_2 | 336,960 | 525,786 | 1.5604 |
01 Feb 2013 21:17:10 | 1143361 | 15557353 | hadcm3n_zfak_1880_40_008251369_2 | 311,040 | 484,631 | 1.5581 |
01 Feb 2013 07:46:38 | 1143361 | 15557353 | hadcm3n_zfak_1880_40_008251369_2 | 285,120 | 444,046 | 1.5574 |
31 Jan 2013 17:11:09 | 1143361 | 15557353 | hadcm3n_zfak_1880_40_008251369_2 | 259,200 | 402,868 | 1.5543 |
©2024 climateprediction.net