Name | hadcm3n_o25e_2140_40_008269913_3 |
Workunit | 8425037 |
Created | 8 Apr 2013, 18:47:29 UTC |
Sent | 8 Apr 2013, 18:47:32 UTC |
Report deadline | 9 Jul 2013, 2:14:43 UTC |
Received | 22 Apr 2013, 23:54:45 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1304823 |
Run time | 13 days 13 hours 15 min 44 sec |
CPU time | 11 days 14 hours 12 min 49 sec |
Validate state | Invalid |
Credit | 9,331.20 |
Device peak FLOPS | 3.35 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> 12:26:50 (2963): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:28:45 (3449): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:31:02 (3476): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:34:06 (3504): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:34:46 (3527): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:36:50 (3543): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:38:42 (3569): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:40:39 (3592): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:42:26 (3615): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:44:43 (3638): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:46:02 (3661): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:47:24 (3684): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:50:06 (3707): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:52:29 (3730): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:55:01 (3754): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:57:19 (3780): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:59:06 (3803): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:01:04 (3826): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:03:21 (3849): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:05:33 (3872): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:08:10 (3898): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:10:28 (3922): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:12:56 (3945): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:15:24 (3968): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:17:25 (3994): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:19:27 (4020): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:21:34 (4043): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:24:02 (4066): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:26:09 (4090): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:28:41 (4116): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:30:43 (4139): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:32:50 (4162): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:34:43 (4185): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:36:35 (4208): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:12:38 (4235): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:13:49 (18861): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:14:30 (18879): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:16:48 (18900): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:19:25 (18928): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:21:53 (18949): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:24:35 (18970): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:27:03 (18991): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:28:50 (19016): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:30:57 (19037): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:33:35 (19058): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:35:31 (19079): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:38:14 (19103): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:40:11 (19124): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:42:23 (19145): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... MainError: 09:33:23 AM No files match the supplied pattern. MainError: 09:33:23 AM No files match the supplied pattern. MainError: 08:30:01 PM No files match the supplied pattern. MainError: 08:30:01 PM No files match the supplied pattern. MainError: 07:26:29 AM No files match the supplied pattern. MainError: 07:26:29 AM No files match the supplied pattern. MainError: 06:22:46 PM No files match the supplied pattern. MainError: 06:22:46 PM No files match the supplied pattern. MainError: 05:18:37 AM No files match the supplied pattern. MainError: 05:18:37 AM No files match the supplied pattern. MainError: 04:14:37 PM No files match the supplied pattern. MainError: 04:14:37 PM No files match the supplied pattern. MainError: 03:10:57 AM No files match the supplied pattern. MainError: 03:10:57 AM No files match the supplied pattern. MainError: 02:07:34 PM No files match the supplied pattern. MainError: 02:07:34 PM No files match the supplied pattern. MainError: 01:04:23 AM No files match the supplied pattern. MainError: 01:04:23 AM No files match the supplied pattern. MainError: 12:01:11 AM No files match the supplied pattern. MainError: 12:01:11 AM No files match the supplied pattern. Error converting file to netcdf: dataout/o25eka.ph11c10 Error converting file to netcdf: dataout/o25eka.pg11c10 Error converting file to netcdf: dataout/o25eka.pe11c10 MainError: 10:36:52 PM No files match the supplied pattern. MainError: 10:36:52 PM No files match the supplied pattern. BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: Numerical result out of range BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: Numerical result out of range BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: Numerical result out of range BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: Numerical result out of range BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: Numerical result out of range BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
22 Apr 2013 22:55:44 | 1244944 | 15717578 | hadcm3n_o25e_2140_40_008269913_3 | 777,600 | 1,155,841 | 1.4864 |
22 Apr 2013 12:04:58 | 1244944 | 15717578 | hadcm3n_o25e_2140_40_008269913_3 | 751,680 | 1,118,095 | 1.4875 |
22 Apr 2013 01:06:11 | 1244944 | 15717578 | hadcm3n_o25e_2140_40_008269913_3 | 725,760 | 1,079,068 | 1.4868 |
21 Apr 2013 14:10:16 | 1244944 | 15717578 | hadcm3n_o25e_2140_40_008269913_3 | 699,840 | 1,040,053 | 1.4861 |
21 Apr 2013 03:15:29 | 1244944 | 15717578 | hadcm3n_o25e_2140_40_008269913_3 | 673,920 | 1,001,081 | 1.4855 |
20 Apr 2013 16:15:58 | 1244944 | 15717578 | hadcm3n_o25e_2140_40_008269913_3 | 648,000 | 962,119 | 1.4848 |
20 Apr 2013 05:22:48 | 1244944 | 15717578 | hadcm3n_o25e_2140_40_008269913_3 | 622,080 | 923,190 | 1.4840 |
19 Apr 2013 18:35:13 | 1244944 | 15717578 | hadcm3n_o25e_2140_40_008269913_3 | 596,160 | 884,233 | 1.4832 |
19 Apr 2013 07:28:12 | 1244944 | 15717578 | hadcm3n_o25e_2140_40_008269913_3 | 570,240 | 845,235 | 1.4822 |
18 Apr 2013 21:24:36 | 1244944 | 15717578 | hadcm3n_o25e_2140_40_008269913_3 | 544,320 | 806,228 | 1.4812 |
18 Apr 2013 10:21:52 | 1244944 | 15717578 | hadcm3n_o25e_2140_40_008269913_3 | 518,400 | 767,213 | 1.4800 |
17 Apr 2013 23:21:22 | 1244944 | 15717578 | hadcm3n_o25e_2140_40_008269913_3 | 492,480 | 728,280 | 1.4788 |
17 Apr 2013 12:18:08 | 1244944 | 15717578 | hadcm3n_o25e_2140_40_008269913_3 | 466,560 | 689,306 | 1.4774 |
17 Apr 2013 01:14:21 | 1244944 | 15717578 | hadcm3n_o25e_2140_40_008269913_3 | 440,640 | 650,341 | 1.4759 |
16 Apr 2013 14:15:38 | 1244944 | 15717578 | hadcm3n_o25e_2140_40_008269913_3 | 414,720 | 611,364 | 1.4742 |
16 Apr 2013 03:12:51 | 1244944 | 15717578 | hadcm3n_o25e_2140_40_008269913_3 | 388,800 | 572,420 | 1.4723 |
15 Apr 2013 16:12:02 | 1244944 | 15717578 | hadcm3n_o25e_2140_40_008269913_3 | 362,880 | 533,519 | 1.4702 |
15 Apr 2013 05:09:13 | 1244944 | 15717578 | hadcm3n_o25e_2140_40_008269913_3 | 336,960 | 494,623 | 1.4679 |
14 Apr 2013 18:58:47 | 1244944 | 15717578 | hadcm3n_o25e_2140_40_008269913_3 | 311,040 | 455,666 | 1.4650 |
14 Apr 2013 08:21:32 | 1244944 | 15717578 | hadcm3n_o25e_2140_40_008269913_3 | 285,120 | 418,240 | 1.4669 |
©2024 climateprediction.net