climateprediction.net home page
Task 15909653

Task 15909653

Name hadcm3n_o5pb_2140_40_008269734_4
Workunit 8424858
Created 29 Jul 2013, 12:41:49 UTC
Sent 29 Jul 2013, 12:54:43 UTC
Report deadline 28 Oct 2013, 20:21:54 UTC
Received 15 Sep 2013, 3:41:46 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1310137
Run time 15 days 13 hours 7 min 21 sec
CPU time 12 days 4 hours 9 min 8 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 2.94 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>7.0.65</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
07:31:07 (11613): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:35:41 (14227): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:38:24 (14268): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:41:03 (14306): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:44:09 (14426): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
17:14:40 (2410): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:14:41 (2410): No heartbeat from core client for 30 sec - exiting
17:14:42 (2410): No heartbeat from core client for 30 sec - exiting
17:14:43 (2410): No heartbeat from core client for 30 sec - exiting
17:14:44 (2410): No heartbeat from core client for 30 sec - exiting
17:14:45 (2410): No heartbeat from core client for 30 sec - exiting
17:14:46 (2410): No heartbeat from core client for 30 sec - exiting
17:14:47 (2410): No heartbeat from core client for 30 sec - exiting
17:14:48 (2410): No heartbeat from core client for 30 sec - exiting
17:14:49 (2410): No heartbeat from core client for 30 sec - exiting
17:14:50 (2410): No heartbeat from core client for 30 sec - exiting
17:14:51 (2410): No heartbeat from core client for 30 sec - exiting
17:14:52 (2410): No heartbeat from core client for 30 sec - exiting
17:14:53 (2410): No heartbeat from core client for 30 sec - exiting
17:14:54 (2410): No heartbeat from core client for 30 sec - exiting
17:14:55 (2410): No heartbeat from core client for 30 sec - exiting
17:14:56 (2410): No heartbeat from core client for 30 sec - exiting
17:14:57 (2410): No heartbeat from core client for 30 sec - exiting
17:14:58 (2410): No heartbeat from core client for 30 sec - exiting
17:14:59 (2410): No heartbeat from core client for 30 sec - exiting
17:15:00 (2410): No heartbeat from core client for 30 sec - exiting
17:15:01 (2410): No heartbeat from core client for 30 sec - exiting
17:15:02 (2410): No heartbeat from core client for 30 sec - exiting
17:15:03 (2410): No heartbeat from core client for 30 sec - exiting
17:15:04 (2410): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
03:00:20 (1683): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:00:22 (1683): No heartbeat from core client for 30 sec - exiting
03:00:23 (1683): No heartbeat from core client for 30 sec - exiting
03:00:24 (1683): No heartbeat from core client for 30 sec - exiting
03:00:25 (1683): No heartbeat from core client for 30 sec - exiting
03:00:26 (1683): No heartbeat from core client for 30 sec - exiting
03:00:27 (1683): No heartbeat from core client for 30 sec - exiting
03:00:28 (1683): No heartbeat from core client for 30 sec - exiting
03:00:29 (1683): No heartbeat from core client for 30 sec - exiting
03:00:30 (1683): No heartbeat from core client for 30 sec - exiting
03:00:31 (1683): No heartbeat from core client for 30 sec - exiting
03:00:32 (1683): No heartbeat from core client for 30 sec - exiting
03:00:33 (1683): No heartbeat from core client for 30 sec - exiting
03:00:34 (1683): No heartbeat from core client for 30 sec - exiting
03:00:35 (1683): No heartbeat from core client for 30 sec - exiting
03:00:36 (1683): No heartbeat from core client for 30 sec - exiting
03:00:37 (1683): No heartbeat from core client for 30 sec - exiting
03:00:38 (1683): No heartbeat from core client for 30 sec - exiting
03:00:39 (1683): No heartbeat from core client for 30 sec - exiting
03:00:40 (1683): No heartbeat from core client for 30 sec - exiting
03:00:41 (1683): No heartbeat from core client for 30 sec - exiting
03:03:49 (17148): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:03:50 (17148): No heartbeat from core client for 30 sec - exiting
03:03:51 (17148): No heartbeat from core client for 30 sec - exiting
03:03:52 (17148): No heartbeat from core client for 30 sec - exiting
03:03:53 (17148): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
01:02:32 (2758): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	07:16:34 AM	No files match the supplied pattern.
MainError:	07:16:34 AM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
MainError:	07:43:22 PM	No files match the supplied pattern.
MainError:	07:43:22 PM	No files match the supplied pattern.
MainError:	07:58:32 AM	No files match the supplied pattern.
MainError:	07:58:32 AM	No files match the supplied pattern.
MainError:	08:09:43 PM	No files match the supplied pattern.
MainError:	08:09:43 PM	No files match the supplied pattern.
MainError:	07:37:10 AM	No files match the supplied pattern.
MainError:	07:37:10 AM	No files match the supplied pattern.
MainError:	06:46:49 PM	No files match the supplied pattern.
MainError:	06:46:49 PM	No files match the supplied pattern.
MainError:	05:57:42 AM	No files match the supplied pattern.
MainError:	05:57:42 AM	No files match the supplied pattern.
MainError:	05:06:57 PM	No files match the supplied pattern.
MainError:	05:06:57 PM	No files match the supplied pattern.
MainError:	04:17:20 AM	No files match the supplied pattern.
MainError:	04:17:20 AM	No files match the supplied pattern.
MainError:	03:29:43 PM	No files match the supplied pattern.
MainError:	03:29:43 PM	No files match the supplied pattern.
Error converting file to netcdf: dataout/o5pbka.ph11c10
Error converting file to netcdf: dataout/o5pbka.pg11c10
Error converting file to netcdf: dataout/o5pbka.pe11c10
MainError:	02:40:40 AM	No files match the supplied pattern.
MainError:	02:40:40 AM	No files match the supplied pattern.

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 64 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 66 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

BUFFIN: Read Failed: Numerical result out of range
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    

BUFFIN: Read Failed: Inappropriate ioctl for device
BUFFIN: C I/O Error feof - Unit 64 - Return code = 1

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error feof - Unit 66 - Return code = 1

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

BUFFIN: Read Failed: Numerical result out of range
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    

BUFFIN: Read Failed: Inappropriate ioctl for device
BUFFIN: C I/O Error feof - Unit 64 - Return code = 1

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error feof - Unit 66 - Return code = 1

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

BUFFIN: Read Failed: Numerical result out of range
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    

BUFFIN: Read Failed: Inappropriate ioctl for device
BUFFIN: C I/O Error feof - Unit 64 - Return code = 1

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error feof - Unit 66 - Return code = 1

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

BUFFIN: Read Failed: Numerical result out of range
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    

BUFFIN: Read Failed: Inappropriate ioctl for device
BUFFIN: C I/O Error feof - Unit 64 - Return code = 1

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error feof - Unit 66 - Return code = 1

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

BUFFIN: Read Failed: Numerical result out of range
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
15 Sep 2013 02:42:22 1286052 15909653 hadcm3n_o5pb_2140_40_008269734_4 777,600 1,321,156 1.6990
14 Sep 2013 15:34:42 1286052 15909653 hadcm3n_o5pb_2140_40_008269734_4 751,680 1,281,041 1.7042
14 Sep 2013 04:21:01 1286052 15909653 hadcm3n_o5pb_2140_40_008269734_4 725,760 1,240,833 1.7097
13 Sep 2013 17:23:00 1286052 15909653 hadcm3n_o5pb_2140_40_008269734_4 699,840 1,200,745 1.7157
13 Sep 2013 06:20:29 1286052 15909653 hadcm3n_o5pb_2140_40_008269734_4 673,920 1,160,726 1.7223
12 Sep 2013 19:17:54 1286052 15909653 hadcm3n_o5pb_2140_40_008269734_4 648,000 1,120,610 1.7293
12 Sep 2013 08:09:17 1286052 15909653 hadcm3n_o5pb_2140_40_008269734_4 622,080 1,080,568 1.7370
11 Sep 2013 20:29:32 1286052 15909653 hadcm3n_o5pb_2140_40_008269734_4 596,160 1,039,462 1.7436
11 Sep 2013 08:21:02 1286052 15909653 hadcm3n_o5pb_2140_40_008269734_4 570,240 995,756 1.7462
10 Sep 2013 20:12:27 1286052 15909653 hadcm3n_o5pb_2140_40_008269734_4 544,320 951,810 1.7486
10 Sep 2013 07:18:52 1286052 15909653 hadcm3n_o5pb_2140_40_008269734_4 518,400 908,453 1.7524
09 Sep 2013 19:08:35 1286052 15909653 hadcm3n_o5pb_2140_40_008269734_4 492,480 864,644 1.7557
09 Sep 2013 06:52:37 1286052 15909653 hadcm3n_o5pb_2140_40_008269734_4 466,560 820,841 1.7593
08 Sep 2013 18:19:40 1286052 15909653 hadcm3n_o5pb_2140_40_008269734_4 440,640 776,009 1.7611
08 Sep 2013 05:35:59 1286052 15909653 hadcm3n_o5pb_2140_40_008269734_4 414,720 730,624 1.7617
07 Sep 2013 17:31:32 1286052 15909653 hadcm3n_o5pb_2140_40_008269734_4 388,800 686,719 1.7663
07 Sep 2013 05:28:12 1286052 15909653 hadcm3n_o5pb_2140_40_008269734_4 362,880 642,979 1.7719
06 Sep 2013 17:19:58 1286052 15909653 hadcm3n_o5pb_2140_40_008269734_4 336,960 599,143 1.7781
06 Sep 2013 03:04:59 1286052 15909653 hadcm3n_o5pb_2140_40_008269734_4 311,040 555,442 1.7858
06 Sep 2013 03:04:59 1286052 15909653 hadcm3n_o5pb_2140_40_008269734_4 285,120 511,713 1.7947


©2024 climateprediction.net