climateprediction.net home page
Task 15523197

Task 15523197

Name hadcm3n_zksn_1920_40_008256443_1
Workunit 8411567
Created 4 Jan 2013, 20:22:50 UTC
Sent 4 Jan 2013, 20:23:01 UTC
Report deadline 6 Apr 2013, 3:50:12 UTC
Received 7 Feb 2013, 18:21:41 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1225550
Run time 16 days 19 hours 25 min 32 sec
CPU time 12 days 13 hours 50 min
Validate state Invalid
Credit 5,598.72
Device peak FLOPS 1.88 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>7.0.29</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
13:32:33 (12333): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
11:44:27 (12059): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
11:44:29 (12059): No heartbeat from core client for 30 sec - exiting
11:44:30 (12059): No heartbeat from core client for 30 sec - exiting
11:44:31 (12059): No heartbeat from core client for 30 sec - exiting
11:44:32 (12059): No heartbeat from core client for 30 sec - exiting
11:44:33 (12059): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
18:01:38 (3196): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
21:24:22 (22692): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:24:23 (22692): No heartbeat from core client for 30 sec - exiting
21:24:24 (22692): No heartbeat from core client for 30 sec - exiting
21:24:25 (22692): No heartbeat from core client for 30 sec - exiting
21:24:26 (22692): No heartbeat from core client for 30 sec - exiting
21:24:27 (22692): No heartbeat from core client for 30 sec - exiting
21:24:28 (22692): No heartbeat from core client for 30 sec - exiting
21:24:29 (22692): No heartbeat from core client for 30 sec - exiting
21:24:30 (22692): No heartbeat from core client for 30 sec - exiting
21:24:31 (22692): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...

BUFFOUT: Write Failed: No space left on device
BUFFOUT: C I/O Error - Return code = 1

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/pipe_dummy                                                                  2048    

BUFFOUT: Write Failed: No space left on device
BUFFOUT: C I/O Error - Return code = 1

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/pipe_dummy                                                                  2048    

BUFFOUT: Write Failed: No space left on device
BUFFOUT: C I/O Error - Return code = 1

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/pipe_dummy                                                                  2048    

BUFFOUT: Write Failed: No space left on device
BUFFOUT: C I/O Error - Return code = 1

Model crashed: WRITDUMP: BAD BUFFOUT OF
Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/pipe_dummy                                                                  2048    
forrtl: No space left on device
forrtl: severe (38): error during write, unit 6, file /var/lib/boinc/projects/climateprediction.net/hadcm3n_zksn_1920_40_008256443/dataout/stdout_um.txt
Image              PC        Routine            Line        Source             
hadcm3n_um_6.07_i  0848EB7D  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0848D975  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0845F3CF  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0841F90D  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0841F257  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  08450E2C  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0844E937  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0822D68F  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0818A767  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0818D749  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  08391957  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0838F8B7  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0839BDF8  Unknown               Unknown  Unknown
libc.so.6          B7534943  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3809, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
05 Feb 2013 14:55:36 1225550 15523197 hadcm3n_zksn_1920_40_008256443_1 466,560 1,084,961 2.3254
02 Feb 2013 21:03:32 1225550 15523197 hadcm3n_zksn_1920_40_008256443_1 440,640 986,319 2.2384
01 Feb 2013 05:06:03 1225550 15523197 hadcm3n_zksn_1920_40_008256443_1 414,720 930,450 2.2436
27 Jan 2013 22:52:35 1225550 15523197 hadcm3n_zksn_1920_40_008256443_1 388,800 874,133 2.2483
26 Jan 2013 02:48:58 1225550 15523197 hadcm3n_zksn_1920_40_008256443_1 362,880 816,768 2.2508
25 Jan 2013 05:09:42 1225550 15523197 hadcm3n_zksn_1920_40_008256443_1 336,960 759,183 2.2530
24 Jan 2013 01:44:52 1225550 15523197 hadcm3n_zksn_1920_40_008256443_1 311,040 699,929 2.2503
19 Jan 2013 14:40:31 1225550 15523197 hadcm3n_zksn_1920_40_008256443_1 285,120 641,876 2.2512
18 Jan 2013 17:22:17 1225550 15523197 hadcm3n_zksn_1920_40_008256443_1 259,200 585,692 2.2596
17 Jan 2013 11:30:07 1225550 15523197 hadcm3n_zksn_1920_40_008256443_1 233,280 527,922 2.2630
16 Jan 2013 10:58:20 1225550 15523197 hadcm3n_zksn_1920_40_008256443_1 207,360 468,044 2.2572
15 Jan 2013 06:53:05 1225550 15523197 hadcm3n_zksn_1920_40_008256443_1 181,440 411,136 2.2660
14 Jan 2013 03:22:25 1225550 15523197 hadcm3n_zksn_1920_40_008256443_1 155,520 354,345 2.2785
12 Jan 2013 14:10:21 1225550 15523197 hadcm3n_zksn_1920_40_008256443_1 129,600 298,242 2.3013
09 Jan 2013 13:05:11 1225550 15523197 hadcm3n_zksn_1920_40_008256443_1 103,680 233,314 2.2503
08 Jan 2013 16:05:57 1225550 15523197 hadcm3n_zksn_1920_40_008256443_1 77,760 172,738 2.2214
07 Jan 2013 21:59:20 1225550 15523197 hadcm3n_zksn_1920_40_008256443_1 51,840 113,850 2.1962
07 Jan 2013 02:09:36 1225550 15523197 hadcm3n_zksn_1920_40_008256443_1 25,920 56,443 2.1776


©2024 climateprediction.net