Name | hadcm3n_3mgg_1940_40_008260560_0 |
Workunit | 8415684 |
Created | 20 Dec 2012, 18:44:47 UTC |
Sent | 20 Dec 2012, 18:44:54 UTC |
Report deadline | 22 Mar 2013, 2:12:05 UTC |
Received | 16 Jan 2013, 4:43:21 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1118054 |
Run time | 13 days 21 hours 54 min 37 sec |
CPU time | 13 days 14 hours 44 min 29 sec |
Validate state | Invalid |
Credit | 8,087.04 |
Device peak FLOPS | 2.71 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 01:34:35 (13985): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:34:36 (13985): No heartbeat from core client for 30 sec - exiting 18:13:36 (23418): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:13:39 (23418): No heartbeat from core client for 30 sec - exiting 20:13:48 (21576): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:15:43 (25637): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:16:54 (25727): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:20:03 (25805): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:21:57 (25883): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:16:49 (25960): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:09:53 (4372): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:17:44 (14711): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 12:27:48 (7582): No heartbeat from core client for 30 sec - exiting 12:27:49 (7582): No heartbeat from core client for 30 sec - exiting 12:27:50 (7582): No heartbeat from core client for 30 sec - exiting 12:27:51 (7582): No heartbeat from core client for 30 sec - exiting 12:27:52 (7582): No heartbeat from core client for 30 sec - exiting 12:27:53 (7582): No heartbeat from core client for 30 sec - exiting 12:27:54 (7582): No heartbeat from core client for 30 sec - exiting 12:27:55 (7582): No heartbeat from core client for 30 sec - exiting 12:27:56 (7582): No heartbeat from core client for 30 sec - exiting 12:27:57 (7582): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:16:55 (14535): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:08:12 (5902): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:37:05 (29298): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... SIGABRT: abort called Stack trace (9 frames): /opt/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf7703400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf7703430] /lib/libc.so.6(gsignal+0x4f)[0xf753d31f] /lib/libc.so.6(abort+0x143)[0xf753ec03] /opt/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /opt/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /opt/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/libc.so.6(__libc_start_main+0xf5)[0xf75283d5] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=21406, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /opt/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf777e400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf777e430] /lib/libc.so.6(gsignal+0x4f)[0xf75b831f] /lib/libc.so.6(abort+0x143)[0xf75b9c03] /opt/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /opt/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /opt/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/libc.so.6(__libc_start_main+0xf5)[0xf75a33d5] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=21406, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /opt/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf770b400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf770b430] /lib/libc.so.6(gsignal+0x4f)[0xf754531f] /lib/libc.so.6(abort+0x143)[0xf7546c03] /opt/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /opt/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /opt/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/libc.so.6(__libc_start_main+0xf5)[0xf75303d5] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=21406, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /opt/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf7771400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf7771430] /lib/libc.so.6(gsignal+0x4f)[0xf75ab31f] /lib/libc.so.6(abort+0x143)[0xf75acc03] /opt/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /opt/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /opt/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/libc.so.6(__libc_start_main+0xf5)[0xf75963d5] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=21406, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /opt/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf775c400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf775c430] /lib/libc.so.6(gsignal+0x4f)[0xf759631f] /lib/libc.so.6(abort+0x143)[0xf7597c03] /opt/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /opt/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /opt/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/libc.so.6(__libc_start_main+0xf5)[0xf75813d5] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=21406, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /opt/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf76e3400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf76e3430] /lib/libc.so.6(gsignal+0x4f)[0xf751d31f] /lib/libc.so.6(abort+0x143)[0xf751ec03] /opt/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /opt/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /opt/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/libc.so.6(__libc_start_main+0xf5)[0xf75083d5] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=21406, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
15 Jan 2013 08:43:28 | 1118054 | 15488554 | hadcm3n_3mgg_1940_40_008260560_0 | 673,920 | 1,133,989 | 1.6827 |
14 Jan 2013 16:41:13 | 1118054 | 15488554 | hadcm3n_3mgg_1940_40_008260560_0 | 648,000 | 1,090,096 | 1.6822 |
13 Jan 2013 21:03:58 | 1118054 | 15488554 | hadcm3n_3mgg_1940_40_008260560_0 | 622,080 | 1,046,115 | 1.6816 |
12 Jan 2013 00:22:44 | 1118054 | 15488554 | hadcm3n_3mgg_1940_40_008260560_0 | 596,160 | 1,002,787 | 1.6821 |
03 Jan 2013 12:30:10 | 1118054 | 15488554 | hadcm3n_3mgg_1940_40_008260560_0 | 570,240 | 959,618 | 1.6828 |
03 Jan 2013 00:09:01 | 1118054 | 15488554 | hadcm3n_3mgg_1940_40_008260560_0 | 544,320 | 916,183 | 1.6832 |
02 Jan 2013 11:31:21 | 1118054 | 15488554 | hadcm3n_3mgg_1940_40_008260560_0 | 518,400 | 871,831 | 1.6818 |
01 Jan 2013 23:12:50 | 1118054 | 15488554 | hadcm3n_3mgg_1940_40_008260560_0 | 492,480 | 828,556 | 1.6824 |
01 Jan 2013 10:56:45 | 1118054 | 15488554 | hadcm3n_3mgg_1940_40_008260560_0 | 466,560 | 785,199 | 1.6830 |
31 Dec 2012 22:23:59 | 1118054 | 15488554 | hadcm3n_3mgg_1940_40_008260560_0 | 440,640 | 741,618 | 1.6830 |
31 Dec 2012 10:08:43 | 1118054 | 15488554 | hadcm3n_3mgg_1940_40_008260560_0 | 414,720 | 698,346 | 1.6839 |
30 Dec 2012 21:45:43 | 1118054 | 15488554 | hadcm3n_3mgg_1940_40_008260560_0 | 388,800 | 655,084 | 1.6849 |
30 Dec 2012 09:34:35 | 1118054 | 15488554 | hadcm3n_3mgg_1940_40_008260560_0 | 362,880 | 612,013 | 1.6865 |
29 Dec 2012 21:10:41 | 1118054 | 15488554 | hadcm3n_3mgg_1940_40_008260560_0 | 336,960 | 568,633 | 1.6875 |
29 Dec 2012 08:50:22 | 1118054 | 15488554 | hadcm3n_3mgg_1940_40_008260560_0 | 311,040 | 525,185 | 1.6885 |
28 Dec 2012 20:56:32 | 1118054 | 15488554 | hadcm3n_3mgg_1940_40_008260560_0 | 285,120 | 481,817 | 1.6899 |
28 Dec 2012 08:14:17 | 1118054 | 15488554 | hadcm3n_3mgg_1940_40_008260560_0 | 259,200 | 438,407 | 1.6914 |
27 Dec 2012 19:33:37 | 1118054 | 15488554 | hadcm3n_3mgg_1940_40_008260560_0 | 233,280 | 395,502 | 1.6954 |
27 Dec 2012 06:23:54 | 1118054 | 15488554 | hadcm3n_3mgg_1940_40_008260560_0 | 207,360 | 351,609 | 1.6956 |
26 Dec 2012 18:01:19 | 1118054 | 15488554 | hadcm3n_3mgg_1940_40_008260560_0 | 181,440 | 308,227 | 1.6988 |
©2024 climateprediction.net