Name | famous_v2j9_1999_200_006689051_0 |
Workunit | 6892304 |
Created | 26 Aug 2010, 15:49:17 UTC |
Sent | 4 Sep 2010, 11:03:43 UTC |
Report deadline | 4 Dec 2010, 18:30:54 UTC |
Received | 9 Oct 2010, 23:19:55 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1066710 |
Run time | 8 days 11 hours 19 min 55 sec |
CPU time | 8 days 3 hours 37 min 18 sec |
Validate state | Invalid |
Credit | 5,064.67 |
Device peak FLOPS | 2.66 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <message> Het apparaat herkent de opdracht niet. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:36:01 (3500): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:45:56 (1628): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:42:59 (3620): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 06:27:35 (1916): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 12:11:22 (4488): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:58:02 (4524): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2968, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 12:16:37 (5068): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2364, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3028, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... 08:03:16 (4444): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:31:13 (196): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/famous_v2j9_1999_200_006689051/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/famous_v2j9_1999_200_006689051/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/famous_v2j9_1999_200_006689051/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/famous_v2j9_1999_200_006689051/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/famous_v2j9_1999_200_006689051/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/famous_v2j9_1999_200_006689051/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy Sorry, too many model crashes! :-( 08:05:45 (4756): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
09 Oct 2010 23:24:39 | 1066710 | 11705024 | famous_v2j9_1999_200_006689051_0 | 1,535,066 | 704,239 | 0.4588 |
09 Oct 2010 23:24:39 | 1066710 | 11705024 | famous_v2j9_1999_200_006689051_0 | 1,525,706 | 699,907 | 0.4587 |
09 Oct 2010 23:24:39 | 1066710 | 11705024 | famous_v2j9_1999_200_006689051_0 | 1,516,346 | 695,584 | 0.4587 |
09 Oct 2010 23:24:38 | 1066710 | 11705024 | famous_v2j9_1999_200_006689051_0 | 1,506,986 | 691,273 | 0.4587 |
09 Oct 2010 23:24:38 | 1066710 | 11705024 | famous_v2j9_1999_200_006689051_0 | 1,497,626 | 687,009 | 0.4587 |
09 Oct 2010 23:24:38 | 1066710 | 11705024 | famous_v2j9_1999_200_006689051_0 | 1,488,266 | 682,754 | 0.4588 |
09 Oct 2010 23:24:38 | 1066710 | 11705024 | famous_v2j9_1999_200_006689051_0 | 1,478,906 | 678,506 | 0.4588 |
09 Oct 2010 23:24:38 | 1066710 | 11705024 | famous_v2j9_1999_200_006689051_0 | 1,469,546 | 674,245 | 0.4588 |
06 Oct 2010 13:56:03 | 1066710 | 11705024 | famous_v2j9_1999_200_006689051_0 | 1,460,186 | 669,975 | 0.4588 |
06 Oct 2010 11:12:59 | 1066710 | 11705024 | famous_v2j9_1999_200_006689051_0 | 1,450,826 | 665,786 | 0.4589 |
06 Oct 2010 06:38:33 | 1066710 | 11705024 | famous_v2j9_1999_200_006689051_0 | 1,441,466 | 661,514 | 0.4589 |
05 Oct 2010 22:48:35 | 1066710 | 11705024 | famous_v2j9_1999_200_006689051_0 | 1,432,106 | 657,242 | 0.4589 |
05 Oct 2010 21:21:20 | 1066710 | 11705024 | famous_v2j9_1999_200_006689051_0 | 1,422,746 | 652,794 | 0.4588 |
05 Oct 2010 20:09:06 | 1066710 | 11705024 | famous_v2j9_1999_200_006689051_0 | 1,413,386 | 648,494 | 0.4588 |
05 Oct 2010 18:56:01 | 1066710 | 11705024 | famous_v2j9_1999_200_006689051_0 | 1,404,026 | 644,206 | 0.4588 |
05 Oct 2010 17:41:18 | 1066710 | 11705024 | famous_v2j9_1999_200_006689051_0 | 1,394,666 | 639,939 | 0.4588 |
05 Oct 2010 16:29:14 | 1066710 | 11705024 | famous_v2j9_1999_200_006689051_0 | 1,385,306 | 635,682 | 0.4589 |
05 Oct 2010 06:29:30 | 1066710 | 11705024 | famous_v2j9_1999_200_006689051_0 | 1,375,946 | 631,431 | 0.4589 |
04 Oct 2010 23:15:17 | 1066710 | 11705024 | famous_v2j9_1999_200_006689051_0 | 1,366,586 | 627,171 | 0.4589 |
04 Oct 2010 21:58:05 | 1066710 | 11705024 | famous_v2j9_1999_200_006689051_0 | 1,357,226 | 622,908 | 0.4590 |
©2024 climateprediction.net