Task 12498658

Name	famous_wspw_999_200_007123979_0
Workunit	7322339
Created	16 Jan 2011, 17:02:38 UTC
Sent	16 Jan 2011, 18:19:59 UTC
Report deadline	18 Apr 2011, 1:47:10 UTC
Received	25 Jul 2011, 23:42:59 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1116694
Run time	10 days 19 hours 51 min 42 sec
CPU time	9 days 18 hours 29 min 5 sec
Validate state	Invalid
Credit	3,520.59
Device peak FLOPS	1.42 GFLOPS
Application version	UK Met Office FAMOUS v6.11 windows_intelx86
Stderr	<core_client_version>6.10.18</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 06:47:50 (4032): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1588, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 13:27:42 (3632): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:27:25 (3832): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3156, iMonCtr=1 Model crash detected, will try to restart... C18:55:23 (3740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2196, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1428, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1532, iMonCtr=1 Model crash detected, will try to restart... C23:15:05 (3592): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CCPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3908, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3436, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_wspw_999_200_007123979/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_wspw_999_200_007123979/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_wspw_999_200_007123979/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_wspw_999_200_007123979/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_wspw_999_200_007123979/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_wspw_999_200_007123979/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy Sorry, too many model crashes! :-( 18:40:57 (3260): called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
04 Jul 2011 19:11:13	1116694	12498658	famous_wspw_999_200_007123979_0	1,067,066	842,088	0.7892
27 Jun 2011 03:52:26	1116694	12498658	famous_wspw_999_200_007123979_0	1,057,706	834,888	0.7893
27 Jun 2011 01:21:48	1116694	12498658	famous_wspw_999_200_007123979_0	1,048,346	827,818	0.7896
27 Jun 2011 01:21:48	1116694	12498658	famous_wspw_999_200_007123979_0	1,038,986	820,794	0.7900
26 Jun 2011 20:39:59	1116694	12498658	famous_wspw_999_200_007123979_0	1,029,626	813,739	0.7903
20 Jun 2011 06:04:02	1116694	12498658	famous_wspw_999_200_007123979_0	1,020,266	806,141	0.7901
20 Jun 2011 04:38:33	1116694	12498658	famous_wspw_999_200_007123979_0	1,010,906	798,333	0.7897
20 Jun 2011 04:38:33	1116694	12498658	famous_wspw_999_200_007123979_0	1,001,546	790,917	0.7897
20 Jun 2011 04:38:33	1116694	12498658	famous_wspw_999_200_007123979_0	992,186	783,416	0.7896
16 Jun 2011 04:09:30	1116694	12498658	famous_wspw_999_200_007123979_0	982,826	775,704	0.7893
14 Jun 2011 21:18:51	1116694	12498658	famous_wspw_999_200_007123979_0	973,466	768,523	0.7895
13 Jun 2011 08:29:01	1116694	12498658	famous_wspw_999_200_007123979_0	964,106	761,246	0.7896
13 Jun 2011 05:48:10	1116694	12498658	famous_wspw_999_200_007123979_0	954,746	754,231	0.7900
11 Jun 2011 04:53:34	1116694	12498658	famous_wspw_999_200_007123979_0	945,386	747,172	0.7903
08 Jun 2011 04:06:31	1116694	12498658	famous_wspw_999_200_007123979_0	936,026	739,837	0.7904
08 Jun 2011 02:55:36	1116694	12498658	famous_wspw_999_200_007123979_0	926,666	732,721	0.7907
07 Jun 2011 23:37:58	1116694	12498658	famous_wspw_999_200_007123979_0	917,306	725,478	0.7909
07 Jun 2011 03:55:47	1116694	12498658	famous_wspw_999_200_007123979_0	907,946	717,983	0.7908
06 Jun 2011 04:43:19	1116694	12498658	famous_wspw_999_200_007123979_0	898,586	710,246	0.7904
05 Jun 2011 07:15:16	1116694	12498658	famous_wspw_999_200_007123979_0	889,226	702,820	0.7904