Message boards :
Number crunching :
MORE DOWNLOAD ERRORS
Message board moderation
Author | Message |
---|---|
Send message Joined: 31 Dec 07 Posts: 1152 Credit: 22,063,325 RAC: 928 |
We seem to have a download problem again. Today I tried to download 4 WU�s. Two hadcm3n WU�s downloaded fine. The problem is that it also tried to download 2 hadam3p that failed. The failed units are hadam3p_c7r7_1987_1_008018169-1 and hadam3p_c7c4_1984_1_ 008018167_1. Both show as download failed and there are no files stuck in the transfer tab. Error message follow 11/25/2013 6:22:36 PM | climateprediction.net | Scheduler request completed: got 2 new tasks 11/25/2013 6:22:38 PM | climateprediction.net | Started download of hadam3p_pnw_c7r4_1984_1_008018167.zip 11/25/2013 6:22:38 PM | climateprediction.net | Started download of atmos_c7r4_1984_1_008018167_0.gz 11/25/2013 6:22:39 PM | climateprediction.net | Giving up on download of hadam3p_pnw_c7r4_1984_1_008018167.zip: permanent HTTP error 11/25/2013 6:22:39 PM | climateprediction.net | Giving up on download of atmos_c7r4_1984_1_008018167_0.gz: permanent HTTP error 11/25/2013 6:22:39 PM | climateprediction.net | Started download of pnw_c7r4_1984_1_008018167_0.gz 11/25/2013 6:22:39 PM | climateprediction.net | Started download of ic19611020_10_N96.gz 11/25/2013 6:22:40 PM | climateprediction.net | Giving up on download of pnw_c7r4_1984_1_008018167_0.gz: permanent HTTP error 11/25/2013 6:22:40 PM | climateprediction.net | Started download of hadam3p_pnw_c7r7_1987_1_008018169.zip 11/25/2013 6:22:41 PM | climateprediction.net | Giving up on download of hadam3p_pnw_c7r7_1987_1_008018169.zip: permanent HTTP error 11/25/2013 6:22:41 PM | climateprediction.net | Started download of atmos_c7r7_1987_1_008018169_0.gz 11/25/2013 6:22:42 PM | climateprediction.net | Finished download of ic19611020_10_N96.gz 11/25/2013 6:22:42 PM | climateprediction.net | Giving up on download of atmos_c7r7_1987_1_008018169_0.gz: permanent HTTP error 11/25/2013 6:22:42 PM | climateprediction.net | Started download of pnw_c7r7_1987_1_008018169_0.gz 11/25/2013 6:22:42 PM | climateprediction.net | Started download of HadISST_SST_N96_1986_12_1989_01f.gz 11/25/2013 6:22:43 PM | climateprediction.net | Giving up on download of pnw_c7r7_1987_1_008018169_0.gz: permanent HTTP error 11/25/2013 6:22:43 PM | climateprediction.net | Started download of HadISST_SI_N96_1986_12_1989_01f.gz 11/25/2013 6:22:48 PM | climateprediction.net | Finished download of HadISST_SI_N96_1986_12_1989_01f.gz 11/25/2013 6:22:48 PM | climateprediction.net | Started download of so2dms_N96_1986_12_1989_02.gz 11/25/2013 6:22:52 PM | climateprediction.net | Finished download of HadISST_SST_N96_1986_12_1989_01f.gz 11/25/2013 6:22:55 PM | climateprediction.net | Finished download of so2dms_N96_1986_12_1989_02.gz It appears to be the Permanent HTTP error again. |
Send message Joined: 13 Jan 06 Posts: 1498 Credit: 15,613,038 RAC: 0 |
It's a brand-new batch, and from what people are reporting it doesn't look like it is a good batch. Andy will need to take a look tomorrow. I'm a volunteer and my views are my own. News and Announcements and FAQ |
Send message Joined: 22 Mar 06 Posts: 144 Credit: 24,695,428 RAC: 0 |
Just to let you know all my PNW downloads have failed - all 11 of them, including the 3 that are currently shown as In Progress. Comp ID 1290283. |
Send message Joined: 13 Jan 06 Posts: 1498 Credit: 15,613,038 RAC: 0 |
Just to let you know all my PNW downloads have failed - all 11 of them, including the 3 that are currently shown as In Progress. Comp ID 1290283. Yeah, it's a bad batch. I'm a volunteer and my views are my own. News and Announcements and FAQ |
Send message Joined: 17 Aug 04 Posts: 289 Credit: 44,103,664 RAC: 0 |
Also reporting that all my PNW downloads have failed too. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
The recent batch was a mass re-issue from June 2012 by the BOINC software, so Abort anything that hasn't failed by itself. |
Send message Joined: 9 Sep 04 Posts: 228 Credit: 30,269,058 RAC: 2,247 |
???? Should we also abort the hadcm3n ? |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Ah. No information available about them. I have one, and I'm letting it run. |
Send message Joined: 22 Mar 06 Posts: 144 Credit: 24,695,428 RAC: 0 |
I thought the recent hadcm3n models (I have 7) were just normal downloads, but since Bonsai911's post, I do notice that some of them have exactly the same sent times as the PNW models. However, looking that the recently sent hadcm3n work unit IDs, all of the tasks had 100% failures in the past, so are possibly part of a normal reissue. Like Les, I'm letting them run. Perhaps a wrong flag got set somewhere that resulted in the PNW models being released as well as reissued hadcm3n models? |
Send message Joined: 5 Aug 04 Posts: 1056 Credit: 16,521,771 RAC: 1,278 |
Me to. (I am not sure the following two items are the same work unit or not. I think they are, but if not, they behave similarly. 8170815 26-Nov-2013 19:00:03 [climateprediction.net] Requesting new tasks 26-Nov-2013 19:00:08 [climateprediction.net] Scheduler request completed: got 1 new tasks 26-Nov-2013 19:00:10 [climateprediction.net] Started download of hadam3p_pnw_c6xt_1969_1_008024812.zip 26-Nov-2013 19:00:10 [climateprediction.net] Started download of atmos_c6xt_1969_1_008024812_0.gz 26-Nov-2013 19:00:12 [climateprediction.net] Giving up on download of hadam3p_pnw_c6xt_1969_1_008024812.zip: file not found 26-Nov-2013 19:00:12 [climateprediction.net] Giving up on download of atmos_c6xt_1969_1_008024812_0.gz: file not found 26-Nov-2013 19:00:12 [climateprediction.net] Started download of pnw_c6xt_1969_1_008024812_0.gz 26-Nov-2013 19:00:12 [climateprediction.net] Started download of HadISST_SST_N96_1968_12_1971_01f.gz 26-Nov-2013 19:00:13 [climateprediction.net] Giving up on download of pnw_c6xt_1969_1_008024812_0.gz: file not found 26-Nov-2013 19:00:13 [climateprediction.net] Started download of HadISST_SI_N96_1968_12_1971_01f.gz 26-Nov-2013 19:00:15 [climateprediction.net] Finished download of HadISST_SI_N96_1968_12_1971_01f.gz 26-Nov-2013 19:00:15 [climateprediction.net] Started download of so2dms_N96_1968_12_1971_02.gz 26-Nov-2013 19:00:18 [climateprediction.net] Finished download of so2dms_N96_1968_12_1971_02.gz 26-Nov-2013 19:00:20 [climateprediction.net] Finished download of HadISST_SST_N96_1968_12_1971_01f.gz |
Send message Joined: 31 Aug 04 Posts: 391 Credit: 219,888,554 RAC: 1,481,373 |
Looking at the logs here last 2 days -- seems like a batch of hadcm3n/RAPID/Rapit hit the server and got taken up by crunchers, no problems at all. Strange but good to be running a range of models some starting in 1880, and some in 2060. The regional batch - yeah, problems. The crew will likely figure and fix the regional model problem soonish. Meanwhile - keep on crunching. |
Send message Joined: 5 Aug 04 Posts: 1283 Credit: 15,824,334 RAC: 0 |
As far as I can see all of the download failures are for reissued tasks from workunits created on 23rd July 2012. Every one I've looked at has one task which appears to be unsent (status unknown), for example WU 8179645. I suspect it's another instance of BOINC automatically generating new tasks 12500 hours after creation of a workunit which is still in the database and hasn't been completed or reached its error limit. This can only happen on projects like CPDN which don't remove results and workunits from the database. The download files files were probably deleted from the server when that batch of work was "completed" (or weren't transferred over when the server was rebuilt). "The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer |
Send message Joined: 31 Dec 07 Posts: 1152 Credit: 22,063,325 RAC: 928 |
Looks like we are back to failed downloads on hadam3p_eu�s. I have 2 of them sitting in the Boinc manager. No files stuck in the transfer tab. It�s the permanent HTTP error again.I see that server status shows zero so these are probably reissues of old WU�s (not another complete bad batch) as both end in �2�. Please tell them to kick the box. Error messages below: 12/19/2013 4:22:38 PM | climateprediction.net | Started download of hadam3p_eu_9ckn_1964_1_008052151.zip 12/19/2013 4:22:38 PM | climateprediction.net | Started download of atmos_9ckn_1964_1_008052151_0.gz 12/19/2013 4:22:41 PM | climateprediction.net | Giving up on download of hadam3p_eu_9ckn_1964_1_008052151.zip: permanent HTTP error 12/19/2013 4:22:41 PM | climateprediction.net | Giving up on download of atmos_9ckn_1964_1_008052151_0.gz: permanent HTTP error 12/19/2013 4:22:41 PM | climateprediction.net | Started download of eu_9ckn_1964_1_008052151_0.gz 12/19/2013 4:22:41 PM | climateprediction.net | Started download of ic19610106_10_N96.gz 12/19/2013 4:22:42 PM | climateprediction.net | Giving up on download of eu_9ckn_1964_1_008052151_0.gz: permanent HTTP error 12/19/2013 4:22:42 PM | climateprediction.net | Started download of HadISST_SST_N96_1963_12_1966_01f.gz 12/19/2013 4:22:44 PM | climateprediction.net | Finished download of ic19610106_10_N96.gz 12/19/2013 4:22:44 PM | climateprediction.net | Started download of HadISST_SI_N96_1963_12_1966_01f.gz 12/19/2013 4:22:47 PM | climateprediction.net | Finished download of HadISST_SI_N96_1963_12_1966_01f.gz 12/19/2013 4:22:47 PM | climateprediction.net | Started download of so2dms_N96_1963_12_1966_02.gz 12/19/2013 4:22:52 PM | climateprediction.net | Finished download of so2dms_N96_1963_12_1966_02.gz 12/19/2013 4:22:52 PM | climateprediction.net | Started download of oxi.addfa.gz 12/19/2013 4:22:54 PM | climateprediction.net | Finished download of HadISST_SST_N96_1963_12_1966_01f.gz 12/19/2013 4:22:54 PM | climateprediction.net | Started download of o3_A2_1959_2010_N96_f.anc.gz 12/19/2013 4:23:09 PM | climateprediction.net | Finished download of o3_A2_1959_2010_N96_f.anc.gz 12/19/2013 4:23:09 PM | climateprediction.net | Started download of hadam3p_eu_9kzp_1982_1_008053360.zip 12/19/2013 4:23:11 PM | climateprediction.net | Giving up on download of hadam3p_eu_9kzp_1982_1_008053360.zip: permanent HTTP error 12/19/2013 4:23:11 PM | climateprediction.net | Started download of atmos_9kzp_1982_1_008053360_0.gz 12/19/2013 4:23:12 PM | climateprediction.net | Giving up on download of atmos_9kzp_1982_1_008053360_0.gz: permanent HTTP error 12/19/2013 4:23:12 PM | climateprediction.net | Started download of eu_9kzp_1982_1_008053360_0.gz 12/19/2013 4:23:13 PM | climateprediction.net | Giving up on download of eu_9kzp_1982_1_008053360_0.gz: permanent HTTP error 12/19/2013 4:23:13 PM | climateprediction.net | Started download of ic19610123_10_N96.gz 12/19/2013 4:23:16 PM | climateprediction.net | Finished download of ic19610123_10_N96.gz 12/19/2013 4:23:16 PM | climateprediction.net | Started download of HadISST_SST_N96_1981_12_1984_01f.gz 12/19/2013 4:23:26 PM | climateprediction.net | Finished download of HadISST_SST_N96_1981_12_1984_01f.gz 12/19/2013 4:23:26 PM | climateprediction.net | Started download of HadISST_SI_N96_1981_12_1984_01f.gz 12/19/2013 4:23:29 PM | climateprediction.net | Finished download of HadISST_SI_N96_1981_12_1984_01f.gz 12/19/2013 4:23:29 PM | climateprediction.net | Started download of so2dms_N96_1981_12_1984_02.gz 12/19/2013 4:23:34 PM | climateprediction.net | Finished download of so2dms_N96_1981_12_1984_02.gz 12/19/2013 4:24:05 PM | climateprediction.net | Finished download of oxi.addfa.gz |
Send message Joined: 1 Jan 07 Posts: 943 Credit: 34,187,965 RAC: 6,888 |
Looks like we are back to failed downloads on hadam3p_eu�s. I have 2 of them sitting in the Boinc manager. No files stuck in the transfer tab. It�s the permanent HTTP error again.I see that server status shows zero so these are probably reissues of old WU�s (not another complete bad batch) as both end in �2�. Indeed, hunting through your collection of computers finds 9ckn at WU 8207265 |
Send message Joined: 22 Feb 06 Posts: 484 Credit: 29,602,471 RAC: 2,231 |
Looks like I have a similar problem:- 20/12/2013 06:15:36 | climateprediction.net | Scheduler request completed: got 1 new tasks 20/12/2013 06:15:38 | climateprediction.net | Started download of hadam3p_eu_a166_1984_1_008055647.zip 20/12/2013 06:15:38 | climateprediction.net | Started download of atmos_a166_1984_1_008055647_0.gz 20/12/2013 06:15:40 | climateprediction.net | Giving up on download of hadam3p_eu_a166_1984_1_008055647.zip: permanent HTTP error 20/12/2013 06:15:40 | climateprediction.net | Giving up on download of atmos_a166_1984_1_008055647_0.gz: permanent HTTP error 20/12/2013 06:15:40 | climateprediction.net | Started download of eu_a166_1984_1_008055647_0.gz 20/12/2013 06:15:40 | climateprediction.net | Started download of HadISST_SST_N96_1983_12_1986_01f.gz 20/12/2013 06:15:42 | climateprediction.net | Giving up on download of eu_a166_1984_1_008055647_0.gz: permanent HTTP error 20/12/2013 06:15:42 | climateprediction.net | Started download of HadISST_SI_N96_1983_12_1986_01f.gz 20/12/2013 06:15:43 | climateprediction.net | Finished download of HadISST_SST_N96_1983_12_1986_01f.gz 20/12/2013 06:15:43 | climateprediction.net | Finished download of HadISST_SI_N96_1983_12_1986_01f.gz 20/12/2013 06:15:43 | climateprediction.net | Started download of so2dms_N96_1983_12_1986_02.gz 20/12/2013 06:15:45 | climateprediction.net | Finished download of so2dms_N96_1983_12_1986_02.gz |
Send message Joined: 16 Jan 10 Posts: 1081 Credit: 6,982,827 RAC: 3,789 |
[chavk wrote:] That hadam3p_eu_a166_1984_1_008055647 work unit was generated on 17 July 2012. At some point when the bulk of the models in that batch had completed, the project removed the download files, not realising that the work units would reappear later. A better policy might have been to remove only those work units that had completed and leave us to slowly mop up the rest - or, indeed, to stop the server reviving these phantoms. In any event, you needn't be concerned that there is anything you can do to stop these errors. It's a project error. |
Send message Joined: 22 Feb 06 Posts: 484 Credit: 29,602,471 RAC: 2,231 |
Have got this error on 5 downloads Task 16151138 work ID 8211626 16150408 8210761 16149592 8209709 16149590 8209707 16148812 8206403 all HADAM3P_eu Thanks Ian, I'll ignore them. |
Send message Joined: 1 Jan 07 Posts: 943 Credit: 34,187,965 RAC: 6,888 |
[chavk wrote:] It's curious that, for both that a166, and the 9ckn I found for Jim, it looks as if replication_0 was never issued in the first place. |
Send message Joined: 9 Sep 04 Posts: 228 Credit: 30,269,058 RAC: 2,247 |
First time appearance: 22.12.2013 16:53:48 | climateprediction.net | Scheduler request failed: HTTP gateway timeout |
Send message Joined: 29 Sep 04 Posts: 2363 Credit: 14,611,758 RAC: 0 |
The real reason for the download failure was: WU download error: couldn't get input files and the explanation for this is in Iain Inglis's post four above this. The task must have tried and tried to get all its files before the download timed out. The numbering of tasks in this batch of workunits appears chaotic, or perhaps it's the order in which they are sent out that's chaotic. It several workunits that I've looked at the _2 task was sent out first. They are all part of the defective batch created on 16 and 17 July 2012. Cpdn news |
©2024 climateprediction.net