Message boards :
Number crunching :
Download errors on UK Met Office HadAM4 at N216 resolution v8.52 tasks
Message board moderation
Previous · 1 · 2
Author | Message |
---|---|
Send message Joined: 1 Jan 07 Posts: 943 Credit: 34,318,120 RAC: 11,354 |
Just downloaded four new tasks - 2xN144 and 2xN216. All downloads completed successfully. |
Send message Joined: 15 May 09 Posts: 4347 Credit: 16,541,921 RAC: 6,087 |
Just downloaded four new tasks - 2xN144 and 2xN216. All downloads completed successfully. Good news, I had a completed task upload before the website said the servers were back up and my couple of glances at the tasks running and ready to send were not enough to confirm that new tasks were being downloaded. |
Send message Joined: 1 Jan 07 Posts: 943 Credit: 34,318,120 RAC: 11,354 |
I'm back again. 21/10/2021 09:04:24 | climateprediction.net | Started download of hadam4h_h1jm_201302_4_920_012117220.zipFour more tasks, issued 21 Oct 2021, 4:56:37 UTC to host 1498009, won't be coming home. |
Send message Joined: 10 May 20 Posts: 50 Credit: 3,368,088 RAC: 1,015 |
Same stuff repeating on my end on a Linux VM (Host 1519938). Tasks are stuck in the "Download: retry in xx:xx" loop. According to the event log some files started to download okay, but got suddenly stuck with the message "Temporarily failed download of …" & "Backing off xx:xx on download of …". |
Send message Joined: 15 May 09 Posts: 4347 Credit: 16,541,921 RAC: 6,087 |
And I am getting, Thu 21 Oct 2021 11:08:04 BST | climateprediction.net | Project requested delay of 3636 seconds Thu 21 Oct 2021 11:08:09 BST | cpdnboinc_dev | Sending scheduler request: To fetch work. Thu 21 Oct 2021 11:08:09 BST | cpdnboinc_dev | Requesting new tasks for CPU Thu 21 Oct 2021 11:08:10 BST | | Project communication failed: attempting access to reference site Thu 21 Oct 2021 11:08:10 BST | cpdnboinc_dev | Scheduler request failed: Couldn't connect to server Thu 21 Oct 2021 11:08:11 BST | | Internet access OK - project servers may be temporarily down.E Email sent. |
Send message Joined: 5 Aug 04 Posts: 1063 Credit: 16,546,621 RAC: 2,321 |
It seems to be working for me. I got four tasks last evening, one at a time, an hour apart. This is the last of them: Wed 20 Oct 2021 09:04:04 PM EDT | climateprediction.net | Sending scheduler request: To fetch work. Wed 20 Oct 2021 09:04:04 PM EDT | climateprediction.net | Requesting new tasks for CPU Wed 20 Oct 2021 09:04:06 PM EDT | climateprediction.net | Scheduler request completed: got 1 new tasks Wed 20 Oct 2021 09:04:06 PM EDT | climateprediction.net | Project requested delay of 3636 seconds Wed 20 Oct 2021 09:04:08 PM EDT | climateprediction.net | Started download of hadam4h_h0c7_200602_4_920_012115657.zip Wed 20 Oct 2021 09:04:08 PM EDT | climateprediction.net | Started download of h0c7_920_atmos.gz Wed 20 Oct 2021 09:04:11 PM EDT | climateprediction.net | Finished download of hadam4h_h0c7_200602_4_920_012115657.zip Wed 20 Oct 2021 09:04:11 PM EDT | climateprediction.net | Started download of ic_N216_2003_03_000052_f.nc.gz Wed 20 Oct 2021 09:04:18 PM EDT | climateprediction.net | Finished download of ic_N216_2003_03_000052_f.nc.gz Wed 20 Oct 2021 09:04:39 PM EDT | climateprediction.net | Finished download of h0c7_920_atmos.gz Wed 20 Oct 2021 09:04:40 PM EDT | climateprediction.net | Starting task hadam4h_h0c7_200602_4_920_012115657_0 So they must have fixed it between our two observations. |
Send message Joined: 1 Jan 07 Posts: 943 Credit: 34,318,120 RAC: 11,354 |
So they must have fixed it between our two observations.Other way round. Your times are before ours. So they must have broken it between our two observations, or some unconnected external event must have broken it. |
Send message Joined: 15 May 09 Posts: 4347 Credit: 16,541,921 RAC: 6,087 |
Other way round. Your times are before ours. And still broken as of 50 minutes ago. Backoff will let me check again in 10. |
Send message Joined: 5 Aug 04 Posts: 1063 Credit: 16,546,621 RAC: 2,321 |
Other way round. Your times are before ours. My mental arithmetic is not what it used to be. |
Send message Joined: 7 Jun 17 Posts: 23 Credit: 44,434,789 RAC: 2,600,991 |
Same stuff repeating on my end on a Linux VM (Host 1519938). Tasks are stuck in the "Download: retry in xx:xx" loop. According to the event log some files started to download okay, but got suddenly stuck with the message "Temporarily failed download of …" & "Backing off xx:xx on download of …". Me too. Same issue on two hosts (ID: 1522999; ID: 1523002), although other hosts have received work units after the first one reported problems. Do I need to do anything, or does this get resolved server-side? Many thanks |
Send message Joined: 10 May 20 Posts: 50 Credit: 3,368,088 RAC: 1,015 |
Just retried now. Downloads finally work smoothly again at least on my host. |
Send message Joined: 15 May 09 Posts: 4347 Credit: 16,541,921 RAC: 6,087 |
Just retried now. Downloads finally work smoothly again at least on my host. I got project has no tasks available but there are another two batches on the way shortly. |
Send message Joined: 1 Jan 07 Posts: 943 Credit: 34,318,120 RAC: 11,354 |
So they do. Forced a work fetch, and got four new tasks - all downloaded properly. They might have told us. |
Send message Joined: 15 May 09 Posts: 4347 Credit: 16,541,921 RAC: 6,087 |
So they do. Forced a work fetch, and got four new tasks - all downloaded properly. Strange, before heading to work this morning, I tried again and got the no work available message again but while out another task has downloaded so all seems to be working again. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
They may have re-issued the ones that failed downloading. |
Send message Joined: 1 Jan 07 Posts: 943 Credit: 34,318,120 RAC: 11,354 |
Looking at the four I got last night, they were all created on 20 October, but my download late on 21 October was the first time they'd been issued. So they probably blithely went on deploying work, unaware that it couldn't be downloaded, Left hand, right hand. host 1498009 |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
They were emailed about it, but there's been no reply, so no idea what's going on there. :( |
©2024 climateprediction.net