climateprediction.net home page
Upload failures

Upload failures

Message boards : Number crunching : Upload failures
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 7 · 8 · 9 · 10 · 11 · 12 · 13 . . . 19 · Next

AuthorMessage
Nuadormrac
Avatar

Send message
Joined: 14 Oct 05
Posts: 44
Credit: 2,036,693
RAC: 0
Message 60507 - Posted: 30 Jun 2019, 12:23:15 UTC

6/30/2019 8:16:08 AM | climateprediction.net | Started upload of wah2_sam50_n1xd_198812_24_822_011878852_0_r1995804256_11.zip
6/30/2019 8:16:08 AM | climateprediction.net | Started upload of wah2_sam50_n5id_201412_24_822_011882296_0_r330904530_8.zip
6/30/2019 8:16:11 AM | climateprediction.net | Temporarily failed upload of wah2_sam50_n1xd_198812_24_822_011878852_0_r1995804256_11.zip: connect() failed
6/30/2019 8:16:11 AM | climateprediction.net | Backing off 00:07:35 on upload of wah2_sam50_n1xd_198812_24_822_011878852_0_r1995804256_11.zip
6/30/2019 8:16:11 AM | climateprediction.net | Temporarily failed upload of wah2_sam50_n5id_201412_24_822_011882296_0_r330904530_8.zip: connect() failed
6/30/2019 8:16:11 AM | climateprediction.net | Backing off 00:06:05 on upload of wah2_sam50_n5id_201412_24_822_011882296_0_r330904530_8.zip
6/30/2019 8:16:11 AM | climateprediction.net | Started upload of wah2_sam50_n5ro_201612_25_822_011883181_2_r705226955_22.zip
6/30/2019 8:16:11 AM | climateprediction.net | Started upload of wah2_sam50_n1xd_198812_24_822_011878852_0_r1995804256_17.zip
6/30/2019 8:16:12 AM | | Project communication failed: attempting access to reference site
6/30/2019 8:16:13 AM | climateprediction.net | Temporarily failed upload of wah2_sam50_n5ro_201612_25_822_011883181_2_r705226955_22.zip: connect() failed
6/30/2019 8:16:13 AM | climateprediction.net | Backing off 00:38:46 on upload of wah2_sam50_n5ro_201612_25_822_011883181_2_r705226955_22.zip
6/30/2019 8:16:13 AM | climateprediction.net | Temporarily failed upload of wah2_sam50_n1xd_198812_24_822_011878852_0_r1995804256_17.zip: connect() failed
6/30/2019 8:16:13 AM | climateprediction.net | Backing off 00:33:42 on upload of wah2_sam50_n1xd_198812_24_822_011878852_0_r1995804256_17.zip
6/30/2019 8:16:14 AM | | Internet access OK - project servers may be temporarily down.


I've got version 7.14.2, surely that's newer then version 6.12.xx mentioned above, so doesn't sound like that particular problem. This has been going on for days now...[/quote]
ID: 60507 · Report as offensive     Reply Quote
mmonnin

Send message
Joined: 28 May 17
Posts: 49
Credit: 11,877,392
RAC: 5,582
Message 60511 - Posted: 30 Jun 2019, 17:49:26 UTC

The client version has nothing to do with a full disk on a project server.
ID: 60511 · Report as offensive     Reply Quote
Jim1348

Send message
Joined: 15 Jan 06
Posts: 637
Credit: 26,744,749
RAC: 7
Message 60512 - Posted: 30 Jun 2019, 20:18:22 UTC - in response to Message 60511.  

6 TB disk drives are selling for $200 in the U.S. Someone should pass the plate and take up a collection.
ID: 60512 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 60513 - Posted: 30 Jun 2019, 21:27:59 UTC

Depends on where the data center is.
Might even be in the USA.
ID: 60513 · Report as offensive     Reply Quote
Jean-David Beyer

Send message
Joined: 5 Aug 04
Posts: 718
Credit: 10,133,158
RAC: 6,135
Message 60514 - Posted: 30 Jun 2019, 21:42:20 UTC - in response to Message 60512.  

6 TB disk drives are selling for $200 in the U.S. Someone should pass the plate and take up a collection.


Is the server tax-deductible in USA? Is that the only problem? If so, what is their mailing address and how should the check be made out?
ID: 60514 · Report as offensive     Reply Quote
Profile Alan K

Send message
Joined: 22 Feb 06
Posts: 451
Credit: 19,706,234
RAC: 2,509
Message 60515 - Posted: 30 Jun 2019, 22:01:18 UTC - in response to Message 60513.  

Apparently in Swindon UK.
ID: 60515 · Report as offensive     Reply Quote
Profile Alan K

Send message
Joined: 22 Feb 06
Posts: 451
Credit: 19,706,234
RAC: 2,509
Message 60516 - Posted: 30 Jun 2019, 22:02:59 UTC - in response to Message 60513.  

Depends on where the data center is.
Might even be in the USA.


Apparently in Swindon, UK.
ID: 60516 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 60517 - Posted: 30 Jun 2019, 22:13:46 UTC

Oh, well, that explains it.

Museum of Computing
ID: 60517 · Report as offensive     Reply Quote
Profile geophi
Volunteer moderator

Send message
Joined: 7 Aug 04
Posts: 2129
Credit: 58,267,435
RAC: 8,229
Message 60518 - Posted: 1 Jul 2019, 1:42:12 UTC - in response to Message 60507.  

I've got version 7.14.2, surely that's newer then version 6.12.xx mentioned above, so doesn't sound like that particular problem. This has been going on for days now...


My reply that mentioned the boinc version was in reponse to Chris L's and Dave's wondering about whether there is still a 14 day limit on upload tries before the task aborts...and whether cpdn now requires newer versions of boinc that may not have such limits.

The client version has nothing to do with a full disk on a project server.


Absolutely.
ID: 60518 · Report as offensive     Reply Quote
crashtech

Send message
Joined: 1 Jun 17
Posts: 13
Credit: 11,107,619
RAC: 0
Message 60520 - Posted: 1 Jul 2019, 2:42:58 UTC

To be honest I don't have any issue with the outage, but it's slightly annoying to have to search the user forum to see if uploads can be enabled again. A push notification through BOINC once things are made right would be welcome.

https://boinc.berkeley.edu/trac/wiki/Notifications
ID: 60520 · Report as offensive     Reply Quote
Profile Dave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 3581
Credit: 10,761,686
RAC: 5,528
Message 60521 - Posted: 1 Jul 2019, 5:30:04 UTC
Last modified: 1 Jul 2019, 7:13:09 UTC

https://en.wikipedia.org/wiki/Museum_of_Computing


Interesting to look at some of the machines at a rival museum here in Cambridge,

http://www.computinghistory.org.uk/sec/193/Computers/

I go there from time to time when fed up with the limitations of my hardware to remind myself what it used to be like! I would be giving away my age if I said how many of them I remember.
ID: 60521 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 60522 - Posted: 1 Jul 2019, 6:19:03 UTC - in response to Message 60521.  

Some memories there all right.
ID: 60522 · Report as offensive     Reply Quote
Profile Dave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 3581
Credit: 10,761,686
RAC: 5,528
Message 60523 - Posted: 1 Jul 2019, 7:17:34 UTC

Just a thought.

As I remember there used to be a maximum number of retries for zip uploads and as far as I know there isn't a mechanism to stop some retrying while others are allowed. Not sure if this limit still applies or what it is.

With this in mind, I am restricting internet access to when there are at least two or three uploads that I expect to work though the real danger of passing any limit is probably repeated pressing of the try again button.
ID: 60523 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 60524 - Posted: 1 Jul 2019, 7:38:24 UTC

It's still very early in the UK, but with the levels of cabin fever so high, I've just sent an email to see if anything happened over the weekend.
ID: 60524 · Report as offensive     Reply Quote
David Wallom
Volunteer moderator
Project administrator

Send message
Joined: 26 Oct 11
Posts: 9
Credit: 3,275,889
RAC: 0
Message 60525 - Posted: 1 Jul 2019, 8:54:56 UTC - in response to Message 60524.  

Hello All,

Apologies for the continued unavailability of the jasmin-upload system which we have been clearing out over the weekend. We have cleared 5TB of space since Thursday so will be re-enabling uploads imminently.

We are going to be reconfiguring the data transfer from the upload to the project storage over this week so that we will be able totake advantage of new capability within the JASMIN system to speed these transfers in future.

Regards

David
ID: 60525 · Report as offensive     Reply Quote
David Wallom
Volunteer moderator
Project administrator

Send message
Joined: 26 Oct 11
Posts: 9
Credit: 3,275,889
RAC: 0
Message 60526 - Posted: 1 Jul 2019, 8:58:02 UTC - in response to Message 60525.  

There are now 140+ parallel uploads onto the system.

David
ID: 60526 · Report as offensive     Reply Quote
Profile Dave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 3581
Credit: 10,761,686
RAC: 5,528
Message 60527 - Posted: 1 Jul 2019, 9:28:33 UTC - in response to Message 60526.  

There are now 140+ parallel uploads onto the system.


So if you have internet access disabled, keep it that way till some of those who never read the boards and whose machines will be uploading now have finished. Eventually, the pressure on the server taking these uploads will ease off. If you try now, the connection timing out due to the number of computers trying to upload is highly likely!
ID: 60527 · Report as offensive     Reply Quote
[P3D] Crashtest

Send message
Joined: 2 Apr 05
Posts: 16
Credit: 16,862,994
RAC: 0
Message 60530 - Posted: 1 Jul 2019, 13:35:08 UTC

Well I have more than 3000 files waiting for upload ... but I only get:

01.07.2019 15:32:48 | climateprediction.net | Started upload of wah2_cam25_a09h_200405_18_691_011369890_0_r1209528118_10.zip
01.07.2019 15:32:48 | climateprediction.net | Started upload of wah2_sam50_n1zc_199012_24_822_011878923_0_r472347994_22.zip
01.07.2019 15:33:11 | | Project communication failed: attempting access to reference site
01.07.2019 15:33:11 | climateprediction.net | Temporarily failed upload of wah2_sam50_n1zc_199012_24_822_011878923_0_r472347994_22.zip: connect() failed
01.07.2019 15:33:11 | climateprediction.net | Backing off 04:53:16 on upload of wah2_sam50_n1zc_199012_24_822_011878923_0_r472347994_22.zip
01.07.2019 15:33:11 | climateprediction.net | Started upload of wah2_sam50_n1zc_199012_24_822_011878923_0_r472347994_23.zip
01.07.2019 15:33:12 | climateprediction.net | Temporarily failed upload of wah2_cam25_a09h_200405_18_691_011369890_0_r1209528118_10.zip: transient HTTP error
01.07.2019 15:33:12 | climateprediction.net | Backing off 04:41:38 on upload of wah2_cam25_a09h_200405_18_691_011369890_0_r1209528118_10.zip
01.07.2019 15:33:12 | climateprediction.net | Started upload of wah2_sam50_n1zc_199012_24_822_011878923_0_r472347994_24.zip
01.07.2019 15:33:13 | | Internet access OK - project servers may be temporarily down.
01.07.2019 15:33:33 | | Project communication failed: attempting access to reference site
01.07.2019 15:33:33 | climateprediction.net | Temporarily failed upload of wah2_sam50_n1zc_199012_24_822_011878923_0_r472347994_24.zip: transient HTTP error
01.07.2019 15:33:33 | climateprediction.net | Backing off 05:08:51 on upload of wah2_sam50_n1zc_199012_24_822_011878923_0_r472347994_24.zip
01.07.2019 15:33:34 | | Internet access OK - project servers may be temporarily down.
01.07.2019 15:34:23 | | Project communication failed: attempting access to reference site
01.07.2019 15:34:23 | climateprediction.net | Temporarily failed upload of wah2_sam50_n1zc_199012_24_822_011878923_0_r472347994_23.zip: transient HTTP error
01.07.2019 15:34:23 | climateprediction.net | Backing off 03:50:59 on upload of wah2_sam50_n1zc_199012_24_822_011878923_0_r472347994_23.zip
01.07.2019 15:34:24 | | Internet access OK - project servers may be temporarily down.
ID: 60530 · Report as offensive     Reply Quote
Jim1348

Send message
Joined: 15 Jan 06
Posts: 637
Credit: 26,744,749
RAC: 7
Message 60531 - Posted: 1 Jul 2019, 14:19:26 UTC - in response to Message 60530.  

They are starting for me. I have three uploads going at 300+ Kbps.
ID: 60531 · Report as offensive     Reply Quote
[P3D] Crashtest

Send message
Joined: 2 Apr 05
Posts: 16
Credit: 16,862,994
RAC: 0
Message 60532 - Posted: 1 Jul 2019, 15:53:09 UTC - in response to Message 60531.  

one System reboot later (incl. flushdns):

01.07.2019 17:49:28 | climateprediction.net | Started upload of wah2_cam25_a09h_200405_18_691_011369890_0_r1209528118_10.zip
01.07.2019 17:49:28 | climateprediction.net | Started upload of wah2_sam50_n1zc_199012_24_822_011878923_0_r472347994_17.zip
01.07.2019 17:49:52 | | Project communication failed: attempting access to reference site
01.07.2019 17:49:52 | climateprediction.net | Temporarily failed upload of wah2_cam25_a09h_200405_18_691_011369890_0_r1209528118_10.zip: transient HTTP error
01.07.2019 17:49:52 | climateprediction.net | Backing off 04:22:50 on upload of wah2_cam25_a09h_200405_18_691_011369890_0_r1209528118_10.zip
01.07.2019 17:49:52 | climateprediction.net | Started upload of wah2_sam50_n1zc_199012_24_822_011878923_0_r472347994_18.zip
01.07.2019 17:49:54 | | Internet access OK - project servers may be temporarily down.


So I'm sure the servers are to small to handle our litle output.
ID: 60532 · Report as offensive     Reply Quote
Previous · 1 . . . 7 · 8 · 9 · 10 · 11 · 12 · 13 . . . 19 · Next

Message boards : Number crunching : Upload failures

©2022 climateprediction.net