climateprediction.net (CPDN) home page
Thread 'transient HTTP error'

Thread 'transient HTTP error'

Message boards : Number crunching : transient HTTP error
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · Next

AuthorMessage
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4540
Credit: 19,039,635
RAC: 18,944
Message 59094 - Posted: 26 Nov 2018, 10:27:31 UTC

Someone has kicked something.

My hadcm3s zip is now uploading. With my bored band speed, I suspect that even were it not for the problems with space for data at Oxford, a really fast computer would be producing data more quickly than I could upload it if running all new global models!
........
........
........


And finally upload finished!
ID: 59094 · Report as offensive     Reply Quote
WB8ILI

Send message
Joined: 1 Sep 04
Posts: 161
Credit: 81,522,141
RAC: 1,164
Message 59191 - Posted: 17 Dec 2018, 12:45:10 UTC

I have been stuck uploading some zips for batch 691 for several days. Transient HTTP error.
ID: 59191 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4540
Credit: 19,039,635
RAC: 18,944
Message 59192 - Posted: 17 Dec 2018, 13:55:42 UTC - in response to Message 59191.  

I wonder if they go somewhere else, I don't have any of that batch but the four different batch numbers I do have tasks for all seem to be uploading their zips normally.
ID: 59192 · Report as offensive     Reply Quote
Profilegeophi
Volunteer moderator

Send message
Joined: 7 Aug 04
Posts: 2187
Credit: 64,822,615
RAC: 5,275
Message 59193 - Posted: 17 Dec 2018, 16:28:25 UTC

Ian has/had batch 691 upload problems as well. Andy just changed a timeout value on the server, but I don't know if that will solve the problem.
ID: 59193 · Report as offensive     Reply Quote
WB8ILI

Send message
Joined: 1 Sep 04
Posts: 161
Credit: 81,522,141
RAC: 1,164
Message 59195 - Posted: 17 Dec 2018, 17:31:21 UTC

The detail for batch 691 uploads is -


12/17/2018 12:27:16 PM | climateprediction.net | Started upload of wah2_cam25_a09n_200405_18_691_011369896_0_r1612571459_7.zip
12/17/2018 12:27:16 PM | climateprediction.net | [file_xfer] URL: http://upload6.cpdn.org/cgi-bin/file_upload_handler
12/17/2018 12:27:18 PM | | Internet access OK - project servers may be temporarily down.
ID: 59195 · Report as offensive     Reply Quote
ProfileThyme Lawn
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1283
Credit: 15,824,334
RAC: 0
Message 59197 - Posted: 17 Dec 2018, 22:11:07 UTC - in response to Message 59191.  
Last modified: 17 Dec 2018, 22:14:53 UTC

I have been stuck uploading some zips for batch 691 for several days. Transient HTTP error.

As George mentioned, I also have stuck uploads for a task in this batch. The task has completed and everything has been successfully sent to upload6 except for:

  • wah2_cam25_a0fm_200405_18_691_011370111_2_r66429025_13.zip is stuck at 54.19%, first failed at 13:07:05 on 15th, has been retried 24 times and is getting up to 54.41% on each attempt.

  • wah2_cam25_a0fm_200405_18_691_011370111_2_r66429025_16.zip is stuck at 8.56%, first failed at 05:58:34 on 16th, has been retried 15 times and is getting up to 8.68% on each attempt.

In both cases the BOINC event log shows that the file was "locked by file_upload_handler" for 2 hours after the first attempt and gives no indication why subsequent attempts have been failing.

Wireshark shows that retries are successfully negotiating the restart offset and data is being sent before timing out. The same restart point is negotiated on every retry, indicating that none of the retransmitted data is being received.


"The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer
ID: 59197 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 59198 - Posted: 18 Dec 2018, 20:52:19 UTC

My first one of these cam25's had 3 zips that refused to upload.
As back then no one knew if they were even needed anymore, I let the model run, then made a copy of the 3 zips and aborted them to let the model finish.
But I've now got a second one, which has just uploaded it's zip 15, and there are no problems so far.

I've been having similar problems with a few sas50's but they eventually get through.

It looks like those two research places need better control of their servers.
Possibly a lot of people are putting files and programs onto the one server, and it can't handle it.
ID: 59198 · Report as offensive     Reply Quote
ProfileSteve Dodd

Send message
Joined: 28 Oct 11
Posts: 15
Credit: 9,958,339
RAC: 6,770
Message 59284 - Posted: 2 Jan 2019, 9:07:43 UTC - in response to Message 59198.  

I'm getting these messages for all of my wah2 zip files:

1/2/2019 1:01:59 AM | climateprediction.net | Temporarily failed upload of wah2_safr50_n0nt_198812_14_781_011715489_1_r171020809_7.zip: connect() failed
1/2/2019 1:01:59 AM | climateprediction.net | Backing off 00:06:25 on upload of wah2_safr50_n0nt_198812_14_781_011715489_1_r171020809_7.zip

There are 5 of them queued to transfer all with the same type error.
ID: 59284 · Report as offensive     Reply Quote
bernard_ivo

Send message
Joined: 18 Jul 13
Posts: 438
Credit: 25,633,761
RAC: 2,021
Message 59285 - Posted: 2 Jan 2019, 9:55:05 UTC - in response to Message 59284.  

I also get Transient HTTP error for the safr WUs, it should go to a UK server starting 192.171..
ID: 59285 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4540
Credit: 19,039,635
RAC: 18,944
Message 59286 - Posted: 2 Jan 2019, 12:32:08 UTC - in response to Message 59285.  

Getting problems on both sas50 and safr50s. Email sent to project.
ID: 59286 · Report as offensive     Reply Quote
ProfileAlan K

Send message
Joined: 22 Feb 06
Posts: 491
Credit: 31,151,719
RAC: 15,407
Message 59289 - Posted: 2 Jan 2019, 23:09:24 UTC - in response to Message 59286.  

Might get some response but it was only their first day back after Xmas - possibly.
ID: 59289 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4540
Credit: 19,039,635
RAC: 18,944
Message 59290 - Posted: 3 Jan 2019, 8:06:29 UTC - in response to Message 59289.  

Might get some response but it was only their first day back after Xmas - possibly.


Hilary term doesn't start until 13th so I am not holding my breath, though some things were sorted after last term finished. I just don't know who starts work before the new term and who is on a strict academic time table.
ID: 59290 · Report as offensive     Reply Quote
ProfileAlan K

Send message
Joined: 22 Feb 06
Posts: 491
Credit: 31,151,719
RAC: 15,407
Message 59291 - Posted: 3 Jan 2019, 12:47:21 UTC - in response to Message 59290.  

I would expect support staff to be back as of yesterday - but there again.... If the project aren't checking email.
ID: 59291 · Report as offensive     Reply Quote
Jim1348

Send message
Joined: 15 Jan 06
Posts: 637
Credit: 26,751,529
RAC: 653
Message 59292 - Posted: 3 Jan 2019, 15:05:25 UTC - in response to Message 59291.  

All of the safr50 779, 780 and 781 zips on one machine cleared out within the last 20 minutes, but the 777 on another machine remain stuck.
ID: 59292 · Report as offensive     Reply Quote
Rayburner

Send message
Joined: 17 Jan 05
Posts: 10
Credit: 23,525,643
RAC: 0
Message 59293 - Posted: 3 Jan 2019, 17:57:02 UTC

I have one 691 stuck

wah2_cam25_a0de_200405_18_691_011370031
ID: 59293 · Report as offensive     Reply Quote
Jim1348

Send message
Joined: 15 Jan 06
Posts: 637
Credit: 26,751,529
RAC: 653
Message 59294 - Posted: 3 Jan 2019, 19:55:18 UTC - in response to Message 59292.  

My 777 are now uploading, with four zips remaining. They are 92 MB each. I know that has been commented on before, and it is no problem for my ISP. But it helps explain why the servers are struggling.
ID: 59294 · Report as offensive     Reply Quote
ProfileSteve Dodd

Send message
Joined: 28 Oct 11
Posts: 15
Credit: 9,958,339
RAC: 6,770
Message 59295 - Posted: 4 Jan 2019, 2:12:13 UTC

All my backlog has uploaded. Now to see how long it takes to get credit :)
ID: 59295 · Report as offensive     Reply Quote
Rayburner

Send message
Joined: 17 Jan 05
Posts: 10
Credit: 23,525,643
RAC: 0
Message 59296 - Posted: 4 Jan 2019, 11:28:56 UTC - in response to Message 59293.  

mine are still sitting there.

from the logs it ssems to be a spefically upload6.cpdn.org which is having an issue.

04.01.2019 12:22:26 | climateprediction.net | [http] [ID#130] Info: Found bundle for host upload6.cpdn.org: 0x355d500 [can pipeline]
04.01.2019 12:22:26 | climateprediction.net | [http] [ID#130] Info: Re-using existing connection! (#227) with host upload6.cpdn.org
04.01.2019 12:22:26 | climateprediction.net | [http] [ID#130] Info: Connected to upload6.cpdn.org (158.97.9.11) port 80 (#227)
04.01.2019 12:22:26 | climateprediction.net | [http] [ID#130] Sent header to server: POST /cgi-bin/file_upload_handler HTTP/1.1
04.01.2019 12:22:26 | climateprediction.net | [http] [ID#130] Sent header to server: Host: upload6.cpdn.org
04.01.2019 12:22:26 | climateprediction.net | [http] [ID#130] Sent header to server: User-Agent: BOINC client (windows_x86_64 7.14.2)
04.01.2019 12:22:26 | climateprediction.net | [http] [ID#130] Sent header to server: Accept: */*
04.01.2019 12:22:26 | climateprediction.net | [http] [ID#130] Sent header to server: Accept-Encoding: deflate, gzip
04.01.2019 12:22:26 | climateprediction.net | [http] [ID#130] Sent header to server: Content-Type: application/x-www-form-urlencoded
04.01.2019 12:22:26 | climateprediction.net | [http] [ID#130] Sent header to server: Accept-Language: de_DE
04.01.2019 12:22:26 | climateprediction.net | [http] [ID#130] Sent header to server: Content-Length: 70022930
04.01.2019 12:22:26 | climateprediction.net | [http] [ID#130] Sent header to server: Expect: 100-continue
04.01.2019 12:22:26 | climateprediction.net | [http] [ID#130] Sent header to server:
04.01.2019 12:22:26 | climateprediction.net | [http] [ID#130] Sent header to server: ain=.google.com; HttpOnly
04.01.2019 12:22:27 | climateprediction.net | [http] [ID#130] Received header from server: HTTP/1.1 100 Continue
04.01.2019 12:22:48 | climateprediction.net | [http] [ID#130] Info: Recv failure: Connection was reset
04.01.2019 12:22:48 | climateprediction.net | [http] [ID#130] Info: Closing connection 227
04.01.2019 12:22:48 | climateprediction.net | [http] HTTP error: Failure when receiving data from the peer
04.01.2019 12:22:48 | climateprediction.net | [file_xfer] http op done; retval -184 (transient HTTP error)
04.01.2019 12:22:48 | climateprediction.net | [file_xfer] file transfer status -184 (transient HTTP error)
04.01.2019 12:22:48 | climateprediction.net | Temporarily failed upload of wah2_cam25_a0de_200405_18_691_011370031_0_r1337013121_7.zip: transient HTTP error
04.01.2019 12:22:48 | climateprediction.net | Backing off 04:46:21 on upload of wah2_cam25_a0de_200405_18_691_011370031_0_r1337013121_7.zip
ID: 59296 · Report as offensive     Reply Quote
ProfileIain Inglis
Volunteer moderator

Send message
Joined: 16 Jan 10
Posts: 1084
Credit: 7,860,147
RAC: 4,891
Message 59297 - Posted: 4 Jan 2019, 14:30:19 UTC - in response to Message 59296.  

[Rayburner wrote:]
mine are still sitting there.

from the logs it ssems to be a spefically upload6.cpdn.org which is having an issue.

...

I believe that there is a particular problem with old CAM25 models, such as the batch #691 example you've got (issued on 13 December 2017). However, I can't immediately find a solution, other than waiting or aborting.
ID: 59297 · Report as offensive     Reply Quote
Rayburner

Send message
Joined: 17 Jan 05
Posts: 10
Credit: 23,525,643
RAC: 0
Message 59298 - Posted: 4 Jan 2019, 16:41:49 UTC - in response to Message 59297.  

I will just wait … :-)
ID: 59298 · Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · 4 · Next

Message boards : Number crunching : transient HTTP error

©2024 cpdn.org