climateprediction.net home page
Posts by Thyme Lawn

Posts by Thyme Lawn

21) Message boards : Number crunching : Project communication failed: attempting access to reference site (Message 54803)
Posted 20 Sep 2016 by Profile Thyme Lawn
Post:
Thanks Vitalii, that's definitely appears to be a server problem (there are no [http_xfer] messages when the file content should be being transmitted to the server). I'll notify the project team.
22) Message boards : Number crunching : Project communication failed: attempting access to reference site (Message 54799)
Posted 20 Sep 2016 by Profile Thyme Lawn
Post:
That contains this line:

19.09.2016 22:50:06 | | [http] [ID#0] Received header from server: HTTP/1.1 302 Found

That's not where the problem lies Les. The 302 is the response from google.com to BOINC's internet connectivity check telling it to redirect the request to google.com.ua.

The relevant lines from Vitali's output are:

19.09.2016 22:49:38 | climateprediction.net | [http] [ID#9] Info: Connected to upload6.cpdn.org (158.97.9.11) port 80 (#35)
19.09.2016 22:49:38 | climateprediction.net | [http] [ID#9] Sent header to server: Content-Length: 4540690
19.09.2016 22:49:38 | climateprediction.net | [http] [ID#9] Sent header to server: Expect: 100-continue
19.09.2016 22:49:39 | climateprediction.net | [http] [ID#9] Received header from server: HTTP/1.1 100 Continue
19.09.2016 22:50:05 | climateprediction.net | [http] [ID#9] Info: Recv failure: Connection was reset
19.09.2016 22:50:05 | climateprediction.net | [http] [ID#9] Info: Closing connection 35
19.09.2016 22:50:05 | climateprediction.net | [http] HTTP error: Failure when receiving data from the peer


The "HTTP/1.1 100 Continue" response indicates that an HTTP/1.1 connection has been successfully made to the upload server, the file offset to start uploading from has been determined and BOINC has started sending 4,540,690 bytes of data (if the file being sent is larger than that it's indicating that some data has successfully been transferred to upload6). The next line is indicating that the connection was reset 26 seconds later due to a receive failure. My guess is that it's indicating a problem on that upload server (upload6.cpdn.org (158.97.9.11)).

Enabling "http_xfer_debug" will directly indicate whether any data has being transferred. I didn't suggest it earlier because it generates a message for each section of data sent to the server which very quickly mounts up (in my case it's about 180 messages per second for an upload). The messages don't identify the project and the ID# is the only way to tie them to the original request (i.e. enabling http_xfer_debug with http_debug disabled is meaningless). The following messages are selected from my previous post with a couple of http_xfer_debug messages included and lots of lines removed. The first POST sequence determines how many bytes the server has already received and the second is transferring the file from that offset:

19/09/2016 11:11:32 | climateprediction.net | Started upload of wah2_sas50_ky74_200912_13_448_010718036_0_r921421034_1.zip
19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Info: Connected to upload2.cpdn.org (129.67.195.140) port 80 (#627)
19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Sent header to server: POST /cgi-bin/file_upload_handler HTTP/1.1
19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Sent header to server: Content-Length: 311
19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Info: We are completely uploaded and fine
19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Received header from server: HTTP/1.1 200 OK
19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Info: Connection #627 to host upload2.cpdn.org left intact
19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Info: Re-using existing connection! (#627) with host upload2.cpdn.org
19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Info: Connected to upload2.cpdn.org (129.67.195.140) port 80 (#627)
19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Sent header to server: POST /cgi-bin/file_upload_handler HTTP/1.1
19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Sent header to server: Content-Length: 23198108
19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Sent header to server: Expect: 100-continue
19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Received header from server: HTTP/1.1 100 Continue
19/09/2016 11:11:35 | | [http_xfer] [ID#9825] HTTP: wrote 2956 bytes
19/09/2016 11:11:35 | | [http_xfer] [ID#9825] HTTP: wrote 2441 bytes
19/09/2016 11:18:04 | climateprediction.net | [http] [ID#9825] Info: We are completely uploaded and fine
19/09/2016 11:18:06 | climateprediction.net | [http] [ID#9825] Received header from server: HTTP/1.1 200 OK
19/09/2016 11:18:07 | climateprediction.net | Finished upload of wah2_sas50_ky74_200912_13_448_010718036_0_r921421034_1.zip
23) Message boards : Number crunching : Project communication failed: attempting access to reference site (Message 54794)
Posted 19 Sep 2016 by Profile Thyme Lawn
Post:
In BOINC Manager you can enable debugging of HTTP conversations by typing <Ctrl>+<Shift>+F, clicking on the "http_debug" checkbox to make it ticked and clicking OK (to disable the debug repeat that process, this time removing the "http_debug" checkbox tick).

A successful file transfer will result in an event log sequence similar to the following:

19/09/2016 11:11:32 | climateprediction.net | Started upload of wah2_sas50_ky74_200912_13_448_010718036_0_r921421034_1.zip
19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Info:    Trying 129.67.195.140...
19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Info:  Connected to upload2.cpdn.org (129.67.195.140) port 80 (#627)
19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Sent header to server: POST /cgi-bin/file_upload_handler HTTP/1.1
19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Sent header to server: Host: upload2.cpdn.org
19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Sent header to server: User-Agent: BOINC client (windows_x86_64 7.6.22)
19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Sent header to server: Accept: */*
19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Sent header to server: Accept-Encoding: deflate, gzip
19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Sent header to server: Content-Type: application/x-www-form-urlencoded
19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Sent header to server: Accept-Language: en_GB
19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Sent header to server: Content-Length: 311
19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Sent header to server:
19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Info:  We are completely uploaded and fine
19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Received header from server: HTTP/1.1 200 OK
19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Received header from server: Date: Mon, 19 Sep 2016 10:10:26 GMT
19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Received header from server: Server: Apache/2.4.6 (CentOS)
19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Received header from server: Content-Length: 93
19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Received header from server: Content-Type: text/plain; charset=UTF-8
19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Received header from server:
19/09/2016 11:11:33 | climateprediction.net | [http] [ID#9825] Info:  Connection #627 to host upload2.cpdn.org left intact
19/09/2016 11:11:34 | climateprediction.net | [http] HTTP_OP::libcurl_exec(): ca-bundle set
19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Info:  Found bundle for host upload2.cpdn.org: 0x1c365e04cb0
19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Info:  Re-using existing connection! (#627) with host upload2.cpdn.org
19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Info:  Connected to upload2.cpdn.org (129.67.195.140) port 80 (#627)
19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Sent header to server: POST /cgi-bin/file_upload_handler HTTP/1.1
19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Sent header to server: Host: upload2.cpdn.org
19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Sent header to server: User-Agent: BOINC client (windows_x86_64 7.6.22)
19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Sent header to server: Accept: */*
19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Sent header to server: Accept-Encoding: deflate, gzip
19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Sent header to server: Content-Type: application/x-www-form-urlencoded
19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Sent header to server: Accept-Language: en_GB
19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Sent header to server: Content-Length: 23198108
19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Sent header to server: Expect: 100-continue
19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Sent header to server:
19/09/2016 11:11:35 | climateprediction.net | [http] [ID#9825] Received header from server: HTTP/1.1 100 Continue
19/09/2016 11:18:04 | climateprediction.net | [http] [ID#9825] Info:  We are completely uploaded and fine
19/09/2016 11:18:06 | climateprediction.net | [http] [ID#9825] Received header from server: HTTP/1.1 200 OK
19/09/2016 11:18:06 | climateprediction.net | [http] [ID#9825] Received header from server: Date: Mon, 19 Sep 2016 10:10:28 GMT
19/09/2016 11:18:06 | climateprediction.net | [http] [ID#9825] Received header from server: Server: Apache/2.4.6 (CentOS)
19/09/2016 11:18:06 | climateprediction.net | [http] [ID#9825] Received header from server: Transfer-Encoding: chunked
19/09/2016 11:18:06 | climateprediction.net | [http] [ID#9825] Received header from server: Content-Type: text/plain; charset=UTF-8
19/09/2016 11:18:06 | climateprediction.net | [http] [ID#9825] Received header from server:
19/09/2016 11:18:06 | climateprediction.net | [http] [ID#9825] Info:  Connection #627 to host upload2.cpdn.org left intact
19/09/2016 11:18:07 | climateprediction.net | Finished upload of wah2_sas50_ky74_200912_13_448_010718036_0_r921421034_1.zip


The upload messages will be preceded with those for the associated trickle:

19/09/2016 11:11:30 | climateprediction.net | Sending scheduler request: To send trickle-up message.
19/09/2016 11:11:30 | climateprediction.net | Not requesting tasks: don't need (not highest priority project)
19/09/2016 11:11:30 | climateprediction.net | [http] HTTP_OP::init_post(): http://climateapps2.oerc.ox.ac.uk/cpdnboinc_cgi/cgi
19/09/2016 11:11:30 | climateprediction.net | [http] HTTP_OP::libcurl_exec(): ca-bundle set
19/09/2016 11:11:31 | climateprediction.net | [http] [ID#1] Info:    Trying 129.67.195.185...
19/09/2016 11:11:31 | climateprediction.net | [http] [ID#1] Info:  Connected to climateapps2.oerc.ox.ac.uk (129.67.195.185) port 80 (#626)
19/09/2016 11:11:31 | climateprediction.net | [http] [ID#1] Sent header to server: POST /cpdnboinc_cgi/cgi HTTP/1.1
19/09/2016 11:11:31 | climateprediction.net | [http] [ID#1] Sent header to server: Host: climateapps2.oerc.ox.ac.uk
19/09/2016 11:11:31 | climateprediction.net | [http] [ID#1] Sent header to server: User-Agent: BOINC client (windows_x86_64 7.6.22)
19/09/2016 11:11:31 | climateprediction.net | [http] [ID#1] Sent header to server: Accept: */*
19/09/2016 11:11:31 | climateprediction.net | [http] [ID#1] Sent header to server: Accept-Encoding: deflate, gzip
19/09/2016 11:11:31 | climateprediction.net | [http] [ID#1] Sent header to server: Content-Type: application/x-www-form-urlencoded
19/09/2016 11:11:31 | climateprediction.net | [http] [ID#1] Sent header to server: Accept-Language: en_GB
19/09/2016 11:11:31 | climateprediction.net | [http] [ID#1] Sent header to server: Content-Length: 37921
19/09/2016 11:11:31 | climateprediction.net | [http] [ID#1] Sent header to server: Expect: 100-continue
19/09/2016 11:11:31 | climateprediction.net | [http] [ID#1] Sent header to server:
19/09/2016 11:11:31 | climateprediction.net | [http] [ID#1] Received header from server: HTTP/1.1 100 Continue
19/09/2016 11:11:31 | climateprediction.net | [http] [ID#1] Info:  We are completely uploaded and fine
19/09/2016 11:11:32 | climateprediction.net | [http] [ID#1] Received header from server: HTTP/1.1 200 OK
19/09/2016 11:11:32 | climateprediction.net | [http] [ID#1] Received header from server: Date: Mon, 19 Sep 2016 10:11:33 GMT
19/09/2016 11:11:32 | climateprediction.net | [http] [ID#1] Received header from server: Server: Apache
19/09/2016 11:11:32 | climateprediction.net | [http] [ID#1] Received header from server: Connection: close
19/09/2016 11:11:32 | climateprediction.net | [http] [ID#1] Received header from server: Transfer-Encoding: chunked
19/09/2016 11:11:32 | climateprediction.net | [http] [ID#1] Received header from server: Content-Type: text/xml
19/09/2016 11:11:32 | climateprediction.net | [http] [ID#1] Received header from server:
19/09/2016 11:11:32 | climateprediction.net | [http] [ID#1] Info:  Closing connection 626
19/09/2016 11:11:32 | climateprediction.net | [http] HTTP_OP::libcurl_exec(): ca-bundle 'C:\Program Files\BOINC\ca-bundle.crt'
19/09/2016 11:11:32 | climateprediction.net | [http] HTTP_OP::libcurl_exec(): ca-bundle set
19/09/2016 11:11:32 | climateprediction.net | Scheduler request completed
24) Message boards : Number crunching : News and Announcements 2 (Message 53885)
Posted 4 Apr 2016 by Profile Thyme Lawn
Post:
Archived credits have been restored for all users. These aren't included in the total at the moment but should be on completion of the credit script Andy has started.
25) Message boards : Number crunching : Total Credit (Message 53884)
Posted 4 Apr 2016 by Profile Thyme Lawn
Post:
Not sure if I should be posting this as I don't want to raise false hopes but credits are currently being re-calculated and should include archived credit.

Many thanks to Andy for his work on this and to the moderators who have fielded most of the queries/flack on this.

That will with luck leave more time for looking at the posting new topics problem which I am told looks as if it is due to an upgrade from PHP5.4 To PHP.5

I do realise that some like myself may be old enough to remember a different meaning of upgrade which had something to do with an improvement!

Archived credits now appear on user pages but won't be included in the total until the credit script finishes running. My account page currently shows that I'll be getting back more than 5.5 million when it's done:
Computing and credit

Total credit 5,582,914
Recent average credit 2,199.89
Archived results (not viewable) 559
Archived credit (included in total) 5,557,035
26) Message boards : Number crunching : Total Credit (Message 53866)
Posted 1 Apr 2016 by Profile Thyme Lawn
Post:
The outstanding missing credit is from tasks which were archived in 2010. The account page for all users has "Archived credit (included in total)" set to 0. The project team are aware that archived credits haven't been carried forward to the new server and are investigating.
27) Message boards : Number crunching : News and Announcements 2 (Message 53052)
Posted 8 Dec 2015 by Profile Thyme Lawn
Post:
Project will be offline from 1000 UTC on Wednesday 9th December for at least 24 hours

News from the project team:
We will be taking the project offline tomorrow (Wednesday 9th December) from 10am (UK time) in order to take a snapshot of the database. This is part of the process of the re-configuration of a slave database machine. Once this snapshot process has completed we will bring the project back online again, we anticipate that this process will take a minimum of 24 hours to complete. We apologise in advance for any inconvenience.
28) Message boards : Number crunching : CPDN process is not running?? (Message 53049)
Posted 7 Dec 2015 by Profile Thyme Lawn
Post:
Windows Task Manager is not very detailed, so maybe I should look for a better tool to find out how much my memory load really is.

I use Sysinternals Process Explorer.

The Sysinternals suite started off as a third party freeware product. Microsoft bought it up 9 years ago and at least one of the original developers is still involved in its development.
29) Message boards : Number crunching : Late November batch of Windows work (Message 53042)
Posted 7 Dec 2015 by Profile Thyme Lawn
Post:
Two of my WAH2 models from the 29 November batch have completed. Some others have failed early on: at least one of those has made some progress on another computer, which makes we wonder whether they don't like being run with too many in parallel (my habit is to run 25% CPUs, except when getting new work when I put CPUs back to 100% - the crashes all occurred during the 100% period).

The memory load for WAH2 seems to be much higher than was the case for previous applications. My wah2_eu25 tasks have a total working set size of around 460MB and I've changed the project resource shares on my Q6600 (which only has 2GB of RAM) to prevent it from running more than one of these tasks.
30) Message boards : Number crunching : Download failed (Message 53014)
Posted 3 Dec 2015 by Profile Thyme Lawn
Post:
The problem has been traced to a now corrected permission denied error on transferring the affected workunits to the download server.
31) Message boards : Number crunching : Download failed (Message 53011)
Posted 2 Dec 2015 by Profile Thyme Lawn
Post:
It's definitely not your problem ChrisD. Clicking the '+' on the Stderr line of one of your failed tasks (e.g. hadam3p_pnw_xg5r_200712_12_010210835_1) shows that there was a permanent HTTP error on one of the input files (i.e. it wasn't available at the URL the server told BOINC to fetch it from). I've passed that up to the project team.
32) Message boards : Number crunching : Late November batch of Windows work (Message 53010)
Posted 2 Dec 2015 by Profile Thyme Lawn
Post:
How often do these tasks checkpoint? Looking at the task running now, it seems it's been over 50 minutes of CPU time since the last checkpoint.

All CPDN models checkpoint at fixed points in the calculation. For these models it's at the end of each model day, with trickles and uploads being made every 30 model days.

My 15 has a checkpoint interval of just under 50 minutes and for the Q6600 it's around 70 minutes.
33) Message boards : Number crunching : Total Credit (Message 52998)
Posted 30 Nov 2015 by Profile Thyme Lawn
Post:
Credits are only granted when the daily script is run and appears to be working fine to me. The last of the 5 HadAM3P EU tasks downloaded to my i5 a couple of weeks ago started running a couple of days ago and has been receiving credits for its trickles (hadam3p_eu_fb17_2014_0_010389523_0).

Looking at the task list for your computer it completed 2 tasks on 23rd November and has had no CPDN work (other than a couple of probable ghost tasks from June and July) until 4 WAH2 tasks were downloaded at 19:07:08 UTC yesterday. The first trickle from those tasks was returned at 15:33:40 UTC today and should be credited overnight.
34) Message boards : Number crunching : No work for Windows? (Message 52990)
Posted 30 Nov 2015 by Profile Thyme Lawn
Post:
The application "WAH2 Region Independent (Weather At Home 2)" has now been added to the project preferences page.
35) Message boards : Number crunching : News and Announcements 2 (Message 52989)
Posted 30 Nov 2015 by Profile Thyme Lawn
Post:
Although there is a new batch of WAH2 work for Windows systems you'll only be able to receive a task if you have the "If no work for selected applications is available, accept work from other applications?" option enabled in your project preferences.

I've posted a technical explanation here for those who are interested.

The application "WAH2 Region Independent (Weather At Home 2)" has now been added to the project preferences page.
36) Message boards : Number crunching : No work for Windows? (Message 52974)
Posted 29 Nov 2015 by Profile Thyme Lawn
Post:
I also run NOINC as a Windows service. I've always accepted work from non-selected applications and haven't been sent any HadCM3S tasks since 27th August despite being starved of work from the project for much of that time. The server status page says there are currently 10,513 unsent WAH2 tasks, so it's extremely unlikely that you'll be sent the occasional HadCM3S resend rather than one of the WAH2 ones. Given the odds Les's suggestion of temporarily enabling the work from other applications option seems reasonable to me.
37) Message boards : Number crunching : News and Announcements 2 (Message 52969)
Posted 29 Nov 2015 by Profile Thyme Lawn
Post:
Although there is a new batch of WAH2 work for Windows systems you'll only be able to receive a task if you have the "If no work for selected applications is available, accept work from other applications?" option enabled in your project preferences.

I've posted a technical explanation here for those who are interested.
38) Message boards : Number crunching : No work for Windows? (Message 52968)
Posted 29 Nov 2015 by Profile Thyme Lawn
Post:
Server Status: 6,592 WAH2 tasks ready to send.

Reply:

No work is available for Weather At Home 2 (wah2)

Please Check Your project preferences.......


???

ChrisD

I have "If no work for selected applications is available, accept work from other applications?" checked in my climateprediction.net specific preferences and I downloaded 4 tasks. Maybe check that option in your preferences.

You definitely need to be accepting work from other applications for now because the new work is running short application name "wah2_ri" instead of "wah2". They both have the long application name "WAH2 (Weather At Home 2)" which only appears once in the list of selectable applications. Selecting that only allows work to be fetched for "wah2" (which doesn't have any work available).

Confused? Join the club ...
39) Message boards : Number crunching : News and Announcements 2 (Message 52767)
Posted 30 Oct 2015 by Profile Thyme Lawn
Post:
The CPDN project will be taken offline for a scheduled downtime starting at 1000 UTC on Monday the 2nd November.

This is in order to perform firmware updates to the controllers of the storage array on which the CPDN servers reside. The firmware updates will correct an issue with the controllers that was a cause of the recent unscheduled downtime.

This work should not affect the climateprediction.net pages of the website, however the climateapps2.oerc portion of the website will be taken offline.

Will in OeRC support will be performing this work on the controllers of the storage array. At present we don't have an expected length of time that this firmware update will take, however we will endeavour to restore the project as soon as the updates have completed.
40) Message boards : Number crunching : CPDN SITE STILL UNRESPONIVE (Message 52760)
Posted 29 Oct 2015 by Profile Thyme Lawn
Post:
WAH2 PNW task uploads for beta are sent to http://boinc1.coas.oregonstate.edu/cpdn_cgi_main/file_upload_handler which is working.

All of the other beta WAH2 region uploads are sent to http://cpdnbeta.oerc.ox.ac.uk/beta_cgi/file_upload_handler and are failing.

BOINC treats uploads on the project level rather that the upload URL level, so pending WAH2 PNW uploads might never be attempted if you have uploads for other regions. You can force the PNW uploads to be done by selecting the first one and clicking the "Retry Now" button, but you'll probably have to repeat this a few times because BOINC starts with the first file which isn't backed off, tries 3 files and goes into a project backoff if they all fail.


Previous 20 · Next 20

©2024 climateprediction.net