climateprediction.net home page
Posts by geophi

Posts by geophi

1) Questions and Answers : Unix/Linux : *** Running 32bit CPDN from 64bit Linux - Discussion *** (Message 70636)
Posted 10 Mar 2024 by Profile geophi
Post:
There aren't any Linux tasks available to download right now if that is your issue. There really haven't been any since Feb 2023 and those were 64 bit oifs tasks. So given when your computers were active, you wouldn't have been able to download any. There should be some hadam4 type tasks (32 bit) for linux coming out this spring? We are testing some now. I'm not sure exactly when they would be coming out. That would give you your chance to test to make sure the 32bit libraries are installed on your linux PC.
2) Message boards : Number crunching : EAS batches 1001-4 (Message 70582)
Posted 1 Mar 2024 by Profile geophi
Post:
A resend I have picked up has this.

couldn't start app: Task file wah2_8.29_windows_intelx86.exe: file missing</message>
Not the virus scanner issue as it got going and has produced 7 zips gaining 407,756.40 in credit.

Task ID:1549001 The computer in question seems to be trashing everything with this though often not till several zips have been sent. Issue is there both with region independent and the older app versions.

Dave, it looks like the 8.24 crashes were either signal 11 or they didn't have a stderr. For the region independent crashes, if 8.29 doesn't exist, how does it start the task, or the next RI task it crashes, etc.? Very strange.
3) Message boards : Number crunching : Uploads not working (Message 70449)
Posted 19 Feb 2024 by Profile geophi
Post:
If you have a cc_config.xml in your boinc data directory, what value do you have for:

<max_file_xfers_per_project>N</max_file_xfers_per_project>

The default is 2, but it can be changed. If it is something larger than 2, perhaps bring it back to 2 and see if there is any relief. It wouldn't seem like it should matter, but it's something to try.

I've had several uploads go up today, and they've all maxed out my upload limit on my ISP plan (0.5 MB/sec).
4) Message boards : Number crunching : Batch 1005 WAH2 NZ region (Message 70445)
Posted 19 Feb 2024 by Profile geophi
Post:
We had errors like this before when a bunch of upload files couldn't be uploaded for a long time, and built up in the directory. It's not actually exceeding the total boinc disk space allocated, it's exceeding the the rsc_disk_bound value set for that work unit in client_state.xml

<rsc_disk_bound> </rsc_disk_bound>
Is the maximum amount of disk space your application should take up while running any given task. Includes all input, temporary and output files. Is set in bytes.

https://boinc.bakerlab.org/rosetta/forum_thread.php?id=15160&postid=108374

We've seen this before, especially when a bunch of upload files can't be uploaded because of server problems. Maybe someone with better memory and/or a better understanding of boinc could expound on this.
5) Message boards : Number crunching : Batch 1005 WAH2 NZ region (Message 70444)
Posted 19 Feb 2024 by Profile geophi
Post:
I still have problems uploading to upload 11. I have two WUs one at 22 zip and these can't get through. I also had two more WUs failing with
<core_client_version>7.24.1</core_client_version>
<![CDATA[
<message>
Disk usage limit exceeded</message>
<stderr_txt>
CPDN Monitor - Abort request from BOINC...
22:22:29 (9908): called boinc_finish(10)

</stderr_txt>
]]>


And I have almost 200 GB allocated to BOINC so there is plenty of disk space

Bernard. Just out of curiosity, how much disk space is the entire boinc data directory using?
6) Message boards : Number crunching : couldn't start app: CreateProcess() failed. Check your antivirus. (Message 70410)
Posted 16 Feb 2024 by Profile geophi
Post:
Thank you. I am using McAfee and I can exclude files, but not directories. I excluded the .exe causing the problem, but would hate to add each new one (even an update) each time.

Yeah, wow! This thread says it all about McAfee I guess.

https://forums.mcafee.com/t5/VirusScan/How-to-exclude-folders-from-real-time-scan/td-p/658116
7) Message boards : Number crunching : couldn't start app: CreateProcess() failed. Check your antivirus. (Message 70408)
Posted 16 Feb 2024 by Profile geophi
Post:
I know how to do this in Linux, but I am ignorant about Windows 10 that is running on my other machine.
So how do I exclude folders from anti-virus in Windows 10>


If using Windows Defender,

https://www.howtogeek.com/671233/how-to-add-exclusions-in-windows-defender-on-windows-10/

If using some other AV solution, google search exclusions for that AV.
8) Message boards : Cafe CPDN : BOINC options Select Columns. (Message 70333)
Posted 8 Feb 2024 by Profile geophi
Post:
Dave. What boinc version did this pop up in?
9) Message boards : Number crunching : Batch 996 Weather@Home2 East Asia25 (Message 69940)
Posted 18 Oct 2023 by Profile geophi
Post:
Should our computers "automagically" connect to Jasmin now, or will that take some time?

The reason I ask is that mine is still looking at, what I believe to be the Korean serve "upload7.cpnd.org" on ip address 141.223.16.156, port 80.

Wouldn't this quote from Glenn explain why some may see the changeover faster than others? italics mine

"We've had confirmation that the security policy on the http port at the S.Korea site is blocking some connections to the upload server due to the high number of attempts. Not unsurprisingly the site does not want to open up the port, so CPDN is going to switch the upload address to the UK JASMIN site (the upload URL is just an alias and can be pointed to other machines). This should happen later today and then it'll take a day or so for the change to propagate through the nameservers. "
10) Message boards : Number crunching : Batch 996 Weather@Home2 East Asia25 (Message 69736)
Posted 10 Oct 2023 by Profile geophi
Post:
@Glenn

I should have said

using the Advanced search link at the top of the forum,

that is how you would get to the search I was talking about.
11) Message boards : Number crunching : Batch 996 Weather@Home2 East Asia25 (Message 69723)
Posted 9 Oct 2023 by Profile geophi
Post:
Looking back on the message board for

drive not specified

with Search limits set to no limit, Iain Inglis had some ideas about that error message. It goes back farther than the hadam4 models, and may have been on Windows tasks instead. My memory isn't performing too well today.
12) Message boards : Number crunching : Batch 996 Weather@Home2 East Asia25 (Message 69722)
Posted 9 Oct 2023 by Profile geophi
Post:
Funnily enough, I was looking over the hard fail workunits last couple of days and I've seen multiple tasks failing with that kind of error 'invalid device/drive, device not found'. I am starting to wonder if it's task related rather than just host specifc. But I'd need to trawl through the logs of all the fails after the batch to see how prevalent it is to be sure. I've never seen that kind of error with previous batches.

These have happened occasionally in the past for me with, I believe, the hadam4/h model series. The best I can figure is it happens when lots of disk writes are occurring with multiple models, like when all the models are essentially in sync with each other and saving files, or finishing the model at the same time. I haven't had one for a long time though. When I'm running one or two models at a time, I've never seen it.
13) Message boards : Number crunching : Batch 996 Weather@Home2 East Asia25 (Message 69664)
Posted 5 Oct 2023 by Profile geophi
Post:
Firstly, I am thinking that having threads by batch numbers might help keeping relevant posts together in one place.

With five currently running on my Ryzen7 using WINE. I am estimating about 7 days computing time for these.

While there have been no "hard fails" in this batch so far (where all 3 tasks in a work unit fail), and there is no way to view the number of individual task failures, it looks like Signal 11 failures are dominating at this point. The task on my Ryzen running Windows natively failed at the usual point with a signal 11 (segmentation fault) during the first model day. Tasks running under Wine appear to be progressing nicely.
14) Message boards : Number crunching : Ghost work units? (Message 69417)
Posted 28 Jul 2023 by Profile geophi
Post:
Back in earlier days, the cpdn server sometimes had trouble keeping up, especially when it was running the weekly credit script. So,occasionally, if tasks were reported during that credit run, the completion status was not logged/stored correctly.

For example, there were 4 tasks (marked Abandoned when I detached) that were sent to one of my computers on May 22 2020 that sent in all 4 trickles,

https://www.cpdn.org/results.php?hostid=1492829&offset=160&show_names=0&state=0&appid=33

and reported to the server on May 27 that the tasks were completed and were a success. However, the server did not record the completion report and so those tasks were no longer on my PC, but were "In progress" according to the task status on the server. When sent to my computer, these tasks had a deadline of May 4 2021. On June 4th 2020 I detached that computer from climateprediction which is when the boinc server marked the stats as "Abandoned".

Looking at one of those work units

https://www.cpdn.org/workunit.php?wuid=12017682

you can see the next task from it was sent out on June 4 2020 to a computer that completed that task. So it did not wait until the deadline, the next task from that work unit was sent back out immediately after abandonment.

If you do an advanced search on the number crunching forum going back with a search limit of "no limit" and keyword detach, or abandoned, you find some replies by WaterOakley, who is a sharp boinc user, recommending the same method for tasks that are listed by the server as in progress for a PC, but are not in the boinc manager task list for the PC.
15) Message boards : Number crunching : Ghost work units? (Message 69415)
Posted 28 Jul 2023 by Profile geophi
Post:
Noticed that I have some ghost/phantom tasks (i.e. tasks showing up in the server as "in progress") but nothing on my PC.

I guess I'll just have to let them expire in about 12 months time.

Let me know if there is a way to recover or if not recycle these tasks back to other volunteers. Let me know whom I can pm the list of ghost tasks to be recycled, if needed. These are the tasks that start with wah2_eas25*, so likely to end up with errors from what I've seen on a few of them.

Cheers.

If you detach the PC associated with these tasks, their status will go to "Abandoned" and the next task from that work unit will be ready to send out (assuming yours wasn't the last task in that work unit). You can then reattach.

Edit...Do this after you have no cpdn tasks currently running.
16) Message boards : Number crunching : New work discussion - 2 (Message 69072)
Posted 2 Jul 2023 by Profile geophi
Post:
Did you reload config file with"read local prefs file" in "Options" drop-down?

Yep.
17) Message boards : Number crunching : New work discussion - 2 (Message 69070)
Posted 1 Jul 2023 by Profile geophi
Post:
I made the 2 changes to cc_config and tried the upload again.

7/1/2023 3:20:29 PM | climateprediction.net | Started upload of wah2_eas25_a1hb_199711_25_994_012217357_2_r1373083460_restart.zip
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Info:  Connection 1 seems to be dead
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Info:  Closing connection 1
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Info:  schannel: shutting down SSL/TLS connection with dev.cpdn.org port 443
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Info:  schannel: ApplyControlToken failure: SEC_E_UNSUPPORTED_FUNCTION (0x80090302)
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Info:    Trying 141.223.16.156:80...
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Info:  Connected to upload7.cpdn.org (141.223.16.156) port 80 (#3)
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Sent header to server: POST /cgi-bin/file_upload_handler HTTP/1.1
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Sent header to server: Host: upload7.cpdn.org
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Sent header to server: User-Agent: BOINC client (windows_x86_64 7.22.2)
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Sent header to server: Accept: */*
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Sent header to server: Accept-Encoding: deflate, gzip
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Sent header to server: Accept-Language: en_US
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Sent header to server: Content-Length: 318
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Sent header to server: Content-Type: application/x-www-form-urlencoded
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Sent header to server:
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Info:  We are completely uploaded and fine
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Received header from server: HTTP/1.1 200 OK
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Received header from server: Date: Sat, 01 Jul 2023 20:33:53 GMT
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Received header from server: Server: Apache/2.2.3 (CentOS)
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Received header from server: Transfer-Encoding: chunked
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Received header from server: Content-Type: text/plain; charset=UTF-8
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Received header from server:
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Received header from server: 63
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Received header from server: <data_server_reply>
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Received header from server:     <status>0</status>
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Received header from server:     <file_size>8495740</file_size>
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Received header from server: </data_server_reply>
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Received header from server:
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Received header from server: 0
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Received header from server:
7/1/2023 3:20:30 PM | climateprediction.net | [http] [ID#5] Info:  Connection #3 to host upload7.cpdn.org left intact
7/1/2023 3:20:31 PM | climateprediction.net | [http] [ID#5] Info:  Found bundle for host: 0x8cd3a0 [serially]
7/1/2023 3:20:31 PM | climateprediction.net | [http] [ID#5] Info:  Re-using existing connection #3 with host upload7.cpdn.org
7/1/2023 3:20:31 PM | climateprediction.net | [http] [ID#5] Sent header to server: POST /cgi-bin/file_upload_handler HTTP/1.1
7/1/2023 3:20:31 PM | climateprediction.net | [http] [ID#5] Sent header to server: Host: upload7.cpdn.org
7/1/2023 3:20:31 PM | climateprediction.net | [http] [ID#5] Sent header to server: User-Agent: BOINC client (windows_x86_64 7.22.2)
7/1/2023 3:20:31 PM | climateprediction.net | [http] [ID#5] Sent header to server: Accept: */*
7/1/2023 3:20:31 PM | climateprediction.net | [http] [ID#5] Sent header to server: Accept-Encoding: deflate, gzip
7/1/2023 3:20:31 PM | climateprediction.net | [http] [ID#5] Sent header to server: Accept-Language: en_US
7/1/2023 3:20:31 PM | climateprediction.net | [http] [ID#5] Sent header to server: Content-Length: 126698478
7/1/2023 3:20:31 PM | climateprediction.net | [http] [ID#5] Sent header to server: Content-Type: application/x-www-form-urlencoded
7/1/2023 3:20:31 PM | climateprediction.net | [http] [ID#5] Sent header to server: Expect: 100-continue
7/1/2023 3:20:31 PM | climateprediction.net | [http] [ID#5] Sent header to server:
7/1/2023 3:20:31 PM | climateprediction.net | [http] [ID#5] Received header from server: HTTP/1.1 100 Continue
7/1/2023 3:25:37 PM | climateprediction.net | [http] [ID#5] Info:  Operation too slow. Less than 10 bytes/sec transferred the last 300 seconds
7/1/2023 3:25:37 PM | climateprediction.net | [http] [ID#5] Info:  Closing connection 3
7/1/2023 3:25:37 PM | climateprediction.net | [http] HTTP error: Timeout was reached
7/1/2023 3:25:37 PM | climateprediction.net | Temporarily failed upload of wah2_eas25_a1hb_199711_25_994_012217357_2_r1373083460_restart.zip: transient HTTP error
7/1/2023 3:25:37 PM | climateprediction.net | Backing off 04:56:14 on upload of wah2_eas25_a1hb_199711_25_994_012217357_2_r1373083460_restart.zip
18) Message boards : Number crunching : New work discussion - 2 (Message 69069)
Posted 1 Jul 2023 by Profile geophi
Post:
@Richard

The received header from server: 8495740 size is likely the the number of bytes the server thinks is uploaded so far, where it is stuck at now.
In boinc manager it is stuck at 8.16 MB of the 128.98 MB upload file (6.33%)

The “last bytes transferred” number of 8561276 converts to 8.16 MB so the client thinks it has transferred more than the server has recorded??

Yes, the boinc executable is running in wine on an Ubuntu linux host.

We’ve seen this before where a single or a few uploads get stuck and rebooting the server, or manually killing the server side process associated with that file will allow them to upload. But I thought we had very different messages in the logs when that occurred in the past.
19) Message boards : Number crunching : New work discussion - 2 (Message 69061)
Posted 30 Jun 2023 by Profile geophi
Post:
I also have a stuck file upload, this one at 6.33%. It's been stuck for a few days and below is what I get when I enable http_debug and click on Retry Now for that file in the boinc manager transfers tab. I've had no problems with all the other uploads so far.

6/30/2023 1:45:17 PM | climateprediction.net | Started upload of wah2_eas25_a1hb_199711_25_994_012217357_2_r1373083460_restart.zip
6/30/2023 1:45:18 PM | climateprediction.net | [http] [ID#21] Info:  Too old connection (364 seconds idle), disconnect it
6/30/2023 1:45:18 PM | climateprediction.net | [http] [ID#21] Info:  Connection 38 seems to be dead
6/30/2023 1:45:18 PM | climateprediction.net | [http] [ID#21] Info:  Closing connection 38
6/30/2023 1:45:18 PM | climateprediction.net | [http] [ID#21] Info:  schannel: shutting down SSL/TLS connection with www.google.com port 443
6/30/2023 1:45:18 PM | climateprediction.net | [http] [ID#21] Info:  schannel: ApplyControlToken failure: SEC_E_UNSUPPORTED_FUNCTION (0x80090302)
6/30/2023 1:45:18 PM | climateprediction.net | [http] [ID#21] Info:    Trying 141.223.16.156:80...
6/30/2023 1:45:18 PM | climateprediction.net | [http] [ID#21] Info:  Connected to upload7.cpdn.org (141.223.16.156) port 80 (#39)
6/30/2023 1:45:18 PM | climateprediction.net | [http] [ID#21] Sent header to server: POST /cgi-bin/file_upload_handler HTTP/1.1
6/30/2023 1:45:18 PM | climateprediction.net | [http] [ID#21] Sent header to server: Host: upload7.cpdn.org
6/30/2023 1:45:18 PM | climateprediction.net | [http] [ID#21] Sent header to server: User-Agent: BOINC client (windows_x86_64 7.22.2)
6/30/2023 1:45:18 PM | climateprediction.net | [http] [ID#21] Sent header to server: Accept: */*
6/30/2023 1:45:18 PM | climateprediction.net | [http] [ID#21] Sent header to server: Accept-Encoding: deflate, gzip
6/30/2023 1:45:18 PM | climateprediction.net | [http] [ID#21] Sent header to server: Accept-Language: en_US
6/30/2023 1:45:18 PM | climateprediction.net | [http] [ID#21] Sent header to server: Content-Length: 318
6/30/2023 1:45:18 PM | climateprediction.net | [http] [ID#21] Sent header to server: Content-Type: application/x-www-form-urlencoded
6/30/2023 1:45:18 PM | climateprediction.net | [http] [ID#21] Sent header to server:
6/30/2023 1:45:18 PM | climateprediction.net | [http] [ID#21] Info:  We are completely uploaded and fine
6/30/2023 1:45:19 PM | climateprediction.net | [http] [ID#21] Received header from server: HTTP/1.1 200 OK
6/30/2023 1:45:19 PM | climateprediction.net | [http] [ID#21] Received header from server: Date: Fri, 30 Jun 2023 18:58:41 GMT
6/30/2023 1:45:19 PM | climateprediction.net | [http] [ID#21] Received header from server: Server: Apache/2.2.3 (CentOS)
6/30/2023 1:45:19 PM | climateprediction.net | [http] [ID#21] Received header from server: Transfer-Encoding: chunked
6/30/2023 1:45:19 PM | climateprediction.net | [http] [ID#21] Received header from server: Content-Type: text/plain; charset=UTF-8
6/30/2023 1:45:19 PM | climateprediction.net | [http] [ID#21] Received header from server:
6/30/2023 1:45:19 PM | climateprediction.net | [http] [ID#21] Received header from server: 63
6/30/2023 1:45:19 PM | climateprediction.net | [http] [ID#21] Received header from server: <data_server_reply>
6/30/2023 1:45:19 PM | climateprediction.net | [http] [ID#21] Received header from server:     <status>0</status>
6/30/2023 1:45:19 PM | climateprediction.net | [http] [ID#21] Received header from server:     <file_size>8495740</file_size>
6/30/2023 1:45:19 PM | climateprediction.net | [http] [ID#21] Received header from server: </data_server_reply>
6/30/2023 1:45:19 PM | climateprediction.net | [http] [ID#21] Received header from server:
6/30/2023 1:45:19 PM | climateprediction.net | [http] [ID#21] Info:  Connection #39 to host upload7.cpdn.org left intact
6/30/2023 1:45:20 PM | climateprediction.net | [http] [ID#21] Info:  Found bundle for host: 0x9e8d80 [serially]
6/30/2023 1:45:20 PM | climateprediction.net | [http] [ID#21] Info:  Re-using existing connection #39 with host upload7.cpdn.org
6/30/2023 1:45:20 PM | climateprediction.net | [http] [ID#21] Sent header to server: POST /cgi-bin/file_upload_handler HTTP/1.1
6/30/2023 1:45:20 PM | climateprediction.net | [http] [ID#21] Sent header to server: Host: upload7.cpdn.org
6/30/2023 1:45:20 PM | climateprediction.net | [http] [ID#21] Sent header to server: User-Agent: BOINC client (windows_x86_64 7.22.2)
6/30/2023 1:45:20 PM | climateprediction.net | [http] [ID#21] Sent header to server: Accept: */*
6/30/2023 1:45:20 PM | climateprediction.net | [http] [ID#21] Sent header to server: Accept-Encoding: deflate, gzip
6/30/2023 1:45:20 PM | climateprediction.net | [http] [ID#21] Sent header to server: Accept-Language: en_US
6/30/2023 1:45:20 PM | climateprediction.net | [http] [ID#21] Sent header to server: Content-Length: 126698478
6/30/2023 1:45:20 PM | climateprediction.net | [http] [ID#21] Sent header to server: Content-Type: application/x-www-form-urlencoded
6/30/2023 1:45:20 PM | climateprediction.net | [http] [ID#21] Sent header to server: Expect: 100-continue
6/30/2023 1:45:20 PM | climateprediction.net | [http] [ID#21] Sent header to server:
6/30/2023 1:45:20 PM | climateprediction.net | [http] [ID#21] Received header from server: HTTP/1.1 100 Continue
6/30/2023 1:50:27 PM | climateprediction.net | [http] [ID#21] Info:  Operation too slow. Less than 10 bytes/sec transferred the last 300 seconds
6/30/2023 1:50:27 PM | climateprediction.net | [http] [ID#21] Info:  Closing connection 39
6/30/2023 1:50:27 PM | climateprediction.net | [http] HTTP error: Timeout was reached
6/30/2023 1:50:27 PM | climateprediction.net | Temporarily failed upload of wah2_eas25_a1hb_199711_25_994_012217357_2_r1373083460_restart.zip: transient HTTP error
6/30/2023 1:50:27 PM | climateprediction.net | Backing off 03:34:02 on upload of wah2_eas25_a1hb_199711_25_994_012217357_2_r1373083460_restart.zip
6/30/2023 1:50:28 PM |  | Project communication failed: attempting access to reference site
6/30/2023 1:50:28 PM |  | [http] HTTP_OP::init_get(): https://www.google.com/
6/30/2023 1:50:28 PM |  | [http] [ID#0] Info:    Trying 142.251.40.68:443...
6/30/2023 1:50:28 PM |  | [http] [ID#0] Info:  Connected to www.google.com (142.251.40.68) port 443 (#40)
6/30/2023 1:50:28 PM |  | [http] [ID#0] Info:  schannel: disabled automatic use of client certificate
6/30/2023 1:50:28 PM |  | [http] [ID#0] Info:  using HTTP/1.x
6/30/2023 1:50:28 PM |  | [http] [ID#0] Sent header to server: GET / HTTP/1.1
6/30/2023 1:50:28 PM |  | [http] [ID#0] Sent header to server: Host: www.google.com
6/30/2023 1:50:28 PM |  | [http] [ID#0] Sent header to server: User-Agent: BOINC client (windows_x86_64 7.22.2)
6/30/2023 1:50:28 PM |  | [http] [ID#0] Sent header to server: Accept: */*
6/30/2023 1:50:28 PM |  | [http] [ID#0] Sent header to server: Accept-Encoding: deflate, gzip
6/30/2023 1:50:28 PM |  | [http] [ID#0] Sent header to server: Accept-Language: en_US
6/30/2023 1:50:28 PM |  | [http] [ID#0] Sent header to server:
6/30/2023 1:50:28 PM |  | [http] [ID#0] Sent header to server: roject_name>
6/30/2023 1:50:28 PM |  | [http] [ID#0] Sent header to server:     <name>wah2_eas25_a1hb_199711_25_994_012217357_2_r1373083460_restart.zip</name>
6/30/2023 1:50:28 PM |  | [http] [ID#0] Sent header to server:     <nbytes>135193723.000000</nbytes>
6/30/2023 1:50:28 PM |  | [http] [ID#0] Sent header to server:     <max_nbytes>150000000.000000</max_nbytes>
6/30/2023 1:50:28 PM |  | [http] [ID#0] Sent header to server:     <status>1</status>
6/30/2023 1:50:28 PM |  | [http] [ID#0] Sent header to server:     <persistent_file_xfer>
6/30/2023 1:50:28 PM |  | [http] [ID#0] Sent header to server:         <num_retries>20</num_retries>
6/30/2023 1:50:28 PM |  | [http] [ID#0] Sent header to server:         <first_request_time>1687927102.472574</first_request_time>
6/30/2023 1:50:28 PM |  | [http] [ID#0] Sent header to server:         <next_request_time>1688163869.678019</next_request_time>
6/30/2023 1:50:28 PM |  | [http] [ID#0] Sent header to server:         <time_so_far>5887.330475</time_so_far>
6/30/2023 1:50:28 PM |  | [http] [ID#0] Sent header to server:         <last_bytes_xferred>8561276.000000</last_bytes_xferred>
6/30/2023 1:50:28 PM |  | [http] [ID#0] Sent header to server:         <is_upload>1</is_upload>
6/30/2023 1:50:28 PM |  | [http] [ID#0] Sent header to server:     </persistent_file_xfer>
6/30/2023 1:50:28 PM |  | [http] [ID#0] Sent header to server: </file_transfer>
6/30/2023 1:50:28 PM |  | [http] [ID#0] Sent header to server: </file_transfers>
6/30/2023 1:50:28 PM |  | [http] [ID#0] Sent header to server: </boinc_gui_rpc_reply>
6/30/2023 1:50:28 PM |  | [http] [ID#0] Sent header to server: 
6/30/2023 1:50:28 PM |  | [http] [ID#0] Received header from server: HTTP/1.1 200 OK
6/30/2023 1:50:28 PM |  | [http] [ID#0] Received header from server: Date: Fri, 30 Jun 2023 18:50:28 GMT
6/30/2023 1:50:28 PM |  | [http] [ID#0] Received header from server: Expires: -1
6/30/2023 1:50:28 PM |  | [http] [ID#0] Received header from server: Cache-Control: private, max-age=0
6/30/2023 1:50:28 PM |  | [http] [ID#0] Received header from server: Content-Type: text/html; charset=ISO-8859-1
6/30/2023 1:50:28 PM |  | [http] [ID#0] Received header from server: Content-Security-Policy-Report-Only: object-src 'none';base-uri 'self';script-src 'nonce-g26TqV-NdFTi9ZDmZpym1A' 'strict-dynamic' 'report-sample' 'unsafe-eval' 'unsafe-inline' https: http:;report-uri https://csp.withgoogle.com/csp/gws/other-hp
6/30/2023 1:50:28 PM |  | [http] [ID#0] Received header from server: P3P: CP="This is not a P3P policy! See g.co/p3phelp for more info."
6/30/2023 1:50:28 PM |  | [http] [ID#0] Received header from server: Content-Encoding: gzip
6/30/2023 1:50:28 PM |  | [http] [ID#0] Received header from server: Server: gws
6/30/2023 1:50:28 PM |  | [http] [ID#0] Received header from server: X-XSS-Protection: 0
6/30/2023 1:50:28 PM |  | [http] [ID#0] Received header from server: X-Frame-Options: SAMEORIGIN
6/30/2023 1:50:28 PM |  | [http] [ID#0] Received header from server: Set-Cookie: 1P_JAR=2023-06-30-18; expires=Sun, 30-Jul-2023 18:50:28 GMT; path=/; domain=.google.com; Secure
12/31/1969 6:00:00 PM |  | 
6/30/2023 1:50:28 PM |  | [http] [ID#0] Received header from server: Set-Cookie: NID=511=d90zWbRMfd4_ZIKvWbcbyrh41vg3goXyxQ0qasa4qz3ZCx6zE26cLxlotdkQ_3crmBCjaBN0YeA96WRcCr5etkDBGkLver2gUlaSaDdrYWTTQUmJ3LYSvruu56-e8XLitSTmIqdnoh5L_2d6us9tQUlQF_kYbGwVkqkJEAAw44Q; expires=Sat, 30-Dec-2023 18:50:28 GMT; path=/; domain=.google.com; HttpOnly
6/30/2023 1:50:28 PM |  | [http] [ID#0] Received header from server: Alt-Svc: h3=":443"; ma=2592000,h3-29=":443"; ma=2592000
6/30/2023 1:50:28 PM |  | [http] [ID#0] Received header from server: Transfer-Encoding: chunked
6/30/2023 1:50:28 PM |  | [http] [ID#0] Received header from server:
6/30/2023 1:50:28 PM |  | [http] [ID#0] Received header from server: 00000001
6/30/2023 1:50:28 PM |  | [http] [ID#0] Received header from server: 
6/30/2023 1:50:28 PM |  | [http] [ID#0] Received header from server: 00000001
6/30/2023 1:50:28 PM |  | 
6/30/2023 1:50:28 PM |  | [http] [ID#0] Received header from server: 00000001
6/30/2023 1:50:28 PM |  | [http] [ID#0] Received header from server: 
6/30/2023 1:50:28 PM |  | [http] [ID#0] Received header from server: 00000001
6/30/2023 1:50:28 PM |  | [http] [ID#0] Info:  Connection #40 to host www.google.com left intact
6/30/2023 1:50:29 PM |  | Internet access OK - project servers may be temporarily down.
[/code]
20) Message boards : Number crunching : New work discussion - 2 (Message 68988)
Posted 26 Jun 2023 by Profile geophi
Post:
As the models are failing right at the start it's almost certainly a problem with the input files. Though normally I would expect to see a floating point exception error because of bad input values rather than a segmentation violation (which means a bad memory reference). However, some bad data, say a negative pressure reference might put a -ve value in a memory reference and cause a segv.

Without seeing the process traceback and the model log file it's v difficult to know. If the CPDN server decides to give me some more tasks I'll disable networking to keep the files so I can look at them. However, all my tasks' workunits all failed so I suspect it's a bad input problem.

I'll join the CPDN technical meeting tomorrow to find out more.

I only have 3 running on my Ryzen, but they are almost through 7 model months now. Of the work units associated with these tasks, two of the work units had two SEGV failure tasks each, very early in their runs, prior to my download. The third task running on my Ryzen had a similar early SEGV task failure prior to my downloading the 2nd task from that work unit. So, if it's an input file problem, that can't be the reason for the SEGV failures in the work units my three tasks came from. The work units are:

https://www.cpdn.org/workunit.php?wuid=12217926
https://www.cpdn.org/workunit.php?wuid=12216852
https://www.cpdn.org/workunit.php?wuid=12217357

Like Dave, my Ryzen is running a version of Ubuntu, with Windows BOINC running under Wine.


Next 20

©2024 climateprediction.net