climateprediction.net home page
Posts by BetelgeuseFive

Posts by BetelgeuseFive

1) Message boards : Number crunching : New work Discussion (Message 62915)
Posted 9 Nov 2020 by BetelgeuseFive
Post:
- When I first encountered this problem I thought it was related to 32-bit library issues, but I have doubts about this now. I recently switched from CentOS 7 to Ubuntu 18.04 LTS because some of the projects I am running were dynamically linked to newer libraries than provided with CentOS 7. In these cases I would get tasks, but they would error out because of the missing dynamic libraries. Does the CPDN server has any clues as to the dynamic libraries installed on my system ? I would think not as there are too many different versions for different Linux distributions, so my guess is that it would just send out work and it would error out because of the missing dynamic libraries.

Don't know about CentOS but the instructions in the pinned post in the Linux section give instructions to install the libraries that are missing for most.

If with any distro still getting problems with missing libraries, run ldd on the executable from CPDN and it will list anything missing. (I know from experience that it can then take a bit of detective work to find the package that supplies said library.)


You are missing something here: Catch-22

If you get no tasks you also do not get the executable (you get it when the first task is assigned). It is very hard to run ldd on an executable you do not have.

This is exactly the reason why I doubt the problem is caused by missing dynamic libraries: the CPDN server has no way of knowing if the correct libraries are present.
If have seen this with several other projects: you get a task that errors out and in the error log you can usually see it was caused by missing dynamic libraries. You install the missing libraries (if they are provided by the distribution you are using !) and you check if the next task works.

Tom
2) Message boards : Number crunching : New work Discussion (Message 62898)
Posted 8 Nov 2020 by BetelgeuseFive
Post:
Betelgeuse.

What are your settings for days of work to be stored? The N216 models can easily take 15 to 20 days to complete and if BOINC thinks that you haven't allowed enough time it might not get you any work. Also are you running any other work units from other projects?


OK, I received a task now..
My settings were store at least 0.5 days of work with an additional 0.1 days of work.
Set all other projects (I am running 12 projects at the moment) to no new tasks and changed settings to 1.5 + 1.5 days.
First try after this I got a task.

A couple notes/questions related to this:

- I enabled some of the event log debug options. This resulted in a huge number of lines in the event log, but nothing useful.
- Why does CPDN not issue work when I request it ? The settings say store at least a certain amount of work, there is nothing about a maximum.
- When I first encountered this problem I thought it was related to 32-bit library issues, but I have doubts about this now. I recently switched from CentOS 7 to Ubuntu 18.04 LTS because some of the projects I am running were dynamically linked to newer libraries than provided with CentOS 7. In these cases I would get tasks, but they would error out because of the missing dynamic libraries. Does the CPDN server has any clues as to the dynamic libraries installed on my system ? I would think not as there are too many different versions for different Linux distributions, so my guess is that it would just send out work and it would error out because of the missing dynamic libraries.

Thanks everybody for the help.

Tom
3) Message boards : Number crunching : New work Discussion (Message 62878)
Posted 7 Nov 2020 by BetelgeuseFive
Post:
There is a need for 32bit libraries as these are 32bit applications. See this post for commands to install the necessary libraries for common Linux distributions...

https://www.cpdn.org/forum_thread.php?id=8916#62038


I installed the 32-bit libraries for Ubuntu 18.04 LTS:

sudo apt-get install lib32ncurses5 lib32z1 lib32stdc++-6-dev

Everything installed fine, but I am still not receiving new work (same message as before):

Sat 07 Nov 2020 10:07:22 AM CET | climateprediction.net | update requested by user
Sat 07 Nov 2020 10:07:25 AM CET | climateprediction.net | Sending scheduler request: Requested by user.
Sat 07 Nov 2020 10:07:25 AM CET | climateprediction.net | Requesting new tasks for CPU
Sat 07 Nov 2020 10:07:27 AM CET | climateprediction.net | Scheduler request completed: got 0 new tasks
Sat 07 Nov 2020 10:07:27 AM CET | climateprediction.net | No tasks sent

Tom
4) Message boards : Number crunching : New work Discussion (Message 62866)
Posted 6 Nov 2020 by BetelgeuseFive
Post:
According to the server status page there are over 600 tasks available for UK Met Office HadAM4 at N216 resolution

From what I understand these are Linux tasks, but I am not getting any work when requesting it from my Linux system.

Are there any special requirements for these tasks that I am not aware of ?

Thanks,

Tom
5) Message boards : Number crunching : Download server (Message 58790)
Posted 21 Sep 2018 by BetelgeuseFive
Post:
I am also having download problems, got 3 tasks but I am no able to download a single one. Lots of messages like this:

21/09/2018 17:24:32 | climateprediction.net | Temporarily failed download of so2dms_rcp85_N96_2049_2060.gz: transient HTTP error

Tom
6) Message boards : Number crunching : Stuck upload issue (Message 57785)
Posted 17 Feb 2018 by BetelgeuseFive
Post:
I still have two stuck uploads from the following tasks:

https://www.cpdn.org/cpdnboinc/result.php?resultid=20919737
https://www.cpdn.org/cpdnboinc/result.php?resultid=20919722

Both uploads are appr. 105 Mb in size.
One of them is stuck at 53 Mb the other one at 46 Mb.

As no one seems to be willing to look into this or tell us what to do about it I have set no new tasks for CPDN until this is resolved.

Tom

17/02/2018 10:15:07 | climateprediction.net | Started upload of wah2_cam25_a03s_200405_18_689_011368580_0_r616315434_17.zip
17/02/2018 10:15:07 | climateprediction.net | Started upload of wah2_cam25_a047_200405_18_689_011368595_0_r2068114942_7.zip
17/02/2018 10:15:29 | | Project communication failed: attempting access to reference site

17/02/2018 10:15:29 | climateprediction.net | Temporarily failed upload of wah2_cam25_a03s_200405_18_689_011368580_0_r616315434_17.zip: transient HTTP error
17/02/2018 10:15:29 | climateprediction.net | Backing off 05:16:01 on upload of wah2_cam25_a03s_200405_18_689_011368580_0_r616315434_17.zip
17/02/2018 10:15:30 | | Internet access OK - project servers may be temporarily down.
17/02/2018 10:15:34 | | Project communication failed: attempting access to reference site
17/02/2018 10:15:34 | climateprediction.net | Temporarily failed upload of wah2_cam25_a047_200405_18_689_011368595_0_r2068114942_7.zip: transient HTTP error
17/02/2018 10:15:34 | climateprediction.net | Backing off 03:50:57 on upload of wah2_cam25_a047_200405_18_689_011368595_0_r2068114942_7.zip
17/02/2018 10:15:35 | | Internet access OK - project servers may be temporarily down.
7) Message boards : Number crunching : Stuck upload issue (Message 57682)
Posted 21 Jan 2018 by BetelgeuseFive
Post:
I have two stuck uploads, one of them for over a week, the other for a couple of days:

21/01/2018 10:41:08 | climateprediction.net | Started upload of wah2_cam25_a047_200405_18_689_011368595_0_r2068114942_7.zip
21/01/2018 10:41:32 | | Project communication failed: attempting access to reference site
21/01/2018 10:41:32 | climateprediction.net | Temporarily failed upload of wah2_cam25_a047_200405_18_689_011368595_0_r2068114942_7.zip: transient HTTP error
21/01/2018 10:41:32 | climateprediction.net | Backing off 03:59:04 on upload of wah2_cam25_a047_200405_18_689_011368595_0_r2068114942_7.zip
21/01/2018 10:41:35 | | Internet access OK - project servers may be temporarily down.
21/01/2018 10:42:07 | climateprediction.net | Started upload of wah2_cam25_a03s_200405_18_689_011368580_0_r616315434_17.zip
21/01/2018 10:42:31 | | Project communication failed: attempting access to reference site
21/01/2018 10:42:31 | climateprediction.net | Temporarily failed upload of wah2_cam25_a03s_200405_18_689_011368580_0_r616315434_17.zip: transient HTTP error
21/01/2018 10:42:31 | climateprediction.net | Backing off 04:12:33 on upload of wah2_cam25_a03s_200405_18_689_011368580_0_r616315434_17.zip
21/01/2018 10:42:32 | | Internet access OK - project servers may be temporarily down.

Anything I can do about this on my side or does this need to be resolved on the server side ?

Tom
8) Message boards : Number crunching : Completed task not marked as completed (Message 57397)
Posted 26 Nov 2017 by BetelgeuseFive
Post:
The following task was completed, but it does not show as completed on the website:

https://www.cpdn.org/cpdnboinc/result.php?resultid=20862120

Any clues ?

Thanks,

Tom
9) Message boards : Number crunching : MORE FAILED DOWNLOADS (Message 56164)
Posted 6 May 2017 by BetelgeuseFive
Post:
I don't think the problem is that the files are not there. The problem seems to be that the server cannot be found:

06/05/2017 10:00:39 | climateprediction.net | Temporarily failed download of waterfix.ancil.be.32.gz: connect() failed

Maybe after all the recent changes something is wrong with the DNS settings ?
10) Message boards : Number crunching : HadCM3 short errors (Message 52007)
Posted 30 May 2015 by BetelgeuseFive
Post:
Alan, While most of my "No Resubmission" tasks are the 1980s batch, a few are not, so it is necessary to check them all.


I don't know if I am referring to the same problem here, but I recently had a number of tasks that failed with an 'out of memory' message. Same message reported by other tasks for the same workunit.
Examples:

http://climateapps2.oerc.ox.ac.uk/cpdnboinc/workunit.php?wuid=9406431
http://climateapps2.oerc.ox.ac.uk/cpdnboinc/workunit.php?wuid=9413766
http://climateapps2.oerc.ox.ac.uk/cpdnboinc/workunit.php?wuid=9408774

The last one in this list was sent out again yesterday. Seems like a waste of resources ...

Tom




©2024 climateprediction.net