climateprediction.net home page
Posts by Hal Bregg

Posts by Hal Bregg

1) Message boards : Number crunching : Scheduler request too recent (Message 61897)
Posted 3 Jan 2020 by Hal Bregg
Post:
ASAP

The expectation of at least one of the researchers, is that people will only download the same number of tasks as they have processor cores, and run them immediately, without stopping to run work from other projects.
Then get another bunch and do the same.



It makes perfect sense to donwload amount of work that can be done within relatively short time. I have to admit that I used to download few days worth jobs for few different projects at the same time. Right now when my hardware stock decresed and it's fairly old I'm trying to crucnh one or two projects at the same time and setup BOINC manager to download new work once WUs are completed.

You mentioned about returning work ASAP. This might be tricky for people with old hardware used for distributed computing.
2) Message boards : Number crunching : Scheduler request too recent (Message 61892)
Posted 3 Jan 2020 by Hal Bregg
Post:
I recall that the project does not actually use/need the deadlines. They are there just because BOINC environment requires to have one.


While true that the project does not currently use deadlines, if they were to start using them, it would enable tasks to be re-issued in time to be of use to the researchers. At the time things were set up with long deadlines, all the work was generated and processed at Oxford. Now universities from all over the world generate the work and often do need it much more quickly. These days while they might still get credit, tasks returned a year after being issued are of no use to the scientists.


So what is desireable return time for crunched data to be useful?
3) Message boards : Number crunching : Why did I get ZERO credit for these tasks? (Message 61620)
Posted 28 Nov 2019 by Hal Bregg
Post:
Trickles were uploaded and credit was granted, like you said, today.
4) Message boards : Number crunching : Why did I get ZERO credit for these tasks? (Message 61616)
Posted 27 Nov 2019 by Hal Bregg
Post:
Ah yes I see, credit has been granted.

I did not realize that the credit was only updated on a weekly basis. Thank you for the heads up. It was not a well known fact to me a returning "newcomer". But now I know, so thank you.


Does it mean that valid and completed task is given zero credit initially and correct credit is granted later?
5) Message boards : Number crunching : Slow progress rate for HadAM4 at N216 (Message 61445)
Posted 2 Nov 2019 by Hal Bregg
Post:
I let the task run and seems to be behaving really well. It's progressing slow but similar tasks are running at pretty much the same speed on another host with slower CPU.

I keep saving VM regularly to avoid loosing work already done.
6) Questions and Answers : Unix/Linux : No work being downloaded on Linux host (Message 61430)
Posted 30 Oct 2019 by Hal Bregg
Post:
What are the messages saying (in the Event Log) ?


Just standard message like this
Tue 29 Oct 2019 20:21:37 GMT | climateprediction.net | Scheduler request completed: got 0 new tasks


However not long after I created this topic I got one task and 2 days later another one. I suspect that server is sometimes picky when it comes to sending new work.[/url]
7) Questions and Answers : Unix/Linux : No work being downloaded on Linux host (Message 61413)
Posted 27 Oct 2019 by Hal Bregg
Post:
I am trying to run project on 64-bit Linux host with 32-bit libraries installed as per this thread
https://www.cpdn.org/forum_thread.php?id=8008#54197

but no tasks are being downloaded, despite server status showing over 2000 of them ready to send.
Any particular reason why client is not downloading?

I was able to download one task on 32-bit host installed in VM but running time is rather appalling.
8) Message boards : Number crunching : Slow progress rate for HadAM4 at N216 (Message 61404)
Posted 26 Oct 2019 by Hal Bregg
Post:
Also I kept running 2 LHC@home tasks at the same time, which use VirtualBox. However I suspended those for about 2 hours to see if things will get better but didn't notice any improvement in crunching progress of CPND task.

Hummm. The LHC VBox tasks will take a lot of memory, at least CMS and ATLAS. When you suspend them, if you have "leave applications in memory" enabled, they will hang around in memory. So I wouldn't run them at all.
I don't even try to run the native LHC tasks on my machines with a lot more memory and cache. Just exit LHC entirely and run CPDN for a while.

Good luck with the switch out of VBox. I have never attempted such a thing.


I treat this as experiment. Nothing more. I tried once to run CPND on 64-bit Linux but either it wasn't working at all or tasks where crashing unexpectedly after some time. I might try again in the future.
9) Message boards : Number crunching : Slow progress rate for HadAM4 at N216 (Message 61402)
Posted 26 Oct 2019 by Hal Bregg
Post:
That would be over 25 days per work unit, which is a bit slow for that CPU. I would expect at least twice that fast.
Are you running other projects? Normally VBox does not exact much of a penalty, but maybe it does not work well with N216.
The caching requirements for N216 are a bit strange.


Exactly. Nearly a month of continuous crunching which I am not capable of doing at the moment. At current speed it would take me more than that.

Anyway, I installed 32-bit Debian based CLI version of Linux and I simply save current state of VM before turning of VirtualBox. It should not affect progress of currently processed task. Also I kept running 2 LHC@home tasks at the same time, which use VirtualBox. However I suspended those for about 2 hours to see if things will get better but didn't notice any improvement in crunching progress of CPND task.

What stroke me else is that I crunched few wah2 WUs on Windows host with Intel Celeron 2.16Ghz on board and usually I needed around 7 days to complete those.

And last thing, I am familiar with the announcement Les Bayliss mentioned in his post but I did not see any clear indication of how long the task would run. Looking at running times posted by Jean-David Beyer indicates that I might end up crunching for more than a month at current speed.
10) Message boards : Number crunching : Slow progress rate for HadAM4 at N216 (Message 61394)
Posted 26 Oct 2019 by Hal Bregg
Post:
Hello,

I decided to run the project on 32-bit Linux installed in VM. I dedicated only one core of i3-2100 to VM and 4GB of RAM. I got one HadAM4 at N216 but the progress rate is really slow. After nearly 6hrs of running the project, I completed only 0.84%. Should I expect such long-running time for this task or is it just my host?
11) Questions and Answers : Unix/Linux : 8.09 UK Met Office HadAM4 at N144 resolution (Message 60617)
Posted 8 Jul 2019 by Hal Bregg
Post:
Hello,

I snatched one of those tasks on my Linux host but after 2 days of crunching WU failed with no credit granted.

Was WU faulty?

https://www.cpdn.org/cpdnboinc/result.php?resultid=21733569


Three trickles have been reported so I imagine you will get credit for them by end of play tomorrow when the weekly credit script has run. (assuming all goes to plan!)


The -ve theta detected means the model has produced an impossible climate for example a negative air pressure. Some tasks in a batch often push the limits of what might happen and if they are too far out this happens. That result is still useful.


Great to hear it was nothing to do with host. I will carry on crunching.
12) Questions and Answers : Unix/Linux : 8.09 UK Met Office HadAM4 at N144 resolution (Message 60607)
Posted 6 Jul 2019 by Hal Bregg
Post:
Hello,

I snatched one of those tasks on my Linux host but after 2 days of crunching WU failed with no credit granted.

Was WU faulty?

https://www.cpdn.org/cpdnboinc/result.php?resultid=21733569
13) Questions and Answers : Unix/Linux : Error while loading libstdc++ library (Message 59793)
Posted 12 Mar 2019 by Hal Bregg
Post:
On Ubuntu 16.04 LTS this command worked to satisfy the dependencies

sudo apt-get install lib32ncurses5 lib32z1 gcc-4.7-multilib

However, Mint 19.1 is based on Ubuntu 18.04 LTS so may have different dependencies.

You may be able to substitute

lib32stdc++-6-dev

for

gcc-4.7-multilib

in the above command. If that doesn't work, the output from that command may give you an explanation of what replacement dependency is needed.

As a last resort, you can also do a

sudo ldd on the hadcm3s executables (probably 2 of them with different names) and .so file in the (probably) /var/lib/boinc-client/projects/climateprediction.net directory to see what dependencies are left to be satisfied.


Library gcc-4.7-multilib was not available for me. However I had gcc-4.8-multilib available which I installed as well as lib32stdc++-6-dev. I am not getting any tasks at the moment but once I get something to crunch I will post update.
14) Questions and Answers : Unix/Linux : Error while loading libstdc++ library (Message 59782)
Posted 11 Mar 2019 by Hal Bregg
Post:
Hello,

I have five failed UK Met Office HadCM3 short v8.34 i686-pc-linux-gnu tasks with the same error

process exited with code 127 (0x7f, -129)</message>
<stderr_txt>
../../projects/climateprediction.net/hadcm3s_8.34_i686-pc-linux-gnu: error while loading shared libraries: libstdc++.so.6: cannot open shared object file: No such file or directory


Boinc version 7.9.3 and is running on 64bit Linux Mint 19.1 with 4.15.0-45-generic kernel.

Does it mean particular library is missing?
15) Questions and Answers : Windows : Task status 'Didn't need' (Message 59654)
Posted 14 Feb 2019 by Hal Bregg
Post:
Thank you for the update.
16) Questions and Answers : Windows : Task status 'Didn't need' (Message 59643)
Posted 14 Feb 2019 by Hal Bregg
Post:
Hello,

I just finished long wah2 task. Credit has been applied but status of the finished task shows as 'Didn't need' and 'Run time' and 'CPU time' shows 0.

https://www.cpdn.org/cpdnboinc/result.php?resultid=21449810




©2021 climateprediction.net