Message boards :
Number crunching :
HADAM3P not getting credits
Message board moderation
Author | Message |
---|---|
Send message Joined: 20 Dec 05 Posts: 14 Credit: 277,045 RAC: 0 |
I have been running climate CPDN for many years without significant issues. In April 5 HADAM3P Australia New Zealand v6.10 tasks were downloaded. Since then no credits have been allocated. One task has has failed and the other 4 have over 9 hours of elapsed time. I run other projects on the same machine and they are fine. |
Send message Joined: 7 Aug 04 Posts: 50 Credit: 548,730 RAC: 0 |
Hi Nigel, The ANZ tasks are a bit slower than others, presumably because they are more complex. The model I started a couple of days ago on a 2.5 GHz machine took 22hours to get to the first trickle, 6.8 sec/Ts where other tasks have been around 2 secs/Ts. They also only have 12 trickles and it needs to get to a trickle point to have credits awarded. If the model looks like it running fine then it probably is, but after 9 hours, it isn't far enough along to have trickled yet, therefore no credits yet, Patience is the key. |
Send message Joined: 20 Dec 05 Posts: 14 Credit: 277,045 RAC: 0 |
Hi Ray, Thanks for the confidence booster. It just seemed after a month and a half there should have been some credits. The most advanced is at 11:42:19 elapsed and 143:08:46 so should be almost there. I'll give it a while longer. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
On my machines, a complete model takes about 110 hours, a bit over 4 days. If yours are running that slowly, you're either not leaving the computer on for very long each day, you haven't given cpdn as much priority as your other projects, or there's something wrong with the computer. |
Send message Joined: 1 Jan 07 Posts: 943 Credit: 34,187,965 RAC: 6,888 |
The task which failed on 3 May shows a huge number of "Suspend request from BOINC..." in stderr.txt You might perhaps review your BOINC preferences, so that the climate models have a clear run without being constantly interrupted. |
Send message Joined: 20 Dec 05 Posts: 14 Credit: 277,045 RAC: 0 |
The HADAM3P Australia New Zealand models seem to be absolutely huge. presumably that's why the deadline is 21st March 2015. This is running on a MAC which I also use for video editing so I need the resource hence the suspend events. The machine is turned off quite a bit as I also use other machines. I'll just let the tasks run and see what happens. |
Send message Joined: 20 Dec 05 Posts: 14 Credit: 277,045 RAC: 0 |
Hi! I now have a task over 9% complete and no credit. Seems that something is not working. Are there any logs I can check? I also had a failed task with 23,375.33 cpu sec and no credit. It seems like there is a problem here. I'm not too fussed about my credit but it could put off some people!!! |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
The log to check is the Events log, which will have a message every time there's a scheduler request to upload a trickle_up file, which is the start of the credits process. Press: Control + shift + E to open this log. You could also read through some/all of this thread, which is all about the credits issue, although it's starting to drift a little. It's called: Credit updates?, and is just below your thread. |
Send message Joined: 20 Dec 05 Posts: 14 Credit: 277,045 RAC: 0 |
Hi Les, Event log does not mention trickles and is not much help. Is anyone else geting credit on this model or has someone forgotten to add trickles to it? :-) Thanks for the thread reference but that seems to be about external stats sites. I'm looking at the stats from boinc manager and the cpdn website.I'm assuming they are pretty authoritative. I will just let these tasks run and see what happens. |
Send message Joined: 17 Nov 07 Posts: 142 Credit: 4,271,370 RAC: 0 |
hi Nigel, please copy and paste your event log to a pastebin and post the URL here - or in a private message to Les, if you think here is too public. The next step turn on debugging in your cc_config.xml - searching this message board via google ("site:climateapps2.oerc.ox.ac.uk cc_config.xml debugging"))ought to provide you with several sets of instructions on how to do that. |
Send message Joined: 22 Feb 06 Posts: 484 Credit: 29,602,471 RAC: 2,231 |
Trickles on the HADAM3P models appear to be less frequent than on the full coupled models. Typically first trickles on anz models are round about 8 - 9% completion rather than 2.5% for a full coupled model. As for credits..... |
Send message Joined: 1 Jan 07 Posts: 943 Credit: 34,187,965 RAC: 6,888 |
Trickles on the HADAM3P models appear to be less frequent than on the full coupled models. Typically first trickles on anz models are round about 8 - 9% completion rather than 2.5% for a full coupled model. Err, yes. A full coupled model is simulating 40 years of weather. It reports in via a trickle once every year, or at each 2.5% increment. An AM3P model is simulating one year of weather, and reports in via a trickle once every (simulated, idealised, 30-day) month. So, a trickle every 100/12, or 8.333% increment. Those are exact figures: no 'typical' or 'appears' about it. If you care to dig a little deeper, you can see that for coupled models: First trickle at timestep 25,920 Last trickle at timestep 1,036,800 and for AM3P (ANZ) models: First trickle at timestep 11,819 Last trickle at timestep 138,539 |
Send message Joined: 15 May 09 Posts: 4342 Credit: 16,501,246 RAC: 5,648 |
If you are turning the computer off every day the log is certainly be default cleared and a blank one is started when BOINC starts up again. Don't know if this is relevant or not. |
Send message Joined: 1 Jan 07 Posts: 943 Credit: 34,187,965 RAC: 6,888 |
If you are turning the computer off every day the log is certainly be default cleared and a blank one is started when BOINC starts up again. Don't know if this is relevant or not. And logs for the previous session (and before that, and before that...) can be found in the files 'stdoutdae.txt' and 'stdoutdae.old' in the BOINC data directory. |
Send message Joined: 20 Dec 05 Posts: 14 Credit: 277,045 RAC: 0 |
Task finished with computing error! No more tasks to run at present. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
ALL of your tasks are failing. Possibly because you've left the setting: Suspend work if CPU usage is above 0 means no restriction at the default of 25%. Which is fine for other projects, but NOT here. |
©2024 climateprediction.net