climateprediction.net home page
Posts by Drago75

Posts by Drago75

1) Message boards : Number crunching : Completing a WU? Impossible. What am i doing wrong? (Message 70023)
Posted 31 Oct 2023 by Drago75
Post:
This may have been answered already in another thread but my question fits this subject. While crunching the wus they send intermediate progress checkpoints back to your server, I believe they are refered to as "trickles" for which we are awarded credit. If a wu fails after that trickle-save does that mean that that wu is sent out to another volunteer from that point or from scratch?

Tom
2) Message boards : Number crunching : OpenIFS Discussion (Message 68617)
Posted 22 Mar 2023 by Drago75
Post:
This has propably been raised on a number of occasions but it still puzzles me so I would like to ask this again. The project has 45.600 active work units which don't seem to finish ever. Over the past few months I noticed that the majority of work is beeing completed within 10-14 days. Wouldn't it be a good idea to reduce their expiry date to less then 4 weeks still? Maybe even to 14 days? Those wus run for approx. 18-24 hours and they don't seem to like being paused. The only real way to run them is either continiously or by interupting them by sending the PC to standby. Either way once started they should finish within days. When I look at the WAH units they still allow for one year to be completed. If a calculation run takes that long it isn't any faster then the real weather outside. So if the projects aim is to predict the weather for the future, don't the scientists need the data as quickly as possible? There seem to be a lot of crunchers here who would be willing to process more data but don't get enough work.
3) Message boards : Number crunching : no credit awarded? (Message 66584)
Posted 25 Nov 2022 by Drago75
Post:
ok, thanks Dave
4) Message boards : Number crunching : no credit awarded? (Message 66580)
Posted 25 Nov 2022 by Drago75
Post:
for the last nine N144 tasks I uploaded I didn't receive any credit so far. Here is one example: task no. 22240066.
Is that normal? On previous tasks I received credit in several stages according to completion but this time none so far.
5) Message boards : Number crunching : New work discussion - 2 (Message 66451)
Posted 15 Nov 2022 by Drago75
Post:
Thanks guys for the input about the restart problem. I shut my hosts down because I like to run them on solar power if possible. For now I will try to make sure that each task runs at least 2 minutes after the last check point to make sure it is written to the SSD properly.
6) Message boards : Number crunching : New work discussion - 2 (Message 66413)
Posted 15 Nov 2022 by Drago75
Post:
I am getting a lot of invalid units. They produce some calculation error somewhere on the way. That usually happens when I restart my hosts in the morning. In the evening I always pause all work, then I wait 30 seconds before I shut em down to make sure all data is written to the ssd correctly.The next morning I get some aborts. Happens on two AMD hosts running Linux Mint 20 and Ubuntu 20. I presume there is some issue with the checkpoints. Did anybody else notice that, too?
7) Message boards : Number crunching : New work discussion - 2 (Message 66357)
Posted 11 Nov 2022 by Drago75
Post:
My R9 is crunching 32 N144 units now. I noticed that checkpoint times seem to be somewhat erratic and seem to be sometimes hours apart. Does anybody know their checkpoint pattern?
8) Questions and Answers : Unix/Linux : What does: "negative theta detected" mean? (Message 66279)
Posted 30 Oct 2022 by Drago75
Post:
ok thanks...
9) Questions and Answers : Unix/Linux : What does: "negative theta detected" mean? (Message 66274)
Posted 30 Oct 2022 by Drago75
Post:
My UBUNTU 20.04 Laptop R7-5800H with 16 GB ran a single HadSM4 at N144 unit which errored out after six minutes with the following report. Was this due to missing libraries which I meanwhile installed?

<core_client_version>7.16.6</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)</message>
<stderr_txt>

Model crashed: ATM_DYN : NEGATIVE THETA DETECTED.
Model crashed: ATM_DYN : NEGATIVE THETA DETECTED.
Model crashed: ATM_DYN : NEGATIVE THETA DETECTED.
Model crashed: ATM_DYN : NEGATIVE THETA DETECTED.
Model crashed: ATM_DYN : NEGATIVE THETA DETECTED.
Model crashed: ATM_DYN : NEGATIVE THETA DETECTED.
Sorry, too many model crashes! :-(
13:13:23 (8405): called boinc_finish(22)

</stderr_txt>
]]>[/size]




©2024 climateprediction.net