climateprediction.net home page
Posts by marmot

Posts by marmot

1) Message boards : Number crunching : no credit awarded? (Message 68551)
Posted 3 Mar 2023 by marmot
Post:
It's been many years since I managed CPDN WU's.
Moved my Linux AntiX VM onto a server to let it do some climate modeling work.

And I'm still puzzled because the last conversation I had with Andy was that only trickles (for OpenIFS) are awarded credit, not completion. But I see what you mean. Richard's offered to take a closer look. Next time there's a tech meeting I'm in I'll bring it up, it will be more effective that way than myself or the moderators sending emails.


I have 4 OpenIFS marked valid, the trickles were all uploaded successfully (according to their log and my client event log), yet they all have 0 credit.
Not sure how to tell if the WU are only partially completed and the rest of the work went to another machine.
But seeing that most of these lines are at less than 100% I guess means the model isn't completed and was moved on:
STATS FOR ALL TASKS
 NUM ROUTINE                                     CALLS  MEAN(ms)   MAX(ms)   FRAC(%)  UNBAL(%)
   0 CNT0     - COMPLETE EXECUTION                   1 ********* *********    100.00      0.00
   1 CNT4     - FORWARD INTEGRATION                  1 ********* *********     99.98      0.00
   8 SCAN2M - GRID-POINT DYNAMICS                 3200   14521.4   14521.4     43.02      0.00
   9 SPCM     - SPECTRAL COMP.                    2952    1842.4    1842.4      5.03      0.00
  10 SCAN2M - PHYSICS                             2953    9882.9    9882.9     27.02      0.00
  11 IOPACK   - OUTPUT P.P. RESULTS                247    6811.2    6811.2      1.56      0.00
  12 SPNORM   - SPECTRAL NORM COMP.                126      82.3      82.3      0.01      0.00
  13 SCAN2M - RADIATION CALC.                      985   82359.3   82359.3     75.10      0.00
  14 SUINIF                                          1   14351.2   14351.2      0.01      0.00
  17 GRIDFPOS IN CNT4                              247     362.0     362.0      0.08      0.00
  18 SUSPECG                                         1    3399.0    3399.0      0.00      0.00
  19 SUSPEC                                          1    3468.4    3468.4      0.00      0.00
  24 SUGRIDU                                         1    7905.6    7905.6      0.01      0.00
  25 SPECRT                                          1    1461.0    1461.0      0.00      0.00
  26 SUGRIDF                                         1    1516.0    1516.0      0.00      0.00
  27 RESTART FILES - WRITING                       123   13675.8   13675.8      1.56      0.00
  28 RESTART FILES - READING                         1       0.0       0.0      0.00      0.00
  29 SU4FPOS IN CNT4                               247       1.4       1.4      0.00      0.00
  30 DYNFPOS IN CNT4                               247   17375.5   17375.5      3.97      0.00
  31 POSDDH IN STEPO                                13      36.4      36.4      0.00      0.00
  37 CPGLAG   - SL COMPUTATIONS                   2953  -53919.1       0.0      0.00    147.40
  38 WAM      - TOTAL COST OF WAVE MODEL          2952   23517.5   23517.5     64.27      0.00
  39 SU0YOMB                                         1    1564.1    1564.1      0.00      0.00
  51 SCAN2M   - SL COMM. PART 1                   2953      59.5      59.5      0.16      0.00
  54 SPCM     - M TO S/S TO M TRANSP.             2952     367.6     367.6      1.00      0.00
  55 SPCIMPF  - S TO M/M TO S TRANSP.             2952      82.1      82.1      0.22      0.00
  56 SPNORM   - SPECTRAL NORM COMM.                126       1.3       1.3      0.00      0.00
 102 LTINV_CTL   - INVERSE LEGENDRE TRANSFORM    10094    1333.5    1333.5     12.46      0.00
 103 LTDIR_CTL   - DIRECT LEGENDRE TRANSFORM      6152    1427.9    1427.9      8.13      0.00
 106 FTDIR_CTL   - DIRECT FOURIER TRANSFORM       6152     228.6     228.6      1.30      0.00
 107 FTINV_CTL   - INVERSE FOURIER TRANSFORM     10094     233.5     233.5      2.18      0.00
 140 SULEG       - COMP. OF LEGENDRE POL.            2     127.7     127.7      0.00      0.00
 152 LTINV_CTL   - M TO L TRANSPOSITION          10094      59.8      59.8      0.56      0.00
 153 LTDIR_CTL   - L TO M TRANSPOSITION           6152      64.6      64.6      0.37      0.00
 157 FTINV_CTL   - L TO G TRANSPOSITION          10094      78.6      78.6      0.73      0.00
 158 FTDIR_CTL   - G TO L TRANSPOSITION           6152      65.8      65.8      0.37      0.00
 400 GSTATS                                     589499       0.0       0.0      0.00      0.00
 401 GSTATS HOOK                                564603       0.0       0.0      0.00      0.00
TOTAL MEASURED IMBALANCE =       0.0 SECONDS,  0.0 PERCENT
TOTAL WALLCLOCK TIME   108019.4 CPU TIME  503376.7 VECTOR TIME   503376.7


From Richard's last comment; I'm guessing the new credit script failed to catch these 4 WU's or the scripts haven't run since they completed about 11 hours ago.

I see nothing abnormal to this run and so rather than starting a new thread, guess this should be added to this conversation.
https://www.cpdn.org/result.php?resultid=22315582

--------
For Glenn Carver:
My BOINC credit was awarded coins, which was translated into fiat dollars of $1800 that bought 3 used rack servers that went into making me a more productive member
of the BOINC community. Proof of work coins are still viable and the electricity goes into actual science work (and heating my home), like finding primes, but not currently climate research... which is a shame..
All labor that human hands, and minds do, must become paid work as AI rises to take over more duties and we may eventually need to "pay" the AI's, so human wages can compete with their "wages". Their wages will need to goto charities, or to fund basic monthly income for humans, as they take over more employment. Chat bots are already making inroads into help desk duties. Wealth disparities can lead societies to civil wars https://phys.org/news/2014-06-rich-poor-gap-civil-war.html and the disparities are growing, and that's not just the pandemic's effect

So yeah, I at least want a cookie, or some credit, for my time spent on these WU's.
And great, you all found some people willing to pay for the modeling services.
If they are paying then send some of those funds our way because managing 400-800 BOINC computing cores is human work, not an AI's, yet... I'm 60, with a physics degree, yet looking at never being able to retire, and needing to work till I die.

And if you think my anger isn't appropriate then stop making disparaging comments about users who like to get simple tokens of credit for the time, which is worth money, spent running your research...It's like I tell believers; "If you don't want your religion criticized then don't bring it up".
Do not tell us we should not even worry about getting credit.
We deserve credit and we also deserve cash for our labor.
2) Message boards : Number crunching : Move tasks between computers (Message 63152)
Posted 21 Dec 2020 by marmot
Post:
Since they are in a VM that is your best situation for moving them to even another box as the VM is divorced from most all specific hardware drivers and is emulating video/harddrive/NIC.

I've successfully moved a BOINC VM to another machine.
Also have successfully moved a harddrive, with a windows 7 install (it can be tricky and needing to edit the registry to eliminate hard drive controller entries) to another machine and had BOINC continue from where it left off (CPU only tasks).

Going to attempt a Windows X HD move (new motherboard, other died) but the WU's are long expired.
3) Message boards : Number crunching : Welcome back/checking if everything is working? (Message 63145)
Posted 20 Dec 2020 by marmot
Post:


https://www.climateprediction.net should be back up and will hopefully resolve any problems connecting a computer to it.


That laptop is communicating with that domain now. Thanks.

I could have edited the project .xml and changed it to cpdn.net to get an immediate fix but I did a normal install (instead of dropping all of BOINC in the user HOME directory) and so the files with the domain were locked. The root account could have taken ownership temporarily ... but I got lazy.

Gonna wait for a few days as the new Mint 19.3 install dropped into fallback mode running gaia@home. Maybe one of the widgets for CPU, temps or process on the task bar crashed the OS (didn't seem like overheating issue at 65C). These CPDN WU's demand days of stability from what I've read.

Mint 20 is out (20.1 in beta) but for some reason Wine absolutely failed to function.
4) Message boards : Number crunching : Welcome back/checking if everything is working? (Message 63127)
Posted 18 Dec 2020 by marmot
Post:
Dedicating this Dell Latitude laptop to CPDN WU's.
Installed Linux Mint 20 and have run into an issue; climateprediction.net is unaccessable so continuously get 'project communication failed' (although I was able to connect to my account using cpdn.org link).
It's not just my issue:
https://isdown.me/www.climateprediction.net reports the website is down.

This is the most recent thread that a search showed a user mentioning this issue so I posted here instead of a new post.
5) Message boards : Number crunching : WU won't upload (Message 61607)
Posted 25 Nov 2019 by marmot
Post:

2: That is a hadcm3s.
Up until recently, when the problem was found and fixed, that type of model had a fault in the trickle_up code, whereby the 1st trickle was generated with the correct info, but all subsequent trickles were identical.
So the server code would get the 2nd (and all others) trickle, compare it with what it already had, then discard it as a duplicate.
So people only got credit for one trickle for this type of model.
And, as you've already received that, you won't be getting more credit.



After aborting, likely because of the above error and subsequent patch, the WU is marked completed.

21710685	11381390	15 Jun 2019, 6:10:47 UTC	21 Nov 2019, 8:00:36 UTC	Completed	779,944.57	779,508.50	3,111.26	UK Met Office HadCM3 short v8.34 windows_intelx86
6) Message boards : Number crunching : WU won't upload (Message 61594)
Posted 22 Nov 2019 by marmot
Post:

1: The so called deadline is just a number to keep BOINC happy. It's NOT when the task is required. Which is: ASAP
.


If required return is ASAP then maybe lessen the deadline to 10-20 days from over 200 days (IIRC).
People actually USE the deadline to make decisions when manually prioritizing WU's.

I would have let the WU complete or aborted it before shutting that machine down for 3 months during the summer (climate crisis: a/c to cool servers in a home when we're in the hottest global streak of years on record is just species' suicide).
7) Message boards : Number crunching : no new work units (Message 61588)
Posted 21 Nov 2019 by marmot
Post:
"even though it's set to additional 0.01 days of work (which should equate to every 14 minutes and appears to be the smallest increment BOINC accepts in that field)."

I thought that "set to additional days work" (actually "Store additional Days work" in my BOINC manager version), enabled you to download and store additional work units for your computer to crunch on future days.

Surely it has nothing to do with how frequently the requests for work are made - which is controlled by CPDN, not the BOINC Manager, to no more frequently that one hour as Les pointed out.



I asked at the main BOINC forums how to get boinc.exe to request tasks at it's fastest rate and setting additional work to 0.01 was the response. You can check the forums; you'll likely find my handle 'marmot' in the search.

Other settings to increase frequency of requests was to exert control over the work cache to limit WU's to very few in number such as setting resource share to 0 on each project (some don't accept less than 1, some projects are still so aggressive their WU's dominate all other projects vying for spots) and, for project with newer BOINC server software, and the options implemented, explicitly setting number of downloaded WU's.

Local app_config.xml settings of <max_concurrent> or ,project_max_concurrent> aren't reported to the server and can worsen cache flow as the project sends enough work to fill all available cores yet only gets to run the max_concurrent number at once. Resource share 0 can be a crucial supplement to max_concurrent settings when you are assigning cores to multiple projects.
8) Message boards : Number crunching : WU won't upload (Message 61586)
Posted 21 Nov 2019 by marmot
Post:
WU was aborted.

Thanks for the answer.
9) Message boards : Number crunching : no new work units (Message 61501)
Posted 9 Nov 2019 by marmot
Post:


For Windows work, there's a "window of opportunity" which lasts for around 30 to 90 minutes, because of the huge numbers of Windows machines waiting for work.
Then it's back to waiting for a few weeks.


So we need to have a machine that has no work units running at all, CPDN the only project accepting work and set to only request particular WU's to have a shot at WU's we've personally never crunched before.

Even if I use 0 resource share on all the other projects and there is 1 WU running per core, and no WU's in queue; BOINC slows the request for work down to every 60 minutes even though it's set to additional 0.01 days of work (which should equate to every 14 minutes and appears to be the smallest increment BOINC accepts in that field).
I've played with leaving 1 or 2 of 8 cores open and the request rate is still slowed but maybe my brain is misremembering that test. I should try it again and see if the requests are every 14-15 minutes with 2 cores always open (only possible on a project with server controlled number of WU's downloaded).
10) Message boards : Number crunching : WU won't upload (Message 61499)
Posted 8 Nov 2019 by marmot
Post:
BOINC has been retrying to upload the results on https://www.cpdn.org/result.php?resultid=21710685 for 10 days now.

Restarted the BOINC install but every other thing to try (reset project, detach/reattach) will lose the WU.

The WU is not past it's deadline but sat idle during the summer while the computer was down for the hot months.
11) Message boards : Number crunching : Free-DC reports negative credits today for CPDN (Message 60419)
Posted 24 Jun 2019 by marmot
Post:
Any idea what happened with the export?

My CPID shows -19,129 today.
12) Questions and Answers : Getting started : Avatar issue (Message 60240)
Posted 29 May 2019 by marmot
Post:

And personally, I think that this board looks a lot cleaner without them.



You're missing out.

Majestic Alpine Marmot Surveys His Alpine Meadow painting is beautiful.

Woah, your security even broke IMG tag posting.

OK, an 8k JPG on IMGUR looks horrible because of magnification pixalation.

URL link https://imgur.com/MUW9i3I
13) Message boards : Number crunching : Error 22 on machine that successfully ran same WU type in April. (Message 60233)
Posted 28 May 2019 by marmot
Post:
Credits are awarded each time data is received from a task (unlike other projects, which require completed tasks). Your task apparently failed to report the first reporting point. Sorry about that - we all lose some minutes of processing that way...



I understand.

How hard would a script that recognized "Model crashed: ATM_DYN : INVALID THETA DETECTED", awards a base 100 credit for the failed model, then lists these WU's as invalids, be?

Guess the researchers are getting their Invalid Theta percentages, and scrutinizing other various failures, from a separate script that gathers statistics on all failed and invalid WU's.

It's just a thought from the standpoint that people getting the error won't waste time at helpdesk diagnostics trying to discover some issue with their machines. Minimal credit and marked invalid; people might just say "huh, that's odd" and not bother the helpdesk staff (like I did).
14) Message boards : Number crunching : Daily scores = more crunchers (Message 60232)
Posted 28 May 2019 by marmot
Post:
Ok, then I'll leave.


BYE BYE


Andy was working on an alternative script that wouldn't use so many resources but I have heard nothing of that since the big crash a few months ago.

.
.
.

A bit like Brexit, whatever happens, many will not be happy! Personally I think the current system is acceptable. When other important work is completed, it may be that Andy has time to return to work on the new script but until/unless that happens I doubt if there will be any change.


That alternate script would be appreciated.

I'd love to crunch this regularly; this was my first project when joining BOINC in 2007 after leaving SETI@Home in 1999. As of right now I've compromised: I maintain 300 magnitude for GRC as a way to pay for the other projects that aren't on our whitelist. The donated time went to RakeSearch, Primegrid and, in a vanity goal, an attempt to get my name on the 180 list at WUProps by giving 100 hours to a bunch of projects' WU's.

Hot season approaches and our local utility still have some gas and coal fired plants, so shutting down machines rather than use A/C. Crypto currency shouldn't hasten climate change; and when it uses electricity, it should heat homes and provide useful information rather than attempting to crack senseless codes.

Aurum's attitude doesn't reflect the entire GRC community. We chose to focus on GRC, instead of other cryptocurrencies, because the science is important. Many off topic discussions, in our chat rooms, are about the advancements in science.
Many of us recognize an oncoming job loss to AI, and an ever widening world-wide wealth gap, and see that our managing machines for science has value. We should get paid for our work and are building a method to do so.

I see the women heating her house in Siberia with computers grinding for BitCoin as a wasted opportunity since she's gained heat and some money but the community didn't gain any advancement in scientific knowledge.

https://qz.com/1117836/bitcoin-mining-heats-homes-for-free-in-siberia/
15) Questions and Answers : Getting started : Avatar issue (Message 60229)
Posted 28 May 2019 by marmot
Post:
Still no success today after letting the server clear it's cache.

Avatar won't upload and no error message; just a blank white screen.
16) Questions and Answers : Getting started : Avatar issue (Message 60215)
Posted 27 May 2019 by marmot
Post:
My avatar had been up here for several years and noticed today it was black and white. Deleted it and tried to upload a color jpg, 87x99, <4kb and all I get is a blank, white website page when clicking update.

The page is:

https://www.cpdn.org/cpdnboinc/edit_forum_preferences_action.php

So I can't successfully update the avatar and my old one is gone.
17) Message boards : Number crunching : Error 22 on machine that successfully ran same WU type in April. (Message 60214)
Posted 27 May 2019 by marmot
Post:
Just have to add that there was no error on the user-client side. Nothing the BOINC user has any control over.

The work unit is at worst invalid because of a failure in the data set. And even the failure of the model because of initial conditions is something learned. A failed experiment can still teach the scientist something about their research.

As such, these work units should complete as invalid WITH credit given.

The WU did take up minimum 30 minutes of a slot that another project could have had reserved time for.

Worked on 170+ different work units now and can't remember another WU end as a 0 credit error because the calculation ended in a null result.
This would be akin to assigning 0 credit because we didn't find a prime number in a SRBase search.
18) Message boards : Number crunching : Error 22 on machine that successfully ran same WU type in April. (Message 60196)
Posted 22 May 2019 by marmot
Post:
Model crashed: ATM_DYN : INVALID THETA DETECTED


They now know that the starting values used in that model run lead to an instability.

So it's and error from data set variable starting values.


This seemed to be a configuration error, but if this error can occur from data set conditions, then all is fine and just keep crunching.
<![CDATA[
<message>
The device does not recognize the command.
(0x16) - exit code 22 (0x16)</message>
<stderr_txt>



This particular WU is hard to get and it's disappointing that it ended so quickly. Thanks for the response.
19) Message boards : Number crunching : Error 22 on machine that successfully ran same WU type in April. (Message 60186)
Posted 22 May 2019 by marmot
Post:
The only change to the machine is an added RX 550 and it's downclocked from 24x to 8x due to warming weather.
No OS changes but the AMD driver.
Running at 31.5 GB commits of 32GB RAM and 28.7GB occupied private/working.
All 32 threads in use.
Plenty of free disk space.
Currently also running, and turning in valid results for, Amicable Numbers(GPU+cores), Sixtrack (LHC), RakeSearch and SRBase long.

Did some requirement change for this WU type?

From machine: https://www.cpdn.org/cpdnboinc/results.php?hostid=1347460
<core_client_version>7.14.2</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
(0x16) - exit code 22 (0x16)</message>
<stderr_txt>

Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048

Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048

Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048

Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048

Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048

Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048
Sorry, too many model crashes! :-(
02:12:54 (1396): called boinc_finish(22)

</stderr_txt>
]]>
20) Message boards : Number crunching : Total Credit (Message 53833)
Posted 27 Mar 2016 by marmot
Post:
@Dave Jackson
As I said a few posts earlier, Andy thinks it is the pre-22011 credit that has gone, as you have been crunching since well before that it has hit you. If it isn't fixed soon, because of the weekend and public holidays, not much is likely to change till Tuesday.


I missed that. The amounts lost do seem to be about all the credit my old machine got in 2005-2008. Will see what happens later this week. My machines are focused on other projects now.

Are you all going to consider sending some kind of summary notification to our clients detailing what is the issue with credit so that people out there are updated on this issue?


@Iain Inglis

These WU are responsible for 70 to 95% of the power usage on any machine that they are run upon. It's not MARGINAL at all.


True enough, but in winter time you wont need to use the heaters so much, since the PC's heat your rooms :) And the research is valuable. I think it was marginal many years ago? Back when Athlon64's were the thing?

I read somewhere that a search on google uses as much electricity as a 60 watt light bulb does for 17 seconds.

Now back to topic :)

As an aside and not to divert the thread too greatly, the word "marginal" is sometimes used in marmot's sense of "small", but also means "at the margin" as in the expression "marginal cost". In addressing the question of 24/7 running, the word was used in the second sense and not the first: in effect, marmot argues that the marginal cost of running a CPDN model is "large" relative to the cost of the running an idle PC - that's actually a good thing, as it means the PC is efficient. However, it only produces a disagreement if "marginal" is allowed to have only one meaning, which it doesn't.


None of the included definitions, including that from economics, use "marginal" when the value talked about becomes 4 (6 hours of gaming vs 24 hours of BOINC) to 20 (100% vs 5% wattage at idle) times the original base cost wattages used by BOINC WU when run 100% of the time on 100% cores. BOINC energy usage overrides all other considerations and become the primary cost. But this isn't the topic of the thread, I just had to address a major fault of perception that seems common.
BOINC WU's are valuable but also the primary cost of electricity usage on computers running CPU/GPU intensive WU's.




Next 20

©2024 climateprediction.net