climateprediction.net home page
Posts by Jean-David Beyer

Posts by Jean-David Beyer

1) Message boards : Number crunching : What is "HadAM4 at N216 resolution v8.52"? (Message 62330)
Posted 21 Apr 2020 by Jean-David Beyer
Post:
The 4 Gigs refers to L3 cache.


So when I look at the processor using cpdn web site, and it says

CPU type GenuineIntel
Intel(R) Xeon(R) CPU E5-2603 0 @ 1.80GHz [Family 6 Model 45 Stepping 7]
Number of processors 4

Memory 15.5 GB
Cache 10240 KB


to which cache does it refer?
2) Message boards : Number crunching : What is "HadAM4 at N216 resolution v8.52"? (Message 62326)
Posted 20 Apr 2020 by Jean-David Beyer
Post:
HadAM4 at N216 resolution v8.52 is the new Linux only model. From what I’ve read it requires a lot of RAM (about 4GB per core) to run well.


I got four of those on my relatively slow machine. One of them crashed soon after the second trickle (in Message 62299 - Posted: 13 Apr 2020, 12:16:51 UTC ), but the other three seem to be running OK. One has done three trickles and the others, two each.

As far as RAM is concerned, they seem to take about the same as all N216 models.
VIRT RES SHR
1383m 1.3g 15m

Memory 15.5 GB
Cache 10240 KB
3) Message boards : Number crunching : UK Met Office HadAM4 at N216 resolution v8.52 failed 867 (Message 62308)
Posted 14 Apr 2020 by Jean-David Beyer
Post:
It's felt that you have a bad download of oxi.addfa

You'll need a new copy before you get more tasks.
Or, set the project to No new tasks before you finish any more, and let BOINC delete everything. Then you'll get it all again with the next lot.


Which one? I have three of these still running.
I have set the client to no new tasks.
Should I abort these tasks? Or just let them run? If oxi.addfa is bad, must I manually delete it, or will the boinc client manage to get a new one?

$ locate oxi.addfa
/home/boinc/projects/climateprediction.net/oxi.addfa.N216L38.gz
/home/boinc/projects/climateprediction.net/hadam4h_a0iu_209311_4_867_012012577/datain/ancil/oxi.addfa.N216L38
/home/boinc/projects/climateprediction.net/hadam4h_a1c9_209611_4_868_012016786/datain/ancil/oxi.addfa.N216L38
/home/boinc/projects/climateprediction.net/hadam4h_a1fw_209611_4_867_012013767/datain/ancil/oxi.addfa.N216L38
/home/boinc/slots/0/oxi.addfa.N216L38.gz
/home/boinc/slots/3/oxi.addfa.N216L38.gz
/home/boinc/slots/6/oxi.addfa.N216L38.gz
4) Message boards : Number crunching : UK Met Office HadAM4 at N216 resolution v8.52 failed 867 (Message 62299)
Posted 13 Apr 2020 by Jean-David Beyer
Post:
It did two trickles before this happened.
The complaints in stderr seem familiar, but I cannot find them here.

i686-pc-linux-gnu

Name hadam4h_a27q_210011_4_867_012014769_0
Workunit 12014769

<core_client_version>7.2.33</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...

BUFFIN: Read Failed: Input/output error
BUFFIN: C I/O Error ferror - Unit 116 - Return code = 1

Model crashed: REPLANCA :I/O ERROR                                                                                                                                                                                                                                             tmp/xnnuj.pipe_dummy                                                            
cpdnmonitor: error reading file /home/boinc/projects/climateprediction.net/hadam4h_a27q_210011_4_867_012014769/datain/ancil/oxi.addfa.N216L38

BUFFIN: Read Failed: Input/output error
BUFFIN: C I/O Error ferror - Unit 116 - Return code = 1

Model crashed: REPLANCA :I/O ERROR                                                                                                                                                                                                                                             tmp/xnnuj.pipe_dummy                                                            
cpdnmonitor: error reading file /home/boinc/projects/climateprediction.net/hadam4h_a27q_210011_4_867_012014769/datain/ancil/oxi.addfa.N216L38

BUFFIN: Read Failed: Input/output error
BUFFIN: C I/O Error ferror - Unit 116 - Return code = 1

Model crashed: REPLANCA :I/O ERROR                                                                                                                                                                                                                                             tmp/xnnuj.pipe_dummy                                                            
cpdnmonitor: error reading file /home/boinc/projects/climateprediction.net/hadam4h_a27q_210011_4_867_012014769/datain/ancil/oxi.addfa.N216L38

BUFFIN: Read Failed: Input/output error
BUFFIN: C I/O Error ferror - Unit 116 - Return code = 1

Model crashed: REPLANCA :I/O ERROR                                                                                                                                                                                                                                             tmp/xnnuj.pipe_dummy                                                            
cpdnmonitor: error reading file /home/boinc/projects/climateprediction.net/hadam4h_a27q_210011_4_867_012014769/datain/ancil/oxi.addfa.N216L38

BUFFIN: Read Failed: Input/output error
BUFFIN: C I/O Error ferror - Unit 116 - Return code = 1

Model crashed: REPLANCA :I/O ERROR                                                                                                                                                                                                                                             tmp/xnnuj.pipe_dummy                                                            
cpdnmonitor: error reading file /home/boinc/projects/climateprediction.net/hadam4h_a27q_210011_4_867_012014769/datain/ancil/oxi.addfa.N216L38

BUFFIN: Read Failed: Input/output error
BUFFIN: C I/O Error ferror - Unit 116 - Return code = 1

Model crashed: REPLANCA :I/O ERROR                                                                                                                                                                                                                                             tmp/xnnuj.pipe_dummy                                                            
Sorry, too many model crashes! :-(
06:12:41 (17668): called boinc_finish(22)

</stderr_txt>
]]>
5) Message boards : Number crunching : New work Discussion (Message 62284)
Posted 8 Apr 2020 by Jean-David Beyer
Post:
Last work completed in the first week in January. Nothing since. I guess the project has just about run out of steam.


I do not think it is out of steam. I got three N216 work units about a week ago, and one more a few days later. They are crunching right now. The first three have completed over 25% and each of the three have delivered a trickle.

There are a lot more in the queue:

UK Met Office HadAM4 at N216 resolution 3945
6) Questions and Answers : Unix/Linux : fedora 30 64 bit (Message 62177)
Posted 2 Mar 2020 by Jean-David Beyer
Post:
This model did not like to get interrupted:

UK Met Office HadAM4 at N144 resolution v8.08 i686-pc-linux-gnu

But the v8.09 fixed that problem. I have had no trouble with the N216 models.

I run RHEL6.10.
7) Message boards : Number crunching : New work Discussion (Message 62166)
Posted 27 Feb 2020 by Jean-David Beyer
Post:
Edit:My E5400 @ 2.70GHz, which must be one of the slowest computers still able to crunch the longest tasks will finish an N216 in under 6 months even when only used when i am at the computer. Cutting the deadline back to that rather than the 11 months when the task was sent would for me be the least we could do.


Mine is slower than yours.

GenuineIntel
Intel(R) Xeon(R) CPU E5-2603 0 @ 1.80GHz [Family 6 Model 45 Stepping 7]
Number of processors 4
Memory 15.5 GB
Cache 10240 KB

1,963,447.89 1,860,666.00 27,115.14 UK Met Office HadAM4 at N216 resolution v8.52
i686-pc-linux-gnu
8) Message boards : climateprediction.net Science : Climate change in the News (Message 62091)
Posted 9 Feb 2020 by Jean-David Beyer
Post:
+18C in Antarctica today. (11C higher than Cambridge!)


Since there is no such thing as climate change, there must be severe Orgone Fields sweeping over Antarctica to cause all their thermometers to read so much higher than the real temperatures.
9) Questions and Answers : Unix/Linux : *** Running 32bit CPDN from 64bit Linux - Discussion *** (Message 62076)
Posted 3 Feb 2020 by Jean-David Beyer
Post:
Ubuntu now includes 32-bit libraries by default.
HadAM4 now is 64-bit


Not on my 64-bit machine. All seem to be 32-bit.

$ file hadam4_8.09_i686-pc-linux-gnu hadam4_8.52_i686-pc-linux-gnu hadam4_um_8.09_i686-pc-linux-gnu hadam4_um_8.52_i686-pc-linux-gnu hadcm3s_8.36_i686-pc-linux-gnu hadcm3s_um_8.34_i686-pc-linux-gnu hadcm3s_um_8.36_i686-pc-linux-gnu

hadam4_8.09_i686-pc-linux-gnu: ELF 32-bit LSB executable, Intel 80386, version 1 (SYSV), dynamically linked (uses shared libs), for GNU/Linux 2.6.18, not stripped
hadam4_8.52_i686-pc-linux-gnu: ELF 32-bit LSB executable, Intel 80386, version 1 (SYSV), dynamically linked (uses shared libs), for GNU/Linux 2.6.18, not stripped
hadam4_um_8.09_i686-pc-linux-gnu: ELF 32-bit LSB executable, Intel 80386, version 1 (SYSV), dynamically linked (uses shared libs), for GNU/Linux 2.6.18, not stripped
hadam4_um_8.52_i686-pc-linux-gnu: ELF 32-bit LSB executable, Intel 80386, version 1 (SYSV), dynamically linked (uses shared libs), for GNU/Linux 2.6.18, not stripped
hadcm3s_8.36_i686-pc-linux-gnu: ELF 32-bit LSB executable, Intel 80386, version 1 (SYSV), dynamically linked (uses shared libs), for GNU/Linux 2.6.18, not stripped
hadcm3s_um_8.34_i686-pc-linux-gnu: ELF 32-bit LSB executable, Intel 80386, version 1 (SYSV), dynamically linked (uses shared libs), for GNU/Linux 2.6.18, not stripped
hadcm3s_um_8.36_i686-pc-linux-gnu: ELF 32-bit LSB executable, Intel 80386, version 1 (SYSV), dynamically linked (uses shared libs), for GNU/Linux 2.6.18, not stripped
10) Message boards : Number crunching : Scheduler request too recent (Message 61946)
Posted 10 Jan 2020 by Jean-David Beyer
Post:
Which doesn't take into account that most people joining over the past couple of years seem to have no particular interest in the climate models, and load up with lots of work from other projects as well.


I joined quite a while ago. I am particularly interested in climate models so my priorities are 50% climateprediction, 25% WCG, 13% Rosetta, and 12% Seti@home. When I can get all the work units I wanted, this ended up with three processors running CPDN, and the others taking their turns on the other processor.

These days there are large gaps in work-unit availability, so now I have Rosetta and Seti@home set to Won't Get New Tasks. I allow new tasks in CPDN until I get four work units, and then turn off getting new ones. I leave WCG on, but I accept only one work unit at a time on two of their jobs, and two at a time on one other. As a result, I seldom get any.

Right now, my deficit on CPDN work is so great that I am usually getting four work units at a time. (Right now I am running one N144 work unit and three N216 work units.) But when this does not happen, I get a work unit or two from WCG that usually go through in less than a day. If I run out of work, I enable either seti@home or rosetta just long enough to get some work units (usually 6 or 8 come through) and they grab the processors. By the time they are done, I have received some more CPDN work. If work units were available more of the time, I hope to find a way to just let them all run all the time without my fiddling around like this. I.e., like in the old days.
11) Message boards : Number crunching : UK Met Office HadAM4 at N216 resolution (Message 61865)
Posted 31 Dec 2019 by Jean-David Beyer
Post:
This one failed for me due to negative pressure.
I do have the required 32-bit libraries as evidenced by other 144 work have completed successfully, as have the 216 ones.

Task 21866633
Name hadam4_a06t_209110_6_856_011961960_3
Workunit 11961960

My failed one says.

Workunit 11961960
name hadam4_a06t_209110_6_856_011961960
application UK Met Office HadAM4 at N144 resolution
created 9 Dec 2019, 1:30:09 UTC
minimum quorum 1
initial replication 1
max # of error/total/success tasks 5, 5, 1
errors Too many total results

No need to do anything about this, I suppose: just another data point.
12) Message boards : Number crunching : What ever it was modeling it did it fast. (Message 61810)
Posted 25 Dec 2019 by Jean-David Beyer
Post:
Seems there sadly are not many Linux crunchers out there, or at least connected to this project in comparison to others.


I have been signed up for CPDN for ages, running only Linux. If memory serves, I went about a whole year without getting any CPDN work.
Presently, I am running one hadam4h and two hadam4 work units. Also one WCG work unit.
My normal intention is to run 50% CPDN, 25% WCG, and 12.5% each of Seti@home and rosetta@home. I enable these last two only when I have idle processor cores and cannot get either CPDN or WCG work units. When the latest hadam4 and hadam4h Linux units started coming out, I was running 100% CPDN for a while.

I do not know if you would count me as a Linux cruncher during the year when I crunched no CPDN work units, because I did not get any.
13) Message boards : climateprediction.net Science : Climate change in the News (Message 61706)
Posted 18 Dec 2019 by Jean-David Beyer
Post:
I wonder why a study was required for this. When I was in high school in the mid 1950s, I learned that the solubility of gasses in water went down with increasing temperature. I cannot imagine this has changed since then. The study may reveal to the ignorant just how bad the situation already is and that, unless we make a dramatic change in energy use, what the future is likely to be.
14) Message boards : climateprediction.net Science : Climate change in the News (Message 61635)
Posted 2 Dec 2019 by Jean-David Beyer
Post:
If they set it up as BOINC work units, the cloud of boinc users is already here and ready to go, no?
15) Message boards : Number crunching : New work Discussion (Message 61625)
Posted 30 Nov 2019 by Jean-David Beyer
Post:
All my N216s were 8.53


Oops! Typo. I meant 8.52.
16) Message boards : Number crunching : Why did I get ZERO credit for these tasks? (Message 61619)
Posted 27 Nov 2019 by Jean-David Beyer
Post:
And eventually all of this will be put into files for the external stats sites to collect later.


Some external sites refer to this reasonably promptly.

But this site:

https://boinc.netsoft-online.com/e107_plugins/boinc/get_user.php?cpid=549ef8ba1b5181207e1ad3a2ba626d83&html=1

does not do it hardly ever anymore. It thinks I have 3,075,732 credits while your site says I have 3,239,868; they are about 164,000 credits behind.
17) Message boards : Number crunching : New work Discussion (Message 61591)
Posted 21 Nov 2019 by Jean-David Beyer
Post:
My N144s that worked were all 809. the one 808 I got crashed.
All my N216s were 8.53; the first one was from 17 Oct 2019, 6:34:33 UTC
that completed 9 Nov 2019, 14:18:39 UTC that shows how slow (or fast) my 1.8 GHz Xeon processor it.

They certainly checkpoint more frequently than the, send trickles. The most recent checkpoint was about an hour ago, but that does not prove much.
18) Message boards : Number crunching : Validation pending for 9 years... (Message 61584)
Posted 20 Nov 2019 by Jean-David Beyer
Post:
I have 77 still pending for validation, but I am not worried either. They start sometime in 2010 and end in December 2015.
I have 157 valid ones. I have this much credit: 3,212,752

I wonder what it all means.
19) Message boards : Number crunching : Validation pending for 9 years... (Message 61578)
Posted 19 Nov 2019 by Jean-David Beyer
Post:
5 GB per core?! I do run a mixed number of memory light and memory heavy workloads, but so far the biggest one used "only" 2GB per core.


My biggest ones (hadam4_um_8.52_i686-pc-linux-gnu N216) each use 1385 Megabytes of virtual memory, but actually resident in RAM is only 1.3 Gigabytes.

Are you perhaps referring to OpenIFS models? I have not seen any of those yet.
20) Message boards : climateprediction.net Science : Misconfigured Machine? (Message 61576)
Posted 19 Nov 2019 by Jean-David Beyer
Post:
There is some discussion about this at the moment and whether sending emails to owners is GDPR compliant.


If I remember correctly, when signing up for this group, I checked a box saying it was OK to send me e-mails from the project. If this is correct, it seems to me the project could send people who did this e-mails.


Next 20

©2020 climateprediction.net