climateprediction.net home page
Posts by adrianxw
Posts by adrianxw
log in
21) Message boards : Number crunching : WU crashed, am investigating. (Message 44399)
Posted 14 Jun 2012 by Profile adrianxw
This morning, I was investigating the crash of a WU from RNA World. It was one of their XXL long runner units which doesn't checkpoint.

My CPDN wu also crashed during the night, coincidence?

The log of the CPDN shows nothing useful, but the RNA issue started at 01:38 with a "No heartbeat" message and various stops and starts for an hour.

I have also installed BOINC on my wifes laptop. This morning, there were a flock of "upload pending" issues which went away when I poked them.

I am wondering if something my ISP has done could be to blame. In January, we had to change from my homebrew setup, which has worked fine for years, to TDC's "Trio" pack, which includes a wired and wireless router.

What could be going on here?
22) Message boards : Number crunching : Iceworld Appeal (Message 38209)
Posted 28 Oct 2009 by Profile adrianxw
Okay, done. Current wu Ctrl-Q\'d.
23) Message boards : Number crunching : Iceworld Appeal (Message 38198)
Posted 28 Oct 2009 by Profile adrianxw
I\'d have helped, (note past tense), but what I came to the board to do was report an ice world, hadsm3fub_jowe_006398408 this one. I have suspended it rather than aborting in case there is anything to be gained here, but I doubt it. I do not have any backups of the model. If there is anything to recover, let me know. (It crunched for 250+ hours before \"freezing\").
24) Message boards : Number crunching : No work available. (Message 37906)
Posted 25 Aug 2009 by Profile adrianxw
I was seeing these messages...

25/08/2009 07:37:00 climateprediction.net Message from server: No work sent
25/08/2009 07:37:00 climateprediction.net Message from server: No work available for the applications you have selected. Please check your settings on the web site.

... a few days ago, so I dutifully attached to two other model types, to be told, \"No work available...\" eventually, like a couple of days later, a short wu arrived, but that has crunched to completion and again, this machine has no CPDN, despite asking every hour or so.
25) Message boards : Number crunching : Your Account/Tasks page - order changed. (Message 37589)
Posted 30 Jul 2009 by Profile adrianxw
Until mid May this year, when you downloaded a new wu, it's record went to the top of the list in "Your Account->Tasks". The wu I received 20th May was the last to do this. The one 23rd May is now top of the list with later ones continuing down the list.

I don't know if this was by design, but it is a bit of a nuisance.
26) Message boards : Number crunching : Trickle and credit delays (Message 37381)
Posted 28 Jun 2009 by Profile adrianxw
I think it's fairly clear that all of us have noticed. The reported values have been a little suspect since 23rd actually.
27) Message boards : Number crunching : Server Problem? (Message 36724)
Posted 16 Apr 2009 by Profile adrianxw
It is not just trickles that are getting that...

16/04/2009 09:05:54|climateprediction.net|Sending scheduler request: Requested by user. Requesting 0 seconds of work, reporting 1 completed tasks
16/04/2009 09:05:59|climateprediction.net|Scheduler request failed: Error 417

... it\'ll sort out, no worries.
28) Message boards : Number crunching : Iceworlds & Slowdowns hadsm3/mh - Closed - Discussion (Message 36701)
Posted 12 Apr 2009 by Profile adrianxw
http://climateapps2.oucs.ox.ac.uk/cpdnboinc/workunit.php?wuid=6193392
149066 of 259248

Next, not sure, pressed z and cannot see what you want... what it says is...

Atmos model time 13:00
Atmos model date 16:07/2059
Hours elapsed 0418:33:33 (2.26 s/TS) <-- maybe that!
Timestep 149066 of 259248 - Phase 3 of 3
Progress 85.83%

Not blue
Intel Quad under Windows XP
Not over clocked.

Hope that helps, was running on a system not visited everyday so may have been like that for 4-5 days at least.

Best wishes,
Adrian.
29) Message boards : Number crunching : Trickles and Credits (Message 36542)
Posted 29 Mar 2009 by Profile adrianxw
Cosmology@Home has also been offline for more than a week now, there was a note that they are working on it.

Credit coming through here now.
30) Message boards : Number crunching : Trickles and Credits (Message 36502)
Posted 27 Mar 2009 by Profile adrianxw
That timeline is the same as ours at Boinc@Denmark, ie. that this is the third such event recently.

Another effect I can see is the wu that finished 25/03 has not had it\'s final credit corrected, it still says 6,521.71 rather than the expected 6,805.26.

Shrugs, it has always sorted itself out.
31) Message boards : Number crunching : Trickles and Credits (Message 36479)
Posted 26 Mar 2009 by Profile adrianxw
Didn\'t run this morning. Les told me it was not planned, rather, \"just one of those things\".
32) Questions and Answers : Getting started : How credit works? (Message 36453)
Posted 24 Mar 2009 by Profile adrianxw
Okay Les, thanks.
33) Questions and Answers : Getting started : How credit works? (Message 36450)
Posted 24 Mar 2009 by Profile adrianxw
Credit didn\'t run last night. Same thing happened last week. Is this a new periodic operating procedure?

*** Edited - spelling! ***
34) Message boards : Number crunching : Iceworlds & Slowdowns hadsm3/mh - Closed - Discussion (Message 34545)
Posted 6 Aug 2008 by Profile adrianxw
I recall well the older Met Office systems at Bracknel. The Cosmos, Cray II, the ETA-10 debarcle, Cray Y-MP etc. Not terribly productive, but a lot of memories stirred!

I really quite liked programming in Fortran. I\'m a C++ jock now though, haven\'t used Fortran professionally for more then 10 years. (Carl wanted to know if I wanted a job - is how I know a little about the code base!).
35) Message boards : Number crunching : Iceworlds & Slowdowns hadsm3/mh - Closed - Discussion (Message 34539)
Posted 6 Aug 2008 by Profile adrianxw
Could be almost anything. Without a detailed knowledge of the codebase, I wouldn\'t know where to hazard a guess. I don\'t even know what compilers are used by the project, let alone options or libraries.

I know, (Carl told me), that there is a lot of Fortran in there. I also recall weirdness when, ~20 years back, I was porting a big Fortran-77 fluid dynamics application from a Gould 32/77 running MPX-32 to a VAX 11-750 under VMS. Theoretically, they were both standard compliant 32 bit systems with standard compliant compilers, but the application results differed from the same source data - we traced it to the runtime libraries and had to write our own versions of some of the mathematics functions.
36) Message boards : Number crunching : Odd looking graphic. (Message 34537)
Posted 6 Aug 2008 by Profile adrianxw
I don\'t normally look at the graphics, but as I had a couple of iceballs recently, when I see anything odd, I have started to do so. Today I found a wu at exactly 66.666% and not progressing. Now, I am aware that when it changes from phase to phase there is a period of \"different\" activity, so have not done anything about it yet, but the graphic looked odd...

37) Message boards : Number crunching : Iceworlds & Slowdowns hadsm3/mh - Closed - Discussion (Message 34466)
Posted 1 Aug 2008 by Profile adrianxw
I asked this once before and don\'t think I ever got an answer. Can I somehow, (obviously not through the regular BOINC mechanism), get EXACTLY the same wu again to run on another machine? There is a second Q6600 machine, (B4 stepping and not OC\'d), sitting 1m from the G0 Q6600. It normally does not run CPDN but there is no reason why I shouldn\'t stick it on, and simply reduce the quota on the other quad to even things out.

I am happy to run it again as, although it may fail, I get the trickles and so am not really losing anything, and it might just help.

PM me if you like.
38) Message boards : Number crunching : Iceworlds & Slowdowns hadsm3/mh - Closed - Discussion (Message 34462)
Posted 1 Aug 2008 by Profile adrianxw
I had 3 models on that quad. It has a 25% resource share, so theoretically, only 1 was running, but as LHC and Cels have no work at the moment, it gets more. The one that had been running pretty much all the time, and that which was furthest down the tree were the iceballs. The middle wu is running and trickling normally.

The machine is a Intel Q6600 and the advanced G0 stepping model. It is rated for 2.4GHz and clocked at 3.0GHz. It is in an open chasis with a huge Zalman CNPS9700 heatsink on it running at full speed. Speedfan shows the cores 38-39C but Speedfan under reports these chips by 15C so they are running at less then 55C which for a G0 is nothing. I\'ve torture tested the thing a couple of times without issue. It has crunched a good number of CPDN\'s. I don\'t think it exhibits any instability but you know what CPDN is like!

As always, if there is anything else you need, post or PM me.
39) Message boards : Number crunching : Iceworlds & Slowdowns hadsm3/mh - Closed - Discussion (Message 34442)
Posted 31 Jul 2008 by Profile adrianxw
Back to back iceballs 7537005 and 7553089. Blue globe, slow down, all the usual.
40) Message boards : Number crunching : Disappearing cobblestones (Message 34211)
Posted 3 Jul 2008 by Profile adrianxw
Our members big pluses and minuses seem to have reversed themselves today.


Previous 20 · Next 20

Main page · Your account · Message boards


Copyright © 2019 climateprediction.net