climateprediction.net home page
Posts by KWSN - Sir Frank of the Wood

Posts by KWSN - Sir Frank of the Wood

21) Message boards : Number crunching : Compute Errors / Bad Work Units? (Message 47180)
Posted 27 Sep 2013 by KWSN - Sir Frank of the Wood
Post:
astroWX:

machine 1270234 seems to be spinning its wheels also...

frank
22) Message boards : Number crunching : Compute Errors / Bad Work Units? (Message 46977)
Posted 6 Sep 2013 by KWSN - Sir Frank of the Wood
Post:
astroWX:

machine 1184413 seems to also be chewing up work units without any useful results...

frank
23) Message boards : Number crunching : More Work (Message 46946)
Posted 2 Sep 2013 by KWSN - Sir Frank of the Wood
Post:
belfry:

les was making a "tongue in cheek" observation of the situation...it is the combination of machines/OSs/networks/jobs running on both ends (servers and crunchers) that discover obscure hairballs that choke things up and require lots of troubleshooting...he was not really blaming us volunteer crunchers...

frank
24) Message boards : Number crunching : Compute Errors / Bad Work Units? (Message 46930)
Posted 1 Sep 2013 by KWSN - Sir Frank of the Wood
Post:
...my machine made it to 96% on work unit 8537815 before keeling over with an exit code of 22...i think it was perhaps OS confusion caused by other things that were running, but don't know for sure...other machines gave up much sooner...

while looking at other folks progress on this work unit, i noticed that machine 1105670 has crashed on nearly every work unit it has attempted...any obvious reason for this situation ???

frank
25) Message boards : Cafe CPDN : Astonishing web pages (Message 46341)
Posted 1 Jun 2013 by KWSN - Sir Frank of the Wood
Post:
...and those menu items are just starters - here at the NC State Fair(Raleigh NC) we also have Fried Twinkies, Fried Candy Bars, and Fried Coca Cola...
26) Message boards : Number crunching : Download Failed (Message 45956)
Posted 18 Apr 2013 by KWSN - Sir Frank of the Wood
Post:
hello les

take a look at computer 1177150...

what would cause this situation ???

frank
27) Message boards : Number crunching : Reporting - Errors while computing - (Message 45913)
Posted 12 Apr 2013 by KWSN - Sir Frank of the Wood
Post:
hello les

i wondered what was going on - i checked to see how that work unit was processed by other folks, and i was the only person that had it...


so the download attempt was just a mistake ???

frank
28) Message boards : Number crunching : Reporting - Errors while computing - (Message 45903)
Posted 12 Apr 2013 by KWSN - Sir Frank of the Wood
Post:
hello

not sure if this will help in troubleshooting, but here is what i have:


4/11/2013 9:21:55 PM climateprediction.net Giving up on download of hadam3p_pnw_c1zs_1959_1_007935543.zip: file not found


frank
29) Message boards : Number crunching : Don't receive a packet (Message 45856)
Posted 9 Apr 2013 by KWSN - Sir Frank of the Wood
Post:
hello mamph

welcome to the project !!!

as dave jackson said, most of us also participate in other projects, in order to have something for our machines to process during the times CPDN does not have any work available...

check "server status" in the list over on the left side of the screen...

frank
30) Message boards : Number crunching : HadCM3N crashing at first trickle (Message 43380)
Posted 5 Nov 2011 by KWSN - Sir Frank of the Wood
Post:
(this may help somebody troubleshoot something...)


looked at wu 7717715...sent to 5 machines - 4 errored out in less than 60 secs...
all running Darwin ...

my XP machine has been crunching same WU for 38 hours so far...


sir frank
31) Message boards : Number crunching : NO WORK! (Message 42714)
Posted 30 Jul 2011 by KWSN - Sir Frank of the Wood
Post:
hello andrew

"server status" shows 348 HadAM3p-eu work units available...so being patient is probably about all we can do...
32) Message boards : Number crunching : Cancel high priority run now or later? (Message 42699)
Posted 28 Jul 2011 by KWSN - Sir Frank of the Wood
Post:
markus:

there are 40 trickles - each trickle is 311 credits;

therefore, you will receive about 12,440 credits for the completed
workunit...
33) Message boards : Number crunching : Account Updating! (Message 42694)
Posted 28 Jul 2011 by KWSN - Sir Frank of the Wood
Post:
Les:

reminds me of the old army security classification joke: CBAR (Classified Beyond All Recognition) which is a higher classification than Yankee White (which is supposed to be higher than anything known to mere mortals)...
34) Message boards : Number crunching : hadam3p_eu crash 45 seconds in. (Message 42516)
Posted 1 Jul 2011 by KWSN - Sir Frank of the Wood
Post:
Make that 5 workunits that failed in 63 to 74 seconds each...just noticed the 5th one...
35) Message boards : Number crunching : hadam3p_eu crash 45 seconds in. (Message 42514)
Posted 1 Jul 2011 by KWSN - Sir Frank of the Wood
Post:
basically the same here...workunit hadam3p_eu_4hvn_1999_1_007334252_1 ran for 1min 5sec...it was the last of 4 hadam3p_eu workunits that failed in about the same length of time...
36) Message boards : Number crunching : HadCM3 Full Resolution model low credits (Message 42175)
Posted 14 May 2011 by KWSN - Sir Frank of the Wood
Post:
from Bill

"I suspect more problems than just scoring popped up, or they decided to rearrange/redesign some of the systems. Clearly the problem(s) were more complex than we suspected."

as a retired IT data cruncher, i can testify that most ANY problem in IT turns out to be worse than you think, going in...

IT rule(s) of thumb:

a one-hour fix will take at least a day, by the time you figure out what is REALLY wrong...

a one-day fix will probably take a week...

a one-week fix will take a month, by the time you fix all the things you broke while you were fixing the original problem...

a ninety-day conversion will take about a year, because the fool that made the first estimate doesn't know anything about the old system, the new system, or IT in general...

been there - both as a witness and as a worker bee...

37) Message boards : Number crunching : HadCM3 Full Resolution model low credits (Message 42104)
Posted 3 May 2011 by KWSN - Sir Frank of the Wood
Post:
...indeed - a little "vigorish" of about a 10% bonus in the credit department would encourage folks to run the 50-day workunits...
38) Message boards : Number crunching : hadcm3n Shorter deadline? (Message 42057)
Posted 29 Apr 2011 by KWSN - Sir Frank of the Wood
Post:
Bernard:

what astrowx says is correct - i received one of the cm3n work units with an estimate of 2400 hours to completion...it actually is doing 2 percent per day, so it will finish in about 50 days...
39) Message boards : Number crunching : HadCM3 Full Resolution model low credits (Message 41923)
Posted 7 Apr 2011 by KWSN - Sir Frank of the Wood
Post:
...same here - 60 hours of crunching produces about 80 credits...my RAC is dropping like a rock...


Previous 20

©2024 climateprediction.net