climateprediction.net home page
Posts by old_user272

Posts by old_user272

21) Message boards : Number crunching : Sulphur Download? (Message 16656)
Posted 18 Oct 2005 by old_user272
Post:
My slow (750MHz) machine has just finished a slab model and downloaded a new Sulphur model which, I reckon, will take about 120 days to complete. Is there any point in running it?. I thought the idea was that the Sulphur models needed to be finished quickly so they could be used to seed the spinup models.

IAn
22) Questions and Answers : Windows : MS Antispyware beta removed trojan downloader, Boinc Ceased working (Message 15554)
Posted 30 Aug 2005 by old_user272
Post:

> Just an update: the problem is resolved at this point.
> From the restor point, I reran MS Antispware, removed the
> trojan downloader again, and rebooted. Boinc started fine this time.
> Perhaps the two issues were unrelated.

Or maybe not. I installed MS AntiSpyware a week or so ago and since then, after a restart/reboot, I have occasionally had the "cannot connect" error. I can't remember ever seeing that error before.

A reboot has always succeeded restarting BOINC though so I'm not too worried about it.

Ian
<img src='http://www.boincsynergy.com/images/stats/comb-942.jpg'>
23) Questions and Answers : Windows : BOINC backups (Message 15463)
Posted 27 Aug 2005 by old_user272
Post:

Thanks for the link.

I have got Nick's script working but it appears the exit script can only close BOINC down if the manager is open and maximized on the desktop. If its minimized or in the icon tray then #AppActivate doesn't seem to work - and that's the way I normally run BOINC.

It's a long time since I've used the WSH so I will have to have a refresher - there may be a way round it.

Ian
<img src='http://www.boincsynergy.com/images/stats/comb-942.jpg'>
24) Questions and Answers : Windows : BOINC backups (Message 15452)
Posted 27 Aug 2005 by old_user272
Post:
With the new loooong models that CPDN is issuing I think it might be wise if I changed my BOINC backup strategy - i.e. start having one :-)

I\'d prefer an automated backup, in the middle of the night, but was a bit worried about folders getting out of sync because of the large amount of data being shifted about. i.e.

1) The main BOINC folder being backed up.
2) Something happens that causes client_state.xml in the BOINC folder to be updated.
3) The folder containing the file that caused the client_state change is backed up.

In that situation the client_state file and the data file would be out of sync which might(?) cause a problem if the folders needed to be restored.

Has anyone else addressed this possibility or is it not worth worrying about (always remembering sod\'s law). Is there an easy way of programmatically pausing BOINC while the backup is copying files?

Ian

<img>
25) Questions and Answers : Windows : Negative Credit (Message 14764)
Posted 30 Jul 2005 by old_user272
Post:

&gt; It is possible you worked on some results that were unfortunately handed out
&gt; twice. The credit for these has recently been reassigned to whoever got
&gt; furthest with the model. This seems the fairest system.

It doesn't seem to work that way in all cases. Have a look at

http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=872375

This w/u was allocated to two machines. For the first one and a bit phases my machine (321 (187529 after a merge)) sent in the trickle first, after that the other machine (5336) got in the trickle first.

I realised what was happening soon after my trickles stopped being recognised and aborted the model on my machine, leaving it to 5336 to complete.

However, the above result still shows up in my result list and is missing from 5336's result list, even though that is the machine that completed the model. I haven't noticed a drop in my total credits so can only assume that I'm still being, incorrectly, credited for that model.

Ian

<img src='http://www.boincsynergy.com/images/stats/comb-942.jpg'>
26) Message boards : Number crunching : Trickles are great -- but how about some Credits? (Message 14710)
Posted 29 Jul 2005 by old_user272
Post:

&gt; I suppose I could look for a result id and track back that way? - I never made
&gt; a note of any of them though.

I went to my user page and looked through all the results listed for my username. There were results for computer id 321 listed but when I clicked on the "321" link it came back with the "Couldn't find computer" message.

It looks like the result ids still refer back to the merged machines, and the total stats for the user are therefore correct, but there are no stats available for the merged machines themselves.

Ian
<img src='http://www.boincsynergy.com/images/stats/comb-942.jpg'>
27) Message boards : Number crunching : Trickles are great -- but how about some Credits? (Message 14709)
Posted 29 Jul 2005 by old_user272
Post:
&gt; does the machine appear so you can "re-merge" it?

I tried to get the details for my old merged machine id (321) using the following but the server just came back with "Couldn't find computer".

http://climateapps2.oucs.ox.ac.uk/cpdnboinc/show_host_detail.php?hostid=321

I suppose I could look for a result id and track back that way? - I never made a note of any of them though.

Ian
<img src='http://www.boincsynergy.com/images/stats/comb-942.jpg'>
28) Message boards : Number crunching : Trickles are great -- but how about some Credits? (Message 14704)
Posted 29 Jul 2005 by old_user272
Post:

&gt; kenfins rac &amp; total credit #'s (and probably a cast of thousands') should
&gt; be fixed now.

Does this mean that the results for merged machines are no longer available?. Due to a double allocated WU I detached/reattached and then merged one of my machines (id 321 -&gt; 187529). All of the results for 321 seem to have disappeared from my stats page - although the total credit I have is still correct.

Ian
<img src='http://www.boincsynergy.com/images/stats/comb-942.jpg'>
29) Questions and Answers : Windows : Benchmak stopped model being crunched (Message 14640)
Posted 25 Jul 2005 by old_user272
Post:
Just noticed the following in the log of one of my machines running hadsm3 4.13

25/07/2005 07:26:37 128 Suspending computation and network activity - running CPU benchmarks
25/07/2005 07:26:37 129 Pausing result 13pm_100072003_1 (removed from memory)
25/07/2005 07:26:39 130 Running CPU benchmarks
25/07/2005 07:26:47 131 Aborting CPU benchmarks, one or more active tasks are still running.

I remember this happening once before on an earlier model and, I _think_, a different machine. The problem is that BOINCVIEW, and I suppose BOINC Manager?, show the model as running although it was never actually restarted after the benchmark abort. If you\'re not paying attention it\'s quite easy to miss the situation and end up with a machine sitting idle.

Is this a known problem?

Ian


<img>
30) Questions and Answers : Windows : Transition from hadsm 4.12 to 4.13 failed (Message 14638)
Posted 25 Jul 2005 by old_user272
Post:

&gt; Also has anyone successfully downloaded Windows hadsm3*_4.13_* files?

I finished a 4.12 model yesterday. The 4.13 files and a model were downloaded and started without any problem.

Ian
<img src='http://www.boincsynergy.com/images/stats/comb-942.jpg'>
31) Message boards : Number crunching : Incorrect credit allocation (Message 13868)
Posted 26 Jun 2005 by old_user272
Post:
&gt; This problem is a misallocated WU problem. I see nothing different from other
&gt; cases.

OK, thanks.

&gt; The resultid has been sent to 2 computers (5336 and 321). It is
&gt; allocated to your computer 321. 5336 started later but is computing faster and
&gt; has overtaken you.

That makes sense. It would also explain why the WU doesn't show up in 5336's work list - I got it first :-)

I suspended when I saw the problem and the machine is happily crunching another WU (it has a problem with 4.45 and an ever increasing short term debt, in trying to sort that out I had already downloaded another two WUs). I'll leave it in that state until 5336 finishes - but might not that machine have a problem when it tries to upload the result if the machine id doesn't match the one the WU was allocated to?.

Ian
<img src='http://www.boincsynergy.com/images/stats/comb-942.jpg'>
32) Message boards : Number crunching : Incorrect credit allocation (Message 13863)
Posted 26 Jun 2005 by old_user272
Post:
Thanks for confirming that there's somthing strange with that result ID

&gt; If you are doing a run that appears to be allocating credit to another
&gt; machine, it looks as if you possibly have a new variant of the problem in this
&gt; <a> href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/forum_thread.php?id=2577"&gt;thread</a>

I've previously read that thread a number of times and I'm still not sure exactly what problems it is describing and whether my problem is related.

I thought it worth starting a new thread though as the one you quote was related to the first outage (end of May) and not the current one.

Thanks

Ian
<img src='http://www.boincsynergy.com/images/stats/comb-942.jpg'>
33) Message boards : Number crunching : Incorrect credit allocation (Message 13860)
Posted 26 Jun 2005 by old_user272
Post:
Looks like the recent outage has caused another problem with misallocated credits.

I wondered why the trickes from one of my machines weren\'t showing up in it\'s \"received trickles\" list (I know the rest of the stats are broken but trickle acknowledgement seems to be OK) and the reason is that the trickes are being allocated to another machine.

Run ID - 872375
My box (321) shown as trickling up to 19th June
Computer Id 5336 shown as trickling since then.

What\'s the best thing to do? - I\'m tempted to abort the run.

Ian
34) Message boards : Number crunching : Time to give more time to Einstein! (Message 13757)
Posted 23 Jun 2005 by old_user272
Post:

&gt; On my last model, just before uploading, Les Bayliss pointed out that I had
&gt; missed a trickle in Phase 1 (!). Les helped me to manoever around a little and
&gt; tried to rescue it, with some modest success. Thyme Lawn assured me that the
&gt; science was intact, and the model would be productive. _BUT_ the Results for
&gt; the model show the server state "in progress" and the outcome "unknown". Even
&gt; today. (Could that change when the servers feel better? Maybe)

FWIW. I just looked through my results and found result ID 808760. The server has has only received 65 trickles but has a status of Over/Success - it appears to have missed 7 trickles during phase 1 without any ill effect. I also got exactly the same credit (6805.26) as models that were completed with 72 trickles.



Ian
35) Message boards : Number crunching : Time to give more time to Einstein! (Message 13671)
Posted 21 Jun 2005 by old_user272
Post:

&gt; Week.Am not getting ERROR messages and am 70%
&gt; done and close to phase 3 cannot waste 346 Hours
&gt; of Computer time.I say "PLEASE" let us know
&gt; what is going on!!!!!

Why do you think you are in danger of "wasting" 346 hours. The only way that will happen is if you abandon the model or CP disappears for good (and, IMHO, that is very unlikely).

Everyone seems to forget that trickles are completely irrelevant. You could download a model, get it running and only connect to the server again three (or whatever) weeks later when the model is complete - _never_ trickle at all. You would get exactly the same credit and CP would get exactly the same science as if you had trickled every day. I suspect there are people running CPDN on laptops who work that way.

I completely agree that the lack of feedback is regrettable, to say the least, but it is no reason for abandoning the project.

The only thing I would recommend, and I'm in this position at the moment, is that you don't let a model complete while the server is down - I've got one at 98% that I've just suspended and will only resume when the server is back.

Ian
36) Questions and Answers : Windows : no trickles for 3 days? when will be up? (Message 13594)
Posted 20 Jun 2005 by old_user272
Post:

&gt; I an not particularly concerned with the credits not being counted. I am
&gt; concerned that the computer time I am donating to the project is being
&gt; utilized. I don't know of any other indicator one can use to determine if the
&gt; computed data is actually making it back to the system other than looking for
&gt; trickles. I consider that an acknowledgment that the computations performed
&gt; have been received. If the data being sent back is still being captured,
&gt; fantastic.

Trickles are mainy for stats purposes and very little information is returned within one.

If the server doesn't see a trickle then it doesn't matter. Stats are calculated on the amount of work completed, i.e. the timestep in the model, and you will receive the correct credit for _all_ the work you have done when the server gets the _next_ trickle from you.

The only time (as far as I know) science information is sent back is at the end of each of the three phases, mainly the last phase. If the model completes, and the ~8 MiB of data generated is returned, the the full scientific value of the model run obtained, there is no wasted processor time at all.

Trickes are _just_ so that you get some feedback from the model, your experiment would run perfectly well without them.

&gt; A simple news posting on the WEB explaining that there is a
&gt; temporary problem but that all data is still being collected would go a long
&gt; way with the volunteer community.

Can't disagree with that at all.

Ian

37) Questions and Answers : Windows : stats update still (or again?) doesn\'t work (Message 12301)
Posted 4 May 2005 by old_user272
Post:
&gt; I've just uploaded another trickle for the model that has the missing trickle.
&gt; This one made it, but still no sign of the one that's missing.

Same here. My latest trickle has appeared but the 4 that were missed haven't.

Ian
38) Questions and Answers : Windows : stats update still (or again?) doesn\'t work (Message 12272)
Posted 4 May 2005 by old_user272
Post:

&gt; Since then the stats are showing no trickles for the machine which, as far as
&gt; I can tell, is still crunching correctly. I would have expected to see two or
&gt; three trickles by now.

I might have been a bit too quick off the mark there. I have just rebooted the system and BOINC seems to be doing a lot more downloading (of work units on the other projects that I run) than I would expect. It may me that something had gone wrong my end and the reboot cleared it.

It will be interesting to see if the missing trickes appear now.

Ian
39) Questions and Answers : Windows : stats update still (or again?) doesn\'t work (Message 12269)
Posted 4 May 2005 by old_user272
Post:
&gt; What is your host ID? I'd like to see that status for that PC.

I might be seeing the same thing. I finished a run at 20:20 on the 2/5 and it uploaded correctly (I had already downloaded a new model).

Since then the stats are showing no trickles for the machine which, as far as I can tell, is still crunching correctly. I would have expected to see two or three trickles by now.

Host ID is 290.

Ian
40) Questions and Answers : Windows : No work available (Message 10569)
Posted 8 Mar 2005 by old_user272
Post:
I had trouble downloading a new model this morning, for the same reason.

However I successfully downloaded a new model at 14:00ish UTC, so if you haven't yet got one it might be worth trying again.

Ian


Previous 20 · Next 20

©2024 climateprediction.net