climateprediction.net home page
Stuck Model?

Stuck Model?

Questions and Answers : Windows : Stuck Model?
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile Gordon Hartman

Send message
Joined: 31 Aug 04
Posts: 5
Credit: 351,767
RAC: 0
Message 32829 - Posted: 3 Mar 2008, 10:36:02 UTC

Task hadsm3fub_013j_005927676_0 using hadsm3 version 506

The task is at 85.941% for a week now. It goes a little higher to 85.955%, then goes back to 85.941%. It appears to be an ice world on graphics, all blue. Timestep 149910 of 259248,Date 04/08/2059 3:00 (1810 to 2050)

Do I need to abort this task.

Thanks

All that is necessary for the triumph of evil is that good men do nothing.


ID: 32829 · Report as offensive     Reply Quote
Profile Iain Inglis

Send message
Joined: 9 Jan 07
Posts: 467
Credit: 14,549,176
RAC: 317
Message 32830 - Posted: 3 Mar 2008, 11:40:29 UTC
Last modified: 3 Mar 2008, 11:42:37 UTC

Hi Gordon, and welcome to the message board.

All the models in that work unit have hit the same problem, though not everyone has noticed. The model appears to change into an ice world between 140,426 and 151,228. The other models aren\'t looping indefinitely, but are making very slow progress - about a week per trickle. They will eventually finish, but that\'s about ten weeks - in which time the PC could possibly do four other slab models.

So, I would abort it.

Iain
ID: 32830 · Report as offensive     Reply Quote
Profile mo.v
Volunteer moderator
Avatar

Send message
Joined: 29 Sep 04
Posts: 2363
Credit: 14,611,758
RAC: 0
Message 32832 - Posted: 3 Mar 2008, 18:52:25 UTC

Iain, if you think it\'s in order, I\'ll send private messages to the other people running the same workunit, though I\'ll wait till each person\'s model reaches the critical point and their timesteps & trickles show they\'ve hit the same problem. When we notice a problem it seems perverse to let crunchers waste computer time on a doomed model. What do you think?
Cpdn news
ID: 32832 · Report as offensive     Reply Quote
Profile Iain Inglis

Send message
Joined: 9 Jan 07
Posts: 467
Credit: 14,549,176
RAC: 317
Message 32834 - Posted: 3 Mar 2008, 19:10:18 UTC - in response to Message 32832.  

Iain, if you think it\'s in order, I\'ll send private messages to the other people running the same workunit, though I\'ll wait till each person\'s model reaches the critical point and their timesteps & trickles show they\'ve hit the same problem. When we notice a problem it seems perverse to let crunchers waste computer time on a doomed model. What do you think?

They may be doing it deliberately, but I doubt it. When I first got an iceworld I thought I would try to finish it, as the slab Zips don\'t get uploaded until the end of the phase - but I didn\'t have the patience! I convinced myself that if the project really wanted that kind of data they would re-write the model ...

The bad news is that about one in seven of my slabs has gone awry somehow: if that\'s applied across the whole project, then that\'s a lot of PMs - though, as you say, it could be limited to the work units of people who pitch up here with this problem. Caveat cruncher?
ID: 32834 · Report as offensive     Reply Quote
Profile mo.v
Volunteer moderator
Avatar

Send message
Joined: 29 Sep 04
Posts: 2363
Credit: 14,611,758
RAC: 0
Message 32837 - Posted: 3 Mar 2008, 23:08:05 UTC
Last modified: 3 Mar 2008, 23:10:04 UTC

I didn\'t mean I am going to trawl through every member\'s models. For one thing, the server status page says there are over three quarters of a million CPDN models in progress(!!)

http://climateapps2.oucs.ox.ac.uk/cpdnboinc/server_status.php

I just meant the people running the same model as Gordon because now we already know about them we may as well make use of the knowledge. And when other people post about a similar problem we could spend a moment looking at the trickles of other members running the same WU then if necessary tell them about their problem. If some of these people are running BOINC as a service with no graphics they\'re unlikely to notice the anomalies.


Cpdn news
ID: 32837 · Report as offensive     Reply Quote
Profile Iain Inglis

Send message
Joined: 9 Jan 07
Posts: 467
Credit: 14,549,176
RAC: 317
Message 32838 - Posted: 3 Mar 2008, 23:38:31 UTC
Last modified: 3 Mar 2008, 23:48:00 UTC

Sure. I was only being awkward.

One thing to watch out for is that an iceworld may be limited to a processor/operating-system combination - so, Intel/Windows may freeze, but AMD/Windows not. The best check is to look for a significant increase in the S/TS on the same combination and at the same timestep as the person who has spotted the problem - as has occurred with Gordon\'s work unit.

29 Feb 2008 12:58:57 814783 7220684 hadsm3fub_013j_005927676_9 3 183,634 2,275,557 3.2409
21 Feb 2008 21:11:33 814783 7220684 hadsm3fub_013j_005927676_9 3 172,832 1,786,674 2.5844
16 Feb 2008 04:51:06 814783 7220684 hadsm3fub_013j_005927676_9 3 162,030 1,297,841 1.9071
10 Feb 2008 10:10:37 814783 7220684 hadsm3fub_013j_005927676_9 3 151,228 808,730 1.2076
09 Feb 2008 14:13:54 814783 7220684 hadsm3fub_013j_005927676_9 3 140,426 738,815 1.1212


and

23 Feb 2008 02:06:23 830206 7220682 hadsm3fub_013j_005927676_7 3 151,228 1,505,392 2.2478
20 Feb 2008 00:45:04 830206 7220682 hadsm3fub_013j_005927676_7 3 140,426 1,378,477 2.0920


Thus, Chris Beaugrand and GOAL: Mexico\'s 1st place should get a heads-up.

PS Gordon has aborted the model.
ID: 32838 · Report as offensive     Reply Quote
Profile mo.v
Volunteer moderator
Avatar

Send message
Joined: 29 Sep 04
Posts: 2363
Credit: 14,611,758
RAC: 0
Message 32839 - Posted: 3 Mar 2008, 23:54:18 UTC

Yes, they were the ones I was thinking of sending a PM to ie where the problem has already shown up. Then wait to see what happens with the others (assuming one remembers to look back a week later). There\'s no point in telling people about a potential problem that may not occur on their computer.
Cpdn news
ID: 32839 · Report as offensive     Reply Quote
Profile Iain Inglis

Send message
Joined: 9 Jan 07
Posts: 467
Credit: 14,549,176
RAC: 317
Message 32840 - Posted: 3 Mar 2008, 23:58:50 UTC

Shall I do Chris and you the potential Spanish speaker?
ID: 32840 · Report as offensive     Reply Quote
Profile mo.v
Volunteer moderator
Avatar

Send message
Joined: 29 Sep 04
Posts: 2363
Credit: 14,611,758
RAC: 0
Message 32851 - Posted: 4 Mar 2008, 13:50:25 UTC

Good idea.
Cpdn news
ID: 32851 · Report as offensive     Reply Quote

Questions and Answers : Windows : Stuck Model?

©2024 climateprediction.net