climateprediction.net home page
Posts by old_user2354

Posts by old_user2354

1) Message boards : Number crunching : sulphur model - Linux - Signal 11 (Message 19137)
Posted 10 Jan 2006 by old_user2354
Post:
In the BOINC Questions and Problems Linux forum, Tolu announced sulphur 4.23 which is supposed to have solved the signal 11 problem. Any models downloaded after late in the GMT day January 4th should have the new app (should see a 423 in the work tab of the BOINC GUI).

As for the credit, with the server being down from the 6th until today, it is likely the stats scripts weren\'t run since you had the one trickle. It is only being run once a day now, and should update before 12 GMT 10 January.


Now, I\'m waiting for my first 4.23 WU to be requested. Then I\'ll switch both PCs (and their one remaining model each) manually like Honza described. Unless, of course someone has posted a How-To already?
2) Message boards : Number crunching : Welcome back (Message 19079)
Posted 9 Jan 2006 by old_user2354
Post:
CPDN\'s servers seem to be back online (or else I couldn\'t post here). I guess there might be quite some load now when all the clients try to get their trickles sent.

Anyway: Welcome back online
3) Message boards : Number crunching : sulphur model - Linux - Signal 11 (Message 18578)
Posted 21 Dec 2005 by old_user2354
Post:
It\'s still happening to one of my Linux boxes. Any news from the staff about this problem? It\'s really frustrating to see another WU crash when I\'m booting the PC...

[edit: Tpyo]
4) Message boards : Number crunching : sulphur model - Linux - Signal 11 (Message 18115)
Posted 12 Dec 2005 by old_user2354
Post:

The ID that you linked to crashed on the 9th with a signal 11. Must have been when the preference for \"leave applications in memory when pre-empted\" was set to \"no\" as that is the error that often occurs with a task switch, or benchmark with that preference setting.


I think, you\'re wrong at this point: I *got* this WU at 9 December. It was received (as in reported) today (12 Dec). And it\'s the most recent \'completed\' WU from this computer, so I stay with this result.


I\'ve sent an e-mail to Tolu about the signal 11 errors with links to the forum threads on it. It may be a tough one to track down though.


It\'s certainly necessary to get this thing out of the programs. It would be really bad if a WU crashed sometime into the 3rd or 4th phase. That wouldn\'t be that easy to restore. I just hope the devs find out what\'s causing this bug. If they need some of the crashed directories, I have at least that of the WU I mentioned already.

But now, I\'ll better get some sleep ;-)
5) Message boards : Number crunching : sulphur model - Linux - Signal 11 (Message 18109)
Posted 12 Dec 2005 by old_user2354
Post:

Andre,
Which ResultID crashed?

I think it was this one. It\'s the right computer and the time is right, too.

I would really appreciate if a developer could look into this matter as it seems there\'s something seriously wrong with sulphur 4.22 for Linux :-/
6) Message boards : Number crunching : sulphur model - Linux - Signal 11 (Message 18083)
Posted 12 Dec 2005 by old_user2354
Post:
I, too have something to report. I switched \'Leave Apps in memory\' on just for CPDN. This morning, just a few minutes ago, another CPDN WU crashed when I started my PC. So, please, look into the program. There\'s a bug in there.
7) Message boards : Number crunching : sulphur model - Linux - Signal 11 (Message 17925)
Posted 9 Dec 2005 by old_user2354
Post:
CJOrtega wrote:
Is this a problem with Linux, the Linux api, or with the sulphur model?


I guess it\'s something with the sulphor code. I tried running only sulphor on one of my machines, but unfortunately that didn\'t work either. Not everyone can leave all applications in memory, so I think the developers should look at the code what\'s causing a Segmentation violation (Signal 11).

I\'m still waiting for a respose of a developer, but none replied to my message here.
8) Questions and Answers : Unix/Linux : Sulphur crash after project switch (Message 17707)
Posted 4 Dec 2005 by old_user2354
Post:
version 5 does seem to be a bit better about suspending/resuming CPDN.

I can\'t completely agree with this. My both Linux boxes(Suse 10.0 and Debian stable ) run on 5.2.13 and they still crash sulphor WUs pretty quickly. In fact I didn\'t get any sulphor WU to trickle yet. It seems to me like there is some sort of bug in the sulphor client that makes it crash on pausing of the model. That makes CPDN quite hard to use on Linux :-(
9) Message boards : Number crunching : sulphur version 4.22 released (Message 17613)
Posted 1 Dec 2005 by old_user2354
Post:
version 4.22 of the sulphur cycle is available.
We\'ve improved the performance somewhat.
PS: Note there shotly be a mac release ( before the wknd)
as well as a new Advanced viz update to resolve some of the current quirks.


I just hope that version doesn\'t catch signal 11 as I got in this,
this and
this result. They all crashed when they were about to be suspended. I\'m wondering if something with my PC is wrong that it crashes Sulphor WUs so often...
10) Message boards : Number crunching : Sulphur model hung (Message 17429)
Posted 25 Nov 2005 by old_user2354
Post:
Might it be possible you\'re having a problem similar to mine?
The dialog box that I got was btw a box of the fortran system.
11) Message boards : Number crunching : Sulphur crashed after phase 1 (Message 16459)
Posted 5 Oct 2005 by old_user2354
Post:
Well, I tried restoring from just before the end of Phase (in fact, I have a backup of the last checkpoint before the end), but it always errored out again. I\'m not running any other projects on that PC because the LAN-port seems to have trouble.
BTW, the error message that pops up is:

forrtl: severe (24): end-of-file during read, unit 20 file
(Path to boinc)\\projects\\climateprediction.net\\467k_200294944\\fort.20

Image PC Routine Line Source
sulphur_se_4.19_w 004F497F Unknown Unknown Unknown
sulphur_se_4.19_w 004E1C8A Unknown Unknown Unknown
sulphur_se_4.19_w 004E0999 Unknown Unknown Unknown
sulphur_se_4.19_w 004E0EC4 Unknown Unknown Unknown
sulphur_se_4.19_w 004D722E Unknown Unknown Unknown
sulphur_se_4.19_w 0043BE9E Unknown Unknown Unknown
sulphur_se_4.19_w 00435B8E _anc_fld_ 2381 pptoanc1.f
sulphur_se_4.19_w 004340E5 _pptoanc_ 981 pptoanc1.f
sulphur_se_4.19_w 00403C2E Unknown Unknown Unknown
sulphur_se_4.19_w 00403947 Unknown Unknown Unknown
sulphur_se_4.19_w 0052BA5B Unknown Unknown Unknown
kernel32.dll 7C816D4F Unknown Unknown Unknown
12) Message boards : Number crunching : Sulphur crashed after phase 1 (Message 16456)
Posted 5 Oct 2005 by old_user2354
Post:
this result just crashed with a message-box indicating a fortran-error. (I could put a screenshot online if that helps). I\'m wondering what to do now. I\'ld like to upload at least the results of phase 1. It would be even better if I could somehow continue the WU (I have backups from the last 2 days of my BOINC directory...

So, what can I do now?
13) Message boards : Number crunching : Is HyperThreading BAD for Climate? (Message 16173)
Posted 22 Sep 2005 by old_user2354
Post:
There seems to be a lot of cache misses and page faults when I run Climate BOINC on a two physical Xeon configuration with HyperThreading on. The smaller projects, like SETI and Protein, seem to do just fine with HyperThreading, but has anyone confirmed that Climate might do better with HyperThreading off?


Well, I know that two climate WUs on my P4 HT System are quite bad. I think it might be because the two programs are trying to use the same \'parts\' of the CPU and thus in a bottleneck. With enough other projects it\'s quite easy to have climate together with some other work.

Just a non-professional opinion




©2024 climateprediction.net