climateprediction.net home page
Posts by pe

Posts by pe

1) Message boards : Number crunching : exit code 193 (0xc1) (Message 43300)
Posted 26 Oct 2011 by pe
Post:
Hi Greg,

thank you for your input.
I did read a bit on the link you gave me. It seems my mem and hds are ok.
these two wu's were the first in a long time to error out..

did you notice the lack of proper suspend too?

greetz, pe.
2) Message boards : Number crunching : exit code 193 (0xc1) (Message 43269)
Posted 25 Oct 2011 by pe
Post:
Hi there,

two of the models my computer was crunching crashed lately. The second is still not reported as being crashed due to the project being offline.
the first one:
name: hadcm3n_p3yz_1940_40_007420580_1
it states: exit code 193 (0xc1)
<stderr_txt>
etVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
.....

I did spot some strange behaviour prior to the models crashing: their status in boinc manager was 'waiting to run', but the times did still change and they still used half a core on my quad-core system. Then after rebooting the system (for some other reason), the workunits errored out. the first was something about 80% and the second about 60% through.
So I think there might be a problem with proper pausing and resuming the work-units. (they are left in memory while suspended)

anything known that causes these problems?

greetz, pe.




©2024 climateprediction.net