1)
Questions and Answers :
Unix/Linux :
Model ran fine up to 53% then ran into trouble
(Message 33006)
Posted 17 Mar 2008 by old_user204722 Post: Paul, Here\'s the work unit (unit ID is 6106824) http://climateapps2.oucs.ox.ac.uk/cpdnboinc/workunit.php?wuid=6106824 And here\'s the cpu ID (493017) http://climateapps2.oucs.ox.ac.uk/cpdnboinc/show_host_detail.php?hostid=493017 Is there an easy way to restart a work unit from the start (delete some key status file and restart the app)? I had a power failure at the wrong time last Dec and it might have gotten into a state it couldn\'t recover from. If I recall correctly, it doesn\'t (or rather didn\'t) compute past 10:30am somewhere in the year 2043. Good luck. |
2)
Questions and Answers :
Unix/Linux :
Model ran fine up to 53% then ran into trouble
(Message 32996)
Posted 16 Mar 2008 by old_user204722 Post: Hi all, I\'ve run several models successfully in the past, however hadcm3iozn_cpzl_2000_80_135899450_3 started off like all the others, and ran to 53%. It would continue to run, however it would not progress beyond a certain point in model time. My stats for CPDN went from a 50 degree slope to flatline and stayed there for about 2 months. I had to abort this work unit. questions include: - who else has had this kind of problem? - how should I have handled this (e.g. notify the boinc team with the work in progress files?) so the root cause could be addressed? - would it have been better to figure out how to restart the unit from the beginning? - do I just abort and let it get another unit? |
©2024 climateprediction.net