climateprediction.net home page
UGH! Unrecoverable error!!!

UGH! Unrecoverable error!!!

Questions and Answers : Windows : UGH! Unrecoverable error!!!
Message board moderation

To post messages, you must log in.

AuthorMessage
old_user176417

Send message
Joined: 17 Mar 06
Posts: 14
Credit: 81,978
RAC: 0
Message 28698 - Posted: 14 May 2007, 21:50:11 UTC

Well thats a first for me. I have been running this model since last fall and was within 600hrs of getting done. I get home today and notice the \"time to completion\" was over 1900hrs again. And the CPU time was only like 4 hrs. I checked the log and found that at 11AM or so CDT the model had hit an \"unrecoverable error\". So now I am back to square one. Hope they got some good data.

Would backing up my CP data help avoid this??
ID: 28698 · Report as offensive     Reply Quote
Profile MikeMarsUK
Volunteer moderator
Avatar

Send message
Joined: 13 Jan 06
Posts: 1498
Credit: 15,613,038
RAC: 0
Message 28700 - Posted: 14 May 2007, 22:28:05 UTC


The data will be very useful, it\'s good that you got past 2043 (2040 is when a \'restart dump\' is uploaded to the server). Once the software is written, this will allow someone to rerun your model from 2040 onwards.

Your model crashed with an access violation, if you\'d had a backup you could have restored from that and continued again (not all crashes can be restored from backup, there are some which will always stop at the same point). If you do a backup it needs to be of the entire Boinc directory structure, when Boinc is shut down.
I'm a volunteer and my views are my own.
News and Announcements and FAQ
ID: 28700 · Report as offensive     Reply Quote
Profile mo.v
Volunteer moderator
Avatar

Send message
Joined: 29 Sep 04
Posts: 2363
Credit: 14,611,758
RAC: 0
Message 28701 - Posted: 14 May 2007, 23:30:33 UTC

Zoomer, through my sig you can get to the READMEs. One of them is just about backups with a selection of methods all suitable for cpdn.
Cpdn news
ID: 28701 · Report as offensive     Reply Quote
old_user176417

Send message
Joined: 17 Mar 06
Posts: 14
Credit: 81,978
RAC: 0
Message 28739 - Posted: 16 May 2007, 16:49:54 UTC

Ok I had backed up a few nights ago as a test but I did it wrong (I need to learn how to READ). What I did was just back up the CP folder from the Boinc folder. Did not work of course, when I tried to restore the folder as a test of the method the thing got stuck in an infinate loop of trying to communicate and gettting deferred. Finally just aborted it, cleared it out and got a fresh model to run.

NOW when I backup I copy the whole Boinc folder over to my other HD and I also burn it to a CD. Dont want to lose a model run again :)
ID: 28739 · Report as offensive     Reply Quote
Profile mo.v
Volunteer moderator
Avatar

Send message
Joined: 29 Sep 04
Posts: 2363
Credit: 14,611,758
RAC: 0
Message 28764 - Posted: 17 May 2007, 16:51:32 UTC

If you go to the project READMEs thro the link in my signature, you\'ll find one that\'s all about backups. I use Les\'s click-by-click easy manual method, but there are more sophisticated methods too. Les\'s method also works for restoring backups.

The READMEs are also at the top of the Number crunching section of this forum.
Cpdn news
ID: 28764 · Report as offensive     Reply Quote

Questions and Answers : Windows : UGH! Unrecoverable error!!!

©2024 climateprediction.net