climateprediction.net home page
Unrecoverable error for result hadcm3ohc_0pmr_05581578_1 ( - exit code 1073807364 (0x40010004))

Unrecoverable error for result hadcm3ohc_0pmr_05581578_1 ( - exit code 1073807364 (0x40010004))

Questions and Answers : Windows : Unrecoverable error for result hadcm3ohc_0pmr_05581578_1 ( - exit code 1073807364 (0x40010004))
Message board moderation

To post messages, you must log in.

AuthorMessage
old_user206348

Send message
Joined: 31 Oct 06
Posts: 3
Credit: 22,317
RAC: 0
Message 27557 - Posted: 27 Mar 2007, 15:41:46 UTC
Last modified: 27 Mar 2007, 15:54:04 UTC

what happened? it was going along just fine & then suddenly:

3/26/2007 5:09:12 PM|climateprediction.net|Resuming task hadcm3ohc_0pmr_05581578_1 using hadcm3 version 515 ... ...
3/26/2007 8:16:10 PM|climateprediction.net|Unrecoverable error for result hadcm3ohc_0pmr_05581578_1 ( - exit code 1073807364 (0x40010004))
3/26/2007 8:16:10 PM|climateprediction.net|Deferring scheduler requests for 1 minutes and 0 seconds ...
3/26/2007 8:16:14 PM||Rescheduling CPU: application exited
3/26/2007 8:16:14 PM|climateprediction.net|Computation for task hadcm3ohc_0pmr_05581578_1 finished ... ...
3/26/2007 10:40:14 PM|climateprediction.net|Sending scheduler request to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi
3/26/2007 10:40:14 PM|climateprediction.net|Reason: To report completed tasks
3/26/2007 10:40:14 PM|climateprediction.net|Reporting 1 tasks
3/26/2007 10:40:24 PM|climateprediction.net|Scheduler request succeeded

what does this mean?

it had been crunching for nearly 5mos. & was ~52% complete! now all of that is lost?!
ID: 27557 · Report as offensive     Reply Quote
Profile astroWX
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1496
Credit: 95,522,203
RAC: 0
Message 27566 - Posted: 27 Mar 2007, 19:00:30 UTC

It\'s lost if you don\'t have a backup of the entire boinc folder.
It\'s recoverable with you backup copy if you have one.

The error is the ever popular -107... (Thanks for posting the pertinent details.)

Mike\'s post suggests ways to avoid crashes (Solutions to models crashing: -161 error, or -1073741819 (0xc0000005)):
http://climateapps2.oucs.ox.ac.uk/cpdnboinc/forum_thread.php?id=4231
Les\' comments for Exit Code -1 and -107... here:
http://climateapps2.oucs.ox.ac.uk/cpdnboinc/forum_thread.php?id=4710#23372
Thyme Lawn\'s Testing Graphics Compatibility & driver update:
http://bbc.cpdn.org/forum_thread.php?id=1038&nowrap=true#3977
"We have met the enemy and he is us." -- Pogo
Greetings from coastal Washington state, the scenic US Pacific Northwest.
ID: 27566 · Report as offensive     Reply Quote
old_user206348

Send message
Joined: 31 Oct 06
Posts: 3
Credit: 22,317
RAC: 0
Message 27590 - Posted: 28 Mar 2007, 3:51:01 UTC - in response to Message 27566.  
Last modified: 28 Mar 2007, 3:59:04 UTC

thanks for the info.

unfortunately, i haven\'t ever made a backup. :\'-( oh well. if i decide to start a new task, i\'ll definitely begin making backups regularly.

so... will the partially completed data be used at all, or does the task have to be 100% complete for the data to be valid?

ID: 27590 · Report as offensive     Reply Quote
Profile old_user17289

Send message
Joined: 13 Sep 04
Posts: 228
Credit: 354,979
RAC: 0
Message 27591 - Posted: 28 Mar 2007, 4:21:12 UTC

As far as I know, results that are advanced as far as yours, the data is definitely usable and used.

Regarding backups, I did weekly backups in the past, but after I had several crashes in the last few weeks, going to repeat an entire week of processing became too much. So I do now a backup after every trickly (about every 1½ days).

Important: when you do the backup, the model and BOINC manager must not be running. Backup the entire C:\\Program Files\\BOINC folder.
ID: 27591 · Report as offensive     Reply Quote
Profile mo.v
Volunteer moderator
Avatar

Send message
Joined: 29 Sep 04
Posts: 2363
Credit: 14,611,758
RAC: 0
Message 27596 - Posted: 28 Mar 2007, 12:20:43 UTC

Yes, all the results are useful and are used, particularly full 10-year periods (which have uploaded the bigger decadal trickles).

For easy backup instructions by Les, go through my signature to the project READMEs. In the README about avoiding crashes see item #1.

There\'s also a whole README about backups, with a selectionof methods.
Cpdn news
ID: 27596 · Report as offensive     Reply Quote
old_user206348

Send message
Joined: 31 Oct 06
Posts: 3
Credit: 22,317
RAC: 0
Message 27634 - Posted: 30 Mar 2007, 13:32:49 UTC

thanks for the replies & info.
ID: 27634 · Report as offensive     Reply Quote
old_user211598

Send message
Joined: 4 Dec 06
Posts: 2
Credit: 14,174
RAC: 0
Message 28087 - Posted: 24 Apr 2007, 10:50:46 UTC

Hi there,

i keep getting this three times in a row now- with different downloaded tasks:

24.04.2007 10:57:36|climateprediction.net|Reason: Unrecoverable error for result hadcm3inct_cksw_1920_160_05872300_1 (Das Gerät erkennt den Befehl nicht. (0x16) - exit code 22 (0x16))

I don´t know if it is the same error or another one, but it happens not long after the task started, maybe three to four hours.

Please help me,

Woods
ID: 28087 · Report as offensive     Reply Quote
Profile Pooh Bear 27
Avatar

Send message
Joined: 5 Feb 05
Posts: 465
Credit: 1,914,189
RAC: 0
Message 28089 - Posted: 24 Apr 2007, 12:03:25 UTC - in response to Message 28087.  

Hi there,

i keep getting this three times in a row now- with different downloaded tasks:

24.04.2007 10:57:36|climateprediction.net|Reason: Unrecoverable error for result hadcm3inct_cksw_1920_160_05872300_1 (Das Gerät erkennt den Befehl nicht. (0x16) - exit code 22 (0x16))

I don´t know if it is the same error or another one, but it happens not long after the task started, maybe three to four hours.

Please help me,

Woods

You do not have enough memory to run Climate. 512M minimum needed. You only have 384M.

Until you can update your memory, stop running Climate models.

ID: 28089 · Report as offensive     Reply Quote
old_user211598

Send message
Joined: 4 Dec 06
Posts: 2
Credit: 14,174
RAC: 0
Message 28729 - Posted: 16 May 2007, 8:51:49 UTC - in response to Message 28089.  

Hi there,

i keep getting this three times in a row now- with different downloaded tasks:

24.04.2007 10:57:36|climateprediction.net|Reason: Unrecoverable error for result hadcm3inct_cksw_1920_160_05872300_1 (Das Gerät erkennt den Befehl nicht. (0x16) - exit code 22 (0x16))

I don´t know if it is the same error or another one, but it happens not long after the task started, maybe three to four hours.

Please help me,

Woods

You do not have enough memory to run Climate. 512M minimum needed. You only have 384M.

Until you can update your memory, stop running Climate models.


Thank you. I just upgraded memory and it works just fine.
ID: 28729 · Report as offensive     Reply Quote
Profile mo.v
Volunteer moderator
Avatar

Send message
Joined: 29 Sep 04
Posts: 2363
Credit: 14,611,758
RAC: 0
Message 28765 - Posted: 17 May 2007, 16:58:06 UTC
Last modified: 17 May 2007, 16:59:26 UTC

Adding extra memory was a very good idea.

There\'s lots of useful advice about avoiding model crashes in the project READMEs (link in my signature). But don\'t read everything.....

In the Running the model README, I recommend the top tips.

And in the README about avoiding crashes, I recommend item #1 by Les about how to back up the contents of the boinc folder (so if your model crashes you can restore the backup and continue the same model). Plus item #5 by Mike (what we must do and what we mustn\'t do!).
Cpdn news
ID: 28765 · Report as offensive     Reply Quote
old_user5628

Send message
Joined: 31 Aug 04
Posts: 4
Credit: 594,388
RAC: 0
Message 28953 - Posted: 24 May 2007, 23:51:12 UTC - in response to Message 28087.  

Hi there,

i keep getting this three times in a row now- with different downloaded tasks:

24.04.2007 10:57:36|climateprediction.net|Reason: Unrecoverable error for result hadcm3inct_cksw_1920_160_05872300_1 (Das Gerät erkennt den Befehl nicht. (0x16) - exit code 22 (0x16))

I don´t know if it is the same error or another one, but it happens not long after the task started, maybe three to four hours.

Please help me,

Woods


I am getting the same error and I have 1.5GB of RAM and 15 GB free space on my HD -- any ideas would be appreciated....

Doing my small part for scientific research


Doing my small part for scientific research

ID: 28953 · Report as offensive     Reply Quote
Profile MikeMarsUK
Volunteer moderator
Avatar

Send message
Joined: 13 Jan 06
Posts: 1498
Credit: 15,613,038
RAC: 0
Message 28959 - Posted: 25 May 2007, 7:47:43 UTC

You\'ve had one of these : (which I\'ve never seen before, disk problems?)

<core_client_version>5.8.16</core_client_version>
<![CDATA[
<message>
The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19)
</message>
<stderr_txt>
Not a JPEG file: starts with 0x01 0xda
Not a JPEG file: starts with 0x01 0xda

</stderr_txt>
]]>



And the rest were these (all crashed quite quickly):

...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=408, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=408, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=408, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=408, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=408, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=408, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(

</stderr_txt>
]]>


I have a few questions...

* Which antivirus do you use? Sophos can be a problem.
* Do you run anything which takes 100% of CPU time for a long while? (i.e., games, video encoding, ...?)
* Is there anything in the Boinc message log from the time that these crashed?

We think that exit code 22 \'hides\' the original error which is why there are so many of them (like the error code -161s used to).

I'm a volunteer and my views are my own.
News and Announcements and FAQ
ID: 28959 · Report as offensive     Reply Quote
old_user5628

Send message
Joined: 31 Aug 04
Posts: 4
Credit: 594,388
RAC: 0
Message 28960 - Posted: 25 May 2007, 8:19:22 UTC - in response to Message 28959.  

You\'ve had one of these : (which I\'ve never seen before, disk problems?)

<core_client_version>5.8.16</core_client_version>
<![CDATA[
<message>
The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19)
</message>
<stderr_txt>
Not a JPEG file: starts with 0x01 0xda
Not a JPEG file: starts with 0x01 0xda

</stderr_txt>
]]>



And the rest were these (all crashed quite quickly):

...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=408, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=408, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=408, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=408, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=408, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=408, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(

</stderr_txt>
]]>


I have a few questions...

* Which antivirus do you use? Sophos can be a problem.
* Do you run anything which takes 100% of CPU time for a long while? (i.e., games, video encoding, ...?)
* Is there anything in the Boinc message log from the time that these crashed?

We think that exit code 22 \'hides\' the original error which is why there are so many of them (like the error code -161s used to).



I use Norton and this system is used mainly for browsing and BOINC - nothing which would use 100% OF CPU TIME. Here is the log for the most recent unit

5/24/2007 11:37:43 AM|climateprediction.net|Restarting task hadcm3inct_co7i_1920_160_35871578_1 using hadcm3i version 540
5/24/2007 11:40:21 AM|climateprediction.net|Deferring communication for 1 min 0 sec
5/24/2007 11:40:21 AM|climateprediction.net|Reason: Unrecoverable error for result hadcm3inct_co7i_1920_160_35871578_1 (The device does not recognize the command. (0x16) - exit code 22 (0x16))
5/24/2007 11:40:27 AM|climateprediction.net|Computation for task hadcm3inct_co7i_1920_160_35871578_1 finished
5/24/2007 11:40:27 AM|climateprediction.net|Output file hadcm3inct_co7i_1920_160_35871578_1_1.zip for task hadcm3inct_co7i_1920_160_35871578_1 absent
5/24/2007 11:40:27 AM|climateprediction.net|Output file hadcm3inct_co7i_1920_160_35871578_1_2.zip for task hadcm3inct_co7i_1920_160_35871578_1 absent
5/24/2007 11:40:27 AM|climateprediction.net|Output file hadcm3inct_co7i_1920_160_35871578_1_3.zip for task hadcm3inct_co7i_1920_160_35871578_1 absent
5/24/2007 11:40:27 AM|climateprediction.net|Output file hadcm3inct_co7i_1920_160_35871578_1_4.zip for task hadcm3inct_co7i_1920_160_35871578_1 absent
5/24/2007 11:40:27 AM|climateprediction.net|Output file hadcm3inct_co7i_1920_160_35871578_1_5.zip for task hadcm3inct_co7i_1920_160_35871578_1 absent
5/24/2007 11:40:27 AM|climateprediction.net|Output file hadcm3inct_co7i_1920_160_35871578_1_6.zip for task hadcm3inct_co7i_1920_160_35871578_1 absent
5/24/2007 11:40:27 AM|climateprediction.net|Output file hadcm3inct_co7i_1920_160_35871578_1_7.zip for task hadcm3inct_co7i_1920_160_35871578_1 absent
5/24/2007 11:40:27 AM|climateprediction.net|Output file hadcm3inct_co7i_1920_160_35871578_1_8.zip for task hadcm3inct_co7i_1920_160_35871578_1 absent
5/24/2007 11:40:27 AM|climateprediction.net|Output file hadcm3inct_co7i_1920_160_35871578_1_9.zip for task hadcm3inct_co7i_1920_160_35871578_1 absent
5/24/2007 11:40:27 AM|climateprediction.net|Output file hadcm3inct_co7i_1920_160_35871578_1_10.zip for task hadcm3inct_co7i_1920_160_35871578_1 absent
5/24/2007 11:40:27 AM|climateprediction.net|Output file hadcm3inct_co7i_1920_160_35871578_1_11.zip for task hadcm3inct_co7i_1920_160_35871578_1 absent
5/24/2007 11:40:27 AM|climateprediction.net|Output file hadcm3inct_co7i_1920_160_35871578_1_12.zip for task hadcm3inct_co7i_1920_160_35871578_1 absent
5/24/2007 11:40:27 AM|climateprediction.net|Output file hadcm3inct_co7i_1920_160_35871578_1_13.zip for task hadcm3inct_co7i_1920_160_35871578_1 absent
5/24/2007 11:40:27 AM|climateprediction.net|Output file hadcm3inct_co7i_1920_160_35871578_1_14.zip for task hadcm3inct_co7i_1920_160_35871578_1 absent
5/24/2007 11:40:27 AM|climateprediction.net|Output file hadcm3inct_co7i_1920_160_35871578_1_15.zip for task hadcm3inct_co7i_1920_160_35871578_1 absent
5/24/2007 11:40:27 AM|climateprediction.net|Output file hadcm3inct_co7i_1920_160_35871578_1_16.zip for task hadcm3inct_co7i_1920_160_35871578_1 absent
5/24/2007 11:40:27 AM|SETI@home|Resuming task 04mr05ab.17213.19520.90902.3.99_3 using setiathome_enhanced version 515

Any ideas?
Doing my small part for scientific research

ID: 28960 · Report as offensive     Reply Quote
Profile MikeMarsUK
Volunteer moderator
Avatar

Send message
Joined: 13 Jan 06
Posts: 1498
Credit: 15,613,038
RAC: 0
Message 28964 - Posted: 25 May 2007, 13:03:01 UTC



What was just before the restarting task bit? Was that when Boinc Boinc was started?

5/24/2007 11:37:43 AM|climateprediction.net|Restarting task hadcm3inct_co7i_1920_160_35871578_1 using hadcm3i version 540

I'm a volunteer and my views are my own.
News and Announcements and FAQ
ID: 28964 · Report as offensive     Reply Quote
old_user5628

Send message
Joined: 31 Aug 04
Posts: 4
Credit: 594,388
RAC: 0
Message 29051 - Posted: 29 May 2007, 4:46:50 UTC - in response to Message 28964.  



What was just before the restarting task bit? Was that when Boinc Boinc was started?

5/24/2007 11:37:43 AM|climateprediction.net|Restarting task hadcm3inct_co7i_1920_160_35871578_1 using hadcm3i version 540

This was when switching projects...This machine is running Seti, Einstein, ABC and this project.
Doing my small part for scientific research

ID: 29051 · Report as offensive     Reply Quote
Profile MikeMarsUK
Volunteer moderator
Avatar

Send message
Joined: 13 Jan 06
Posts: 1498
Credit: 15,613,038
RAC: 0
Message 29055 - Posted: 29 May 2007, 7:09:37 UTC


Norton antivirus looks like it could be a cause, see the \'crashes and other problems\' README posting in the following forum.

You\'ll need to add the Boinc folder to *both* of Norton\'s exclusion lists, so that it doesn\'t scan the Boinc folder (and lock the files while Boinc is trying to use them).

http://www.climateprediction.net/board/viewforum.php?f=36
I'm a volunteer and my views are my own.
News and Announcements and FAQ
ID: 29055 · Report as offensive     Reply Quote
old_user5628

Send message
Joined: 31 Aug 04
Posts: 4
Credit: 594,388
RAC: 0
Message 29139 - Posted: 4 Jun 2007, 5:37:53 UTC - in response to Message 29055.  


Norton antivirus looks like it could be a cause, see the \'crashes and other problems\' README posting in the following forum.

You\'ll need to add the Boinc folder to *both* of Norton\'s exclusion lists, so that it doesn\'t scan the Boinc folder (and lock the files while Boinc is trying to use them).

http://www.climateprediction.net/board/viewforum.php?f=36


OK tried this and still got a code 22... Any other suggestions? I\'ve put Climate on hold for now till this gets figured out....
Doing my small part for scientific research

ID: 29139 · Report as offensive     Reply Quote
Profile MikeMarsUK
Volunteer moderator
Avatar

Send message
Joined: 13 Jan 06
Posts: 1498
Credit: 15,613,038
RAC: 0
Message 29140 - Posted: 4 Jun 2007, 7:04:05 UTC


It might also be worth trying some of the other suggestions in the posts, for example updating OpenGL and your graphics drivers to the most recent version (as in instructions for the -107... errors), suspending boinc before running games/video encoding, and exiting boinc before shutdown.

At the moment most of the error codes are being hidden by the error code 22\'s, i.e., the actual error might be a 1073807364 (0x40010004), but the one being shown is invariably 22 (0x16), which is making it hard to diagnose specific problems.

I'm a volunteer and my views are my own.
News and Announcements and FAQ
ID: 29140 · Report as offensive     Reply Quote

Questions and Answers : Windows : Unrecoverable error for result hadcm3ohc_0pmr_05581578_1 ( - exit code 1073807364 (0x40010004))

©2024 climateprediction.net