climateprediction.net home page
Task stored idle on the local disk forever?

Task stored idle on the local disk forever?

Questions and Answers : Windows : Task stored idle on the local disk forever?
Message board moderation

To post messages, you must log in.

AuthorMessage
alnitak

Send message
Joined: 9 Sep 08
Posts: 1
Credit: 1,265,194
RAC: 124
Message 35218 - Posted: 10 Oct 2008, 8:44:50 UTC

I\'d like to ask for a task stored on my local disk. I had two cores set in my virtual machine (Win XP under VMWare) and there were two tasks of climateprediction running. Then I had to set my virtual machine to use only one core. So only one task was running and the second one remained stored on the disk. When I was able to set again two cores for my virtual machine, a new task of climateprediction was downloaded and there seems to be no way to start computing of the stored task again. It\'s a pity because there were tens of hours already computed in it. Is there a way to run this task again? Thanks in advance for any answer.
ID: 35218 · Report as offensive     Reply Quote
Profile astroWX
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1496
Credit: 95,522,203
RAC: 0
Message 35221 - Posted: 10 Oct 2008, 21:34:21 UTC

Four Models assigned to the machine. One crashed in September, one today. From today\'s error report:
Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO

I\'m not familiar with virtual machines but I suspect an incomplete backup when the VM was set to use a single core. (Perhaps a VM user will add a correction to this.)
"We have met the enemy and he is us." -- Pogo
Greetings from coastal Washington state, the scenic US Pacific Northwest.
ID: 35221 · Report as offensive     Reply Quote
Profile Ananas
Volunteer moderator

Send message
Joined: 31 Oct 04
Posts: 336
Credit: 3,316,482
RAC: 0
Message 35222 - Posted: 10 Oct 2008, 22:55:24 UTC
Last modified: 10 Oct 2008, 23:06:01 UTC

Several \"no heartbeat\" messages too, either BOINC didn\'t get it right when one CPU VP has been taken away and didn\'t communicate with one model anymore or they have been a result of that evil bug that makes BOINC unresponsive for several minutes when a DNS server is unreachable.

That DNS bug is a real WU killer and can destroy all work in a complete network within a few hours. If you catch such a problem quick enough, it will probably help to disable network access until the DNS server is back.


edit : If it has been a problem of the changing CPU VPs, it might help to

- reduce the \"On multiprocessors, use at most\" setting to the new count
- make BOINC contact the CPDN server
- stop BOINC
- change the CPU VP setting of the VM
- restart BOINC

The difference would be, that the additional task is not sleeping in memory anymore - a task that doesn\'t run will usually not crash so easily.

I\'m not sure if the first two steps are necessary - but in this case I would do it anyway, just in case.
ID: 35222 · Report as offensive     Reply Quote

Questions and Answers : Windows : Task stored idle on the local disk forever?

©2024 climateprediction.net