climateprediction.net home page
Posts by enginerd

Posts by enginerd

1) Message boards : Number crunching : Sulphur units constantly failing (Message 21241)
Posted 13 Mar 2006 by enginerd
Post:
This is why big business make daily backups of their computer data.

so does my small business...

unfortunately after the last run failing, i didnt want to touch this wu at all. however i had to restart the computer, and it failed within about 6 hours.
2) Message boards : Number crunching : Sulphur units constantly failing (Message 21237)
Posted 13 Mar 2006 by enginerd
Post:
i just had one die halfway phase 4 - any help??!? this is the second of my sulfur units to fail, it was after a restart, but i suspended cpdn first. no backup. :(
result # 1754289


<core_client_version>5.2.13</core_client_version>
<message><file_xfer_error>
<file_name>sulphur_j55s_100893152_0_4.zip</file_name>
<error_code>-161</error_code>
<error_message></error_message>
</file_xfer_error>
<file_xfer_error>
<file_name>sulphur_j55s_100893152_0_5.zip</file_name>
<error_code>-161</error_code>
<error_message></error_message>
</file_xfer_error>

</message>
3) Message boards : Number crunching : end process (Message 19608)
Posted 24 Jan 2006 by enginerd
Post:
ok Thyme Lawn

thanks for offering to help, i found sulphur_e0yj_000654427.zip and the client_state.xml file, but am unclear about what to do with them. any help (from you or anybody else) would be greatly appreciated.

-christo

ps. i also pm\'d you on the TP forums.
4) Message boards : Number crunching : end process (Message 19474)
Posted 20 Jan 2006 by enginerd
Post:
>>have you got a backup copy of the BOINC folder?

ummm.....
from before the wu started!

is there any way to delete some recent output files to trick boinc into letting me start this wu again?? 40.5 days is a lot of lost crunching.

actually the process canceled was a non-running sulfur thread (that had been aborted long ago) that still was present in the task manager. when i accidentally killed it, it took the running thread with it.
5) Message boards : Number crunching : end process (Message 19469)
Posted 20 Jan 2006 by enginerd
Post:
how exactly can i restore?
6) Message boards : Number crunching : end process (Message 19454)
Posted 20 Jan 2006 by enginerd
Post:
i accidently did an \"end this process\" on my sulfur workunit (stupid non-optical mouse had dirt all in it). are there any options to get this workunit going again? can i back it up a few model days??? or is all hope lost?

;-^-(

sulphur_e0yj_000654427_0

Result ID 1329886
Workunit 871996
7) Message boards : Number crunching : dammit (Message 17633)
Posted 2 Dec 2005 by enginerd
Post:
is there any way i can restart this result and try to let it finish? sorry for the hassle but i lost 18 days of compute time.
8) Message boards : Number crunching : dammit (Message 17631)
Posted 2 Dec 2005 by enginerd
Post:
so my workunit (754048) errored out with 3:43 left to go when someone printed to this computer. oh well, only a couple hundred hours lost!!! any idea why?? for some reason it was removed from memory just before it completed. this has never been a problem before.

Exit status -5 (0xfffffffb)


<core_client_version>4.45</core_client_version>
<message> - exit code -5 (0xfffffffb)
</message>

9) Message boards : Number crunching : Windows shutdown procedure (Message 17318)
Posted 23 Nov 2005 by enginerd
Post:
Does the shutdown procedure in the FAQ still apply to the latest version of Boinc (5.2.7)? Or is it safe now to shut down without exiting the program first?


i pretty much just shut down whenever, the computer stays on mon-fri and i havent had any problems. but i think it goes

suspend hadsm
quit boinc
restart

or just

suspend hadsm
restart
10) Questions and Answers : Windows : runaway WU (Message 15057)
Posted 12 Aug 2005 by enginerd
Post:
ok, boinc has reset. client error. too bad it didnt figure that out for a week, thats half a w.u.
11) Questions and Answers : Windows : runaway WU (Message 15039)
Posted 11 Aug 2005 by enginerd
Post:
hi
my model (hadsm4.13, boinc 4.45, 0xan_100063606_1) was happily crunching at 56% complete until about a week ago, with 200 something hours left. now it shows 26% complete with 850 hours to go, WTF!!! it hasnt trickled for about a week. can anyone speculate on the cause? should i cancel this wu? it has created 720MB of files on my computer, and it should have finished by now.
thanks
christo
p4 3.2ghz
ddr2 533mhz 512x2
xp pro sp2
12) Questions and Answers : Windows : wu uploaded, but status=in progress (Message 11425)
Posted 25 Mar 2005 by enginerd
Post:
hi
i recently finished a model, but when i check it in my account page it says "in progress." there are 72 trickles, and i got credit, but why does the model outcome say "unknown?" i have the log where it says the wu was uploaded...

2005-03-10 01:38:09 [climateprediction.net] Restarting result 2jfg_100139691_4 using hadsm3 version 4.04
2005-03-10 02:17:32 [climateprediction.net] Sending request to scheduler: http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi
2005-03-10 02:17:35 [climateprediction.net] Scheduler RPC to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi succeeded
2005-03-10 02:17:35 [climateprediction.net] Project prefs: using your defaults
2005-03-10 02:20:25 [climateprediction.net] Computation for result 2jfg_100139691 finished
2005-03-10 02:20:27 [climateprediction.net] Started upload of 2jfg_100139691_4_1.zip
2005-03-10 02:20:27 [climateprediction.net] Started upload of 2jfg_100139691_4_2.zip
2005-03-10 02:21:37 [climateprediction.net] Finished upload of 2jfg_100139691_4_1.zip
2005-03-10 02:21:37 [climateprediction.net] Throughput 22146 bytes/sec
2005-03-10 02:21:37 [climateprediction.net] Started upload of 2jfg_100139691_4_3.zip
2005-03-10 02:21:53 [climateprediction.net] Finished upload of 2jfg_100139691_4_2.zip
2005-03-10 02:21:53 [climateprediction.net] Throughput 23028 bytes/sec
2005-03-10 02:21:53 [climateprediction.net] Started upload of 2jfg_100139691_4_4.zip
2005-03-10 02:23:07 [climateprediction.net] Finished upload of 2jfg_100139691_4_4.zip
2005-03-10 02:23:07 [climateprediction.net] Throughput 21768 bytes/sec
2005-03-10 02:23:07 [climateprediction.net] Started upload of 2jfg_100139691_4_5.zip
2005-03-10 02:23:14 [climateprediction.net] Finished upload of 2jfg_100139691_4_3.zip
2005-03-10 02:23:14 [climateprediction.net] Throughput 23725 bytes/sec
2005-03-10 02:23:20 [climateprediction.net] Finished upload of 2jfg_100139691_4_5.zip
2005-03-10 02:23:20 [climateprediction.net] Throughput 23377 bytes/sec
2005-03-10 03:20:26 [climateprediction.net] Starting result 0kn7_000047048_1 using hadsm3 version 4.10

...can anyone help?
christo

now using boinc 4.25
p4 2.4ghz 512mb ram
xp home sp2
13) Questions and Answers : Windows : Workunit there but CPDN grabbed a new one... (Message 11302)
Posted 23 Mar 2005 by enginerd
Post:
i have the same problem...i am trying to move a run on my home computer to work and cant get boinc to recognize the old model. i copy /projects, /slots, and all the cpdn .xml type files. then copy straight to boinc folder on work machine. when started, boinc manager just sits there.
if i use the installer to "repair" it downloads another model so i have to detach before it crunches.

boinc 4.25 at work
model ran on 4.25 at home

p.s. can the /locale stuff except for default language be deleted safely?




©2024 climateprediction.net