climateprediction.net home page
Arrrggghhh Carl - error during final results upload, again... :-(

Arrrggghhh Carl - error during final results upload, again... :-(

Questions and Answers : Windows : Arrrggghhh Carl - error during final results upload, again... :-(
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Profile old_user156
Avatar

Send message
Joined: 5 Aug 04
Posts: 186
Credit: 1,612,182
RAC: 0
Message 3985 - Posted: 12 Sep 2004, 23:52:20 UTC
Last modified: 17 Sep 2004, 0:13:26 UTC

Close to 18 days crunching on <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?field=Temperature&amp;resultid=24713&amp;phase=AT#graph">Result #24713</a> and the client throws an "upload error" during final results upload. :-( I wonder if this happened because I'm running BOINC Alpha v4.08 Carl..? Hmmnn, oh well, I guess I'll find out in an hour or so when 'Susan' completes her first BOINC model...

Result #24713 <i>has</i> showed up in the 'Last 10 results returned' list - "Run Information Received: 12 Sep 2004 21:26:01 UTC." :-?

<a href="http://www.nmvs.dsl.pipex.com/"><img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=6&amp;team=off&amp;trans=off"></a>
ID: 3985 · Report as offensive     Reply Quote
Profile Honza
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 390
Credit: 2,475,242
RAC: 0
Message 4005 - Posted: 13 Sep 2004, 8:03:01 UTC

What kind of error, Nick?
Temporarily failed upload of ...?
http://www.climateprediction.net/board/viewtopic.php?t=2321
ID: 4005 · Report as offensive     Reply Quote
Profile old_user156
Avatar

Send message
Joined: 5 Aug 04
Posts: 186
Credit: 1,612,182
RAC: 0
Message 4015 - Posted: 13 Sep 2004, 11:00:10 UTC
Last modified: 13 Sep 2004, 11:02:21 UTC

Susan's <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=24658">Result #24658</a> threw the same error. :-(

Honza: The error is listed at both the above links - I can't copy them here because it has XML tags &amp; doesn't display properley. Basically, the CP-boinc servers refused to accept the "*_0_1.zip" file because "(Output file exceeded size limit)" both times.

It lists as an "Unrecoverable error" although it is also shown as one of the "Last 10 results returned". :-?

<a href="http://www.nmvs.dsl.pipex.com/"><img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=6&amp;team=off&amp;trans=off"></a>
ID: 4015 · Report as offensive     Reply Quote
old_user1
Avatar

Send message
Joined: 5 Aug 04
Posts: 907
Credit: 299,864
RAC: 0
Message 4017 - Posted: 13 Sep 2004, 12:14:01 UTC - in response to Message 4005.  
Last modified: 13 Sep 2004, 13:54:48 UTC

oh I know what that is, hopefully the upload will go through soon, give me an hour. I had made a lot of regional means added to the first file but the limit is 1MB for upload; I just have to recompile and distribute to the upload servers (about an hour or so).

OK, I have just updated the upload servers to allow this slightly larger first file through, hopefully it will go through now? I'm not sure if BOINC "gives up" immediately on this error or the file is still there. If it gives up the uploading and the *_1.zip file is there can you just email it to me? Thanks!
ID: 4017 · Report as offensive     Reply Quote
Profile old_user156
Avatar

Send message
Joined: 5 Aug 04
Posts: 186
Credit: 1,612,182
RAC: 0
Message 4037 - Posted: 13 Sep 2004, 16:49:48 UTC - in response to Message 4017.  
Last modified: 13 Sep 2004, 16:56:38 UTC

&gt; oh I know what that is, hopefully the upload will go through soon, give me an
&gt; hour. I had made a lot of regional means added to the first file but the
&gt; limit is 1MB for upload; I just have to recompile and distribute to the upload
&gt; servers (about an hour or so).
&gt;
&gt; OK, I have just updated the upload servers to allow this slightly larger first
&gt; file through, hopefully it will go through now? I'm not sure if BOINC "gives
&gt; up" immediately on this error or the file is still there. If it gives up the
&gt; uploading and the *_1.zip file is there can you just email it to me? Thanks!

Nope, it's just gone I'm afraid - that wouldn't matter much for a SETI work_unit result but it's a <i>big</i> waste of resources for CP-boinc. :-( Pity it doesn't archive the file like classic CPDN does - oh well, good to know it's fixed now anyways...

<a href="http://www.nmvs.dsl.pipex.com/"><img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=6&amp;team=off&amp;trans=off"></a>
ID: 4037 · Report as offensive     Reply Quote
old_user1
Avatar

Send message
Joined: 5 Aug 04
Posts: 907
Credit: 299,864
RAC: 0
Message 4050 - Posted: 13 Sep 2004, 20:06:55 UTC - in response to Message 4037.  
Last modified: 13 Sep 2004, 20:07:11 UTC

damn I wanted to see that too, I think yours was the first since I added the "regional means" from launch, a lot of fun stuff in there, so we get back averages on 29 regions on something like 100 fields. Oh well, that's what happened when I get rushed, they told me the "great idea" of adding regional means like a week before launch and I worked 7 days straight to get it out in the launch version. Sorry about that, but I guess it could have been worse (i.e. if I found out there will be 15K "smallexecs errors" on regional means, ack!)

ID: 4050 · Report as offensive     Reply Quote
Profile old_user156
Avatar

Send message
Joined: 5 Aug 04
Posts: 186
Credit: 1,612,182
RAC: 0
Message 4058 - Posted: 13 Sep 2004, 21:16:50 UTC - in response to Message 4050.  
Last modified: 13 Sep 2004, 21:17:44 UTC

&gt; ...Sorry about that, but I guess it could have been worse.

:Shrug: C'est la vie - just bad luck I had two machines finish a run at much the same time before you could fix it...

&gt; (i.e. if I found out there will be 15K "smallexecs errors" on regional means, ack!)

Yeah, that would have been decidedly <i>ouch</i>..! ;-)

<a href="http://www.nmvs.dsl.pipex.com/"><img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=6&amp;team=off&amp;trans=off"></a>
ID: 4058 · Report as offensive     Reply Quote
Profile old_user156
Avatar

Send message
Joined: 5 Aug 04
Posts: 186
Credit: 1,612,182
RAC: 0
Message 4222 - Posted: 16 Sep 2004, 23:46:59 UTC
Last modified: 17 Sep 2004, 0:47:19 UTC

Aaarrrrrggggghhhh, <i>again</i>;

<img src="http://cpdn.tuxie.org/uk_nick/CP-boinc/Dilly_CP-boinc_error.png">

Dilly this time, with the <i>exact same error code</i> "-131" - <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=24494">Result #24494</a> - it has also showed up okay in the "Last 10 Results Returned: " - That's three now from my machines. :-(

Gah, I thought this was fixed - 'Alison' is due to finish her first CP-boinc run at 04:12 and 'Amanda' at 08:38 this morning - I hope they're not going to throw errors too..!?!? (Is that file size checked at this end too or only by the CP-boinc servers..?)

Looking through the last 10 results returned I see that most are uploading okay but some others are throwing this same error. eg. <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?field=Temperature&amp;resultid=26620&amp;phase=AT#graph">Result #26620</a>. (#26620 was run under BOINC v4.05, so it's not happening because I'm running BOINC v4.09)

<a href="http://www.nmvs.dsl.pipex.com/"><img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=6&amp;team=off&amp;trans=off"></a>

PS: I have a backup of the whole CP-boinc folder from 40 minutes before results upload - any point in trying to run it through again Carl..?
ID: 4222 · Report as offensive     Reply Quote
Profile old_user156
Avatar

Send message
Joined: 5 Aug 04
Posts: 186
Credit: 1,612,182
RAC: 0
Message 4230 - Posted: 17 Sep 2004, 4:35:33 UTC
Last modified: 17 Sep 2004, 4:38:03 UTC

And again, 'Alison' this time, same oversize file upload error "-131" for file "*_1.zip" - <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?field=Temperature&amp;resultid=26653&amp;phase=AT#graph">Result #26653</a>. :-(

I have temporarily disabled Amanda's network access so that she cannot attempt final results upload as yet...

<a href="http://www.nmvs.dsl.pipex.com/"><img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=6&amp;team=off&amp;trans=off"></a>
ID: 4230 · Report as offensive     Reply Quote
Profile old_user156
Avatar

Send message
Joined: 5 Aug 04
Posts: 186
Credit: 1,612,182
RAC: 0
Message 4254 - Posted: 17 Sep 2004, 16:15:02 UTC

Amanda has also thrown the exact same error <i>whilst network access was disabled</i>, so it's <i>this</i> end, not the servers Carl. :?

<img src="http://cpdn.tuxie.org/uk_nick/CP-boinc/Amanda_CP-boinc_error.png">

This is all that is in her "\climateprediction.net" folder:
<img src="http://cpdn.tuxie.org/uk_nick/CP-boinc/Amanda_CP_folder_contents.png">
So "016j_300026516_0_1.zip" has already been deleted, even with network access disabled...

<a href="http://www.nmvs.dsl.pipex.com/"><img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=6&amp;team=off&amp;trans=off"></a>
ID: 4254 · Report as offensive     Reply Quote
Profile old_user156
Avatar

Send message
Joined: 5 Aug 04
Posts: 186
Credit: 1,612,182
RAC: 0
Message 4256 - Posted: 17 Sep 2004, 17:04:33 UTC
Last modified: 17 Sep 2004, 17:05:33 UTC

And <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?field=Temperature&amp;resultid=30197&amp;phase=AT#graph">another one</a> - this one was under BOINC v4.05, so it's not just me, nor the change to BOINC v4.08 ~ 4.09...

(Heh, looks like a 'cold equator' too. ;-)

<a href="http://www.nmvs.dsl.pipex.com/"><img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=6&amp;team=off&amp;trans=off"></a>
ID: 4256 · Report as offensive     Reply Quote
Profile Thyme Lawn
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1283
Credit: 15,824,334
RAC: 0
Message 4257 - Posted: 17 Sep 2004, 17:05:13 UTC - in response to Message 4254.  
Last modified: 17 Sep 2004, 17:09:09 UTC

&gt; Amanda has also thrown the exact same error <i>whilst network access was
&gt; disabled</i>, so it's <i>this</i> end, not the servers Carl. :?
&gt;
&gt; So "016j_300026516_0_1.zip" has already been deleted, even with network access
&gt; disabled...

I've tracked this one down in the BOINC source code, and the problem seems to be in the client_state.xml file. There are a couple of {max_nbytes} entries in the file for each of the output files, and the values for the _1.zip result are 1000000.000000 and 1000000 in my files (set to 5000000.000000 and 5000000 for the other 4 result files).

I would guess that it's possible to work-around the problem by stopping BOINC, manually editing the client_state files to increase the value of the field and restarting BOINC. But I couldn't possibly advise doing it unless you know exactly what you're doing ;-)

I'm afraid it looks like there's another general workunit problem, Carl :(

<a href="http://www.teampicard.net"><img src="http://www.teampicard.net/templates/fisubice/images/phpbb2_logo.jpg"></a><a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/team_display.php?teamid=3">Join us here</a>
ID: 4257 · Report as offensive     Reply Quote
Profile Honza
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 390
Credit: 2,475,242
RAC: 0
Message 4258 - Posted: 17 Sep 2004, 17:16:12 UTC - in response to Message 4257.  
Last modified: 17 Sep 2004, 17:17:39 UTC

Good tip, Thyme Lawn.
Nick, you can also try max_nbytes 0.000000 as it states at apps files; means no limitation i guess.

&gt; I've tracked this one down in the BOINC source code, and the problem seems to
&gt; be in the client_state.xml file. There are a couple of {max_nbytes} entries in
&gt; the file for each of the output files, and the values for the _1.zip result
&gt; are 1000000.000000 and 1000000 in my files (set to 5000000.000000 and 5000000
&gt; for the other 4 result files).
&gt;
&gt; I would guess that it's possible to work-around the problem by stopping BOINC,
&gt; manually editing the client_state files to increase the value of the field and
&gt; restarting BOINC. But I couldn't possibly advise doing it unless you know
&gt; exactly what you're doing ;-)
&gt;
&gt; I'm afraid it looks like there's another general workunit problem, Carl :(
&gt;
&gt; <a href="http://www.teampicard.net"><img> src="http://www.teampicard.net/templates/fisubice/images/phpbb2_logo.jpg"&gt;</a><a> href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/team_display.php?teamid=3"&gt;Join
&gt; us here</a>
&gt;
ID: 4258 · Report as offensive     Reply Quote
Profile Honza
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 390
Credit: 2,475,242
RAC: 0
Message 4260 - Posted: 17 Sep 2004, 17:16:53 UTC - in response to Message 4258.  
Last modified: 17 Sep 2004, 17:17:13 UTC

ID: 4260 · Report as offensive     Reply Quote
Profile old_user156
Avatar

Send message
Joined: 5 Aug 04
Posts: 186
Credit: 1,612,182
RAC: 0
Message 4261 - Posted: 17 Sep 2004, 17:20:28 UTC - in response to Message 4257.  
Last modified: 17 Sep 2004, 17:24:10 UTC

Thyme Lawn wrote:
&gt; I've tracked this one down in the BOINC source code, and the problem seems to
&gt; be in the client_state.xml file. There are a couple of {max_nbytes} entries in
&gt; the file for each of the output files, and the values for the _1.zip result
&gt; are 1000000.000000 and 1000000 in my files (set to 5000000.000000 and 5000000
&gt; for the other 4 result files).

Okay, that looks like it - both 'Helen' and 'Tracy' also have the figure 1000000 instead of 5000000 so I guess I oughta edit them.


&gt; I would guess that it's possible to work-around the problem by stopping BOINC,
&gt; manually editing the client_state files to increase the value of the field and
&gt; restarting BOINC. But I couldn't possibly advise doing it unless you know
&gt; exactly what you're doing ;-)

Hmmn, what text editor is going to alter those figures without screwing something else up, as 'notepad' is liable to do. :?


&gt; I'm afraid it looks like there's another general workunit problem, Carl :(

This must have been present at a certain period only TL - all the models I've downloaded recently are okay.

Carl: Is it worthwhile editing this figure in the backups I have from 'Dilly', 'Alison' &amp; 'Amanda' then re-running them or should I just leave it..?

<a href="http://www.nmvs.dsl.pipex.com/"><img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=6&amp;team=off&amp;trans=off"></a>
ID: 4261 · Report as offensive     Reply Quote
old_user1
Avatar

Send message
Joined: 5 Aug 04
Posts: 907
Credit: 299,864
RAC: 0
Message 4263 - Posted: 17 Sep 2004, 17:23:55 UTC - in response to Message 4261.  

I have changed the server side to accept up to 5MB for any CPDN/BOINC .zip file, but the remaining problem seems to be a batch of old workunits that I had the first file as 1MB upper limit. You can try the edit "1" to "5" for that _1.zip as Thyme Lawn pointed out, however don't do it on the bit as that will cause a validation error on the server (the servers are OK up to 5MB on all files, I've changed them all over).

ID: 4263 · Report as offensive     Reply Quote
Profile old_user156
Avatar

Send message
Joined: 5 Aug 04
Posts: 186
Credit: 1,612,182
RAC: 0
Message 4265 - Posted: 17 Sep 2004, 17:26:29 UTC - in response to Message 4263.  

&gt; You can try the edit "1" to "5" for
&gt; that _1.zip as Thyme Lawn pointed out, however don't do it on the bit

Carl: I don't understand what you mean by "don't do it on the bit"..?

&gt; as that
&gt; will cause a validation error on the server (the servers are OK up to 5MB on
&gt; all files, I've changed them all over).

<a href="http://www.nmvs.dsl.pipex.com/"><img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=6&amp;team=off&amp;trans=off"></a>
ID: 4265 · Report as offensive     Reply Quote
Profile Honza
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 390
Credit: 2,475,242
RAC: 0
Message 4266 - Posted: 17 Sep 2004, 17:29:12 UTC - in response to Message 4263.  

Carl, do you suggest that such value had been changed lately? Are those already uploaded models from beta? I'm quite confused there...

&gt; I have changed the server side to accept up to 5MB for any CPDN/BOINC .zip
&gt; file, but the remaining problem seems to be a batch of old workunits that I
&gt; had the first file as 1MB upper limit. You can try the edit "1" to "5" for
&gt; that _1.zip as Thyme Lawn pointed out, however don't do it on the bit as that
&gt; will cause a validation error on the server (the servers are OK up to 5MB on
&gt; all files, I've changed them all over).
ID: 4266 · Report as offensive     Reply Quote
Profile old_user156
Avatar

Send message
Joined: 5 Aug 04
Posts: 186
Credit: 1,612,182
RAC: 0
Message 4267 - Posted: 17 Sep 2004, 17:39:48 UTC - in response to Message 4265.  

&gt; &gt; You can try the edit "1" to "5" for
&gt; &gt; that _1.zip as Thyme Lawn pointed out, however don't do it on the bit
&gt;
&gt; Carl: I don't understand what you mean by "don't do it on the bit"..?
&gt;
&gt; &gt; as that
&gt; &gt; will cause a validation error on the server (the servers are OK up to 5MB
&gt; on
&gt; &gt; all files, I've changed them all over).

Okay, - don't alter the figure in the 'signed xml' segment is what you meant...

<a href="http://www.nmvs.dsl.pipex.com/"><img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=6&amp;team=off&amp;trans=off"></a>
ID: 4267 · Report as offensive     Reply Quote
Profile old_user156
Avatar

Send message
Joined: 5 Aug 04
Posts: 186
Credit: 1,612,182
RAC: 0
Message 4269 - Posted: 17 Sep 2004, 17:49:07 UTC
Last modified: 17 Sep 2004, 18:01:21 UTC

Carl: Okay, I've edited 'Helen' (23:01) and 'Tracy' (16:43) that are due to finish within 24 hours - all the rest already have 5000000 in there...

Honza: All the models I've downloaded recently already have the 5000000 figure in there - it only seems to be from around the time of that 'fortran namelist' problem when I had to reset the project on a number of machines.

<a href="http://www.nmvs.dsl.pipex.com/"><img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=6&amp;team=off&amp;trans=off"></a>
ID: 4269 · Report as offensive     Reply Quote
1 · 2 · Next

Questions and Answers : Windows : Arrrggghhh Carl - error during final results upload, again... :-(

©2024 climateprediction.net