climateprediction.net home page
Model Finished&Uploaded, Server claims Outcome "unknown"?

Model Finished&Uploaded, Server claims Outcome "unknown"?

Message boards : Number crunching : Model Finished&Uploaded, Server claims Outcome "unknown"?
Message board moderation

To post messages, you must log in.

AuthorMessage
old_user3434
Avatar

Send message
Joined: 30 Aug 04
Posts: 77
Credit: 1,785,934
RAC: 0
Message 10763 - Posted: 12 Mar 2005, 12:22:19 UTC
Last modified: 12 Mar 2005, 12:25:05 UTC

<a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=369911">http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=369911</a>

What shall I make out of that, it's complete but the server seems to think different (?)
Scientific Network : 44800 MHz - 77824 MB - 1970 GB
ID: 10763 · Report as offensive     Reply Quote
Profile geophi
Volunteer moderator

Send message
Joined: 7 Aug 04
Posts: 2169
Credit: 64,555,907
RAC: 5,858
Message 10767 - Posted: 12 Mar 2005, 13:59:38 UTC - in response to Message 10763.  
Last modified: 12 Mar 2005, 14:00:08 UTC

&gt;
&gt; What shall I make out of that, it's complete but the server seems to think
&gt; different (?)
&gt;
Normally I'd say "It should change the status of the model to success the next time your PC contacts the server." However, it looks like you have trickled once since then so it should have changed it then. Quite strange.
ID: 10767 · Report as offensive     Reply Quote
old_user3434
Avatar

Send message
Joined: 30 Aug 04
Posts: 77
Credit: 1,785,934
RAC: 0
Message 10772 - Posted: 12 Mar 2005, 16:34:47 UTC - in response to Message 10767.  

Hmm...

It shows the CPU Time as 0, although all Trickles reported normal CPU Times.

I'll just have to wait what happens with the WorkUnit, but I sure wouldn't want it to expire one day with the outcome "No Reply" :(
Scientific Network : 44800 MHz - 77824 MB - 1970 GB
ID: 10772 · Report as offensive     Reply Quote
Profile astroWX
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1496
Credit: 95,522,203
RAC: 0
Message 10776 - Posted: 12 Mar 2005, 19:18:42 UTC

It may not get resolved. My W/U ID#248708 has been stuck in that condition since 11 Feb. Seems, for reasons of timing or whatever, some fall through the cracks.
<a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=467341"> Here. </a>
"We have met the enemy and he is us." -- Pogo
Greetings from coastal Washington state, the scenic US Pacific Northwest.
ID: 10776 · Report as offensive     Reply Quote
old_user3434
Avatar

Send message
Joined: 30 Aug 04
Posts: 77
Credit: 1,785,934
RAC: 0
Message 10781 - Posted: 13 Mar 2005, 0:10:02 UTC - in response to Message 10776.  
Last modified: 13 Mar 2005, 0:11:28 UTC

Hmpf...

Well, looks like those ~818 hours worth of CPU-time are wasted then <img src="http://www.falconfly.de/img/tongue.gif">
Scientific Network : 44800 MHz - 77824 MB - 1970 GB
ID: 10781 · Report as offensive     Reply Quote
Profile astroWX
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1496
Credit: 95,522,203
RAC: 0
Message 10783 - Posted: 13 Mar 2005, 1:04:31 UTC - in response to Message 10781.  

&gt; Hmpf...
&gt;
&gt; Well, looks like those ~818 hours worth of CPU-time are wasted then <img> src="http://www.falconfly.de/img/tongue.gif"&gt;

I'm not ready to make that assumption. Credit is posted -- a function of the 72nd Trickle upload.

In my case, the five summary files went up okay -- so they were in the hands of the server. My hope is that they found their way into the database for which they were intended and all that was lost is logging in our accounts after the fact.

That said, your assessment is as valid as mine. But let's hope for the best, eh?

If Tolu weren't doing the work of a team of programmers, he could look into it and advise us. However, I wouldn't think of troubling him with this issue for now. (He has help on the way, but it is months down the pipeline. Sometinme after that, perhaps some of these niggling problems will be addressed.)

Regards,
Jim

"We have met the enemy and he is us." -- Pogo
Greetings from coastal Washington state, the scenic US Pacific Northwest.
ID: 10783 · Report as offensive     Reply Quote
Profile Thyme Lawn
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1283
Credit: 15,824,334
RAC: 0
Message 10837 - Posted: 14 Mar 2005, 7:41:05 UTC - in response to Message 10772.  

&gt; I'll just have to wait what happens with the WorkUnit, but I sure wouldn't
&gt; want it to expire one day with the outcome "No Reply" :(

Check if you can find the string '3bci_100176230' anywhere in your client_state.xml file. If you can't the result has definitely been returned to (and acknowledged by) the upload server.
"The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer
ID: 10837 · Report as offensive     Reply Quote
DennisW

Send message
Joined: 7 Sep 04
Posts: 1
Credit: 303,879
RAC: 0
Message 10866 - Posted: 14 Mar 2005, 16:28:25 UTC - in response to Message 10776.  

&gt; It may not get resolved. My W/U ID#248708 has been stuck in that condition
&gt; since 11 Feb. Seems, for reasons of timing or whatever, some fall through the
&gt; cracks.
&gt; <a> href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=467341"&gt;
&gt; Here. </a>
&gt;I've got one like that which finished on 30 November 2004, its got the same credits as one I finished on 9 November that is successful. And I'm getting the no work available message which has been posted elsewhere so I've put one machine on another project. And 2 machines can't trickle, says project is down. I think the end is nigh, forget 2012!
ID: 10866 · Report as offensive     Reply Quote
old_user3434
Avatar

Send message
Joined: 30 Aug 04
Posts: 77
Credit: 1,785,934
RAC: 0
Message 11129 - Posted: 19 Mar 2005, 15:39:14 UTC - in response to Message 10837.  
Last modified: 19 Mar 2005, 15:45:31 UTC

&gt; Check if you can find the string '3bci_100176230' anywhere in your
&gt; client_state.xml file. If you can't the result has definitely been returned
&gt; to (and acknowledged by) the upload server.

I did not find any reference to that in the client_state.xml

But :
The boinc/projects/www.climateprediction.net/ DIR has a SubDirectory named 3bci_100176230
This still seems to hold all of the Model Files, all dated 11 Mar 06:30

That's the morning where it finished the Model. From my previous experiences with other machines which finished while I was monitoring them, I saw alot of Disk Activity (and on Win32 the Screensaver reporting to prepare the Data to be sent or something like that), compressing all Files before transfer.

So I assume something went wrong with the Communication to the Server, since all those MegaBytes should have been deleted after being successfully transferred to the Server (consumes quite alot of HD space actually)
----------------
Anything I can do get this finished Model back into the reporting loop ?
The way it looks like, the Server is just waiting for some "final Signal", if I could induce this Signal, make the process complete and have HD space feed up again after that, I'd be more than happy to do it :)

As the Model Result Data is fully accessible on the Server, looks like it would just need a little Trick to complete.
Scientific Network : 44800 MHz - 77824 MB - 1970 GB
ID: 11129 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 11136 - Posted: 19 Mar 2005, 16:47:44 UTC

The results sent back are just a part of the data. 330Meg or so get left on your computer in the form of
lots of zip files. If this is a problem they can be archived to cd / dvd.
The scientists may like the look of your results and want the rest.

****************

&gt; it would just need a little Trick to complete.

The trick is to do an Update. Otherwise, you will get the same effect the next time you trickle.

Les

ID: 11136 · Report as offensive     Reply Quote
old_user3434
Avatar

Send message
Joined: 30 Aug 04
Posts: 77
Credit: 1,785,934
RAC: 0
Message 11142 - Posted: 19 Mar 2005, 18:23:38 UTC - in response to Message 11136.  

&gt; The trick is to do an Update. Otherwise, you will get the same effect the next
&gt; time you trickle.
&gt;
&gt; Les

Update ?
Doing that just gets the latest Preferences off the Server, but does not affect the semi-/un-finished Workunit.

The machine Trickled already several times with the new Model, with no abnormal behaviour.


Scientific Network : 44800 MHz - 77824 MB - 1970 GB
ID: 11142 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 11145 - Posted: 19 Mar 2005, 18:45:11 UTC

Tricky. Still, according to the records, your computer didn't spend any time on it. ;)

You're not alone with a 'finishing' problem. See <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/results.php?hostid=44284"> this</a> Mac user's page.
Lots of models where the trickles, total credits, and graphs SEEM to indicate they are complete, but are listed as errors.

Les

ID: 11145 · Report as offensive     Reply Quote
old_user412

Send message
Joined: 7 Aug 04
Posts: 18
Credit: 70,985
RAC: 0
Message 11148 - Posted: 19 Mar 2005, 19:14:05 UTC

Yes, I've had this workunit <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=1757">http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=1757</a> stuck in a similar situation since last November.
The server claims to have received the result in Sept when the model crashed. But I managed to restart it from a backup and crunch it till the end. All the trickles went through and the end of model files all uploaded sucessfully, but the server still says Outcome Unknown!
Alan
ID: 11148 · Report as offensive     Reply Quote
Profile Thyme Lawn
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1283
Credit: 15,824,334
RAC: 0
Message 11220 - Posted: 21 Mar 2005, 7:47:41 UTC - in response to Message 11148.  

&gt; The server claims to have received the result in Sept when the model crashed.
&gt; But I managed to restart it from a backup and crunch it till the end. All the
&gt; trickles went through and the end of model files all uploaded sucessfully, but
&gt; the server still says Outcome Unknown!

The result status can't be updated after the first upload (it's a restriction enforced by the BOINC database interface code), but the second set of result files for the model will have been accepted.
"The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer
ID: 11220 · Report as offensive     Reply Quote

Message boards : Number crunching : Model Finished&Uploaded, Server claims Outcome "unknown"?

©2024 climateprediction.net