climateprediction.net home page
Posts by old_user57798

Posts by old_user57798

1) Questions and Answers : Unix/Linux : non reporting system (Message 10643)
Posted 10 Mar 2005 by old_user57798
Post:
> > Do you have any trickle_up_*.xml files in your
> projects/climateprediction.net
> > directory Kyle?
> >
> > If you do then it's definitely some kind of networking problem and the
> stderr
> > and stdout files in your BOINC directory should hopefully give some
> indication
> > of what's causing the problem.
> >
> No, nothing that has trickle_up_ in it. stderr and stdout are empty. Dates on
> all files are no newer than the date I did the restart.
>
> kyle
>

This system still appears to be hung. Although there is lots of CPU time it never gets past:

Starting model in /home/kyle/boinc/projects/climateprediction.net...
Created shared memory region key = 25260
Env
Used=LD_LIBRARY_PATH=/home/kyle/boinc/projects/climateprediction.net:/home/kyle/adabas/lib:/home/kyle/adabas/lib::/usr/local/lib:/usr/lib:/lib
Copying files for startup...
2005-03-08 15:12:16 [climateprediction.net] Scheduler RPC to
http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi succeeded 2005-03-08 15:12:16 [climateprediction.net] Project prefs: no separate
prefs for work; using your defaults
Starting model ID 087m_000035684 Phase 1
Waiting for model startup, this may take a minute...
Stack size=48.00 MB
087m_000035684 - PH 1 TS 000001 - 00/00/0000 00:00 - H:M:S=0000:00:00
AVG= 0.00 DLT= 0.00

No files, no errors nothing. Anyone have any idea what is up?

kyle
2) Questions and Answers : Unix/Linux : non reporting system (Message 10531)
Posted 7 Mar 2005 by old_user57798
Post:
> Do you have any trickle_up_*.xml files in your projects/climateprediction.net
> directory Kyle?
>
> If you do then it's definitely some kind of networking problem and the stderr
> and stdout files in your BOINC directory should hopefully give some indication
> of what's causing the problem.
>
No, nothing that has trickle_up_ in it. stderr and stdout are empty. Dates on all files are no newer than the date I did the restart.

kyle
3) Questions and Answers : Unix/Linux : non reporting system (Message 10520)
Posted 7 Mar 2005 by old_user57798
Post:
Hi again;

I've got a Debian system which has been running for five days (I see 99% cpu time on hadsm3um on a 1.7 MHz box), but I see no new files and it has not reported data (no trickles). I have a second computer which is working fine, trickles etc. with what I think are the same set up. I restarted three days ago with -update_prefs climateprediction.net; do not have the fortran problem reported by others(which shows up in stderr_um.txt). Any suggestions?

kyle
4) Questions and Answers : Unix/Linux : How to force a trickle!? (Message 10373)
Posted 4 Mar 2005 by old_user57798
Post:
> > The tag is "min_rpc_time"
>
> Hi Josh, thank you for providing that info.
>
> For all my computers it says "0
>

Hi - now I'm confused. I have one Linux box that is daily sending trickles but another that appears to be running (CPU maxed out on hadsm3...) for several days with no returns at all. The min_rpc_time says "0" for both....?? I stoped and restarted the non trickling one with "-update_prefs climateprediction.net" but it did not report anything.

kyle
5) Questions and Answers : Unix/Linux : crash with code 251 (Message 10295)
Posted 3 Mar 2005 by old_user57798
Post:
> Hi kyle
> Are you using a network drive for the files?
> If so, this is a bad move.
>
> Les
>

No. And now, through no change on my part (except to restart) the model is running (for about 24 hrs). I notice in the 'Nature' article that some parameter choices are unstable and cause the calculation to crash. Could possibly the first set of model parameters on this machine been unstable and now I have a different set of parameters? How often does something like that happen? Is there a way to tell if that happened?

Thanks,

kyle
6) Questions and Answers : Unix/Linux : crash with code 251 (Message 10220)
Posted 2 Mar 2005 by old_user57798
Post:
I've got boinc/climate running on a couple of windoz OSs and a linux box but am having trouble with a second linux box (GenuineIntel Intel(R) Pentium(R) 4 CPU 1700MHz). I've run mprime for 24 hrs on this machine without difficulties (as recomended elsewhere) but (with same Debian version OS as the other one) I get "Model crash ....(process exited with code 251) .... Defering communication for 1 hr..." and no processing going on. Any suggestions?

kyle




©2024 climateprediction.net