climateprediction.net home page
Posts by old_user35920

Posts by old_user35920

1) Questions and Answers : Unix/Linux : Computation Error on \"Show Graphics\" (Message 23472)
Posted 5 Jul 2006 by old_user35920
Post:
The last set of messages suggest that BOINC caused the crash:
scan: boinc_ufs_cpdnout1.zip
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Can\'t set up shared mem: -1
GLUT: Fatal Error in screensaver: could not open display:
Can\'t set up shared mem: -1
GLUT: Fatal Error in screensaver: could not open display:
Can\'t set up shared mem: -1
GLUT: Fatal Error in screensaver: could not open display:
CPDN Monitor - Quit request from BOINC...


But there were errors in several palces before this, staring with:
scan: boinc_ufs_cpdnout1.zip

pp2netcdf crashed: Error in getting file type
Error in converting file dataout/08akfo.pjd8c10 to netcdf format.

pp2netcdf crashed: Error in getting file type
Error in converting file dataout/08akfo.pid8c10 to netcdf format.

pp2netcdf crashed: Error in getting file type
Error in converting file dataout/08akfo.pfd8c10 to netcdf format.

pp2netcdf crashed: Error in getting file type
Error in converting file dataout/08akfa.phd8c10 to netcdf format.

pp2netcdf crashed: Error in getting file type
Error in converting file dataout/08akfa.pgd8c10 to netcdf format.

pp2netcdf crashed: Error in getting file type
Error in converting file dataout/08akfa.ped8c10 to netcdf format.

pp2netcdf crashed: Error in getting file type
Error in converting file dataout/08akfa.pdd8c10 to netcdf format.


And this is yet another example of why \'the old hands\' have been saying for ages:
Make backups!



Hi,

Thanks for the reply. So now there is a backup solution in place ;). Once a day there will be an update from now on. I am using backup-manager for that, and before starting the update the boinc client is stopped, the home-partition for the boinc client is then remounted readonly after waiting for 180 seconds (as I don\'t know yet how to tell when the app is really finished after requesting the shutdown). Please tell: Does the boinc client wait until the running app is definitely shutdown, or does it forward the request for shutdown and than immediately shuts down itself? Maybe a stupid question, but I want to do it right this time ;)
.
.
.
And sorry for wasting that much work. Some questions though:

What happens to WU\'s that finished with a client error. Are they ever being retransmitted to another client? Or are they gone and never computed again? It would be a pity if important data was never collected, because of a users fault.


Correction:


Once a day there will be a _BACKUP_ from now on. I am using backup-manager for that, and before starting the _BACKUP_ ...

;)
2) Questions and Answers : Unix/Linux : Computation Error on \"Show Graphics\" (Message 23466)
Posted 5 Jul 2006 by old_user35920
Post:
The file conversion shouldn\'t result in a client error.
Most likely it was the display initialization.
Which isn\'t really done by the cpdn app.
Could you please provide detail of your graphics card.


Hi,

my graphics card is a GeForce3Ti200 AGP with 64MB of RAM.

But I want to add, that I was fiddeling around with my xorg.conf to find out, how to get the binary nvidia-driver to work again. I later recognized, that the nvidia module wasn\'t loaded but the nv-driver for X provided by Xorg. I had all config-parameters set up to start the nvidia module, but actually I forgot to change the name of the Display driver module to \"nvidia\" in the \"Device\"-Section of the xorg.conf. So the \"normal\" nv-module was loaded with a set of parameters tweaked according to nvidia\'s README:

Section \"Module\"
Load \"bitmap\"
Load \"dbe\"
Load \"ddc\"
#Load \"dri\"
#Load \"GLCore\"
Load \"evdev\"
Load \"extmod\"
Load \"freetype\"
Load \"glx\"
Load \"int10\"
Load \"record\"
Load \"type1\"
Load \"v4l\"
Load \"vbe\"
EndSection
.
.
.
Section \"Device\"
Identifier \"Nvidia GeForce3 Ti 200\"
Driver \"nv\"
VideoRam 65536
EndSection

At least I think the config looked like that, as the one I used for testing wasn\'t backed up too :/


Maybe that will help too.
3) Questions and Answers : Unix/Linux : Computation Error on \"Show Graphics\" (Message 23464)
Posted 5 Jul 2006 by old_user35920
Post:
The last set of messages suggest that BOINC caused the crash:
scan: boinc_ufs_cpdnout1.zip
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Can\'t set up shared mem: -1
GLUT: Fatal Error in screensaver: could not open display:
Can\'t set up shared mem: -1
GLUT: Fatal Error in screensaver: could not open display:
Can\'t set up shared mem: -1
GLUT: Fatal Error in screensaver: could not open display:
CPDN Monitor - Quit request from BOINC...


But there were errors in several palces before this, staring with:
scan: boinc_ufs_cpdnout1.zip

pp2netcdf crashed: Error in getting file type
Error in converting file dataout/08akfo.pjd8c10 to netcdf format.

pp2netcdf crashed: Error in getting file type
Error in converting file dataout/08akfo.pid8c10 to netcdf format.

pp2netcdf crashed: Error in getting file type
Error in converting file dataout/08akfo.pfd8c10 to netcdf format.

pp2netcdf crashed: Error in getting file type
Error in converting file dataout/08akfa.phd8c10 to netcdf format.

pp2netcdf crashed: Error in getting file type
Error in converting file dataout/08akfa.pgd8c10 to netcdf format.

pp2netcdf crashed: Error in getting file type
Error in converting file dataout/08akfa.ped8c10 to netcdf format.

pp2netcdf crashed: Error in getting file type
Error in converting file dataout/08akfa.pdd8c10 to netcdf format.


And this is yet another example of why \'the old hands\' have been saying for ages:
Make backups!



Hi,

Thanks for the reply. So now there is a backup solution in place ;). Once a day there will be an update from now on. I am using backup-manager for that, and before starting the update the boinc client is stopped, the home-partition for the boinc client is then remounted readonly after waiting for 180 seconds (as I don\'t know yet how to tell when the app is really finished after requesting the shutdown). Please tell: Does the boinc client wait until the running app is definitely shutdown, or does it forward the request for shutdown and than immediately shuts down itself? Maybe a stupid question, but I want to do it right this time ;)
.
.
.
And sorry for wasting that much work. Some questions though:

What happens to WU\'s that finished with a client error. Are they ever being retransmitted to another client? Or are they gone and never computed again? It would be a pity if important data was never collected, because of a users fault.
4) Questions and Answers : Unix/Linux : Computation Error on \"Show Graphics\" (Message 23385)
Posted 26 Jun 2006 by old_user35920
Post:
Hello,

Please anybody who is involved in development of the CPDN Application, have a look at Result-ID 5110074.

The story:

yesterday I upgraded my kernel, and wanted to give the binary nvidia driver another try. I wanted to check, whether I could view the Graphics of CPDN.

Just after I hit the \"Show Graphics\" button in boincmgr the Graphics showed up for about half a second, disappeared and an \"Computation error\" occured in just after that. After that i recognized, that the module failed to install, which might have caused the error. But IMHO showing the Graphics should not influence computation in such a way.

All those preciuos CPU-cycles :(

Please have a look at that Result and see the error message. And please keep me informed ;)

Thanks for your interest.

Greetings
SC

PS: If there is any info missing in this post please let me know. I\'ll see if I can get it ;)
5) Questions and Answers : Unix/Linux : signal 11 when scheduler request failed (Message 20057)
Posted 9 Feb 2006 by old_user35920
Post:
Hello,

my sulphur model with result-ID 1610846 just crashed, when there was a problem with my internet connection.

Here is a snippet of the log:

<snip>
...
Mi 08 Feb 2006 19:50:52 CET|Einstein@Home|Computation for result r1_1159.0__1226_S4R2a_0 finished
Mi 08 Feb 2006 19:50:52 CET|climateprediction.net|Resuming result sulphur_ij30_000864540_0 using sulphur_cycle version 422
Mi 08 Feb 2006 19:50:55 CET|Einstein@Home|Started upload of r1_1159.0__1226_S4R2a_0_0
Mi 08 Feb 2006 19:50:56 CET|LHC@home|Sending scheduler request to http://lhcathome-sched1.cern.ch/scheduler/cgi
Mi 08 Feb 2006 19:50:56 CET|LHC@home|Reason: To fetch work
Mi 08 Feb 2006 19:50:56 CET|LHC@home|Requesting 86400 seconds of new work
Mi 08 Feb 2006 19:51:37 CET||Couldn\'t resolve hostname [lhcathome-sched1.cern.ch]
Mi 08 Feb 2006 19:51:37 CET||Couldn\'t resolve hostname [einstein.phys.uwm.edu]
Mi 08 Feb 2006 19:51:37 CET|climateprediction.net|Unrecoverable error for result sulphur_ij30_000864540_0 (process got signal 11)
Mi 08 Feb 2006 19:51:37 CET||request_reschedule_cpus: process exited
Mi 08 Feb 2006 19:51:37 CET|Einstein@Home|Temporarily failed upload of r1_1159.0__1226_S4R2a_0_0: can\'t resolve hostname
Mi 08 Feb 2006 19:51:37 CET|Einstein@Home|Backing off 1 minutes and 0 seconds on upload of file r1_1159.0__1226_S4R2a_0_0
Mi 08 Feb 2006 19:51:37 CET|climateprediction.net|Computation for result sulphur_ij30_000864540_0 finished
Mi 08 Feb 2006 19:51:37 CET|Einstein@Home|Starting result r1_1159.0__1408_S4R2a_3 using albert version 440
Mi 08 Feb 2006 19:51:37 CET|LHC@home|Scheduler request to http://lhcathome-sched1.cern.ch/scheduler/cgi failed with a return value of -113
Mi 08 Feb 2006 19:51:37 CET|LHC@home|No schedulers responded
...
<snap>

There are also some errors in the stderr about GLUT and X-connection problems but i think they are old, because I had a permission problem some time ago and it is solved now.

It might be worth mentioning that my general preferences are set to \"leave in memory when preempted\".

The BOINC client version is 5.2.13, and the client is also attached to SETI@home, LHC@home and Einstein@home.

Can anybody tell me, why it happened and also how to make it not happen again?

Thanks in advance.

Cheers




©2024 climateprediction.net