climateprediction.net home page
Posts by old_user28498

Posts by old_user28498

1) Questions and Answers : Unix/Linux : Linux Sulphur 4.23 Unstable (Message 21053)
Posted 4 Mar 2006 by old_user28498
Post:
(I was away on travel and I did not read the posts above until now)

I believe the problem with the GUI not updating might be related to the warning messages 4.21 produces upon startup when trying to set pointers to the shared memory segment it allocates. At first, I thought the only side effect would be lack of graphics, but I think the GUI also uses this memory segment to interchange information with the boinc client. As I do not use the GUI much because I find is faster editing the client_state.xml file, I had not realized of this problem.

On the other hand, my experience is that the client seems to be crunching workunits fine on Red Hat Linux (so Fedora should be all right too), and MEPIS. I have no direct experience with other distributions of Linux besides Knoppix, which seems to be all right too. However, what I have noticed odd behaviours of other CPDN applications in the past. For example, the hadcm3 spinup application produces a lot of zombi processes when running on Fedora, something that does not happen on other Linux distributions.

Sorry I cannot be of more help.
2) Questions and Answers : Unix/Linux : Linux Sulphur 4.23 Unstable (Message 20943)
Posted 1 Mar 2006 by old_user28498
Post:
LS Diseño

Thanks. Worked great. There were some complications because I had both 4.22 and 4.23 entries in my client_state file, but I figured it out. Running much faster than the 4.22 executables that were crunching before.



Glad it worked for you! I also have a host with 4.22, and I left it at that as it was crunching through phase 3 already.
3) Questions and Answers : Unix/Linux : Linux Sulphur 4.23 Unstable (Message 20862)
Posted 28 Feb 2006 by old_user28498
Post:
OK. Try:

Instructions to downgrade from sulphur 4.23 to 4.21 on Linux
4) Questions and Answers : Unix/Linux : Linux Sulphur 4.23 Unstable (Message 20859)
Posted 28 Feb 2006 by old_user28498
Post:

I found these files at the url below

http://climateapps2.oucs.ox.ac.uk/cpdnboinc/download/sulphur_4.21_i686-pc-linux-gnu
http://climateapps2.oucs.ox.ac.uk/cpdnboinc/download/sulphur_4.21_i686-pc-linux-gnu.so
http://climateapps2.oucs.ox.ac.uk/cpdnboinc/download/sulphur_data_4.21_i686-pc-linux-gnu.zip
http://climateapps2.oucs.ox.ac.uk/cpdnboinc/download/sulphur_um_4.21_i686-pc-linux-gnu.zip

but we\'ll need the file_signature information for those mentioned in the xml file.


You also need sulphur_se_4.21_i686-pc-linux-gnu.zip

The signature is:

(file_info)
(name)sulphur_4.21_i686-pc-linux-gnu(/name)
(nbytes)4259364.000000(/nbytes)
(max_nbytes)0.000000(/max_nbytes)
(status)1(/status)
(executable/)
(signature_required/)
(file_signature)
765a986e0c67b95cab0da0605be738abdeaa9791abeb36b20e596ff344044513
ed1a2cfa86bec894646149fda4b5da473b56d8d6b60acbb435fa46aa43787e2b
cf7e7d3e0cf0e8e00338257752e9f1e67363d3597d6b26414122d0cb89a7a6ce
0d3c6fb8b82ce3144a5f3762872eace49007cecbd04f7439d4891a7dca32a07b
.
(/file_signature)
(url)http://climateapps2.oucs.ox.ac.uk/cpdnboinc/download/sulphur_4.21_i686-pc-linux-gnu(/url)
(/file_info)
(file_info)
(name)sulphur_4.21_i686-pc-linux-gnu.so(/name)
(nbytes)8281400.000000(/nbytes)
(max_nbytes)0.000000(/max_nbytes)
(status)1(/status)
(executable/)
(signature_required/)
(file_signature)
91f02c74278d1d3da254aaf0d33d3141ace1ea503ace9f52e3e8ea9436985048
c8d05408802b1c232303929358442549cb817f3b49cb59b8fc24ae7479d1ae31
d50102c97c4048faeb0424fad737790d95268885a627204c18a6537be8f6de83
eb4f2daa311af2512a1d94a2a332d9932a4af6a319d26358109ccec9f996f805
.
(/file_signature)
(url)http://climateapps2.oucs.ox.ac.uk/cpdnboinc/download/sulphur_4.21_i686-pc-linux-gnu.so(/url)
(/file_info)
(file_info)
(name)sulphur_data_4.21_i686-pc-linux-gnu.zip(/name)
(nbytes)24047629.000000(/nbytes)
(max_nbytes)0.000000(/max_nbytes)
(status)1(/status)
(signature_required/)
(file_signature)
91e4e438a17842b73758007a92c4ffea63b9ade84211fd34038689b7b672eee8
d80e2a6c146465ce74c6aad4edab79c539a9dfddecbbc9ed93385bcfe8c533cd
3d2ff4f996e980262cfb5cd523b7acdec0352d19b46545407a3a67de6edf2c6e
8b63a505d5e2a9b89a2e6f6167a49cac0d0b04ce93772e25802807b0dc997dc1
.
(/file_signature)
(url)http://climateapps2.oucs.ox.ac.uk/cpdnboinc/download/sulphur_data_4.21_i686-pc-linux-gnu.zip(/url)
(/file_info)
(file_info)
(name)sulphur_um_4.21_i686-pc-linux-gnu.zip(/name)
(nbytes)4563697.000000(/nbytes)
(max_nbytes)0.000000(/max_nbytes)
(status)1(/status)
(signature_required/)
(file_signature)
44040e3d2012e67ece56d07b9ec38910d5e903eeb0093bbda9a7cc7047c00a8d
7f8ec82724b0fc1f2cc3f66f274c5bfe90f10397b18972e71af3111ff025037f
50b5dbad2ea09bd16505c580aace29a846bcb110b547bbd173cb6c8aefad5a4a
e6dd8c33064330452ecaaa42aad23c7daceb2234b5221637454391829091338f
.
(/file_signature)
(url)http://climateapps2.oucs.ox.ac.uk/cpdnboinc/download/sulphur_um_4.21_i686-pc-linux-gnu.zip(/url)
(/file_info)
(file_info)
(name)sulphur_se_4.21_i686-pc-linux-gnu.zip(/name)
(nbytes)5639582.000000(/nbytes)
(max_nbytes)0.000000(/max_nbytes)
(status)1(/status)
(signature_required/)
(file_signature)
09a841d58f5030fbd709c9027d05789a4ea1b2d9e2b3f5d9f5bb51c99128a918
a1c4e005635588ad44301705c947678ee3022053eb9661bd07d759ea9a9fe979
ef154e7b1041ec7bb51f96ed1482a35f8c5591eb9b78da92401b7c14600d40a4
e615c73c55843b82d3c17a3a3a4e2f5b09e61333aeaff1e79a3a4c3bf75d43e0
.
(/file_signature)
(url)http://climateapps2.oucs.ox.ac.uk/cpdnboinc/download/sulphur_se_4.21_i686-pc-linux-gnu.zip(/url)
(/file_info)

And you also have to add the (app_version):

(app_version)
(app_name)sulphur_cycle(/app_name)
(version_num)421(/version_num)
(file_ref)
(file_name)sulphur_4.21_i686-pc-linux-gnu(/file_name)
(main_program/)
(/file_ref)
(file_ref)
(file_name)sulphur_4.21_i686-pc-linux-gnu.so(/file_name)
(open_name)sulphur_4.21_i686-pc-linux-gnu.so(/open_name)
(/file_ref)
(file_ref)
(file_name)sulphur_data_4.21_i686-pc-linux-gnu.zip(/file_name)
(open_name)sulphur_data_4.21_i686-pc-linux-gnu.zip(/open_name)
(/file_ref)
(file_ref)
(file_name)sulphur_um_4.21_i686-pc-linux-gnu.zip(/file_name)
(open_name)sulphur_um_4.21_i686-pc-linux-gnu.zip(/open_name)
(/file_ref)
(file_ref)
(file_name)sulphur_se_4.21_i686-pc-linux-gnu.zip(/file_name)
(open_name)sulphur_se_4.21_i686-pc-linux-gnu.zip(/open_name)
(/file_ref)
(/app_version)


Written in anger: BBCode tags suck!

Anyway, I have an HTML page with the downgrade instructions and the XML changes to the client_state.xml file listed above.
5) Questions and Answers : Unix/Linux : Linux Sulphur 4.23 Unstable (Message 20853)
Posted 28 Feb 2006 by old_user28498
Post:

In order for others who are very persistent to be able to do this, they would need access to the 4.21 executables, zip files, and the associated changes in the snippets of code in the xml files. I know Honza wrote something on upgrading here so I imagine one can attempt to follow that (with appropriate changes) to downgrade, but the files are needed (downloadable) and the file signatures.


Certainly. Do you know where I can upload the files?
6) Questions and Answers : Unix/Linux : Linux Sulphur 4.23 Unstable (Message 20850)
Posted 28 Feb 2006 by old_user28498
Post:
I have decided to process the sulphur workunits using 4.21 until the coupled model becomes available or a new/old version of the linux application is released as 4.24+. I do not know if I will be able to upload the result, but I can always revert to 4.23 for doing that.

Lets see what happens...



It seems to work. I just uploaded the intermediate xx...xxxx_1.zip file at the end of phase 1.
7) Questions and Answers : Unix/Linux : Linux Sulphur 4.23 Unstable (Message 20714)
Posted 24 Feb 2006 by old_user28498
Post:
To follow up with this issue, I remember the team re-released hadsm3 (slab) 4.04 as 4.13 when 4.10, 4.11 and 4.12 were found unstable, much to my disappointment because I never had problems with them and 4.1x were much faster on Athlon 64 processors.

Do you know if re-relasing sulphur 4.21 as 4.24 involves a lot of work? Does it require to regenerate the workunits which are queued to be processed? The answer is probably yes :-(
8) Questions and Answers : Unix/Linux : Linux Sulphur 4.23 Unstable (Message 20713)
Posted 24 Feb 2006 by old_user28498
Post:
I just dropped the 4.21 executables in the climateprediction.net directory, and edited the client_state.xml file to add the (file_info) and (app_version) bits. Then I changed the version of the application in each of the (workunits). When I restarted the client, it complained about a couple of shared memory symbols (I believe is because of the graphics, which I do not use) but it continue running using 4.21 all right.

So far, the clients I modified have trickled, but I have not reach the point yet where I have to upload a file.

Edit: Changed parenthesis () for angle brackets as these do not show.
9) Questions and Answers : Unix/Linux : Linux Sulphur 4.23 Unstable (Message 20673)
Posted 23 Feb 2006 by old_user28498
Post:
I have decided to process the sulphur workunits using 4.21 until the coupled model becomes available or a new/old version of the linux application is released as 4.24+. I do not know if I will be able to upload the result, but I can always revert to 4.23 for doing that.

Lets see what happens...

10) Message boards : Number crunching : Trickles are great -- but how about some Credits? (Message 14703)
Posted 29 Jul 2005 by old_user28498
Post:
Carl

First of all, thanks a lot!

It looks like the updater fixed the problems with the misallocated WUs (both ways, the ones I crunched for others and the one someone crunched for me). However, now some old work units are not accounted for. For example, host 76637 is missing credits for WU 374239 (3ef6_100180254_0), the very first it completed on Dec 8, 2004. Also, host 173068 (which was merged from a previous incarnation of the computer that crashed a disk - I do not have the old host ID, sorry) is missing all workunits it worked on before the current one; i.e: 374121 (3eby_100180136_0), 409173 (220x_100116913_2), 568076 (2d5d_200131472_0, incomplete because of crash).

I wonder whether there are more cases like these. I'll keep looking.

However, the total credit seems to be correct (and is about 30K cobbles above the sum of the individual credits of all my computers).

Regards,

LS

11) Message boards : Number crunching : Announcement: Database residual problem - misallocated WUs (Message 13284)
Posted 9 Jun 2005 by old_user28498
Post:
Many thanks Thyme Lawn. I was looking for something like stdout or stderr to check if there were messages clarifying this issue, but without any luck. I do not have any stdout file neither in the BOINC directory nor in the climeprediction.net one (BOINC 4.19, hadsm 4.13, Linux). If this is just a scheduler matter, then all is well.

LS

12) Message boards : Number crunching : Announcement: Database residual problem - misallocated WUs (Message 13278)
Posted 9 Jun 2005 by old_user28498
Post:
Do we still have problems with misallocated WUs?

My host 158490 has, allegedly, received result 923715, 001u_600025051_1 (created 7 Jun 2005 21:18:18 UTC, sent 8 Jun 2005 23:15:12 UTC). However, the host does not have any file named 001u_600025051.zip in its climateprediction.net directory where WUs are queued for crunching.

If I see credits coming I'll check the hostid reported by the trickles.

LS

13) Message boards : Number crunching : Announcement: Database residual problem - misallocated WUs (Message 12750)
Posted 21 May 2005 by old_user28498
Post:
Well, this is what happened with <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/show_host_detail.php?hostid=69015">host 69015</a> (see posts above in this same thread):

Calculations for result <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=786821">786821</a> were completed, and for the look of it here, the upload went up without any problem. However, the plots for phase 3 do not appear in the result page. No science information seems to be lost as the <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/trickle.php?resultid=786821">trickles</a> do point to the host doing the crunching (69015). If the science team ever wants to look at the result they can find the right host there. 69015 is now crunching the next wu allocated (but the previous one is still 'in progress' as I reported above, even if it is, in fact, completed.

Interestingly enough, now I find myself also at the opposite end of the problem. My host <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/show_host_detail.php?hostid=129362">129362</a> got last night result <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=856358">856358</a>, and if you look at the <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/trickle.php?resultid=856358">trickles for 856358</a> you can see how this result appears to have been sent already to host <a>55941</a>, which is not mine. I ended up with a trickle's worth of credit right away as that host had upload already one.

So now I am considering suspending temporarily result 856358 to see if the other host keeps at it (my conputer won't be idle meanwhile, as it is a multiprocessor machine). Does anyone know if sending a STOP signal to hadsm3um has any adverse effect?

All this was running Linux, BOINC 4.19 and HADSM 4.13, by the way.

Cheers,

LS
14) Message boards : Number crunching : Announcement: Database residual problem - misallocated WUs (Message 12718)
Posted 20 May 2005 by old_user28498
Post:
I will report tomorrow on the completion of result <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/trickle.php?resultid=786821">786821</a> , and I will also back up the whole directory as crandles suggested. I hope the result can be uploaded and get used. I consider the credit problem only a minor matter.

Regards,

LS
15) Message boards : Number crunching : Announcement: Database residual problem - misallocated WUs (Message 12708)
Posted 20 May 2005 by old_user28498
Post:
I have one result with this problem. Some others were allocated but I have deleted them already. This is the info on the troublesome one:

Result: 2svq_300152062_1 (result id 786821)

Host ID working on the result: 69015; got to phase 3 step 226842 so far, so I am completing it. No credit is being granted and is not showing in my results list. The previous result by this computer (69015) is completed but is shown as 'in progress' in the results list (this is result 758126 = 2ofa_200146229_1).

The database shows, however that 2svq_300152062_1 (786821) was allocated to host 73891, which is not mine, and which got a download error when geting this work unit (now that host is working on a different WU).

Two other work units I have deleted already are:

ResultID WU HostID DB_Status
853059 2y7f_300159031_0 129362 Unsent Deleted after ph 1/10802
853400 2ygs_300159372_0 129362 Unsent Deleted before starting

Host 76637 also got an 'unsent' unit queued which I deleted, but I do not have the information on that one anymore.

Thanks and regards,

LS (user id 28498)
16) Questions and Answers : Unix/Linux : FYI: hadsm3_4.13 (Message 11721)
Posted 9 Apr 2005 by old_user28498
Post:
A dual processor machine of mine with Opteron 246s just downloaded a new version of hadsm3, 4.13. Seconds per timestep went up from 2.0x (4.11, 4.12) to 2.60 (two hours more per trickle). By the way, I never had any stability problems with any of the previous versions (4.04, 4.11 and 4.12 all run fine for me).




©2024 climateprediction.net