climateprediction.net home page
Posts by Deborah Goldsmith

Posts by Deborah Goldsmith

1) Message boards : climateprediction.net Science : Misconfiguration e-mail (Message 43068)
Posted 28 Sep 2011 by Deborah Goldsmith
Post:
Bump? Could you please reset my host?

http://climateapps2.oerc.ox.ac.uk/cpdnboinc/show_host_detail.php?hostid=6011
2) Message boards : climateprediction.net Science : Misconfiguration e-mail (Message 43013)
Posted 26 Sep 2011 by Deborah Goldsmith
Post:
Hi,

I got another email saying a host had been removed. This host URL is:

http://climateapps2.oerc.ox.ac.uk/cpdnboinc/show_host_detail.php?hostid=6011

I deleted climate prediction.net from the host, then reattached to the project. Can you try turning the host back on so we can see if that worked?

I also did this on my third remaining host, even though it hasn't been turned off yet, as a precautionary measure.

Thanks,
Deborah
3) Message boards : climateprediction.net Science : Misconfiguration e-mail (Message 42906)
Posted 15 Sep 2011 by Deborah Goldsmith
Post:
Hi,

I got an email saying my host had been removed. The host URL is:

http://climateapps2.oerc.ox.ac.uk/cpdnboinc/show_host_detail.php?hostid=615938

I followed the advice elsewhere and deleted climate prediction.net from the host, then reattached to the project. Can you try turning the host back on so we can see if that worked?

Thanks,
Deborah
4) Questions and Answers : Macintosh : Can\'t get any decent results, detached from project. (Message 17571)
Posted 30 Nov 2005 by Deborah Goldsmith
Post:
hadsm 4.12 did run fine as long as I ran boinc via a shell script, with a ulimit -n 1024, to up the maximum open file limit to 1024 (with the default of 255 it exits with a \'no more file handles\' error).


This was a problem in the BOINC core client, fixed in 5.2.8 or later.
5) Questions and Answers : Macintosh : Stack overflow / SIGBUS (Message 17059)
Posted 8 Nov 2005 by Deborah Goldsmith
Post:
The crash is happening in the CPDN application(s), so it\'s a particular bug in that application. That\'s why it\'s not happening in other BOINC apps. It\'s no reflection on the CPDN programmers; the PowerPC has a memory architecture that\'s harder to deal with than some other platforms. It\'s tripped up lots of good programmers.

If it really is a concurrency problem (and that\'s speculation on my part), setting one processor in your BONC prefs will not help. That just affects how many BOINC apps you run simultaneously. The CPDN app is using multiple threads, and even if you say \"just use one processor\" those threads can still run on different processors, which is what can cause such a problem. To stop the problem, you\'d have to actually disable one of your processors (which you can do from the command line by changing NVRAM or using the Developer Tools).

There was another post by someone who says they saw this on a single-processor G5 (iMac G5), so it\'s possible it\'s not a concurrency error.
6) Questions and Answers : Macintosh : Client errors (Message 17003)
Posted 6 Nov 2005 by Deborah Goldsmith
Post:
See the thread \"Stack overflow/SIGBUS\" in this forum. I believe that this is a problem that only occurs on multiprocessor machines. If anyone knows of this happening on a single-processor machine, please speak up.

I\'ve tried repeatedly to contact the developers to help track it down, but with no success. My speculation is that this is a concurrency error due to the \"weak\" memory model of the PowerPC processor (meaning that reads and writes to main memory can appear out of order between the two processors on a multiprocessor system). Code that runs fine on Intel multiprocessor systems (which have a \"strong\" memory model) often crashes on PowerPC multiprocessors. The fix is to use appropriate concurrency controls (mutex, atomic instructions, etc.) when accessing shared memory.
7) Questions and Answers : Macintosh : Client errors (Message 16789)
Posted 26 Oct 2005 by Deborah Goldsmith
Post:
There seems to be a problem running the climateprediction.net application on dual-processor G5 Macs; it appears to have a concurrency error of some kind. There doesn\'t seem to be much interest in diagnosing or fixing the problem.

I\'d like to participate but at this point the application just doesn\'t work on some Mac models.

If you\'re seeing problems on a machine that does not have a dual processor, then it could be something else. All my non-DP machines seem to be running climateprediction.net just fine.
8) Questions and Answers : Macintosh : Stack overflow / SIGBUS (Message 16623)
Posted 14 Oct 2005 by Deborah Goldsmith
Post:
Well, clearly, I meant the crt.c that is in the crash backtrace, which also, clearly, is part of the cpdn application. Since the file in question seems to be BOINC-specific (what with function names like checkBOINCStatus), and not part of the actual scientific code, I thought it might be available. Perhaps not. If it is available, I thought I might be able to track down the concurrency issue that seems to be causing the crash. PowerPC has a weak memory model that can cause crashes where the same code works on x86 multiprocessor machines.

Yes, I have installed the necessary libraries. That\'s not the issue here.
9) Questions and Answers : Macintosh : Stack overflow / SIGBUS (Message 16621)
Posted 14 Oct 2005 by Deborah Goldsmith
Post:
This crash is happening in the regular model, too, so I don\'t think that workaround would work.

Is the source for that file (crt.c) online anywhere? I\'d like to look at it and see if I can spot a concurrency problem.
10) Questions and Answers : Macintosh : Stack overflow / SIGBUS (Message 16617)
Posted 14 Oct 2005 by Deborah Goldsmith
Post:
I\'m still seeing this crash 100% on my dual G5 (the only machine it\'s happening on). The machine has been unusable for climateprediction.net for over a month due to this issue. None of my single-processor G4s are seeing this crash.

Given that it\'s happening in a thread other than the main thread, and that it\'s only happening on a dual processor machine, I think it\'s probably a concurrency error. It might be due to the weak memory synchronization model on PowerPC.

Prior versions of the climateprediction.net application ran great on this machine, so I think it was a change introduced in 4.12 or later. Wasn\'t graphics support first introduced in that version?

Is there an older version of the app that I can run using app_info.xml?
11) Questions and Answers : Macintosh : Stack overflow / SIGBUS (Message 16327)
Posted 29 Sep 2005 by Deborah Goldsmith
Post:
The stacktrace indeicates the problem might have been in the graphics program.
Is this a repetitive behaviour?

Yes, it\'s happening every time.
12) Questions and Answers : Macintosh : Stack overflow / SIGBUS (Message 16280)
Posted 26 Sep 2005 by Deborah Goldsmith
Post:
I seem to have gotten past the FORTRAN library problem, but now I\'m seeing this when trying to run the sulphur cycle model:

Insufficient Memory/Stack Space Available!
2005-09-26 16:58:53 [climateprediction.net] Unrecoverable error for result 47cd_b00296413_0 (process got signal 10)
2005-09-26 16:58:53 [climateprediction.net] Unrecoverable error for result 47cd_b00296413_0 (process got signal 10)

The crashing thread\'s backtrace was:
Thread 1 Crashed:
0 libGL.dylib 0x92f3f6c0 glDeleteTextures + 48
1 ...r_4.21_powerpc-apple-darwin 0x0001e23c graphics_thread_cleanup + 432 (crt.c:300)
2 ...r_4.21_powerpc-apple-darwin 0x00005a64 app_cleanup() + 36 (crt.c:300)
3 ...r_4.21_powerpc-apple-darwin 0x00006f58 checkBOINCStatus(bool) + 228 (crt.c:300)
4 ...r_4.21_powerpc-apple-darwin 0x000054dc mainLoop() + 76 (crt.c:300)
5 ...r_4.21_powerpc-apple-darwin 0x00004fbc worker() + 1844 (crt.c:300)
6 ...r_4.21_powerpc-apple-darwin 0x0004c810 foobar(void*) + 60 (graphics_impl.C:75)
7 libSystem.B.dylib 0x9002b130 _pthread_body + 108
13) Questions and Answers : Macintosh : Error 251 (0xfb) (Message 16163)
Posted 22 Sep 2005 by Deborah Goldsmith
Post:
are you running the new \"BOINC Manager\" -- Mac\'s seem to have a problem with losing the Fortran run-time environment, mainly the path to the Fortran run-time libraries. Tolu is looking into a fix for this.

One of my machines is hitting this problem constantly, and basically can\'t do climateprediction.net at all any more. Is a fix coming soon?
14) Questions and Answers : Macintosh : \"Missing signature\" error with BOINC 4.19 client (Message 11017)
Posted 17 Mar 2005 by Deborah Goldsmith
Post:
I switched back to the BOINC 4.19 client to avoid the missing signature error, but I'm still seeing it there:
2005-03-16 16:27:46 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds
2005-03-16 16:27:46 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds
2005-03-16 16:27:56 [climateprediction.net] Finished download of hadsm3se_4.03_powerpc-apple-darwin.zip
2005-03-16 16:27:56 [climateprediction.net] Throughput 192751 bytes/sec
2005-03-16 16:27:56 [climateprediction.net] Started download of 0zfv_000066414.zip
error: verify_file: decrypt_public error 1025
2005-03-16 16:27:56 [climateprediction.net] signature verification error for hadsm3se_4.03_powerpc-apple-darwin.zip
2005-03-16 16:27:56 [climateprediction.net] signature verification error for hadsm3se_4.03_powerpc-apple-darwin.zip
2005-03-16 16:27:56 [climateprediction.net] Checksum or signature error for hadsm3se_4.03_powerpc-apple-darwin.zip
2005-03-16 16:27:56 [climateprediction.net] Checksum or signature error for hadsm3se_4.03_powerpc-apple-darwin.zip
2005-03-16 16:27:57 [climateprediction.net] Finished download of 0zfv_000066414.zip
2005-03-16 16:27:57 [climateprediction.net] Throughput 15806 bytes/sec
2005-03-16 16:27:58 [climateprediction.net] Finished download of hadsm3data_4.03_powerpc-apple-darwin.zip
2005-03-16 16:27:58 [climateprediction.net] Throughput 322475 bytes/sec
error: verify_file: decrypt_public error 1025
2005-03-16 16:27:58 [climateprediction.net] signature verification error for hadsm3data_4.03_powerpc-apple-darwin.zip
2005-03-16 16:27:58 [climateprediction.net] signature verification error for hadsm3data_4.03_powerpc-apple-darwin.zip
2005-03-16 16:27:58 [climateprediction.net] Checksum or signature error for hadsm3data_4.03_powerpc-apple-darwin.zip
2005-03-16 16:27:58 [climateprediction.net] Checksum or signature error for hadsm3data_4.03_powerpc-apple-darwin.zip
15) Questions and Answers : Macintosh : \"Missing signature\" error with BOINC 4.25 client (Message 11016)
Posted 17 Mar 2005 by Deborah Goldsmith
Post:
OK, I switched back to 4.19, and I'm still getting this:

2005-03-16 16:27:46 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds
2005-03-16 16:27:46 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds
2005-03-16 16:27:56 [climateprediction.net] Finished download of hadsm3se_4.03_powerpc-apple-darwin.zip
2005-03-16 16:27:56 [climateprediction.net] Throughput 192751 bytes/sec
2005-03-16 16:27:56 [climateprediction.net] Started download of 0zfv_000066414.zip
error: verify_file: decrypt_public error 1025
2005-03-16 16:27:56 [climateprediction.net] signature verification error for hadsm3se_4.03_powerpc-apple-darwin.zip
2005-03-16 16:27:56 [climateprediction.net] signature verification error for hadsm3se_4.03_powerpc-apple-darwin.zip
2005-03-16 16:27:56 [climateprediction.net] Checksum or signature error for hadsm3se_4.03_powerpc-apple-darwin.zip
2005-03-16 16:27:56 [climateprediction.net] Checksum or signature error for hadsm3se_4.03_powerpc-apple-darwin.zip
2005-03-16 16:27:57 [climateprediction.net] Finished download of 0zfv_000066414.zip
2005-03-16 16:27:57 [climateprediction.net] Throughput 15806 bytes/sec
2005-03-16 16:27:58 [climateprediction.net] Finished download of hadsm3data_4.03_powerpc-apple-darwin.zip
2005-03-16 16:27:58 [climateprediction.net] Throughput 322475 bytes/sec
error: verify_file: decrypt_public error 1025
2005-03-16 16:27:58 [climateprediction.net] signature verification error for hadsm3data_4.03_powerpc-apple-darwin.zip
2005-03-16 16:27:58 [climateprediction.net] signature verification error for hadsm3data_4.03_powerpc-apple-darwin.zip
2005-03-16 16:27:58 [climateprediction.net] Checksum or signature error for hadsm3data_4.03_powerpc-apple-darwin.zip
2005-03-16 16:27:58 [climateprediction.net] Checksum or signature error for hadsm3data_4.03_powerpc-apple-darwin.zip
16) Questions and Answers : Macintosh : \"Missing signature\" error with BOINC 4.25 client (Message 10818)
Posted 13 Mar 2005 by Deborah Goldsmith
Post:
I've had one of my machines repeatedly run into the following problem since I upgraded to BOINC 4.25 client.

2005-03-12 16:34:22 [climateprediction.net] Requesting 17280.00 seconds of work
2005-03-12 16:34:22 [climateprediction.net] Sending request to scheduler: http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi
2005-03-12 16:34:23 [climateprediction.net] Scheduler RPC to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi succeeded
2005-03-12 16:34:24 [climateprediction.net] Started download of hadsm3se_4.03_powerpc-apple-darwin.zip
2005-03-12 16:34:24 [climateprediction.net] Started download of hadsm3um_4.03_powerpc-apple-darwin.zip
2005-03-12 16:34:32 [climateprediction.net] Finished download of hadsm3um_4.03_powerpc-apple-darwin.zip
2005-03-12 16:34:32 [climateprediction.net] Throughput 368096 bytes/sec
2005-03-12 16:34:32 [climateprediction.net] Started download of hadsm3data_4.03_powerpc-apple-darwin.zip
2005-03-12 16:34:32 [climateprediction.net] Application file hadsm3um_4.03_powerpc-apple-darwin.zip missing signature
2005-03-12 16:34:32 [climateprediction.net] Application file hadsm3um_4.03_powerpc-apple-darwin.zip missing signature
2005-03-12 16:34:32 [climateprediction.net] This BOINC client cannot accept unsigned application files
2005-03-12 16:34:32 [climateprediction.net] This BOINC client cannot accept unsigned application files
2005-03-12 16:34:32 [climateprediction.net] Checksum or signature error for hadsm3um_4.03_powerpc-apple-darwin.zip
2005-03-12 16:34:32 [climateprediction.net] Checksum or signature error for hadsm3um_4.03_powerpc-apple-darwin.zip
2005-03-12 16:34:32 [climateprediction.net] Unrecoverable error for result 0tdz_000058492_1 (app_version download error: couldn't ge
t input files:

hadsm3um_4.03_powerpc-apple-darwin.zip
-123
missing signature

)
2005-03-12 16:34:32 [climateprediction.net] Unrecoverable error for result 0tdz_000058492_1 (app_version download error: couldn't ge
t input files:

hadsm3um_4.03_powerpc-apple-darwin.zip
-123
missing signature

)
2005-03-12 16:34:33 [climateprediction.net] Deferring communication with project for 58 seconds

It basically did this repeatedly until the server told it to go away. Now the server is saying it has no work.

Is this a known problem?




©2024 climateprediction.net