climateprediction.net home page
Model crashing...is it me?

Model crashing...is it me?

Questions and Answers : Unix/Linux : Model crashing...is it me?
Message board moderation

To post messages, you must log in.

AuthorMessage
old_user73

Send message
Joined: 5 Aug 04
Posts: 39
Credit: 14,887
RAC: 0
Message 51 - Posted: 5 Aug 2004, 16:47:42 UTC
Last modified: 5 Aug 2004, 16:50:07 UTC

Hi
I just installed the new BOINC 4.02 and registered for the project. Everything loaded up fine and the model started to crunch. However, after just a few steps it crashed. The next model did the same. Am I doing something wrong?
After the second model crashed BOINC even crashed! (this is the first time I have ever seen this!)

(Only difference is that I'm running as root, but I don't expect that to have any influence?)

Complete log:
2004-08-05 18:59:54 [---] General prefs: from climateprediction.net (last modified 2004-08-05 18:53:04)
2004-08-05 18:59:54 [---] General prefs: no separate prefs for home; using your defaults
2004-08-05 18:59:54 [climateprediction.net] Project prefs: no separate prefs for home; using your defaults
2004-08-05 18:59:54 [climateprediction.net] Finished download of 006g_000025217.zip
2004-08-05 18:59:54 [climateprediction.net] Approximate throughput 17894.461215 bytes/sec
2004-08-05 18:59:54 [climateprediction.net] Starting computation for result 006g_000025217_0 using hadsm3 version 4.02
Starting model in /root/boinc/projects/climateprediction.net...
Archive: hadsm3data_4.02_i686-pc-linux-gnu.zip
creating: 006g_000025217/dataout/
inflating: 006g_000025217/dataout/thist
creating: 006g_000025217/jobs/
inflating: 006g_000025217/jobs/control.stashc
inflating: 006g_000025217/jobs/double.stashc
inflating: 006g_000025217/jobs/Recona.12
inflating: 006g_000025217/jobs/Recona.13
inflating: 006g_000025217/jobs/spec3a_lw_3_asol2c_hadcm3
inflating: 006g_000025217/jobs/spec3a_sw_3_asol2b_hadcm3
inflating: 006g_000025217/jobs/spin.stashc
inflating: 006g_000025217/jobs/yabsd.ihist
inflating: 006g_000025217/jobs/yabsd.PRESM_A
extracting: 006g_000025217/jobs/yabsd.PRESM_O
extracting: 006g_000025217/jobs/yabsd.PRESM_S
extracting: 006g_000025217/jobs/yabsd.PRESM_W
creating: 006g_000025217/tmp/
inflating: 006g_000025217/tmp/cache2
inflating: 006g_000025217/tmp/cp.namelists
extracting: 006g_000025217/tmp/pipe_dummy
creating: 006g_000025217/viz/
inflating: 006g_000025217/viz/globe.rgb
inflating: 006g_000025217/registration_license.txt
creating: 006g_000025217/datain/
creating: 006g_000025217/datain/ancil/
creating: 006g_000025217/datain/ancil/ctldata/
creating: 006g_000025217/datain/ancil/ctldata/stasets/
inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01001218
inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01002207
inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01003236
inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01003237
extracting: 006g_000025217/datain/ancil/ctldata/stasets/X01003254
inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01003255
inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01003274
inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01003275
inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01003276
inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01003277
inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01003278
inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01003279
inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01003280
inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01003281
inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01003286
inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01005207
inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01005208
inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01005222
inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01005223
inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01010206
creating: 006g_000025217/datain/ancil/ctldata/STASHmaster/
inflating: 006g_000025217/datain/ancil/ctldata/STASHmaster/STASHmaster_A
inflating: 006g_000025217/datain/ancil/ctldata/STASHmaster/STASHmaster_O
inflating: 006g_000025217/datain/ancil/ctldata/STASHmaster/STASHmaster_S
inflating: 006g_000025217/datain/ancil/ctldata/STASHmaster/STASHmaster_W
inflating: 006g_000025217/datain/ancil/qrclim.icedp.32
inflating: 006g_000025217/datain/ancil/qrclim.newsst5.32
inflating: 006g_000025217/datain/ancil/qrclim.ozone_preind_corr
inflating: 006g_000025217/datain/ancil/qrclim.uvcurr.32
creating: 006g_000025217/datain/dumps/
inflating: 006g_000025217/datain/dumps/slab32_1810.start
inflating: 006g_000025217/datain/lats
inflating: 006g_000025217/datain/ppcodes
Archive: 006g_000025217.zip
inflating: 006g_000025217/jobs/climate.spin
inflating: 006g_000025217/jobs/climate.cont
inflating: 006g_000025217/jobs/climate.doub
inflating: 006g_000025217/jobs/ncatts.cpdc
Created shared memory region key = 24630
Env Used=LD_LIBRARY_PATH=/root/boinc/projects/climateprediction.net:/usr/local/lib:/usr/lib:/lib
Copying files for startup...
In pre_initialise_phase (part 1 of 3)
In initialise_phase (part 2 of 3)
In startup_phase (part 3 of 3)
Starting model ID 006g_000025217 Phase 1
Stack size=48.00 MB
Waiting for model startup, this may take a minute...
006g_000025217 - PH 1 TS 000001 - 00/00/0000 00:00 - H:M:S=0000:00:00 AVG= 0.00 DLT= 0.00
006g_000025217 - PH 1 TS 000003 - 01/12/1810 01:30 - H:M:S=0000:00:10 AVG= 3.50 DLT= 5.25
006g_000025217 - PH 1 TS 000004 - 01/12/1810 02:00 - H:M:S=0000:00:11 AVG= 2.88 DLT= 1.00
006g_000025217 - PH 1 TS 000005 - 01/12/1810 02:30 - H:M:S=0000:00:12 AVG= 2.50 DLT= 1.00
006g_000025217 - PH 1 TS 000007 - 01/12/1810 03:30 - H:M:S=0000:00:14 AVG= 2.07 DLT= 1.00
Model crashed...retrying...
adding: ncatts.cpdc (deflated 72%)
adding: climate.cont (deflated 79%)
adding: climate.cpdc (deflated 79%)
adding: climate.doub (deflated 79%)
adding: climate.spin (deflated 79%)
adding: 006g_000025217.xml (deflated 70%)
adding: ncatts.cpdc (deflated 72%)
adding: ncatts.cpdc (deflated 72%)
adding: ncatts.cpdc (deflated 72%)
adding: stderr_um.txt (deflated 74%)
adding: yabsd.out (deflated 93%)
adding: restart.day (deflated 43%)
2004-08-05 19:00:14 [climateprediction.net] Unrecoverable error for result 006g_000025217_0 (process exited with code 251 (0xfb))
2004-08-05 19:00:14 [climateprediction.net] Unrecoverable error for result 006g_000025217_0 (process exited with code 251 (0xfb))
2004-08-05 19:00:14 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds
2004-08-05 19:00:14 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds
2004-08-05 19:00:14 [climateprediction.net] Computation for result 006g_000025217 finished
2004-08-05 19:00:14 [climateprediction.net] Started upload of 006g_000025217_0_1.zip
2004-08-05 19:00:14 [climateprediction.net] Started upload of 006g_000025217_0_2.zip
2004-08-05 19:00:14 [climateprediction.net] Finished upload of 006g_000025217_0_1.zip
2004-08-05 19:00:14 [climateprediction.net] Approximate throughput 5931.765245 bytes/sec
2004-08-05 19:00:15 [climateprediction.net] Started upload of 006g_000025217_0_3.zip
2004-08-05 19:00:15 [climateprediction.net] Finished upload of 006g_000025217_0_2.zip
2004-08-05 19:00:15 [climateprediction.net] Approximate throughput 19671.998588 bytes/sec
2004-08-05 19:00:15 [climateprediction.net] Started upload of 006g_000025217_0_4.zip
2004-08-05 19:00:16 [climateprediction.net] Finished upload of 006g_000025217_0_3.zip
2004-08-05 19:00:16 [climateprediction.net] Approximate throughput 5908.106585 bytes/sec
2004-08-05 19:00:16 [climateprediction.net] Started upload of 006g_000025217_0_5.zip
2004-08-05 19:00:16 [climateprediction.net] Finished upload of 006g_000025217_0_4.zip
2004-08-05 19:00:16 [climateprediction.net] Approximate throughput 5754.167789 bytes/sec
2004-08-05 19:00:19 [climateprediction.net] Finished upload of 006g_000025217_0_5.zip
2004-08-05 19:00:19 [climateprediction.net] Approximate throughput 25372.677485 bytes/sec
2004-08-05 19:01:15 [---] CPU scheduler starvation imminent; requesting more work
2004-08-05 19:01:15 [climateprediction.net] Requesting 6399 seconds of work
2004-08-05 19:01:15 [climateprediction.net] Sending request to scheduler: http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi
2004-08-05 19:01:16 [climateprediction.net] Scheduler RPC to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi succeeded
2004-08-05 19:01:16 [climateprediction.net] Started download of 006q_000025227.zip
2004-08-05 19:01:16 [climateprediction.net] Finished download of 006q_000025227.zip
2004-08-05 19:01:16 [climateprediction.net] Approximate throughput 25149.116738 bytes/sec
2004-08-05 19:01:16 [climateprediction.net] Starting computation for result 006q_000025227_0 using hadsm3 version 4.02
Starting model in /root/boinc/projects/climateprediction.net...
Archive: hadsm3data_4.02_i686-pc-linux-gnu.zip
creating: 006q_000025227/dataout/
inflating: 006q_000025227/dataout/thist
creating: 006q_000025227/jobs/
inflating: 006q_000025227/jobs/control.stashc
inflating: 006q_000025227/jobs/double.stashc
inflating: 006q_000025227/jobs/Recona.12
inflating: 006q_000025227/jobs/Recona.13
inflating: 006q_000025227/jobs/spec3a_lw_3_asol2c_hadcm3
inflating: 006q_000025227/jobs/spec3a_sw_3_asol2b_hadcm3
inflating: 006q_000025227/jobs/spin.stashc
inflating: 006q_000025227/jobs/yabsd.ihist
inflating: 006q_000025227/jobs/yabsd.PRESM_A
extracting: 006q_000025227/jobs/yabsd.PRESM_O
extracting: 006q_000025227/jobs/yabsd.PRESM_S
extracting: 006q_000025227/jobs/yabsd.PRESM_W
creating: 006q_000025227/tmp/
inflating: 006q_000025227/tmp/cache2
inflating: 006q_000025227/tmp/cp.namelists
extracting: 006q_000025227/tmp/pipe_dummy
creating: 006q_000025227/viz/
inflating: 006q_000025227/viz/globe.rgb
inflating: 006q_000025227/registration_license.txt
creating: 006q_000025227/datain/
creating: 006q_000025227/datain/ancil/
creating: 006q_000025227/datain/ancil/ctldata/
creating: 006q_000025227/datain/ancil/ctldata/stasets/
inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01001218
inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01002207
inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01003236
inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01003237
extracting: 006q_000025227/datain/ancil/ctldata/stasets/X01003254
inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01003255
inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01003274
inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01003275
inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01003276
inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01003277
inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01003278
inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01003279
inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01003280
inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01003281
inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01003286
inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01005207
inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01005208
inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01005222
inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01005223
inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01010206
creating: 006q_000025227/datain/ancil/ctldata/STASHmaster/
inflating: 006q_000025227/datain/ancil/ctldata/STASHmaster/STASHmaster_A
inflating: 006q_000025227/datain/ancil/ctldata/STASHmaster/STASHmaster_O
inflating: 006q_000025227/datain/ancil/ctldata/STASHmaster/STASHmaster_S
inflating: 006q_000025227/datain/ancil/ctldata/STASHmaster/STASHmaster_W
inflating: 006q_000025227/datain/ancil/qrclim.icedp.32
inflating: 006q_000025227/datain/ancil/qrclim.newsst5.32
inflating: 006q_000025227/datain/ancil/qrclim.ozone_preind_corr
inflating: 006q_000025227/datain/ancil/qrclim.uvcurr.32
creating: 006q_000025227/datain/dumps/
inflating: 006q_000025227/datain/dumps/slab32_1810.start
inflating: 006q_000025227/datain/lats
inflating: 006q_000025227/datain/ppcodes
Archive: 006q_000025227.zip
inflating: 006q_000025227/jobs/climate.spin
inflating: 006q_000025227/jobs/climate.cont
inflating: 006q_000025227/jobs/climate.doub
inflating: 006q_000025227/jobs/ncatts.cpdc
Created shared memory region key = 24840
Env Used=LD_LIBRARY_PATH=/root/boinc/projects/climateprediction.net:/usr/local/lib:/usr/lib:/lib
Copying files for startup...
In pre_initialise_phase (part 1 of 3)
In initialise_phase (part 2 of 3)
In startup_phase (part 3 of 3)
Starting model ID 006q_000025227 Phase 1
Stack size=48.00 MB
Waiting for model startup, this may take a minute...
006q_000025227 - PH 1 TS 000001 - 01/12/1810 00:30 - H:M:S=0000:00:00 AVG= 0.00 DLT= 0.00
006q_000025227 - PH 1 TS 000002 - 01/12/1810 01:00 - H:M:S=0000:01:14 AVG=37.30 DLT=74.61
Model crashed...retrying...
adding: ncatts.cpdc (deflated 72%)
adding: climate.cont (deflated 79%)
adding: climate.cpdc (deflated 79%)
adding: climate.doub (deflated 79%)
adding: climate.spin (deflated 79%)
adding: 006q_000025227.xml (deflated 70%)
adding: ncatts.cpdc (deflated 72%)
adding: ncatts.cpdc (deflated 72%)
adding: ncatts.cpdc (deflated 72%)
adding: stderr_um.txt (deflated 74%)
adding: yabsd.out (deflated 100%)
adding: restart.day (deflated 43%)
2004-08-05 19:02:37 [climateprediction.net] Unrecoverable error for result 006q_000025227_0 (process exited with code 251 (0xfb))
2004-08-05 19:02:37 [climateprediction.net] Unrecoverable error for result 006q_000025227_0 (process exited with code 251 (0xfb))
2004-08-05 19:02:37 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds
2004-08-05 19:02:37 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds
2004-08-05 19:02:37 [climateprediction.net] Computation for result 006q_000025227 finished
2004-08-05 19:02:37 [climateprediction.net] Started upload of 006q_000025227_0_1.zip
2004-08-05 19:02:37 [climateprediction.net] Started upload of 006q_000025227_0_2.zip
2004-08-05 19:02:38 [climateprediction.net] Error on file upload: invalid signature
2004-08-05 19:02:38 [climateprediction.net] Error on file upload: invalid signature
2004-08-05 19:02:38 [climateprediction.net] Permanently failed upload of 006q_000025227_0_1.zip
2004-08-05 19:02:38 [climateprediction.net] Giving up on upload of 006q_000025227_0_1.zip: server rejected file
2004-08-05 19:02:38 [climateprediction.net] Giving up on upload of 006q_000025227_0_1.zip: server rejected file
SIGSEGV: segmentation violation
Exiting...
ID: 51 · Report as offensive     Reply Quote
Pconfig

Send message
Joined: 5 Aug 04
Posts: 84
Credit: 76,646
RAC: 0
Message 52 - Posted: 5 Aug 2004, 16:49:15 UTC

PC overclocked?
ID: 52 · Report as offensive     Reply Quote
old_user73

Send message
Joined: 5 Aug 04
Posts: 39
Credit: 14,887
RAC: 0
Message 59 - Posted: 5 Aug 2004, 17:20:02 UTC - in response to Message 52.  
Last modified: 5 Aug 2004, 17:21:18 UTC

No - and the error is reproducable - it does this every time.
The upload error crashes the core client and every time I get a model downloaded it crashes after a few steps.

------------------------------
Run 2
------------------------------
2004-08-05 19:29:53 [climateprediction.net] Started upload of 006q_000025227_0_3.zip
2004-08-05 19:29:54 [climateprediction.net] Started upload of 006q_000025227_0_4.zip
2004-08-05 19:29:55 [climateprediction.net] Error on file upload: invalid signature
2004-08-05 19:29:55 [climateprediction.net] Error on file upload: invalid signature
2004-08-05 19:29:55 [climateprediction.net] Permanently failed upload of 006q_000025227_0_3.zip
2004-08-05 19:29:55 [climateprediction.net] Giving up on upload of 006q_000025227_0_3.zip: server rejected file
2004-08-05 19:29:55 [climateprediction.net] Giving up on upload of 006q_000025227_0_3.zip: server rejected file
SIGSEGV: segmentation violation
Exiting...

------------------------------
Run 3
------------------------------
2004-08-05 19:34:08 [climateprediction.net] Started upload of 006q_000025227_0_2.zip
2004-08-05 19:34:08 [climateprediction.net] Started upload of 006q_000025227_0_3.zip
HTTP::init_post2: couldn't get file size
2004-08-05 19:34:09 [climateprediction.net] Giving up on upload of 006q_000025227_0_3.zip: File downloaded was not the correct file or was garbage from bad URL
2004-08-05 19:34:09 [climateprediction.net] Giving up on upload of 006q_000025227_0_3.zip: File downloaded was not the correct file or was garbage from bad URL
2004-08-05 19:34:09 [climateprediction.net] Started upload of 006q_000025227_0_4.zip
2004-08-05 19:34:10 [climateprediction.net] Error on file upload: invalid signature
2004-08-05 19:34:10 [climateprediction.net] Error on file upload: invalid signature
2004-08-05 19:34:10 [climateprediction.net] Permanently failed upload of 006q_000025227_0_2.zip
2004-08-05 19:34:10 [climateprediction.net] Giving up on upload of 006q_000025227_0_2.zip: server rejected file
2004-08-05 19:34:10 [climateprediction.net] Giving up on upload of 006q_000025227_0_2.zip: server rejected file
SIGSEGV: segmentation violation
Exiting...

------------------------------
Run ...6
------------------------------
2004-08-05 19:36:16 [climateprediction.net] Started download of 006x_000025234.zip
HTTP::init_post2: couldn't get file size
2004-08-05 19:36:16 [climateprediction.net] Giving up on upload of 006q_000025227_0_5.zip: File downloaded was not the correct file or was garbage from bad URL
2004-08-05 19:36:16 [climateprediction.net] Giving up on upload of 006q_000025227_0_5.zip: File downloaded was not the correct file or was garbage from bad URL
2004-08-05 19:36:17 [climateprediction.net] Finished download of 006x_000025234.zip
2004-08-05 19:36:17 [climateprediction.net] Approximate throughput 7020.394246 bytes/sec
2004-08-05 19:36:17 [climateprediction.net] Starting computation for result 006x_000025234_0 using hadsm3 version 4.02
Starting model in /root/boinc/projects/climateprediction.net...
Archive: hadsm3data_4.02_i686-pc-linux-gnu.zip
creating: 006x_000025234/dataout/
inflating: 006x_000025234/dataout/thist
creating: 006x_000025234/jobs/
inflating: 006x_000025234/jobs/control.stashc
inflating: 006x_000025234/jobs/double.stashc
inflating: 006x_000025234/jobs/Recona.12
inflating: 006x_000025234/jobs/Recona.13
inflating: 006x_000025234/jobs/spec3a_lw_3_asol2c_hadcm3
inflating: 006x_000025234/jobs/spec3a_sw_3_asol2b_hadcm3
inflating: 006x_000025234/jobs/spin.stashc
inflating: 006x_000025234/jobs/yabsd.ihist
inflating: 006x_000025234/jobs/yabsd.PRESM_A
extracting: 006x_000025234/jobs/yabsd.PRESM_O
extracting: 006x_000025234/jobs/yabsd.PRESM_S
extracting: 006x_000025234/jobs/yabsd.PRESM_W
creating: 006x_000025234/tmp/
inflating: 006x_000025234/tmp/cache2
inflating: 006x_000025234/tmp/cp.namelists
extracting: 006x_000025234/tmp/pipe_dummy
creating: 006x_000025234/viz/
inflating: 006x_000025234/viz/globe.rgb
inflating: 006x_000025234/registration_license.txt
creating: 006x_000025234/datain/
creating: 006x_000025234/datain/ancil/
creating: 006x_000025234/datain/ancil/ctldata/
creating: 006x_000025234/datain/ancil/ctldata/stasets/
inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01001218
inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01002207
inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01003236
inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01003237
extracting: 006x_000025234/datain/ancil/ctldata/stasets/X01003254
inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01003255
inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01003274
inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01003275
inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01003276
inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01003277
inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01003278
inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01003279
inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01003280
inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01003281
inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01003286
inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01005207
inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01005208
inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01005222
inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01005223
inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01010206
creating: 006x_000025234/datain/ancil/ctldata/STASHmaster/
inflating: 006x_000025234/datain/ancil/ctldata/STASHmaster/STASHmaster_A
inflating: 006x_000025234/datain/ancil/ctldata/STASHmaster/STASHmaster_O
inflating: 006x_000025234/datain/ancil/ctldata/STASHmaster/STASHmaster_S
inflating: 006x_000025234/datain/ancil/ctldata/STASHmaster/STASHmaster_W
inflating: 006x_000025234/datain/ancil/qrclim.icedp.32
inflating: 006x_000025234/datain/ancil/qrclim.newsst5.32
inflating: 006x_000025234/datain/ancil/qrclim.ozone_preind_corr
inflating: 006x_000025234/datain/ancil/qrclim.uvcurr.32
creating: 006x_000025234/datain/dumps/
inflating: 006x_000025234/datain/dumps/slab32_1810.start
inflating: 006x_000025234/datain/lats
inflating: 006x_000025234/datain/ppcodes
Archive: 006x_000025234.zip
inflating: 006x_000025234/jobs/climate.spin
inflating: 006x_000025234/jobs/climate.cont
inflating: 006x_000025234/jobs/climate.doub
inflating: 006x_000025234/jobs/ncatts.cpdc
Created shared memory region key = 24810
Env Used=LD_LIBRARY_PATH=/root/boinc/projects/climateprediction.net:/usr/local/lib:/usr/lib:/lib
Copying files for startup...
In pre_initialise_phase (part 1 of 3)
In initialise_phase (part 2 of 3)
In startup_phase (part 3 of 3)
Starting model ID 006x_000025234 Phase 1
Stack size=48.00 MB
Waiting for model startup, this may take a minute...
006x_000025234 - PH 1 TS 000001 - 01/12/1810 00:30 - H:M:S=0000:00:00 AVG= 0.00 DLT= 0.00
006x_000025234 - PH 1 TS 000002 - 01/12/1810 01:00 - H:M:S=0000:00:09 AVG= 5.00 DLT=10.00
Model crashed...retrying...
adding: ncatts.cpdc (deflated 72%)
adding: climate.cont (deflated 78%)
adding: climate.cpdc (deflated 79%)
adding: climate.doub (deflated 78%)
adding: climate.spin (deflated 79%)
adding: 006x_000025234.xml (deflated 70%)
adding: ncatts.cpdc (deflated 72%)
adding: ncatts.cpdc (deflated 72%)
adding: ncatts.cpdc (deflated 72%)
adding: stderr_um.txt (deflated 74%)
adding: yabsd.out (deflated 93%)
adding: restart.day (deflated 43%)
2004-08-05 19:36:34 [climateprediction.net] Unrecoverable error for result 006x_000025234_0 (process exited with code 251 (0xfb))
2004-08-05 19:36:34 [climateprediction.net] Unrecoverable error for result 006x_000025234_0 (process exited with code 251 (0xfb))
ID: 59 · Report as offensive     Reply Quote
old_user73

Send message
Joined: 5 Aug 04
Posts: 39
Credit: 14,887
RAC: 0
Message 66 - Posted: 5 Aug 2004, 17:47:38 UTC - in response to Message 59.  

Even after a complete wipeout of the BOINC directory including client_state etc. did it do it again...

Running Linux Gentoo on a 2.6.5r1 kernel
P4 hyperthreaded 2.4Ghz but running only one simulation at a time.
ID: 66 · Report as offensive     Reply Quote
Profile astroWX
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1496
Credit: 95,522,203
RAC: 0
Message 92 - Posted: 5 Aug 2004, 19:19:46 UTC

Hi, Janus,

I haven't heard of anyone running it in root (sounds dangerous, security-wise). Might be worth a try in your /home directory -- if for no other reason than to eliminate a possible conflict associated with root privileges.

Otherwise. it sounds like one for Carl.
________________________________________________
Washing one's hands of the conflict between the powerful and the powerless means to side with the powerful, not to be neutral.
-- Paulo Freire (1921-1997), educator, author.
ID: 92 · Report as offensive     Reply Quote
old_user1
Avatar

Send message
Joined: 5 Aug 04
Posts: 907
Credit: 299,864
RAC: 0
Message 137 - Posted: 6 Aug 2004, 4:35:21 UTC

Hi Janus: when I get into the office today I will see if I can find your upload server, the interesting stuff should be in the "yabsd.out" which is sent up. My guess is the model can be flakey with overclocking, or perhaps there is another library that I forgot to compile into the model (i.e. I tried to statically link in everything so different Linux versions wouldn't cause problems with the "sensitive" model)
ID: 137 · Report as offensive     Reply Quote
old_user73

Send message
Joined: 5 Aug 04
Posts: 39
Credit: 14,887
RAC: 0
Message 158 - Posted: 6 Aug 2004, 9:51:54 UTC - in response to Message 137.  
Last modified: 6 Aug 2004, 10:20:10 UTC

I have been looking into it a bit more and found two errors. I don't know anything about the sourcecode, so can't say if they are important or not:

stderr_um.txt:
forrtl: info: Fortran error message number is 63.
forrtl: warning: Could not open message catalog: ifcore_msg.cat.
forrtl: info: Check environment variable NLSPATH and protection of /usr/lib/ifcore_msg.cat.

yabsd.out:
Model completed with the following :
Error Code : 1
Message : P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED.
And a list of a LOT of points with negative pressure.

Also in another run these errors are preceeded by:
DAS- K_WEAK reset: NaN NaN
DAS- K_WEAK reset: NaN NaN
DAS- K_WEAK reset: NaN NaN
DAS- K_WEAK reset: NaN NaN
DAS- LSP_FORM- QCL and DELTA not updated: NaN 1800.000
NaN

I'm going to try to run a few models today again and see if the error is the same.

[Update]
I have now installed the client under another user on the same system and it fails the same way - runs a few steps and then dies and uploads. So it wasn't because I was running it as root.
ID: 158 · Report as offensive     Reply Quote
old_user1
Avatar

Send message
Joined: 5 Aug 04
Posts: 907
Credit: 299,864
RAC: 0
Message 298 - Posted: 7 Aug 2004, 10:11:05 UTC - in response to Message 158.  

OK from those error messages obviously the Fortran code is causing trouble. Do you have a /usr/lib/ifcore_msg.cat? I think it's an Intel Fortran library, so perhaps it's a conflict with other libraries you may have installed?
ID: 298 · Report as offensive     Reply Quote
old_user73

Send message
Joined: 5 Aug 04
Posts: 39
Credit: 14,887
RAC: 0
Message 790 - Posted: 12 Aug 2004, 7:55:48 UTC - in response to Message 298.  

> OK from those error messages obviously the Fortran code is causing trouble.
> Do you have a /usr/lib/ifcore_msg.cat? I think it's an Intel Fortran library,
> so perhaps it's a conflict with other libraries you may have installed?

No I haven't got that file - at least I couldn't find it in the specified location.
ID: 790 · Report as offensive     Reply Quote
Profile old_user175

Send message
Joined: 5 Aug 04
Posts: 36
Credit: 2,559,795
RAC: 0
Message 1155 - Posted: 17 Aug 2004, 22:24:00 UTC

Nope, Not just you. I have the same specific problem on forge - a dual 3.06G HT xeon machine with no other system loading of importance. this box is running 2.6.7-latest.smp FC2 linux, no ifcore libs here either.

This is **entire output** up to the failure:

2004-08-17 18:18:48 [---] Starting BOINC client version 4.02 for i686-pc-linux-gnu
2004-08-17 18:18:48 [climateprediction.net] Project prefs: using your defaults
2004-08-17 18:18:48 [climateprediction.net] Host ID not assigned yet
2004-08-17 18:18:48 [---] General prefs: from climateprediction.net (last modified 2004-08-15 16:37:46)
2004-08-17 18:18:48 [---] General prefs: using your defaults
2004-08-17 18:18:48 [---] Running CPU benchmarks
2004-08-17 18:18:48 [---] Suspending computation and network activity - running CPU benchmarks
2004-08-17 18:19:49 [---] Benchmark results:
2004-08-17 18:19:49 [---] Number of CPUs: 4
2004-08-17 18:19:49 [---] 653 double precision MIPS (Whetstone) per CPU
2004-08-17 18:19:49 [---] 1248 integer MIPS (Dhrystone) per CPU
2004-08-17 18:19:49 [---] Finished CPU benchmarks
2004-08-17 18:19:50 [---] Resuming computation and network activity
2004-08-17 18:19:50 [---] CPU scheduler starvation imminent; requesting more work
2004-08-17 18:19:51 [---] CPU scheduler starvation imminent; requesting more work
2004-08-17 18:19:51 [climateprediction.net] Requesting 691200 seconds of work
2004-08-17 18:19:51 [climateprediction.net] Sending request to scheduler: http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi
2004-08-17 18:19:51 [climateprediction.net] Scheduler RPC to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi succeeded
2004-08-17 18:19:51 [---] General prefs: from climateprediction.net (last modified 2004-08-15 16:37:46)
2004-08-17 18:19:51 [---] General prefs: no separate prefs for home; using your defaults
2004-08-17 18:19:51 [climateprediction.net] Project prefs: no separate prefs for home; using your defaults
2004-08-17 18:19:52 [climateprediction.net] Started download of hadsm3_4.02_i686-pc-linux-gnu
2004-08-17 18:19:52 [climateprediction.net] Started download of hadsm3se_4.02_i686-pc-linux-gnu.zip
2004-08-17 18:19:55 [---] CPU scheduler starvation imminent; requesting more work
2004-08-17 18:19:55 [climateprediction.net] Requesting 691200 seconds of work
2004-08-17 18:19:55 [climateprediction.net] Sending request to scheduler: http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi
2004-08-17 18:19:57 [climateprediction.net] Scheduler RPC to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi succeeded
2004-08-17 18:19:57 [---] CPU scheduler starvation imminent; requesting more work
2004-08-17 18:19:57 [climateprediction.net] Requesting 691200 seconds of work
2004-08-17 18:19:57 [climateprediction.net] Sending request to scheduler: http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi
2004-08-17 18:19:58 [climateprediction.net] Scheduler RPC to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi succeeded
2004-08-17 18:19:58 [---] CPU scheduler starvation imminent; requesting more work
2004-08-17 18:19:58 [climateprediction.net] Requesting 691200 seconds of work
2004-08-17 18:19:58 [climateprediction.net] Sending request to scheduler: http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi
2004-08-17 18:19:59 [climateprediction.net] Scheduler RPC to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi succeeded
2004-08-17 18:20:06 [climateprediction.net] Finished download of hadsm3_4.02_i686-pc-linux-gnu
2004-08-17 18:20:06 [climateprediction.net] Approximate throughput 80435.695686 bytes/sec
2004-08-17 18:20:06 [climateprediction.net] Started download of hadsm3um_4.02_i686-pc-linux-gnu.zip
2004-08-17 18:20:31 [climateprediction.net] Finished download of hadsm3se_4.02_i686-pc-linux-gnu.zip
2004-08-17 18:20:31 [climateprediction.net] Approximate throughput 97371.666877 bytes/sec
2004-08-17 18:20:31 [climateprediction.net] Started download of hadsm3data_4.02_i686-pc-linux-gnu.zip
2004-08-17 18:20:32 [climateprediction.net] Finished download of hadsm3um_4.02_i686-pc-linux-gnu.zip
2004-08-17 18:20:32 [climateprediction.net] Approximate throughput 100634.645973 bytes/sec
2004-08-17 18:20:32 [climateprediction.net] Started download of 03n2_000029703.zip
2004-08-17 18:20:32 [climateprediction.net] Finished download of 03n2_000029703.zip
2004-08-17 18:20:32 [climateprediction.net] Approximate throughput 24594.815698 bytes/sec
2004-08-17 18:20:32 [climateprediction.net] Started download of 005z_000025200.zip
2004-08-17 18:20:33 [climateprediction.net] Finished download of 005z_000025200.zip
2004-08-17 18:20:33 [climateprediction.net] Approximate throughput 23927.738069 bytes/sec
2004-08-17 18:20:33 [climateprediction.net] Started download of 0300_000028873.zip
2004-08-17 18:20:33 [climateprediction.net] Finished download of 0300_000028873.zip
2004-08-17 18:20:33 [climateprediction.net] Approximate throughput 19755.729853 bytes/sec
2004-08-17 18:20:33 [climateprediction.net] Started download of 0052_000025167.zip
2004-08-17 18:20:34 [climateprediction.net] Finished download of 0052_000025167.zip
2004-08-17 18:20:34 [climateprediction.net] Approximate throughput 18356.052990 bytes/sec
2004-08-17 18:20:58 [climateprediction.net] Finished download of hadsm3data_4.02_i686-pc-linux-gnu.zip
2004-08-17 18:20:58 [climateprediction.net] Approximate throughput 165114.078013 bytes/sec
2004-08-17 18:20:58 [climateprediction.net] Starting computation for result 03n2_000029703_1 using hadsm3 version 4.02
2004-08-17 18:20:58 [climateprediction.net] Starting computation for result 005z_000025200_1 using hadsm3 version 4.02
Starting model in /misc/boinc/projects/climateprediction.net...
2004-08-17 18:20:58 [climateprediction.net] Starting computation for result 0300_000028873_1 using hadsm3 version 4.02
2004-08-17 18:20:58 [climateprediction.net] Starting computation for result 0052_000025167_1 using hadsm3 version 4.02
Starting model in /misc/boinc/projects/climateprediction.net...
Archive: hadsm3se_4.02_i686-pc-linux-gnu.zip
inflating: ./hadsm3se_4.02_i686-pc-linux-gnu Archive: hadsm3se_4.02_i686-pc-linux-gnu.zip
Starting model in /misc/boinc/projects/climateprediction.net...
Archive: hadsm3um_4.02_i686-pc-linux-gnu.zip
inflating: ./hadsm3um_4.02_i686-pc-linux-gnu Starting model in /misc/boinc/projects/climateprediction.net...
Archive: hadsm3data_4.02_i686-pc-linux-gnu.zip
creating: 005z_000025200/dataout/
inflating: 005z_000025200/dataout/thist
creating: 005z_000025200/jobs/
inflating: 005z_000025200/jobs/control.stashc
inflating: 005z_000025200/jobs/double.stashc
inflating: 005z_000025200/jobs/Recona.12
inflating: 005z_000025200/jobs/Recona.13
inflating: 005z_000025200/jobs/spec3a_lw_3_asol2c_hadcm3
inflating: 005z_000025200/jobs/spec3a_sw_3_asol2b_hadcm3
inflating: 005z_000025200/jobs/spin.stashc
inflating: 005z_000025200/jobs/yabsd.ihist
inflating: 005z_000025200/jobs/yabsd.PRESM_A
extracting: 005z_000025200/jobs/yabsd.PRESM_O
extracting: 005z_000025200/jobs/yabsd.PRESM_S
extracting: 005z_000025200/jobs/yabsd.PRESM_W
creating: 005z_000025200/tmp/
inflating: 005z_000025200/tmp/cache2 inflating: ./hadsm3se_4.02_i686-pc-linux-gnu
inflating: 005z_000025200/tmp/cp.namelists
extracting: 005z_000025200/tmp/pipe_dummy
creating: 005z_000025200/viz/
inflating: 005z_000025200/viz/globe.rgb
inflating: 005z_000025200/registration_license.txt
creating: 005z_000025200/datain/
creating: 005z_000025200/datain/ancil/
creating: 005z_000025200/datain/ancil/ctldata/
creating: 005z_000025200/datain/ancil/ctldata/stasets/
inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01001218
inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01002207
inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01003236
inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01003237
extracting: 005z_000025200/datain/ancil/ctldata/stasets/X01003254
inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01003255
inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01003274
inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01003275
inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01003276
inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01003277
inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01003278
inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01003279
inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01003280
inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01003281
inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01003286
inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01005207
inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01005208
inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01005222
inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01005223
inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01010206
creating: 005z_000025200/datain/ancil/ctldata/STASHmaster/
inflating: 005z_000025200/datain/ancil/ctldata/STASHmaster/STASHmaster_A
inflating: 005z_000025200/datain/ancil/ctldata/STASHmaster/STASHmaster_O
inflating: 005z_000025200/datain/ancil/ctldata/STASHmaster/STASHmaster_S
inflating: 005z_000025200/datain/ancil/ctldata/STASHmaster/STASHmaster_W
inflating: 005z_000025200/datain/ancil/qrclim.icedp.32
inflating: 005z_000025200/datain/ancil/qrclim.newsst5.32
inflating: 005z_000025200/datain/ancil/qrclim.ozone_preind_corr
inflating: 005z_000025200/datain/ancil/qrclim.uvcurr.32
inflating: ./viz
inflating: ./libGL.so.1
creating: 005z_000025200/datain/dumps/
inflating: 005z_000025200/datain/dumps/slab32_1810.start
inflating: ./viz
inflating: ./libGL.so.1
Archive: hadsm3data_4.02_i686-pc-linux-gnu.zip
creating: 0052_000025167/dataout/
inflating: 0052_000025167/dataout/thist
creating: 0052_000025167/jobs/
inflating: 0052_000025167/jobs/control.stashc
inflating: 0052_000025167/jobs/double.stashc
inflating: 0052_000025167/jobs/Recona.12
inflating: 0052_000025167/jobs/Recona.13
inflating: 0052_000025167/jobs/spec3a_lw_3_asol2c_hadcm3
inflating: 0052_000025167/jobs/spec3a_sw_3_asol2b_hadcm3
inflating: 0052_000025167/jobs/spin.stashc
inflating: 0052_000025167/jobs/yabsd.ihist
inflating: 0052_000025167/jobs/yabsd.PRESM_A
extracting: 0052_000025167/jobs/yabsd.PRESM_O
extracting: 0052_000025167/jobs/yabsd.PRESM_S
extracting: 0052_000025167/jobs/yabsd.PRESM_W
creating: 0052_000025167/tmp/
inflating: 0052_000025167/tmp/cache2
inflating: ./libGLU.so.1
inflating: ./libglut.so.3
inflating: 0052_000025167/tmp/cp.namelists

extracting: 0052_000025167/tmp/pipe_dummy
inflating: ./libGLU.so.1 creating: 0052_000025167/viz/
inflating: 0052_000025167/viz/globe.rgb
inflating: ./hadsm3viz_4.02_i686-pc-linux-gnu
inflating: 005z_000025200/datain/lats
inflating: 005z_000025200/datain/ppcodes
Archive: 005z_000025200.zip

inflating: 005z_000025200/jobs/climate.spin inflating: ./libglut.so.3
inflating: 005z_000025200/jobs/climate.cont
inflating: 005z_000025200/jobs/climate.doub
inflating: 005z_000025200/jobs/ncatts.cpdc
Created shared memory region key = 24390

inflating: ./hadsm3viz_4.02_i686-pc-linux-gnu
inflating: 0052_000025167/registration_license.txt
creating: 0052_000025167/datain/
creating: 0052_000025167/datain/ancil/
creating: 0052_000025167/datain/ancil/ctldata/
creating: 0052_000025167/datain/ancil/ctldata/stasets/
inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01001218
inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01002207
inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01003236
inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01003237
extracting: 0052_000025167/datain/ancil/ctldata/stasets/X01003254
inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01003255
inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01003274
inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01003275
inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01003276
inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01003277
inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01003278
inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01003279
inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01003280
inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01003281
inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01003286
inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01005207
inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01005208
inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01005222
inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01005223
inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01010206
creating: 0052_000025167/datain/ancil/ctldata/STASHmaster/
inflating: 0052_000025167/datain/ancil/ctldata/STASHmaster/STASHmaster_A
Archive: hadsm3data_4.02_i686-pc-linux-gnu.zip
creating: 0300_000028873/dataout/
inflating: 0300_000028873/dataout/thist
creating: 0300_000028873/jobs/
inflating: 0300_000028873/jobs/control.stashc

inflating: 0300_000028873/jobs/double.stashc inflating: 0052_000025167/datain/ancil/ctldata/STASHmaster/STASHmaster_O
inflating: 0300_000028873/jobs/Recona.12
inflating: 0300_000028873/jobs/Recona.13
inflating: 0300_000028873/jobs/spec3a_lw_3_asol2c_hadcm3
inflating: 0052_000025167/datain/ancil/ctldata/STASHmaster/STASHmaster_S
inflating: 0052_000025167/datain/ancil/ctldata/STASHmaster/STASHmaster_W
inflating: 0052_000025167/datain/ancil/qrclim.icedp.32
inflating: 0300_000028873/jobs/spec3a_sw_3_asol2b_hadcm3
inflating: 0300_000028873/jobs/spin.stashc
inflating: 0300_000028873/jobs/yabsd.ihist
inflating: 0300_000028873/jobs/yabsd.PRESM_A
extracting: 0300_000028873/jobs/yabsd.PRESM_O
extracting: 0300_000028873/jobs/yabsd.PRESM_S
extracting: 0300_000028873/jobs/yabsd.PRESM_W
creating: 0300_000028873/tmp/
inflating: 0300_000028873/tmp/cache2
inflating: 0052_000025167/datain/ancil/qrclim.newsst5.32
Archive: hadsm3data_4.02_i686-pc-linux-gnu.zip
creating: 03n2_000029703/dataout/
inflating: 03n2_000029703/dataout/thist
creating: 03n2_000029703/jobs/
inflating: 03n2_000029703/jobs/control.stashc
inflating: 03n2_000029703/jobs/double.stashc
inflating: 03n2_000029703/jobs/Recona.12

inflating: 03n2_000029703/jobs/Recona.13 inflating: 0052_000025167/datain/ancil/qrclim.ozone_preind_corr
inflating: 03n2_000029703/jobs/spec3a_lw_3_asol2c_hadcm3
inflating: 03n2_000029703/jobs/spec3a_sw_3_asol2b_hadcm3
inflating: 03n2_000029703/jobs/spin.stashc
inflating: 03n2_000029703/jobs/yabsd.ihist
inflating: 03n2_000029703/jobs/yabsd.PRESM_A
extracting: 03n2_000029703/jobs/yabsd.PRESM_O
extracting: 03n2_000029703/jobs/yabsd.PRESM_S
extracting: 03n2_000029703/jobs/yabsd.PRESM_W
creating: 03n2_000029703/tmp/
inflating: 03n2_000029703/tmp/cache2
inflating: 0052_000025167/datain/ancil/qrclim.uvcurr.32
creating: 0052_000025167/datain/dumps/
inflating: 0052_000025167/datain/dumps/slab32_1810.start
inflating: 0300_000028873/tmp/cp.namelists
extracting: 0300_000028873/tmp/pipe_dummy
creating: 0300_000028873/viz/
inflating: 0300_000028873/viz/globe.rgb Env Used=LD_LIBRARY_PATH=/misc/boinc/projects/climateprediction.net:/usr/local/lib:/usr/lib:/lib
Copying files for startup...
In pre_initialise_phase (part 1 of 3)
In initialise_phase (part 2 of 3)
In startup_phase (part 3 of 3)

inflating: 03n2_000029703/tmp/cp.namelists adding: ncatts.cpdc
(deflated 72%)
extracting: 03n2_000029703/tmp/pipe_dummy
creating: 03n2_000029703/viz/
adding: climate.cont inflating: 03n2_000029703/viz/globe.rgb (deflated 79%)
adding: climate.doub (deflated 79%)
adding: climate.spin (deflated 79%)
adding: 005z_000025200.xml (deflated 66%)
adding: ncatts.cpdc (deflated 72%)
adding: ncatts.cpdc (deflated 72%)
adding: ncatts.cpdc (deflated 72%)

inflating: 0300_000028873/registration_license.txt
creating: 0300_000028873/datain/
creating: 0300_000028873/datain/ancil/
creating: 0300_000028873/datain/ancil/ctldata/
creating: 0300_000028873/datain/ancil/ctldata/stasets/
inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01001218
inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01002207
inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01003236
inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01003237
extracting: 0300_000028873/datain/ancil/ctldata/stasets/X01003254
inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01003255
inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01003274
inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01003275
inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01003276
inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01003277
inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01003278
inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01003279
inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01003280
inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01003281
inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01003286
inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01005207
inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01005208
inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01005222
inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01005223
inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01010206
creating: 0300_000028873/datain/ancil/ctldata/STASHmaster/
inflating: 0300_000028873/datain/ancil/ctldata/STASHmaster/STASHmaster_A
inflating: 0300_000028873/datain/ancil/ctldata/STASHmaster/STASHmaster_O
inflating: 0300_000028873/datain/ancil/ctldata/STASHmaster/STASHmaster_S
inflating: 03n2_000029703/registration_license.txt
inflating: 0300_000028873/datain/ancil/ctldata/STASHmaster/STASHmaster_W
creating: 03n2_000029703/datain/
creating: 03n2_000029703/datain/ancil/
creating: 03n2_000029703/datain/ancil/ctldata/

creating: 03n2_000029703/datain/ancil/ctldata/stasets/
inflating: 0300_000028873/datain/ancil/qrclim.icedp.32 inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01001218
inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01002207
inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01003236
inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01003237
extracting: 03n2_000029703/datain/ancil/ctldata/stasets/X01003254
inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01003255
inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01003274
inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01003275
inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01003276
inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01003277
inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01003278
inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01003279
inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01003280
inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01003281
inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01003286
inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01005207
inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01005208
inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01005222
inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01005223
inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01010206
creating: 03n2_000029703/datain/ancil/ctldata/STASHmaster/
inflating: 03n2_000029703/datain/ancil/ctldata/STASHmaster/STASHmaster_A
inflating: 03n2_000029703/datain/ancil/ctldata/STASHmaster/STASHmaster_O
inflating: 03n2_000029703/datain/ancil/ctldata/STASHmaster/STASHmaster_S
inflating: 03n2_000029703/datain/ancil/ctldata/STASHmaster/STASHmaster_W
inflating: 03n2_000029703/datain/ancil/qrclim.icedp.32
inflating: 0300_000028873/datain/ancil/qrclim.newsst5.32
inflating: 0300_000028873/datain/ancil/qrclim.ozone_preind_corr
inflating: 0300_000028873/datain/ancil/qrclim.uvcurr.32
inflating: 03n2_000029703/datain/ancil/qrclim.newsst5.32
inflating: 0052_000025167/datain/lats
inflating: 0052_000025167/datain/ppcodes
Archive: 0052_000025167.zip
inflating: 0052_000025167/jobs/climate.spin
inflating: 0052_000025167/jobs/climate.cont
inflating: 0052_000025167/jobs/climate.doub
inflating: 0052_000025167/jobs/ncatts.cpdc
Created shared memory region key = 24070

inflating: 03n2_000029703/datain/ancil/qrclim.ozone_preind_corr
creating: 0300_000028873/datain/dumps/
inflating: 0300_000028873/datain/dumps/slab32_1810.start
inflating: 03n2_000029703/datain/ancil/qrclim.uvcurr.32
creating: 03n2_000029703/datain/dumps/
inflating: 03n2_000029703/datain/dumps/slab32_1810.start
inflating: 0300_000028873/datain/lats
inflating: 0300_000028873/datain/ppcodes
Archive: 0300_000028873.zip
inflating: 0300_000028873/jobs/climate.spin
inflating: 0300_000028873/jobs/climate.cont
inflating: 0300_000028873/jobs/climate.doub
inflating: 0300_000028873/jobs/ncatts.cpdc
Created shared memory region key = 24340
Env Used=LD_LIBRARY_PATH=/misc/boinc/projects/climateprediction.net:/usr/local/lib:/usr/lib:/lib
Copying files for startup...
In pre_initialise_phase (part 1 of 3)
In initialise_phase (part 2 of 3)
In startup_phase (part 3 of 3)
adding: ncatts.cpdc (deflated 72%)
adding: climate.cont (deflated 79%)
adding: climate.doub (deflated 79%)
adding: climate.spin (deflated 79%)
adding: 0052_000025167.xml
inflating: 03n2_000029703/datain/lats (deflated 66%)
adding: ncatts.cpdc
inflating: 03n2_000029703/datain/ppcodes (deflated 72%)

adding: ncatts.cpdc (deflated 72%)
Archive: 03n2_000029703.zip
inflating: 03n2_000029703/jobs/climate.spin adding: ncatts.cpdc
inflating: 03n2_000029703/jobs/climate.cont (deflated 72%)

inflating: 03n2_000029703/jobs/climate.doub
inflating: 03n2_000029703/jobs/ncatts.cpdc
Created shared memory region key = 24565
Env Used=LD_LIBRARY_PATH=/misc/boinc/projects/climateprediction.net:/usr/local/lib:/usr/lib:/lib
Copying files for startup...
In pre_initialise_phase (part 1 of 3)
In initialise_phase (part 2 of 3)
In startup_phase (part 3 of 3)
adding: ncatts.cpdc (deflated 72%)
adding: climate.cont (deflated 79%)
adding: climate.doub (deflated 78%)
adding: climate.spin (deflated 79%)
adding: 0300_000028873.xml (deflated 66%)
adding: ncatts.cpdc (deflated 72%)
adding: ncatts.cpdcEnv Used=LD_LIBRARY_PATH=/misc/boinc/projects/climateprediction.net:/usr/local/lib:/usr/lib:/lib
Copying files for startup...
In pre_initialise_phase (part 1 of 3)
In initialise_phase (part 2 of 3)
In startup_phase (part 3 of 3)
(deflated 72%)
adding: ncatts.cpdc (deflated 72%)
adding: ncatts.cpdc (deflated 72%)
adding: climate.cont (deflated 79%)
adding: climate.doub (deflated 79%)
adding: climate.spin (deflated 79%)
adding: 03n2_000029703.xml (deflated 66%)
adding: ncatts.cpdc (deflated 72%)
adding: ncatts.cpdc (deflated 72%)
adding: ncatts.cpdc (deflated 72%)
2004-08-17 18:20:59 [climateprediction.net] Unrecoverable error for result 03n2_000029703_1 (process exited with code 251 (0xfb))
2004-08-17 18:20:59 [climateprediction.net] Unrecoverable error for result 03n2_000029703_1 (process exited with code 251 (0xfb))
2004-08-17 18:20:59 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds
2004-08-17 18:20:59 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds
2004-08-17 18:20:59 [climateprediction.net] Computation for result 03n2_000029703 finished
2004-08-17 18:20:59 [climateprediction.net] Unrecoverable error for result 005z_000025200_1 (process exited with code 251 (0xfb))
2004-08-17 18:20:59 [climateprediction.net] Unrecoverable error for result 005z_000025200_1 (process exited with code 251 (0xfb))
2004-08-17 18:20:59 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds
2004-08-17 18:20:59 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds
2004-08-17 18:20:59 [climateprediction.net] Started upload of 03n2_000029703_1_1.zip
2004-08-17 18:20:59 [climateprediction.net] Started upload of 03n2_000029703_1_2.zip
2004-08-17 18:20:59 [climateprediction.net] Computation for result 005z_000025200 finished
2004-08-17 18:20:59 [climateprediction.net] Unrecoverable error for result 0300_000028873_1 (process exited with code 251 (0xfb))
2004-08-17 18:20:59 [climateprediction.net] Unrecoverable error for result 0300_000028873_1 (process exited with code 251 (0xfb))
2004-08-17 18:20:59 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds
2004-08-17 18:20:59 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds
2004-08-17 18:20:59 [climateprediction.net] Computation for result 0300_000028873 finished
2004-08-17 18:20:59 [climateprediction.net] Unrecoverable error for result 0052_000025167_1 (process exited with code 251 (0xfb))
2004-08-17 18:20:59 [climateprediction.net] Unrecoverable error for result 0052_000025167_1 (process exited with code 251 (0xfb))
2004-08-17 18:20:59 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds
2004-08-17 18:20:59 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds
2004-08-17 18:20:59 [climateprediction.net] Computation for result 0052_000025167 finished
2004-08-17 18:20:59 [climateprediction.net] Finished upload of 03n2_000029703_1_1.zip
2004-08-17 18:20:59 [climateprediction.net] Approximate throughput 4598.970499 bytes/sec
2004-08-17 18:21:00 [climateprediction.net] Started upload of 03n2_000029703_1_3.zip
2004-08-17 18:21:00 [climateprediction.net] Finished upload of 03n2_000029703_1_2.zip
2004-08-17 18:21:00 [climateprediction.net] Approximate throughput 27627.143844 bytes/sec
2004-08-17 18:21:00 [climateprediction.net] Started upload of 03n2_000029703_1_4.zip
2004-08-17 18:21:00 [climateprediction.net] Finished upload of 03n2_000029703_1_3.zip
2004-08-17 18:21:00 [climateprediction.net] Approximate throughput 3673.207244 bytes/sec
2004-08-17 18:21:00 [climateprediction.net] Started upload of 03n2_000029703_1_5.zip
2004-08-17 18:21:00 [climateprediction.net] Finished upload of 03n2_000029703_1_4.zip
2004-08-17 18:21:00 [climateprediction.net] Approximate throughput 4316.120637 bytes/sec
2004-08-17 18:21:00 [climateprediction.net] Started upload of 005z_000025200_1_1.zip
2004-08-17 18:21:01 [climateprediction.net] Finished upload of 03n2_000029703_1_5.zip
2004-08-17 18:21:01 [climateprediction.net] Approximate throughput 4631.697766 bytes/sec
2004-08-17 18:21:01 [climateprediction.net] Started upload of 005z_000025200_1_2.zip
2004-08-17 18:21:01 [climateprediction.net] Finished upload of 005z_000025200_1_1.zip
2004-08-17 18:21:01 [climateprediction.net] Approximate throughput 4700.718916 bytes/sec
2004-08-17 18:21:01 [climateprediction.net] Started upload of 005z_000025200_1_3.zip
2004-08-17 18:21:01 [---] Received signal 2
2004-08-17 18:21:01 [---] Exit requested by user

I'm planning a re-install of FC2 on this box tomorrow - the 3ware raid controller gave me some fits the first time, so this will be a more surgical install of the system - it is the only one of my "big iron" doing this.

jsc (Xcamel)
ID: 1155 · Report as offensive     Reply Quote
Helmer Bryd

Send message
Joined: 16 Aug 04
Posts: 147
Credit: 7,911,957
RAC: 8,948
Message 1297 - Posted: 20 Aug 2004, 1:31:05 UTC

I have problems on my both Linux boxes, RH8 and FC1, keeps crashing.
They are downclocked now to moderate settings but still unstable in CPDN, hmm..?
ID: 1297 · Report as offensive     Reply Quote
old_user73

Send message
Joined: 5 Aug 04
Posts: 39
Credit: 14,887
RAC: 0
Message 1553 - Posted: 24 Aug 2004, 6:15:27 UTC - in response to Message 1297.  

> I have problems on my both Linux boxes, RH8 and FC1, keeps crashing.
> They are downclocked now to moderate settings but still unstable in CPDN,
> hmm..?

Now this is weird since I tested it on RH9 and it worked fine... RH8 and FC1 shares quite a lot of code with RH9...
ID: 1553 · Report as offensive     Reply Quote
old_user73

Send message
Joined: 5 Aug 04
Posts: 39
Credit: 14,887
RAC: 0
Message 4565 - Posted: 23 Sep 2004, 20:26:16 UTC - in response to Message 1553.  

Ok, problem solved!

It turned out to be a problem in the 2.6.5-gentoo-r1 kernel scheduler (I guess). Updating to the new 2.6.8-gentoo-r4 solved all problems! =)
ID: 4565 · Report as offensive     Reply Quote
Jean-David Beyer

Send message
Joined: 5 Aug 04
Posts: 1055
Credit: 16,516,801
RAC: 955
Message 4598 - Posted: 24 Sep 2004, 12:22:44 UTC - in response to Message 298.  

> OK from those error messages obviously the Fortran code is causing trouble.
> Do you have a /usr/lib/ifcore_msg.cat? I think it's an Intel Fortran library,
> so perhaps it's a conflict with other libraries you may have installed?
>
I DO NOT HAVE THIS PROBLEM. (In fact, ClimatePrediction seems to be running OK for me.)

I run Red Hat Enterprise Linux 3 ES.

But I am curious. I have no ifcore_msg.cat anywhere on my system. I hve no ifcore anywhere on my system. I have only the following .cat files on my system.

$ locate .cat
/homeB/jdbeyer/W95/quickenw/Intellic.cat
/opt/IBM/db2/V8.1/msg/en_US.iso88591/db2icons.cat
/opt/IBM/db2/V8.1/msg/en_US.iso88591/db2inst.cat
/opt/IBM/db2/V8.1/msg/en_US.iso88591/db2install.cat
/opt/IBM/db2/V8.1/msg/en_US.iso88591/db2istring.cat
/usr/src/linux-2.4.21-20.EL/drivers/usb/.catc.o.flags
/usr/src/linux-2.4.21-20.EL/fs/hfs/.catalog.o.flags
/usr/src/linux-2.4.21-15.0.3.EL/drivers/usb/.catc.o.flags
/usr/src/linux-2.4.21-15.0.3.EL/fs/hfs/.catalog.o.flags
/usr/share/apps/ksgmltools2/docbook/xml-dtd-4.1.2/docbook.cat
/usr/share/linuxdoc-tools/linuxdoc-tools.catalog
/etc/sgml/sgml-docbook.cat
/etc/sgml/xml-docbook.cat
/etc/sgml/sgml-docbook-3.0-1.0-17.2.cat
/etc/sgml/sgml-docbook-3.1-1.0-17.2.cat
/etc/sgml/sgml-docbook-4.0-1.0-17.2.cat
/etc/sgml/sgml-docbook-4.1-1.0-17.2.cat
/etc/sgml/xml-docbook-4.1.2-1.0-17.2.cat
/etc/sgml/sgml-docbook-4.2-1.0-17.2.cat
/etc/sgml/xml-docbook-4.2-1.0-17.2.cat

Can you really assume the existance of such files in all Linux distributions? My guess is that Red Hat Enterprise Linux is "fairly standard", whatever that may mean.

Also, I am not sure what you mean by Intel Fortran Library since most Linux systems I know of run GNU compilation systems (e.g., gcc, g++, g77).

I assume that by statically linking the client applications, you are evading this problem, but if so, why ask the O.P. about that particular library (if that is what it is)? I thought libraries ended in .a or .so... Perhaps your post came before you started statically linking.
ID: 4598 · Report as offensive     Reply Quote

Questions and Answers : Unix/Linux : Model crashing...is it me?

©2024 climateprediction.net