Questions and Answers :
boinc runs rosetta fine although climateprediction crashes at startup?
Posted 6 Jun 2006 by old_user189455
Logs included below although the story is I have multiple xeon servers running linux, all running boinc & previously 100% rosetta. I was wanting to run some climateprediction although get what appears to be the same crash on every server.
Run trustix 2.2 as the distribution (designed for minimal install servers), kernel 2.6, glibc 2.3 all the essentials although no x windows as its not needed and I dont want boinc graphics.
ldd on the climate binary shows all required libraries are present:
libpthread.so.0 => /lib/libpthread.so.0 (0x40019000)
libc.so.6 => /lib/libc.so.6 (0x4006b000)
libdl.so.2 => /lib/libdl.so.2 (0x4019b000)
/lib/ld-linux.so.2 => /lib/ld-linux.so.2 (0x40000000)
logs as follows:
Unzipping workunit data file...
Created shared memory region key = 87855 of size 655036 bytes
Reconstructing ancillary file DMSNH3SO2A1B_19182082N, this may take a minute!
Reconstructing ancillary file SULPC_OXIDANTS_19_A2_1990, this may take a minute!
Reconstructing ancillary file ozone_hadcm3_1919_2082, this may take a minute!
Can\'t setup pointer to .so shared memory hadcm3trans_5.08_i686-pc-linux-gnu: undefined symbol: setupSharedMem!
Can\'t setup pointer to .so graphics cleanup hadcm3trans_5.08_i686-pc-linux-gnu: undefined symbol: graphics_thread_cleanup
Copying files for startup...
Copying climate.cpdc files...
Starting model ID hadcm3lbm_avh7_05276622 Phase 1
Climate model starting - use graphics to monitor progress.
Or visit the website to see the graphs for this run.
Preparing for restart...
Rewinding a model-day...
Cleaning up graphics data...
Detaching shared memory...
2006-06-06 18:22:16 [---] Rescheduling CPU: application exited
2006-06-06 18:22:16 [climateprediction.net] Computation for task hadcm3lbm_avh7_05276622_1 finished
2006-06-06 18:22:16 [rosetta@home] Resuming task FRA_t311_CASP7_hom001_2_1b0nA_IGNORE_THE_REST_244_663_3_0 using rosetta version 516
2006-06-06 18:22:17 [climateprediction.net] Unrecoverable error for result hadcm3lbm_avh7_05276622_1 (file_xfer_error
error code -161
Same error continues with various filenames:
2006-06-06 18:22:17 [climateprediction.net] Deferring scheduler requests for 1 minutes and 0 seconds
2006-06-06 18:23:20 [climateprediction.net] Sending scheduler request to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi
2006-06-06 18:23:20 [climateprediction.net] Reason: To report completed tasks
2006-06-06 18:23:20 [climateprediction.net] Requesting 172800 seconds of new work, and reporting 1 completed tasks
2006-06-06 18:23:26 [climateprediction.net] Scheduler request succeeded
2006-06-06 18:23:26 [climateprediction.net] Message from server: No work sent
2006-06-06 18:23:26 [climateprediction.net] Message from server: (reached daily quota of 2 results)