climateprediction.net home page
signal 11 when scheduler request failed

signal 11 when scheduler request failed

Questions and Answers : Unix/Linux : signal 11 when scheduler request failed
Message board moderation

To post messages, you must log in.

AuthorMessage
old_user35920

Send message
Joined: 13 Jan 05
Posts: 6
Credit: 48,878
RAC: 0
Message 20057 - Posted: 9 Feb 2006, 2:33:22 UTC

Hello,

my sulphur model with result-ID 1610846 just crashed, when there was a problem with my internet connection.

Here is a snippet of the log:

<snip>
...
Mi 08 Feb 2006 19:50:52 CET|Einstein@Home|Computation for result r1_1159.0__1226_S4R2a_0 finished
Mi 08 Feb 2006 19:50:52 CET|climateprediction.net|Resuming result sulphur_ij30_000864540_0 using sulphur_cycle version 422
Mi 08 Feb 2006 19:50:55 CET|Einstein@Home|Started upload of r1_1159.0__1226_S4R2a_0_0
Mi 08 Feb 2006 19:50:56 CET|LHC@home|Sending scheduler request to http://lhcathome-sched1.cern.ch/scheduler/cgi
Mi 08 Feb 2006 19:50:56 CET|LHC@home|Reason: To fetch work
Mi 08 Feb 2006 19:50:56 CET|LHC@home|Requesting 86400 seconds of new work
Mi 08 Feb 2006 19:51:37 CET||Couldn\'t resolve hostname [lhcathome-sched1.cern.ch]
Mi 08 Feb 2006 19:51:37 CET||Couldn\'t resolve hostname [einstein.phys.uwm.edu]
Mi 08 Feb 2006 19:51:37 CET|climateprediction.net|Unrecoverable error for result sulphur_ij30_000864540_0 (process got signal 11)
Mi 08 Feb 2006 19:51:37 CET||request_reschedule_cpus: process exited
Mi 08 Feb 2006 19:51:37 CET|Einstein@Home|Temporarily failed upload of r1_1159.0__1226_S4R2a_0_0: can\'t resolve hostname
Mi 08 Feb 2006 19:51:37 CET|Einstein@Home|Backing off 1 minutes and 0 seconds on upload of file r1_1159.0__1226_S4R2a_0_0
Mi 08 Feb 2006 19:51:37 CET|climateprediction.net|Computation for result sulphur_ij30_000864540_0 finished
Mi 08 Feb 2006 19:51:37 CET|Einstein@Home|Starting result r1_1159.0__1408_S4R2a_3 using albert version 440
Mi 08 Feb 2006 19:51:37 CET|LHC@home|Scheduler request to http://lhcathome-sched1.cern.ch/scheduler/cgi failed with a return value of -113
Mi 08 Feb 2006 19:51:37 CET|LHC@home|No schedulers responded
...
<snap>

There are also some errors in the stderr about GLUT and X-connection problems but i think they are old, because I had a permission problem some time ago and it is solved now.

It might be worth mentioning that my general preferences are set to \"leave in memory when preempted\".

The BOINC client version is 5.2.13, and the client is also attached to SETI@home, LHC@home and Einstein@home.

Can anybody tell me, why it happened and also how to make it not happen again?

Thanks in advance.

Cheers
ID: 20057 · Report as offensive     Reply Quote
Profile astroWX
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1496
Credit: 95,522,203
RAC: 0
Message 21094 - Posted: 6 Mar 2006, 3:19:27 UTC

Sorry that no one replied to this post before. (There\'s been a bit of distraction...)

Not enough to go on, actually. Given the Einstein/LHC/CPDN troubles all in a pot makes me wonder whether the machine is overclocked and one glitch brought down the lot.

Given no recent posts, I assume you\'ve sorted the problem(s). What did you find? (Nice to see that you\'re still Trickling.)
"We have met the enemy and he is us." -- Pogo
Greetings from coastal Washington state, the scenic US Pacific Northwest.
ID: 21094 · Report as offensive     Reply Quote
Profile geophi
Volunteer moderator

Send message
Joined: 7 Aug 04
Posts: 2167
Credit: 64,403,322
RAC: 5,085
Message 21107 - Posted: 6 Mar 2006, 22:34:53 UTC

Unfortunately, unless he\'s downgraded to sulphur 4.21, it would appear he will crash again about halfway through the first phase.
ID: 21107 · Report as offensive     Reply Quote

Questions and Answers : Unix/Linux : signal 11 when scheduler request failed

©2024 climateprediction.net