climateprediction.net home page
Misconfiguration e-mail

Misconfiguration e-mail

Message boards : climateprediction.net Science : Misconfiguration e-mail
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 16 · 17 · 18 · 19 · 20 · 21 · 22 . . . 25 · Next

AuthorMessage
old_user520254

Send message
Joined: 30 May 08
Posts: 1
Credit: 59,224
RAC: 0
Message 45606 - Posted: 6 Mar 2013, 10:05:21 UTC

Apparently I have a "misconfigured BOINC", but I don't know what to do.

Therefore, I have upgraded BOINC to 7.0.31, without configuring anything.
I'm using OS/x 10.8.2.

http://climateapps2.oerc.ox.ac.uk/cpdnboinc/show_host_detail.php?hostid=1183075

Could you please advise me?
ID: 45606 · Report as offensive     Reply Quote
Profile Iain Inglis
Volunteer moderator

Send message
Joined: 16 Jan 10
Posts: 1079
Credit: 6,906,534
RAC: 6,466
Message 45607 - Posted: 6 Mar 2013, 12:22:53 UTC
Last modified: 6 Mar 2013, 12:23:58 UTC

That machine hasn't been active for a while, so I'm not sure why you've got the message now. However, if you look at the task record for that computer (here) you will see a long series of crashes. This is caused by a Mac permissions problem (described here). The solution is to reset the project (or remove it) each time BOINC is upgraded. So, as you've just upgraded, a reset is in order.

The problem is shown in the task logs, such as this extract from a HADCM3N crash:
execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 139275) failed!

The good news is that the new HADAM3P applications arriving shortly will not have this problem.

If you post back here noting the reset (or remove/re-attach) then I'll pass the message back to the project and they'll reset that machine's quota to the normal value.
ID: 45607 · Report as offensive     Reply Quote
Belfry

Send message
Joined: 19 Apr 08
Posts: 179
Credit: 4,306,992
RAC: 0
Message 46444 - Posted: 18 Jun 2013, 13:17:21 UTC

This user's AMD machines error-out all the time with a variety of stderr messages.

He/she's our #2 RAC contributor. Would be nice if a few of those machines were producing better science.
ID: 46444 · Report as offensive     Reply Quote
Profile Greg van Paassen

Send message
Joined: 17 Nov 07
Posts: 142
Credit: 4,271,370
RAC: 0
Message 46450 - Posted: 18 Jun 2013, 21:43:56 UTC - in response to Message 46444.  

Hi Belfry,

there was some discussion about this with Ba two weeks ago on this thread. Ba is aware of the problem and agrees with you, I think.
ID: 46450 · Report as offensive     Reply Quote
Profile Greg van Paassen

Send message
Joined: 17 Nov 07
Posts: 142
Credit: 4,271,370
RAC: 0
Message 46451 - Posted: 18 Jun 2013, 21:55:32 UTC

My favourite is EDGeSUser. 5307 and 2469 models crashed, none even started successfully. Host IDs 1218239 and 1218323.
ID: 46451 · Report as offensive     Reply Quote
Profile Dave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4314
Credit: 16,379,331
RAC: 3,596
Message 46457 - Posted: 19 Jun 2013, 12:55:55 UTC

Wingman on one I am downloading @ the moment 393617 is crashing everything in sight with Darwin.
ID: 46457 · Report as offensive     Reply Quote
old_user168399

Send message
Joined: 24 Feb 06
Posts: 5
Credit: 459,249
RAC: 0
Message 46479 - Posted: 21 Jun 2013, 17:11:54 UTC

Have just received message that computer is misconfigured and is crashing climate models. Told to post here for advice, post what exactly?
Told no need to post database entry for my computer so what do I post?

Confused.
ID: 46479 · Report as offensive     Reply Quote
Profile Iain Inglis
Volunteer moderator

Send message
Joined: 16 Jan 10
Posts: 1079
Credit: 6,906,534
RAC: 6,466
Message 46480 - Posted: 21 Jun 2013, 17:22:56 UTC - in response to Message 46479.  

Mark,

In this case the fix is simple. Macs have a permissions problem that causes models to crash on starting. A project reset should re-establish the permissions.

This process needs to be repeated each time BOINC Manager is upgraded.

Iain
ID: 46480 · Report as offensive     Reply Quote
Profile MikeMarsUK
Volunteer moderator
Avatar

Send message
Joined: 13 Jan 06
Posts: 1498
Credit: 15,613,038
RAC: 0
Message 46481 - Posted: 21 Jun 2013, 17:26:55 UTC
Last modified: 21 Jun 2013, 17:31:02 UTC

It looks like any climate models uploaded to your PC are crashing out with a permission problem. The recommended solution is to detach & reattach to CPDN - this will force a new copy of the executables to be downloaded, and these will be given the proper permissions.

http://climateapps2.oerc.ox.ac.uk/cpdnboinc/forum_thread.php?id=7231&nowrap=true#42579

Normally this problem is caused by updating the version of the Boinc manager on your PC, but it looks like yours has not been updated recently. However at least one other person has experienced this problem without having done an update (and a detach/reattach fixed it also).
http://climateapps2.oerc.ox.ac.uk/cpdnboinc/forum_thread.php?id=7385


--- Edit: Iain is much quicker off the mark than me :-)
I'm a volunteer and my views are my own.
News and Announcements and FAQ
ID: 46481 · Report as offensive     Reply Quote
old_user168399

Send message
Joined: 24 Feb 06
Posts: 5
Credit: 459,249
RAC: 0
Message 46483 - Posted: 21 Jun 2013, 18:00:02 UTC - in response to Message 46481.  

Ok, guys thanks for the advice.
Tried re-setting CPDN with no affect so tried detaching, well that part worked (funny how easy it is to bin something) but now I cannot re-attach (surprise, surprise), keep getting an error message telling me there was a failure to attach to project.






ID: 46483 · Report as offensive     Reply Quote
Profile MikeMarsUK
Volunteer moderator
Avatar

Send message
Joined: 13 Jan 06
Posts: 1498
Credit: 15,613,038
RAC: 0
Message 46484 - Posted: 21 Jun 2013, 18:38:27 UTC


Ah yeah, I forgot about that, there is a bug with attaching which they are still working to fix. The easiest way to do it is to copy the account_climateprediction.net.xml file back into the Boinc folder if you still have a copy of it. The next alternative is to use the boinc command line to attach, and the final alternative would be to wait until attaching is fixed.


http://climateapps2.oerc.ox.ac.uk/cpdnboinc/forum_thread.php?id=7558


I'm a volunteer and my views are my own.
News and Announcements and FAQ
ID: 46484 · Report as offensive     Reply Quote
Profile Iain Inglis
Volunteer moderator

Send message
Joined: 16 Jan 10
Posts: 1079
Credit: 6,906,534
RAC: 6,466
Message 46485 - Posted: 21 Jun 2013, 18:40:42 UTC

Doing the 'detach' may have been precipitate, though it's too late now. If you look at the details of that machine (here) you will see towards the bottom a field called Maximum daily WU quota per CPU, which is currently set to -1. That needs to be reset to a positive value by the project staff before the machine will get any more models. So the 'reset' might well have worked.

I'll pass on the message that the machine should no longer be a serial crasher.

Dealing with the attaching problem is discussed here, among others.

[Oops, Mike and I treading on each other's toes ...]
ID: 46485 · Report as offensive     Reply Quote
old_user168399

Send message
Joined: 24 Feb 06
Posts: 5
Credit: 459,249
RAC: 0
Message 46486 - Posted: 21 Jun 2013, 20:30:56 UTC - in response to Message 46485.  

Ok, guys, suspected that detaching might have been a bit previous but...heigh-ho.

Just about remember / understand what a command ine is, but have no idea how or more relevantly where to do it. In Boinc, but how?
ID: 46486 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 46487 - Posted: 21 Jun 2013, 20:57:18 UTC - in response to Message 46486.  

There's a wiki about it here.
There's a copy of it in the programs part of my BOINC setup, so I guess it's part of the standard download.

As for Reset/Detach; a year or so back, it was found that the Reset didn't work, so the full Detach was needed. I don't know if recent versions of OS/BOINC are different.

The original sticky post about the problem is here, at the top of the Macintosh section of this board.


ID: 46487 · Report as offensive     Reply Quote
old_user168399

Send message
Joined: 24 Feb 06
Posts: 5
Credit: 459,249
RAC: 0
Message 46494 - Posted: 22 Jun 2013, 9:15:20 UTC - in response to Message 46487.  

Thanks all who have tried to sort / explain the issue here.
But in effect it means no more CPDN for me or my computer as I a have snow balls chance in Hell of being able to do the command line fix myself.

If I had somebody sat next to me to walk me through it I could do it - I'm not that incapable. I have searched BOINC and tried following up on the suggested links for potential ways in, but...
For the 'what goes on under the computer bonnet' peeps out there the fix is probably esay-peasy.
Hope the same does not happen to SETI as I have been running that for some time now and would be disappointed if I had to stop.
ID: 46494 · Report as offensive     Reply Quote
Profile Iain Inglis
Volunteer moderator

Send message
Joined: 16 Jan 10
Posts: 1079
Credit: 6,906,534
RAC: 6,466
Message 46495 - Posted: 22 Jun 2013, 12:05:14 UTC - in response to Message 46494.  

Mark: It might be a good idea to subscribe to the news thread, as there's bound to be a post there when this attach problem is finally sorted out ...
ID: 46495 · Report as offensive     Reply Quote
old_user168399

Send message
Joined: 24 Feb 06
Posts: 5
Credit: 459,249
RAC: 0
Message 46497 - Posted: 22 Jun 2013, 15:31:18 UTC - in response to Message 46495.  

Iain, I have subscribed as suggested and will await news.
I tried following links in the FAQs about command line arguments - my server could not find any of them at boinc-wiki!

Patience will need to be a virtue. Cheers for the help.
ID: 46497 · Report as offensive     Reply Quote
tednetbook

Send message
Joined: 4 Dec 10
Posts: 1
Credit: 2,466,608
RAC: 0
Message 47086 - Posted: 17 Sep 2013, 23:08:22 UTC

climateprediction.net notification:

Dear tednetbook
Your computer (host # 1186330) described below appears to have a misconfigured BOINC
installation and is crashing models. Would you please have a look at it?

If you need assistance, please post in this thread on our BOINC forums and we will suggest a way to fix the problem. You may post in any language:

http://climateapps2.oerc.ox.ac.uk/cpdnboinc/forum_thread.php?id=6880

Please include this link so that we may more easily find your computer:

http://climateapps2.oerc.ox.ac.uk/cpdnboinc/show_host_detail.php?hostid=1186330

When you have applied the fix please post to say so. Until the problem is fixed no more work will be sent to your computer.


Sincerely,
The climateprediction.net team
ID: 47086 · Report as offensive     Reply Quote
Lockleys

Send message
Joined: 13 Jan 07
Posts: 195
Credit: 10,581,566
RAC: 0
Message 47121 - Posted: 19 Sep 2013, 9:36:38 UTC
Last modified: 19 Sep 2013, 9:38:49 UTC

One of my wingmen with Computer 985494 has a very long string of failures. He's obviously got problems. Probably worth a look.

EDIT. And also the other wingman with Computer 1286362 has a similar long string of failures.
ID: 47121 · Report as offensive     Reply Quote
Profile mo.v
Volunteer moderator
Avatar

Send message
Joined: 29 Sep 04
Posts: 2363
Credit: 14,611,758
RAC: 0
Message 47147 - Posted: 21 Sep 2013, 18:58:57 UTC

Hello Tednetbook

Thank you for posting. Here are your tasks:

http://climateapps2.oerc.ox.ac.uk/cpdnboinc/results.php?hostid=1186330

The models you're running at the moment seem to be doing nicely but quite a few models have crashed in the past on this computer (your other's doing very well). If you look at the computer's page of tasks, then click on a task's (model's) ID in the left-hand column, then on the model page click on stderr+ you'll see the BOINC messages for that model/task. Not all of the messages are errors!

BOINC doesn't always produce exactly the same error code for tasks that crash at the same moment, probably caused by the same problem. In addition, some error codes can be caused by lots of different problems, so it isn't always easy to diagnose definite problems.

However, I see that more than once more than one model seems to have crashed at the same time. Some crashed models in this category have the error code 25.
Code 25 can be caused by the whole computer crashing eg a bluescreen event, or by the computer spontaneously restarting or shutting down without your permission.

Has this computer in fact suffered this problem occasionally?

If this is the case I'd recommend some general stability testing and cleaning up. Run Disk Cleanup, Defragmenter, Windows own memory tester, Chkdsk. If you can't find any of these programs on the computer ask us where they are please.

Do all of this after suspending all the tasks in BOINC Manager then exiting from BOINC completely. To exit completely, right-click on the BOINC icon in the System Tray (notification area) then select Exit. The icon will disappear.

Another possible cause of crashed models could be that you don't usually completely exit from BOINC before shutting down or restarting the computer. It isn't intuitively obvious that this should be done, but not exiting from BOINC before shutting down will sooner or later crash some models.

It's also a good idea to exclude BOINC from the computer's antivirus scans. Or else only allow the AV to carry out scans when you choose (rather than according to a schedule when BOINC could be running), and exit from BOINC before running scans.

Please let us know how you're getting on with this so we can ask Jonathan to unminus your computer asap.
Cpdn news
ID: 47147 · Report as offensive     Reply Quote
Previous · 1 . . . 16 · 17 · 18 · 19 · 20 · 21 · 22 . . . 25 · Next

Message boards : climateprediction.net Science : Misconfiguration e-mail

©2024 climateprediction.net