climateprediction.net home page
New work Discussion

New work Discussion

Message boards : Number crunching : New work Discussion
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 16 · 17 · 18 · 19 · 20 · 21 · 22 . . . 43 · Next

AuthorMessage
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7149
Credit: 22,254,291
RAC: 18,428
Message 58645 - Posted: 26 Aug 2018, 1:46:01 UTC

Batch 738 had a set up error. (INANCILA, which means mismatched data files.)
A message was posted to Abort them.

******************

Batch 742, the sam25's.
Yes, there were a lot of failures with these.
The project person checked everything and couldn't find anything wrong.
And I had run several that had both 1 failure and 2, and there weren't any problems.
So we decided it was most likely just people's computers, and a sensitive modeling area. And I've since run lots of sam25's that have failed on other computers, all with no problems.

Possibly a lot of people were/are running BOINC with the default "training wheels" settings. This apparently works well with other projects, but is a computing hazard with cpdn.
ID: 58645 · Report as offensive     Reply Quote
Alex Plantema

Send message
Joined: 3 Sep 04
Posts: 122
Credit: 25,428,769
RAC: 13,745
Message 58646 - Posted: 26 Aug 2018, 7:02:24 UTC

Now you posted it in this thread. That isn't where people are looking for it when they need it. That's why I suggested to use a central place for it like the batch list.
ID: 58646 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7149
Credit: 22,254,291
RAC: 18,428
Message 58647 - Posted: 26 Aug 2018, 7:35:29 UTC
Last modified: 26 Aug 2018, 7:42:21 UTC

I post where people ask about something.
Let's wait and see if there's any further problems, before fixing a posting place.
ID: 58647 · Report as offensive     Reply Quote
Profile Bill F
Avatar

Send message
Joined: 17 Jan 09
Posts: 66
Credit: 952,287
RAC: 807
Message 58650 - Posted: 28 Aug 2018, 2:43:28 UTC - in response to Message 58641.  
Last modified: 28 Aug 2018, 2:45:21 UTC

Les

That I5-5200U system is a Dell Inspiron laptop. Not real well endowed at 6Gb ram and it is not crunching for about 5 hours per day (heat of the day hot room issue).

It is also crunching for 12 other projects.

If the numbers still look like Training Wheel numbers I am open to suggestions.

Thanks
Bill F
ID: 58650 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7149
Credit: 22,254,291
RAC: 18,428
Message 58658 - Posted: 28 Aug 2018, 21:35:06 UTC

There are 3 options in "Computing preferences", which are set for the majority of projects in a way so as not to scare off new crunchers.

One is Suspend when non-BOINC CPU usage is above
This is set to stop BOINC when the person uses the coomputer.

Two is Use at most 100% of the CPUs

Three is Leave non-GPU tasks in memory while suspended

All of which apparently work nicely for other projects at the default settings, but not cpdn.

Here, One should be set for 100%.
(Just look at all of the Suspend request from BOINC in your tasks.)

Two should be set to something less than the total number of cores in the processors.
From experiments, I've found that my hyperthreaded Intels work best (fastest), when set to just use the real cores. (4 out of 8.)

Three should be set to Yes. This allows for the possibility of saving the current image when cpdn is swapped out for another project's task.

**********************

What ever the problem, crashing ever task it gets is not very useful.
It would be better used just for the other projects on it.
ID: 58658 · Report as offensive     Reply Quote
Profile Hammy
Avatar

Send message
Joined: 24 Aug 07
Posts: 7
Credit: 722,205
RAC: 0
Message 58660 - Posted: 28 Aug 2018, 22:18:28 UTC
Last modified: 28 Aug 2018, 22:43:32 UTC

Can somebody help me by explaining why I don't get any work for Climate please? I have been with the project since 2007, but for many months now I get nothing. Some while ago, one unit did turn up, was processed and weeks later has still not shown up in my user stats. Either I have a problem or is this whole thing shutting down?

My computer is fine, a Mac, and I don't have this issue with most of the other projects I am linked to.
ID: 58660 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7149
Credit: 22,254,291
RAC: 18,428
Message 58661 - Posted: 28 Aug 2018, 23:49:17 UTC - in response to Message 58660.  

Simple.
There's no work for Macs, as the programs currently being used haven't been ported to the Macs. They tried, but had lots of problems.
Also, see this post in the Macintosh section of this board.
ID: 58661 · Report as offensive     Reply Quote
Profile Dave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 2634
Credit: 3,143,917
RAC: 335
Message 58662 - Posted: 29 Aug 2018, 6:15:02 UTC - in response to Message 58661.  

There's no work for Macs, as the programs currently being used haven't been ported to the Macs.


The two batches of hadam3cs tasks were available for Mac but unless you were trying to download them at the right time you could easily have missed out.

https://www.cpdn.org/cpdnboinc/workunit.php?wuid=11622111 This is one completed by a Mac. However, my crystal ball is not telling me anything about more of this task type in the pipeline at the moment. (Actually it isn't telling me anything about other batches in the pipeline either.)
ID: 58662 · Report as offensive     Reply Quote
Profile Dave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 2634
Credit: 3,143,917
RAC: 335
Message 58678 - Posted: 2 Sep 2018, 9:26:03 UTC

A couple more batches released, 752 and 753. 3 and 16 month pnw25 tasks respectively.
ID: 58678 · Report as offensive     Reply Quote
Profile Iain Inglis
Volunteer moderator

Send message
Joined: 16 Jan 10
Posts: 1007
Credit: 4,416,747
RAC: 15,472
Message 58732 - Posted: 8 Sep 2018, 16:08:33 UTC
Last modified: 8 Sep 2018, 16:10:00 UTC

A large batch of 13,500 Pacific North-West models has been released, Batch #754 (batch list) - so the queue is pretty large.
ID: 58732 · Report as offensive     Reply Quote
Profile Iain Inglis
Volunteer moderator

Send message
Joined: 16 Jan 10
Posts: 1007
Credit: 4,416,747
RAC: 15,472
Message 58747 - Posted: 11 Sep 2018, 10:42:26 UTC
Last modified: 11 Sep 2018, 21:01:03 UTC

New batches of 200 11-month and 10 120-month work units have just come out for a new region: batch #755 and batch #756 for the Atlantic region at 50 km (new region, batch list).

[Edit: Corrected from one to two batches.]
ID: 58747 · Report as offensive     Reply Quote
Profile Dave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 2634
Credit: 3,143,917
RAC: 335
Message 58783 - Posted: 20 Sep 2018, 12:36:23 UTC

There is now a batch of 28 month pnw25 tasks out there. Batch 757

Some of these are already on their third attempt with a combination of download errors and being aborted. I suspect this is the download server being out of action referred to elsewhere.
ID: 58783 · Report as offensive     Reply Quote
mmonnin

Send message
Joined: 28 May 17
Posts: 41
Credit: 4,563,779
RAC: 14,345
Message 58789 - Posted: 20 Sep 2018, 22:15:36 UTC - in response to Message 58783.  

There is now a batch of 28 month pnw25 tasks out there. Batch 757

Some of these are already on their third attempt with a combination of download errors and being aborted. I suspect this is the download server being out of action referred to elsewhere.


I've had some of these for 3 days waiting on the download server.
ID: 58789 · Report as offensive     Reply Quote
Profile Iain Inglis
Volunteer moderator

Send message
Joined: 16 Jan 10
Posts: 1007
Credit: 4,416,747
RAC: 15,472
Message 58854 - Posted: 15 Oct 2018, 13:10:40 UTC

There's a new batch of 200 24-month HADCM3S models, Batch #759, which work on Linux and Mac as well as Windows (batch list).
ID: 58854 · Report as offensive     Reply Quote
Profile Dave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 2634
Credit: 3,143,917
RAC: 335
Message 58855 - Posted: 15 Oct 2018, 14:15:50 UTC - in response to Message 58854.  
Last modified: 15 Oct 2018, 14:23:01 UTC

There's a new batch of 200 24-month HADCM3S models, Batch #759,


All gone now and having poked around a little it looks like they are failing to download. as in this work unit.

https://www.cpdn.org/cpdnboinc/workunit.php?wuid=11651129

Edit: email sent.
ID: 58855 · Report as offensive     Reply Quote
Jean-David Beyer

Send message
Joined: 5 Aug 04
Posts: 372
Credit: 3,473,738
RAC: 3,861
Message 58856 - Posted: 15 Oct 2018, 19:09:18 UTC - in response to Message 58855.  

I got two that are currently running on RHEL 6.10, 64-bit.

Red Hat Enterprise Linux Server release 6.10 (Santiago)
Kernel: 2.6.32-754.6.3.el6.x86_64

These are the required libraries.
$ ldd hadcm3s_8.34_i686-pc-linux-gnu
linux-gate.so.1 => (0x00d87000)
libpthread.so.0 => /lib/libpthread.so.0 (0x00cef000)
libdl.so.2 => /lib/libdl.so.2 (0x00d0c000)
libstdc++.so.6 => /usr/lib/libstdc++.so.6 (0x00e39000)
libm.so.6 => /lib/libm.so.6 (0x00da1000)
libgcc_s.so.1 => /lib/libgcc_s.so.1 (0x006d2000)
libc.so.6 => /lib/libc.so.6 (0x00b4b000)
/lib/ld-linux.so.2 (0x56611000)

They have not crashed and each has 4 hours, 47 minutes CPU time consumed.

hadcm3s_ca116_190012_24_759_011651280_2
hadcm3s_ca317_190012_24_759_011651284_2
ID: 58856 · Report as offensive     Reply Quote
Jean-David Beyer

Send message
Joined: 5 Aug 04
Posts: 372
Credit: 3,473,738
RAC: 3,861
Message 58860 - Posted: 15 Oct 2018, 22:13:41 UTC - in response to Message 58855.  

There's a new batch of 200 24-month HADCM3S models, Batch #759,

All gone now and having poked around a little it looks like they are failing to download.


Here are two of mine:
https://www.cpdn.org/cpdnboinc/workunit.php?wuid=11651280
https://www.cpdn.org/cpdnboinc/workunit.php?wuid=11651284

Each has almost eight hours on it.
ID: 58860 · Report as offensive     Reply Quote
Profile Dave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 2634
Credit: 3,143,917
RAC: 335
Message 58861 - Posted: 16 Oct 2018, 8:45:00 UTC - in response to Message 58860.  

From Sarah at the project:

ok so possibly the sync has copied files across now so downloads are ok. There has been some work going on on the infrastructure here by IT that is causing mounting issues which we think is what is causing this. We will try and resolve as soon as possible.
ID: 58861 · Report as offensive     Reply Quote
Jean-David Beyer

Send message
Joined: 5 Aug 04
Posts: 372
Credit: 3,473,738
RAC: 3,861
Message 58862 - Posted: 16 Oct 2018, 16:30:04 UTC - in response to Message 58860.  

I even got a trickle from one of them:

16 Oct 2018 15:49:24 1256552 21338358 hadcm3s_ca317_190012_24_759_011651284_2 1 51,912 82,593 1.5910
ID: 58862 · Report as offensive     Reply Quote
Jean-David Beyer

Send message
Joined: 5 Aug 04
Posts: 372
Credit: 3,473,738
RAC: 3,861
Message 58865 - Posted: 17 Oct 2018, 19:52:28 UTC - in response to Message 58862.  

They both completed OK.

https://www.cpdn.org/cpdnboinc/workunit.php?wuid=11651280

https://www.cpdn.org/cpdnboinc/workunit.php?wuid=11651284
ID: 58865 · Report as offensive     Reply Quote
Previous · 1 . . . 16 · 17 · 18 · 19 · 20 · 21 · 22 . . . 43 · Next

Message boards : Number crunching : New work Discussion

©2020 climateprediction.net