climateprediction.net home page
Posts by NewtonianRefractor

Posts by NewtonianRefractor

1) Message boards : Number crunching : many errors on my computer (Message 43791)
Posted 13 Feb 2012 by NewtonianRefractor
Post:
Sometime last year the department for which our 2 project people work, upgraded their Linux compiler.
The new one uses a newer version of one of the library files than is used by some Linux distros, as per the sticky post to which George referred.

So, no you're NOT producing useful results for the regional models, only for the coupled ocean models, which aren't being offered at the moment.



So the hadcm3n return valid data even with the errors in the stderr?
2) Message boards : Number crunching : many errors on my computer (Message 43789)
Posted 13 Feb 2012 by NewtonianRefractor
Post:
Sorry for the late reply.

Unfortunately I don't have admin rights on this machine. It's running Scientific Linux 5.4 (release date November 4, 2009)and is managed by the IT department.

Does this mean that the machine is not producing useful scientific results for this project? If so, then unfortunately I will just have to detach the machine and attach it to some other project.
3) Message boards : Number crunching : many errors on my computer (Message 43772)
Posted 11 Feb 2012 by NewtonianRefractor
Post:
My machine hostid=1170809 seems to error every HADAM3P model it tries, yet it processes the Coupled Model Full Resolution Ocean withput any problem.

Is there some issue with the machine?
4) Message boards : Number crunching : thank you Les Bayliss for Information in News and Announcements (Message 43353)
Posted 1 Nov 2011 by NewtonianRefractor
Post:
Can we get more info on the security incident that occurred on the server?
5) Questions and Answers : Unix/Linux : HadCM3N o series not compatible with Linux? (Message 43217)
Posted 14 Oct 2011 by NewtonianRefractor
Post:
I have an intel linux machine which had a serious amount of -193 crashes:
hostid=1170809
6) Message boards : Number crunching : Crunching Nonexistent Task (Message 43011)
Posted 26 Sep 2011 by NewtonianRefractor
Post:
I have the same thing happening here too. I have 2 tasks in my Boinc running, but they are not listed on the computer page on the website.

Host id: 1170809,

WU:

hadcm3n_ygn8_1940_40_007461995_3

hadcm3n_u5u8_1980_40_007459995_2
7) Message boards : Number crunching : help restoring backup (Message 42286)
Posted 30 May 2011 by NewtonianRefractor
Post:
Another quick question. What if you have to restore a backup that is several days old, so as the model runs it will try to resubmit trickles that have been already submitted. Is there any problem with that?
8) Message boards : Number crunching : help restoring backup (Message 42240)
Posted 24 May 2011 by NewtonianRefractor
Post:
My models crashed when my computer lost power. I had a backup of the BOINC directory, so I just deleted the old one and overwrote it with the backup. I got a message that the computer generated a new cross project id, and now the tasks are labeled as Client detached on the website. They are still running on the computer.

Did I do something wrong?
9) Message boards : Number crunching : How many total timesteps are the hadcm3n? (Message 42187)
Posted 17 May 2011 by NewtonianRefractor
Post:
I wanted it to be done by the research paper deadline or whatever the scientists were talking about.
10) Message boards : Number crunching : How many total timesteps are the hadcm3n? (Message 42184)
Posted 17 May 2011 by NewtonianRefractor
Post:
My question is how many total timesteps are in a hadcm3n model? One of my computers is doing about 25,000 timesteps per day, or 3.5 TS/s. How long would it take to do the entire model?

Will it be able to do it by the deadline in august? Oh and it's running 24/7.

Here's a link to the UW: resultid=12885657.
11) Message boards : Number crunching : How do I stop BOINC from requesting GPU tasks? (Message 41814)
Posted 17 Mar 2011 by NewtonianRefractor
Post:
As the title says, I want to stop boinc from always asking for GPU tasks.

I have an ati 5770 card, so boinc constantly asks for GPU tasks, which pollutes the message log and probably puts unneeded strain on the servers.

In the boinc client I set the options to 'Use GPU never', but it still always asks for the units.

I ended up setting the project to get no new task, but I want to not micromanage this project.

What should I do?
12) Message boards : Number crunching : Computer Erroring Out (Message 41431)
Posted 4 Jan 2011 by NewtonianRefractor
Post:
I ran checkdisk on the hard-drive with no errors. (it's a 2 TB raid 0 array)

I am running prime95 'blend' mode right now. I will run it for 48 hours.

13) Message boards : Number crunching : Computer Erroring Out (Message 41428)
Posted 4 Jan 2011 by NewtonianRefractor
Post:
I am overclocking the computer, but the problem is that if it is a hardware error related to this it manifests itself very rarely. The models made it to year 60, which is a lot of calculation to go without error. I imagine this is very difficult is not impossible for me to track down.
14) Message boards : Number crunching : Computer Erroring Out (Message 41426)
Posted 4 Jan 2011 by NewtonianRefractor
Post:
That's very interesting because when I accessed the computer it seemed that it was running fine. The up-time was 18 days (I rebooted before I left on winter vacation). Boinc was running and was responsive. It was interesting that it did not contact the server after the 22nd. In the message log it said that it was just running CPU benchmarks every once in a while.
15) Message boards : Number crunching : Computer Erroring Out (Message 41424)
Posted 3 Jan 2011 by NewtonianRefractor
Post:
So I was finally able to check my computer.

These are the boinc messages:



21-Dec-2010 12:46:50 [climateprediction.net] Requesting new tasks for GPU
21-Dec-2010 12:46:53 [climateprediction.net] Scheduler request completed: got 0 new tasks
21-Dec-2010 12:46:53 [climateprediction.net] Message from server: No work sent
21-Dec-2010 12:46:53 [climateprediction.net] Message from server: No work is available for UK Met Office HadSM3 Slab Model
21-Dec-2010 12:46:53 [climateprediction.net] Message from server: No work is available for HadCM3 Coupled Model Experiment Optimised File I/O
21-Dec-2010 12:46:53 [climateprediction.net] Message from server: No work available for the applications you have selected. Please check your settings on the web site.
21-Dec-2010 12:46:54 [climateprediction.net] Started upload of hadcm3igeo_w2kx_2000_80_06759712_0_6.zip
21-Dec-2010 12:49:43 [climateprediction.net] Finished upload of hadcm3igeo_w2kx_2000_80_06759712_0_6.zip
21-Dec-2010 13:29:59 [climateprediction.net] Computation for task hadcm3igeo_w2l0_2000_80_06759709_1 finished
21-Dec-2010 13:29:59 [climateprediction.net] Output file hadcm3igeo_w2l0_2000_80_06759709_1_7.zip for task hadcm3igeo_w2l0_2000_80_06759709_1 absent
21-Dec-2010 13:29:59 [climateprediction.net] Output file hadcm3igeo_w2l0_2000_80_06759709_1_8.zip for task hadcm3igeo_w2l0_2000_80_06759709_1 absent
21-Dec-2010 13:30:04 [climateprediction.net] Computation for task hadcm3igeo_w2kx_2000_80_06759712_0 finished
21-Dec-2010 13:30:04 [climateprediction.net] Output file hadcm3igeo_w2kx_2000_80_06759712_0_7.zip for task hadcm3igeo_w2kx_2000_80_06759712_0 absent
21-Dec-2010 13:30:04 [climateprediction.net] Output file hadcm3igeo_w2kx_2000_80_06759712_0_8.zip for task hadcm3igeo_w2kx_2000_80_06759712_0 absent
21-Dec-2010 13:30:19 [climateprediction.net] Computation for task hadcm3igeo_w2yc_2000_80_06759229_4 finished
21-Dec-2010 13:30:19 [climateprediction.net] Output file hadcm3igeo_w2yc_2000_80_06759229_4_6.zip for task hadcm3igeo_w2yc_2000_80_06759229_4 absent
21-Dec-2010 13:30:19 [climateprediction.net] Output file hadcm3igeo_w2yc_2000_80_06759229_4_7.zip for task hadcm3igeo_w2yc_2000_80_06759229_4 absent
21-Dec-2010 13:30:19 [climateprediction.net] Output file hadcm3igeo_w2yc_2000_80_06759229_4_8.zip for task hadcm3igeo_w2yc_2000_80_06759229_4 absent
21-Dec-2010 13:31:21 [climateprediction.net] Sending scheduler request: To fetch work.
21-Dec-2010 13:31:21 [climateprediction.net] Reporting 3 completed tasks, requesting new tasks for CPU
21-Dec-2010 13:31:29 [climateprediction.net] Scheduler request completed: got 3 new tasks
21-Dec-2010 13:31:31 [climateprediction.net] Started download of hadam3p_pnw_6.08_windows_intelx86.exe
21-Dec-2010 13:31:31 [climateprediction.net] Started download of hadam3p_pnw_um_6.08_windows_intelx86.zip
21-Dec-2010 13:31:37 [climateprediction.net] Finished download of hadam3p_pnw_6.08_windows_intelx86.exe
21-Dec-2010 13:31:37 [climateprediction.net] Started download of hadam3p_pnw_graphics_6.08_windows_intelx86.exe
21-Dec-2010 13:31:41 [climateprediction.net] Finished download of hadam3p_pnw_um_6.08_windows_intelx86.zip
21-Dec-2010 13:31:41 [climateprediction.net] Started download of hadam3p_pnw_se_6.08_windows_intelx86.zip
21-Dec-2010 13:31:47 [climateprediction.net] Finished download of hadam3p_pnw_se_6.08_windows_intelx86.zip
21-Dec-2010 13:31:47 [climateprediction.net] Started download of hadrm3p_pnw_um_6.08_windows_intelx86.zip
21-Dec-2010 13:31:48 [climateprediction.net] [error] File hadam3p_pnw_se_6.08_windows_intelx86.zip has wrong size: expected 987053, got 0
21-Dec-2010 13:31:48 [climateprediction.net] [error] Checksum or signature error for hadam3p_pnw_se_6.08_windows_intelx86.zip
21-Dec-2010 13:31:52 [climateprediction.net] Finished download of hadam3p_pnw_graphics_6.08_windows_intelx86.exe
21-Dec-2010 13:31:52 [climateprediction.net] Started download of hadam3p_pnw_data_6.08_windows_intelx86.zip
21-Dec-2010 13:31:53 [climateprediction.net] [error] File hadam3p_pnw_graphics_6.08_windows_intelx86.exe has wrong size: expected 2098176, got 0
21-Dec-2010 13:31:53 [climateprediction.net] [error] Checksum or signature error for hadam3p_pnw_graphics_6.08_windows_intelx86.exe
21-Dec-2010 13:31:54 [climateprediction.net] Finished download of hadam3p_pnw_data_6.08_windows_intelx86.zip
21-Dec-2010 13:31:54 [climateprediction.net] Started download of hadam3p_eu_xyo1_1960_1_007050137.zip
21-Dec-2010 13:31:55 [climateprediction.net] [error] File hadam3p_pnw_data_6.08_windows_intelx86.zip has wrong size: expected 75116, got 0
21-Dec-2010 13:31:55 [climateprediction.net] [error] Checksum or signature error for hadam3p_pnw_data_6.08_windows_intelx86.zip
21-Dec-2010 13:31:57 [climateprediction.net] Finished download of hadam3p_eu_xyo1_1960_1_007050137.zip
21-Dec-2010 13:31:57 [climateprediction.net] Started download of o3_A2_1959_2010_N96_f.anc.gz
21-Dec-2010 13:31:59 [climateprediction.net] [error] File hadam3p_eu_xyo1_1960_1_007050137.zip has wrong size: expected 12637, got 0
21-Dec-2010 13:31:59 [climateprediction.net] [error] Checksum or signature error for hadam3p_eu_xyo1_1960_1_007050137.zip
21-Dec-2010 13:32:00 [climateprediction.net] Finished download of hadrm3p_pnw_um_6.08_windows_intelx86.zip
21-Dec-2010 13:32:00 [climateprediction.net] Started download of ic19611020_10_N96.gz
21-Dec-2010 13:32:10 [climateprediction.net] Finished download of ic19611020_10_N96.gz
21-Dec-2010 13:32:10 [climateprediction.net] Started download of xaclfa.start.0000.gz
21-Dec-2010 13:32:10 [climateprediction.net] [error] File ic19611020_10_N96.gz has wrong size: expected 1314394, got 0
21-Dec-2010 13:32:10 [climateprediction.net] [error] Checksum or signature error for ic19611020_10_N96.gz
21-Dec-2010 13:32:25 [climateprediction.net] Finished download of o3_A2_1959_2010_N96_f.anc.gz
21-Dec-2010 13:32:25 [climateprediction.net] Started download of oxi.addfa.gz
21-Dec-2010 13:33:04 [climateprediction.net] Sending scheduler request: To report completed tasks.
21-Dec-2010 13:33:04 [climateprediction.net] Reporting 2 completed tasks, not requesting new tasks
21-Dec-2010 13:33:07 [climateprediction.net] Scheduler request completed
21-Dec-2010 13:33:09 [climateprediction.net] [error] File hadam3p_eu_6.08_windows_intelx86.exe has wrong size: expected 780288, got 0
21-Dec-2010 13:33:12 [climateprediction.net] Sending scheduler request: To fetch work.
21-Dec-2010 13:33:12 [climateprediction.net] Not reporting or requesting tasks
21-Dec-2010 13:33:13 [climateprediction.net] Scheduler request completed
21-Dec-2010 13:33:19 [climateprediction.net] Sending scheduler request: To fetch work.
21-Dec-2010 13:33:19 [climateprediction.net] Reporting 1 completed tasks, requesting new tasks for CPU
21-Dec-2010 13:33:21 [climateprediction.net] Scheduler request completed: got 0 new tasks
21-Dec-2010 13:33:21 [climateprediction.net] Message from server: No work sent
21-Dec-2010 13:33:21 [climateprediction.net] Message from server: No work is available for UK Met Office HadSM3 Slab Model
21-Dec-2010 13:33:21 [climateprediction.net] Message from server: No work is available for HadCM3 Coupled Model Experiment Optimised File I/O
21-Dec-2010 13:33:21 [climateprediction.net] Message from server: No work is available for UK Met Office HADAM3P European Region
21-Dec-2010 13:33:21 [climateprediction.net] Message from server: No work is available for UK Met Office HADAM3P Pacific North West
21-Dec-2010 13:33:21 [climateprediction.net] Message from server: (reached daily quota of 3 tasks)
16) Message boards : Number crunching : Computer Erroring Out (Message 41346)
Posted 23 Dec 2010 by NewtonianRefractor
Post:
For my Task 12009783, from the Workunit 6963025, there is a paired fast intel Xeon 5160 @ 3.00GHz on linux that is not far behind in calculation.

I guess I'll watch and see if it has any problems. On the other hand I do think it might be just a problem with my overclocked computer. I did dial down the overclock a bit and the PC ran fine for over 25 days with no problem before.
17) Message boards : Number crunching : Computer Erroring Out (Message 41342)
Posted 22 Dec 2010 by NewtonianRefractor
Post:
I changed the preferences, hopefully it works. The computer has not contacted the server since 22 Dec 2010 0:55:02 UTC, which is about 20 hours ago at the time of this posting.

Can someone please tell me what the stderr means for the crashed work-units? I got a good 25 days of computation on them before they crashed. That sucks.
18) Message boards : Number crunching : Computer Erroring Out (Message 41337)
Posted 22 Dec 2010 by NewtonianRefractor
Post:
In the Projects tab of your manager:
Click on climateprediction.net
Click the No new tasks button



I'm out of town, I don't have physical or remote access to the machine.
19) Message boards : Number crunching : Computer Erroring Out (Message 41335)
Posted 22 Dec 2010 by NewtonianRefractor
Post:
My main computer, hostid 1109774 was crunching 3 very long tasks, HadCM3 Coupled Model.

Task 12009797
Task 12009783
Task 12007385

They all errored out. Furthermore the computer is trashing all new assigned models.

I am out of town until the January third, so I can not manage the computer until then.

There are some strange stderr on all the 3 crashed wu.

Is there a way I can prevent the computer from downloading new tasks? I changed the computer preferences to not do work when the computer is idle,
so hopefully it works. I can also try to change the allow network usage to some time when the server is offline.
20) Message boards : Number crunching : why does the server regularly go offline? (Message 41193)
Posted 1 Dec 2010 by NewtonianRefractor
Post:
Why does the server go offline during the day (pacific time) and come back up during the night?

I assume it has something to do with server maintenance as in the UK it would go offline at night and come back online during the day?


Next 20

©2024 climateprediction.net