1)
Message boards :
Number crunching :
many errors on my computer
(Message 43791)
Posted 13 Feb 2012 by NewtonianRefractor Post: Sometime last year the department for which our 2 project people work, upgraded their Linux compiler. So the hadcm3n return valid data even with the errors in the stderr? |
2)
Message boards :
Number crunching :
many errors on my computer
(Message 43789)
Posted 13 Feb 2012 by NewtonianRefractor Post: Sorry for the late reply. Unfortunately I don't have admin rights on this machine. It's running Scientific Linux 5.4 (release date November 4, 2009)and is managed by the IT department. Does this mean that the machine is not producing useful scientific results for this project? If so, then unfortunately I will just have to detach the machine and attach it to some other project. |
3)
Message boards :
Number crunching :
many errors on my computer
(Message 43772)
Posted 11 Feb 2012 by NewtonianRefractor Post: My machine hostid=1170809 seems to error every HADAM3P model it tries, yet it processes the Coupled Model Full Resolution Ocean withput any problem. Is there some issue with the machine? |
4)
Message boards :
Number crunching :
thank you Les Bayliss for Information in News and Announcements
(Message 43353)
Posted 1 Nov 2011 by NewtonianRefractor Post: Can we get more info on the security incident that occurred on the server? |
5)
Questions and Answers :
Unix/Linux :
HadCM3N o series not compatible with Linux?
(Message 43217)
Posted 14 Oct 2011 by NewtonianRefractor Post: I have an intel linux machine which had a serious amount of -193 crashes: hostid=1170809 |
6)
Message boards :
Number crunching :
Crunching Nonexistent Task
(Message 43011)
Posted 26 Sep 2011 by NewtonianRefractor Post: I have the same thing happening here too. I have 2 tasks in my Boinc running, but they are not listed on the computer page on the website. Host id: 1170809, WU: hadcm3n_ygn8_1940_40_007461995_3 hadcm3n_u5u8_1980_40_007459995_2 |
7)
Message boards :
Number crunching :
help restoring backup
(Message 42286)
Posted 30 May 2011 by NewtonianRefractor Post: Another quick question. What if you have to restore a backup that is several days old, so as the model runs it will try to resubmit trickles that have been already submitted. Is there any problem with that? |
8)
Message boards :
Number crunching :
help restoring backup
(Message 42240)
Posted 24 May 2011 by NewtonianRefractor Post: My models crashed when my computer lost power. I had a backup of the BOINC directory, so I just deleted the old one and overwrote it with the backup. I got a message that the computer generated a new cross project id, and now the tasks are labeled as Client detached on the website. They are still running on the computer. Did I do something wrong? |
9)
Message boards :
Number crunching :
How many total timesteps are the hadcm3n?
(Message 42187)
Posted 17 May 2011 by NewtonianRefractor Post: I wanted it to be done by the research paper deadline or whatever the scientists were talking about. |
10)
Message boards :
Number crunching :
How many total timesteps are the hadcm3n?
(Message 42184)
Posted 17 May 2011 by NewtonianRefractor Post: My question is how many total timesteps are in a hadcm3n model? One of my computers is doing about 25,000 timesteps per day, or 3.5 TS/s. How long would it take to do the entire model? Will it be able to do it by the deadline in august? Oh and it's running 24/7. Here's a link to the UW: resultid=12885657. |
11)
Message boards :
Number crunching :
How do I stop BOINC from requesting GPU tasks?
(Message 41814)
Posted 17 Mar 2011 by NewtonianRefractor Post: As the title says, I want to stop boinc from always asking for GPU tasks. I have an ati 5770 card, so boinc constantly asks for GPU tasks, which pollutes the message log and probably puts unneeded strain on the servers. In the boinc client I set the options to 'Use GPU never', but it still always asks for the units. I ended up setting the project to get no new task, but I want to not micromanage this project. What should I do? |
12)
Message boards :
Number crunching :
Computer Erroring Out
(Message 41431)
Posted 4 Jan 2011 by NewtonianRefractor Post: I ran checkdisk on the hard-drive with no errors. (it's a 2 TB raid 0 array) I am running prime95 'blend' mode right now. I will run it for 48 hours. |
13)
Message boards :
Number crunching :
Computer Erroring Out
(Message 41428)
Posted 4 Jan 2011 by NewtonianRefractor Post: I am overclocking the computer, but the problem is that if it is a hardware error related to this it manifests itself very rarely. The models made it to year 60, which is a lot of calculation to go without error. I imagine this is very difficult is not impossible for me to track down. |
14)
Message boards :
Number crunching :
Computer Erroring Out
(Message 41426)
Posted 4 Jan 2011 by NewtonianRefractor Post: That's very interesting because when I accessed the computer it seemed that it was running fine. The up-time was 18 days (I rebooted before I left on winter vacation). Boinc was running and was responsive. It was interesting that it did not contact the server after the 22nd. In the message log it said that it was just running CPU benchmarks every once in a while. |
15)
Message boards :
Number crunching :
Computer Erroring Out
(Message 41424)
Posted 3 Jan 2011 by NewtonianRefractor Post: So I was finally able to check my computer. These are the boinc messages:
|
16)
Message boards :
Number crunching :
Computer Erroring Out
(Message 41346)
Posted 23 Dec 2010 by NewtonianRefractor Post: For my Task 12009783, from the Workunit 6963025, there is a paired fast intel Xeon 5160 @ 3.00GHz on linux that is not far behind in calculation. I guess I'll watch and see if it has any problems. On the other hand I do think it might be just a problem with my overclocked computer. I did dial down the overclock a bit and the PC ran fine for over 25 days with no problem before. |
17)
Message boards :
Number crunching :
Computer Erroring Out
(Message 41342)
Posted 22 Dec 2010 by NewtonianRefractor Post: I changed the preferences, hopefully it works. The computer has not contacted the server since 22 Dec 2010 0:55:02 UTC, which is about 20 hours ago at the time of this posting. Can someone please tell me what the stderr means for the crashed work-units? I got a good 25 days of computation on them before they crashed. That sucks. |
18)
Message boards :
Number crunching :
Computer Erroring Out
(Message 41337)
Posted 22 Dec 2010 by NewtonianRefractor Post: In the Projects tab of your manager: I'm out of town, I don't have physical or remote access to the machine. |
19)
Message boards :
Number crunching :
Computer Erroring Out
(Message 41335)
Posted 22 Dec 2010 by NewtonianRefractor Post: My main computer, hostid 1109774 was crunching 3 very long tasks, HadCM3 Coupled Model. Task 12009797 Task 12009783 Task 12007385 They all errored out. Furthermore the computer is trashing all new assigned models. I am out of town until the January third, so I can not manage the computer until then. There are some strange stderr on all the 3 crashed wu. Is there a way I can prevent the computer from downloading new tasks? I changed the computer preferences to not do work when the computer is idle, so hopefully it works. I can also try to change the allow network usage to some time when the server is offline. |
20)
Message boards :
Number crunching :
why does the server regularly go offline?
(Message 41193)
Posted 1 Dec 2010 by NewtonianRefractor Post: Why does the server go offline during the day (pacific time) and come back up during the night? I assume it has something to do with server maintenance as in the UK it would go offline at night and come back online during the day? |
©2024 climateprediction.net