climateprediction.net home page
Posts by old_user19523

Posts by old_user19523

1) Message boards : Number crunching : Sulphur units constantly failing (Message 21157)
Posted 8 Mar 2006 by old_user19523
Post:
The crashing definatly has to do with your computer. I think my last model crashed once but it had to have been a fluke, I have used so many different aplications at the same time as the model and not had a crash, that I dont know when to expect one.

Therefore if I am going to do a back up, it will probably be once a month, but my question is what needs to be backed up if I am to back up the data? And how should I back up the information? Should i just copy the files to a different location on my c drive?

If your system is unstable with high cpu usage then good luck. I wouldn\'t do graphics intensive game while I modeled. That and make sure your computer can breathe.

What does a crash look like for those of you who are having crashes?



I\'m Sorry but my computer is perfectly stable.

I can run climateprediction without problems for many hours. I tried prime95 with climateprediction 50/50 cpu time for 24hours without a problem.

The problem is when an higher priority program require 100% cpu the climateprediction application get out of sync (remember that climateprediction run with a very low priority).

I know it and now i stop boinc every time i know that another application will need 100% cpu time.

There is a post also on the boinc dev\'s mailing list about this issue
2) Message boards : Number crunching : Sulphur units constantly failing (Message 21133)
Posted 7 Mar 2006 by old_user19523
Post:
Frequent backups are a must!

daily backups and internet disabled is a must for 4.22 :)

I keep 3 days of backups to be sure 100% :)

it\'s a challenge for me to finish this workunit :)

i managed to finish the first phase :) i don\'t know who has done more work, the cpu or I with the backups ;)


It seems that some lucky bastards don\'t need to do extra work ... I do backups only once a month - just before steping up to next phase! Until now, no problem at all with Sulphur 4.22 & Boinc 5.3.x under Linux (& yes, connected to the Internet 24/7).


every time an application use 100% cpu for a while, the sulphur 4.22 crash.

it can be a game or another kind of application, the result is the same a crash.

I hope that after the BBC project release the dev\'s will correct this problem
3) Message boards : Number crunching : Sulphur units constantly failing (Message 20662)
Posted 23 Feb 2006 by old_user19523
Post:
If people can finish pahse one, there is extra info in it, (compared to slab), that is very usefull. After that, the team need all of the rest.

Frequent backups are a must!



daily backups and internet disabled is a must for 4.22 :)

I keep 3 days of backups to be sure 100% :)

it\'s a challenge for me to finish this workunit :)


i managed to finish the first phase :) i don\'t know who has done more work, the cpu or I with the backups ;)
4) Message boards : Number crunching : Sulphur units constantly failing (Message 19970)
Posted 4 Feb 2006 by old_user19523
Post:
If people can finish pahse one, there is extra info in it, (compared to slab), that is very usefull. After that, the team need all of the rest.

Frequent backups are a must!



daily backups and internet disabled is a must for 4.22 :)

I keep 3 days of backups to be sure 100% :)

it\'s a challenge for me to finish this workunit :)
5) Message boards : Number crunching : Sulphur units constantly failing (Message 19940)
Posted 3 Feb 2006 by old_user19523
Post:
Ever since getting sulphur 4.22 dl\'d to my STABLE machine, this has happened. Any explanations?


you should post the workunits, we can\'t see your results page

well, i think that sulphur 4.22 has some problems, i hope that the next experiment or next suplhur version will be more stable
6) Message boards : Number crunching : WUs constantly failing (Message 19936)
Posted 3 Feb 2006 by old_user19523
Post:
I occasionally run Doom while BOINC is running. The only problem is when it starts to benchmark. Then Doom slows right down and movement gets jerky. At least for me. The baddies seem to keep going. :(
When/if I wake up to it, I suspend Doom until the benchmark is finished.




Not every game eats 100% cpu, for example when i play with pes 5 the model continue to advance because the game doesn\'t need much cpu time :)

many games simply do this:

while (1) {
continue; // :)
}
7) Message boards : Number crunching : WUs constantly failing (Message 19935)
Posted 3 Feb 2006 by old_user19523
Post:
Perhaps for your friend a different project might be more appropriate. I know this sounds like heresy ... but, not all project are suitable for all computers and all people.

I have had decent luck running CPDN on all my PCs, occassional model crashes for various reasong, but, aa pretty decent track record. Heck I am about to complete my second Sulfur model in a coupld days (1 day 12 hours).

But, though you would think that it would be a better computer to run CPDN I have yet to complete a model on my PowerMac G5 ... bad computer? Bad program, gremlins? who knows. But, I just stopped and now run other projects on the PowerMac, it really shines at Einstein@Home ...

Again, this is the beauty of BOINC ...

Oh, and WCG uses the UD program if you like, or you can run thier two projects under BOINC like I do ...


For now i run climateprediction only in my home computer, where there is no internet access, it\'s the best project for a computer like this. I only have to backup boinc folder every day and once in a while transport with cdrw at work.

in my friend\'s computer i\'ll install boinc with WCG and einstein i think, surely i\'ll not install climateprediction. I like this project but needs to much user attentions.

BTW with my home computer i managed to do an old slab model, without backups and with a lot of gaming ;) http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=250218
8) Message boards : Number crunching : WUs constantly failing (Message 19906)
Posted 2 Feb 2006 by old_user19523
Post:
> 5) Start a standard game, in this case Splinter cell 1
> 6) After 30 min or 1 hour you exit from game
> 7) Model crashed

I would recommend you to shut down BOINC or at least suspend computation while gaming!


Yes i know it, at least for climateprediction. But you, I and several thousand of people can do this, but there are millions out there that can\'t do this.

To let the boinc platform be more attractive to normal users, must be more reliable also if you are playing a game :).

the average computer user, is capable to surf in internet, write an email and to install a program, of the other thing he doesn\'t care. I look on the forums, in the server status, results page and so on several times in a day, you can say that I\'m a boinc addicted person :)

An example of an ipothetical non expert user.

1) one friend tell him that can use is spare cpu time for something usefull.
2) He thinks \"why not?\"
3) install the boinc client (if he is capable)
4) He choose the projects he likes (now is better than before, but i\'m waiting for account managers :) )
5) He is sure that it don\'t need his attention and he forget about the boinc existence for a month
6) after that because for 1 Hour a day plays his favourite game, in a month has lost 30 climate models,lost time, wasted server resources, and no science done.
7) Deleted boinc and user lost.

My first DC project was UD, and i liked of it that it was an install and forget program, then over 1 year ago i switched to boinc because i liked its philosofy.
To make an example, I keep the UD client on a friend\'s computer where i have very infrequent access. I would like to install boinc as soon as i can, but for now to manage a remote client with dynamic ip it\'s a **** ** *** ***

P.S.
I\'m sorry for my bad english :(
9) Message boards : Number crunching : WUs constantly failing (Message 19892)
Posted 2 Feb 2006 by old_user19523
Post:
I\'m a programmer so i know that to report a possible bug it\'s better to give more details :)

My computer is an athlon 64 3200+ 754 pin 0,13u 2Ghz
motherboard abit kv8 with latest bios
1 GB of ram (2 ddr400 modules)

addon boards
DVB-S Board - skystar 2
Pinnacle board PCI-500
a standard realtek ethernet board
no sound card, using integrated one.

standard clock, also the memory timing are from the SPD settings.

I have no power supply problem, i have an enermax power supply(i don\'t remember the model :) )

no problem with the cpu overheating, i\'m using hyper 6 from cooler master, 950g of laminated copper :)

I have latest stable drivers of everything, andthe system is completely stable

I\'m running boinc version 5.2.13. with normal installation (no service) with automatic start

Os Windows Xp pro SP2 without any additional update.

How to reproduce these issue, it\'s simple:
1) you need a brother (maybe it\'s not strictly necessary :) )
2) turn on computer.
3) wait until the logon of windows appear
4) logon
5) Start a standard game, in this case Splinter cell 1
6) After 30 min or 1 hour you exit from game
7) Model crashed

This weekend i\'ll try to reproduce the model crashing myself to gather more specific details. And i want to try also if i can reproduce with another computer.
10) Message boards : Number crunching : WUs constantly failing (Message 19890)
Posted 2 Feb 2006 by old_user19523
Post:
ok, i din\'t knew this :) so I copy - filter - paste the old message that if you wish you can delete or keep hidden :)

http://climateapps2.oucs.ox.ac.uk/cpdnboinc/workunit.php?wuid=1108852

Same workunit :)

as i said before, i let this workunit to crash to gather some useful info for avoiding crashes :).

in one crash the yabsd.out was still present, in the last part there was this :
FIXED LENGTH HEADER
-------------------
Dump format version-32768
UM Version No 401
Atmospheric data
On hybrid levels
Over global domain
Ancillary dataset
Exp No =-32768 Run Id =-32768
360-day calendar
Arakawa B grid
Year Month Day Hour Min Sec DayNo
Data time = 0 1 16 0 0 0 0
Validity time = 0 12 16 0 0 0 0
Creation time = 0 1 0 0 0 0 0
Start 1st dim 2nd dim 1st parm 2nd parm
Integer Consts 257 15 15
Real Consts 272 6 6
Level Dep Consts -32768 1 1 1 1
Row Dep Consts -32768 1 1 1 1
Column Dep Consts -32768 1 1 1 1
Fields of Consts -32768 1 1 1 1
Extra Consts -32768 1 1
History Block -32768 1 1
CFI No 1 -32768 1 1
CFI No 2 -32768 1 1
CFI No 3 -32768 1 1
Lookup Tables 278 64 912 64 912
Model Data 58881 6391296 6391296

LOOKUP TABLE
58368 64-bit words long
ANCILLARY_STEPSim(s_im) 5
INITMOS : MOS_OUTPUT_LENGTH = 1129
im,sm,ngroup,new_im,new_sm 1 1 48 T F
PPCTL: Opening preattached file on unit 60
PPCTL: Opening preattached file on unit 61
PPCTL: Opening preattached file on unit 62

PP_CTL: Error Buffering in Fixed length Header
Empty PP File in Climate Mode?

Error code = 0.00
Length requested = 0
Length actually transferred = 256
PPCTL: Opening preattached file on unit 63
PPCTL: Opening preattached file on unit 64
PPCTL: Opening preattached file on unit 65
PPCTL: Opening preattached file on unit 66
PPCTL: Opening preattached file on unit 67
PPCTL: Opening preattached file on unit 68

in the last crash there was only the stderr_um.txt file with this:

BUFFIN: C I/O Error - Return code = 16

naturally i backup everything so the climate model continue to advance and as you can see my machine continue to trickle :)

I don\'t think is a workunit problem, but a application problem that should be solved because how you can tell to normal people, that before playing some games or make something with an heavy load, that they must backup the boinc folder or shut down boinc?


Best Regards
Luigi
11) Message boards : Number crunching : WUs constantly failing (Message 19887)
Posted 2 Feb 2006 by old_user19523
Post:
why my post has been deleted? maybe it was long? or i should post it in the phpbb forum?
12) Message boards : Number crunching : WUs constantly failing (Message 19684)
Posted 27 Jan 2006 by old_user19523
Post:
You could try removing BOINC from the startup folder, so that, when your brother turns on the computer to play games, BOINC doesn\'t start.
When you want to run BOINC, start it manually by clicking on the boincmgr icon in the BOINC folder.



Yes i know :)

But for now it\'s fine in this way, i make a boinc backup every morning ;)

I want to keep the model crashing, so maybe i can find something useful to help programmers fixing issues like this. Because avoiding the start of boinc, or to make a backup is a suitable way for expert people, not for the normal user. In special mode when the workunits last several months :)

Now i have to find a way to avoid the cleanup after the model crash, to find the error in the yabds.out.

In the working model, in yabds.out there are errors similar to those on db\'s post, if i remember i\'ll copy them here tomorrow.
13) Message boards : Number crunching : WUs constantly failing (Message 19682)
Posted 27 Jan 2006 by old_user19523
Post:

http://climateapps2.oucs.ox.ac.uk/cpdnboinc/workunit.php?wuid=1108852

this workunit crash everyday :)

this is a bad workunit or there is a bug on the application?

usually my computer is turned on in the evening and I turn off it in the morning, before turning it off i made a backup of boinc folder. sometimes my brother in the
afternoon play with some computer games and the evening i found the model crashed, unfortunately i don\'t find any yabd.out file beacuse it\'s all ready to send back the creshed result.

I\'m glad that i have not an internet connection at home ;)

My system is stable, i tried with prime95 and climateprediction for 18 Hours (50/50). and also with memtest86+.

maybe there is a problem when the model start and don\'t reach the first checkpoint because there is an evil game that eat every cpu cycle ;)

P.S.
I can see this morning that my english is worse that ever ;)




©2024 climateprediction.net