|
Info | Message |
---|---|
1) Message boards : Number crunching : Feedback on running OpenIFS large memory (16-25 Gb+) configurations requested
Message 71930 Posted 23 Jan 2025 by DJStarfox |
I'd be willing to try these models, under the following conditions: * CPDN server will respect BOINC compute limits (preferences) per computer and not give work for machines with insufficent resources. * Models will respect BOINC compute limits (max memory, CPU count, disk) and will not start if insufficent resource available. (i.e., in the event these limits change since initial download). * Checkpoint files are compressed (best effort). * OpenIFS models are 'opt-in' via project preferences * Sticky forum thread (can be read-only) for OpenIFS system requirements and warnings. FYI... I only have 32GB RAM and 6 cores (12 threads)... so would machines like mine be able to run them well enough? |
2) Message boards : Number crunching : Download issues
Message 68680 Posted 20 Apr 2023 by DJStarfox |
I think this is the same issue that I posted about: https://www.cpdn.org/cpdnboinc/forum_thread.php?id=9198 The web admin needs to install the certificate chain to the web server(s). Perhaps the moderators can merge these threads or link them. |
3) Message boards : Number crunching : SSL Cert for www.climateprediction.net
Message 68678 Posted 19 Apr 2023 by DJStarfox |
OK, looks like the web admin forgot to install the intermediate certificate. The web server and its intermediate certificate are supposed to be sent to the client for TLS session setup. See "certificate issues" here: https://www.ssllabs.com/ssltest/analyze.html?d=www.climateprediction.net |
4) Message boards : Number crunching : SSL Cert for www.climateprediction.net
Message 68677 Posted 19 Apr 2023 by DJStarfox |
Looks like www.climateprediction.net replaced their TLS/SSL certificate with "Let's Encrypt" (lasting only 3 months at a time). I'm getting an error saying, "The issuer certificate of a locally looked up certificate could not be found. No certificates could be verified." Can someone check this cert is publicly trusted? |
5) Message boards : Number crunching : The uploads are stuck
Message 67528 Posted 11 Jan 2023 by DJStarfox |
Might be a hung process taking over port 80. They'll have to stop the service, kill any orphaned processes, and restart the service. |
6) Message boards : Number crunching : The uploads are stuck
Message 67504 Posted 10 Jan 2023 by DJStarfox |
Do you mean <max_file_xfers_per_project>? If your Internet connection can do more than one, why not do it? Yes, that's what I mean: <max_file_xfers>4</max_file_xfers> <max_file_xfers_per_project>1</max_file_xfers_per_project> For normal HTTPS traffic, yes, you want about 4 connections per server, and most browsers do 4 to 8 connections at a time anyway, because most big websites are server farms (multiple servers that can all work in parallel). However, file transfers are a different beast and BOINC projects in particular are, as most are grant funded (i.e., run on minimal hardware). Your 1 allowed file transfer will still download or upload at the maximum possible speed, limited by the project's internet connection. It does no good to hammer the same project file server with multiple connections, if connection #2 runs at half speed, connection #3 at 1/3 speed, etc. In other words, it won't take longer for YOU but it will help the project server by only needing to serve 1 connect per client x 1000 active users, etc. |
7) Message boards : Number crunching : The uploads are stuck
Message 67497 Posted 10 Jan 2023 by DJStarfox |
Upload server update 9/1/23 10:49GMT Thanks for the update, Glenn. FYI... I set my "max uploads per project" to 1 in the cc_config.xml, which is what I recommend for everyone. |
8) Message boards : Number crunching : The uploads are stuck
Message 67496 Posted 10 Jan 2023 by DJStarfox |
Edit: Just realized if you can't write state file, any messing within BOINC might be hopeless. So have to find the space elsewhere from the system. Under your account's computing preferences, you can set set "Leave at least x GB free" (of disk space) to make sure there is enough left for uploads, etc. |
9) Message boards : Number crunching : The uploads are stuck
Message 67322 Posted 4 Jan 2023 by DJStarfox |
The admins should plan for enough infrastructure to handle: * "Computers with recent credit" as per the server status page. Right now, that number is 968 computers. * With project backoff near 1 hour, that means: 16 uploads per minute average * Total file size 224MB each model, means server needs to handle 3.5G per minute during peak time. |
10) Message boards : Number crunching : The uploads are stuck
Message 67216 Posted 2 Jan 2023 by DJStarfox |
Uploads still stuck for me. Hopefully, server can be fixed today or tomorrow. |
11) Message boards : Number crunching : Big models
Message 63463 Posted 2 Feb 2021 by DJStarfox |
Ya know... rather than making each model so big (multiple GB per task), could the programmers simply have each task share more files in common? That way each model takes less space, if that makes sense. |
12) Message boards : Number crunching : Please fix the deadlines!
Message 63095 Posted 4 Dec 2020 by DJStarfox |
If you have 1000 credit on project 1 and 1,000,000 credit on project 2, Bonic will try to have project 1 "catch up" to project 2. If what you say is true, then that design is faulty. The resource share should try to equalize the recent average credit (RAC), not total credit. |
13) Message boards : Number crunching : New work Discussion
Message 63018 Posted 25 Nov 2020 by DJStarfox |
I got five (5) of those HadAM4h tasks, but each one is consuming ~4GB of disk space, which far exceeds the 10G max I have in BOINC settings. Seems like a bug in the task scheduler... I had to abort a few to avoid running out of disk space on the /var partition. :( |
14) Message boards : Number crunching : Preferences - project options missing
Message 60943 Posted 19 Sep 2019 by DJStarfox |
I too am surprised the application selection feature is completely gone now. https://www.cpdn.org/prefs.php?subset=project Various old models would take up 10's of gigabytes. At various points, I would have to periodically delete old models in order to download any new units. By selecting only a few models, this was less of a problem. Also, in the past, the researchers were able to use different platforms as a way to validate a model. Since floating point calculations, etc. varied slightly, it helped them tune the models with a bigger variety of data points. If such an emergent result is no longer useful, then I would highly recommend they post the minimum requirements per platform. (For example, these could go on the CPDN homepage next to the how to join section.) |
15) Message boards : Number crunching : What Happened ???
Message 58153 Posted 25 Apr 2018 by DJStarfox |
The entire cpdn.org website was down for an extended time. I visited climateprediction.net several times but did not see any notice about downtime. |
16) Message boards : Number crunching : What Happened ???
Message 58148 Posted 24 Apr 2018 by DJStarfox |
When this site shuts down, people switch to the BOINC site, in particular the Projects section, top post, which is: News on Project Outages, where a message, usually from Andy, is posted. First of all, I (and likely many others) did not know there was a specific forum on some other site, somewhere, where a cryptic message about some maintenance was posted.... Furthermore, the site has been down for over a month! That sounds much worse than some routine maintenance, so I consider that post deceptive. I'm glad the site is back, and I hope such a long outage does not happen again. In the future, a post on the HOME PAGE of climateprediction.net would be much more appropriate and visible to the community. |
17) Message boards : Number crunching : Volunteer to work with the BOINC Release Manger on Linux Instructions?
Message 57862 Posted 26 Feb 2018 by DJStarfox |
I'm most familiar with RHEL/Centos 6 and 7 and have been running BOINC on Red Hat based distributions for 8+ years. I'm sure i could dedicate an hour or two to write some documentaion for BOINC. Please have Richard reach out to me if he needs anything. |
18) Message boards : Number crunching : Site Back Up
Message 57824 Posted 19 Feb 2018 by DJStarfox |
Anyone know why the site was offline for so long? |
19) Message boards : Number crunching : New work Discussion
Message 57684 Posted 21 Jan 2018 by DJStarfox |
Speaking of new work, I just got a single hadcm3s model after many months of nothing. Seems very random. Now, the server's queue is 27 but I have no idea which model these are. |
20) Questions and Answers : Unix/Linux : Shutting down for re-boot.
Message 57337 Posted 7 Nov 2017 by DJStarfox |
Dave, I've had better luck with the following global compute preference checked: "Leave non-GPU tasks in memory while suspended" = yes If that is disabled on your account, try enabling it and see if that improves things. |
©2025 cpdn.org