climateprediction.net home page
Posts by KAMasud

Posts by KAMasud

21) Message boards : Number crunching : Hardware requirements for upcoming models (Message 65969)
Posted 24 Aug 2022 by KAMasud
Post:
Good idea. Just make Boinc sit in the naughty corner (I wish someone else also) and talk to it.
22) Message boards : Number crunching : upload failure zip file not found (Message 65903)
Posted 20 Aug 2022 by KAMasud
Post:
Try,
Sudo apt update
The magic bullet
23) Message boards : Number crunching : Certificate issue. (Message 65882)
Posted 19 Aug 2022 by KAMasud
Post:
you may need to update your VM Linux image.
I have an Ubuntu VM using VB. I update the guest distribution every couple of weeks or so and have never had a certificate issue with it so it may be as simple as running
sudo apt update
sudo apt upgrade
in the guest OS.


:) Thank you, Dave. That got rid of that thing.
24) Message boards : Number crunching : Certificate issue. (Message 65875)
Posted 19 Aug 2022 by KAMasud
Post:
Richard, that certificate issue, HADSM4 on Linux in VM, OS Mint.
Windows is also involved but with something else. Here
18/08/2022 9:35:34 pm | | Suspending computation - CPU is busy
18/08/2022 9:35:44 pm | | Resuming computation
18/08/2022 10:08:54 pm | climateprediction.net | Sending scheduler request: To send trickle-up message.
18/08/2022 10:08:54 pm | climateprediction.net | Requesting new tasks for CPU
18/08/2022 10:08:57 pm | climateprediction.net | Scheduler request completed: got 0 new tasks
18/08/2022 10:08:57 pm | climateprediction.net | No tasks sent
18/08/2022 10:08:57 pm | climateprediction.net | Project requested delay of 3636 seconds
18/08/2022 10:09:01 pm | climateprediction.net | Started upload of wah2_nz25_a07s_198705_25_936_012150042_0_r724561270_9.zip
18/08/2022 10:14:08 pm | climateprediction.net | Temporarily failed upload of wah2_nz25_a07s_198705_25_936_012150042_0_r724561270_9.zip: transient HTTP error
18/08/2022 10:14:08 pm | climateprediction.net | Backing off 00:02:18 on upload of wah2_nz25_a07s_198705_25_936_012150042_0_r724561270_9.zip
18/08/2022 10:14:09 pm | | Project communication failed: attempting access to reference site
18/08/2022 10:14:11 pm | | Internet access OK - project servers may be temporarily down.
19/08/2022 12:40:47 am | climateprediction.net | Started upload of wah2_nz25_a2db_200605_25_936_012152833_0_r1306077880_14.zip
19/08/2022 12:40:47 am | climateprediction.net | Started upload of wah2_nz25_a2db_200605_25_936_012152833_0_r1306077880_19.zip
19/08/2022 12:45:54 am | climateprediction.net | Temporarily failed upload of wah2_nz25_a2db_200605_25_936_012152833_0_r1306077880_14.zip: transient HTTP error
19/08/2022 12:45:54 am | climateprediction.net | Backing off 05:37:33 on upload of wah2_nz25_a2db_200605_25_936_012152833_0_r1306077880_14.zip
19/08/2022 12:45:54 am | climateprediction.net | Temporarily failed upload of wah2_nz25_a2db_200605_25_936_012152833_0_r1306077880_19.zip: transient HTTP error
19/08/2022 12:45:54 am | climateprediction.net | Backing off 05:44:54 on upload of wah2_nz25_a2db_200605_25_936_012152833_0_r1306077880_19.zip
19/08/2022 12:45:55 am | | Project communication failed: attempting access to reference site
19/08/2022 12:45:56 am | | Internet access OK - project servers may be temporarily down.
19/08/2022 1:26:36 am | climateprediction.net | Sending scheduler request: To send trickle-up message.
19/08/2022 1:26:36 am | climateprediction.net | Requesting new tasks for CPU
19/08/2022 1:26:38 am | climateprediction.net | Scheduler request completed: got 0 new tasks
19/08/2022 1:26:38 am | climateprediction.net | Project has no tasks available
19/08/2022 1:26:38 am | climateprediction.net | Project requested delay of 3636 seconds
19/08/2022 1:26:44 am | climateprediction.net | Started upload of wah2_nz25_a0i4_199005_25_936_012150414_0_r1403527717_23.zip
19/08/2022 1:31:51 am | climateprediction.net | Temporarily failed upload of wah2_nz25_a0i4_199005_25_936_012150414_0_r1403527717_23.zip: transient HTTP error
19/08/2022 1:31:51 am | climateprediction.net | Backing off 00:02:57 on upload of wah2_nz25_a0i4_199005_25_936_012150414_0_r1403527717_23.zip
19/08/2022 1:31:52 am | | Project communication failed: attempting access to reference site
19/08/2022 1:31:54 am | | Internet access OK - project servers may be temporarily down.
19/08/2022 1:32:40 am | climateprediction.net | Started upload of wah2_nz25_a1m2_199905_25_936_012151852_0_r334298915_15.zip
19/08/2022 1:37:48 am | climateprediction.net | Temporarily failed upload of wah2_nz25_a1m2_199905_25_936_012151852_0_r334298915_15.zip: transient HTTP error
19/08/2022 1:37:48 am | climateprediction.net | Backing off 00:03:12 on upload of wah2_nz25_a1m2_199905_25_936_012151852_0_r334298915_15.zip
19/08/2022 1:37:49 am | | Project communication failed: attempting access to reference site
19/08/2022 1:37:50 am | | Internet access OK - project servers may be temporarily down.


You are correct, there is a duel conversation going on. Mod's can separate them.
25) Message boards : Number crunching : Certificate issue. (Message 65866)
Posted 19 Aug 2022 by KAMasud
Post:
HADSM4.
scheduler request failed; ssl peer certificate or ssh remote key was not ok.
Another problem is. on the first attempt to access the server page, I land somewhere else.
Intermittent service is another.
Just had a hadsm4 zip go through with no problems. if you are still getting problems could you post the output in the event log with http debug enabled please.


Linux and Internet problems. I will try later but this line is standard: scheduler request failed; ssl peer certificate or ssh remote key was not ok.
26) Message boards : Number crunching : Uploads slow and overtake the system (Message 65858)
Posted 18 Aug 2022 by KAMasud
Post:
Not me.
18/08/2022 8:51:21 am | climateprediction.net | Sending scheduler request: To send trickle-up message.
18/08/2022 8:51:21 am | climateprediction.net | Not requesting tasks: don't need (CPU: job cache full; NVIDIA GPU: ; Intel GPU: )
18/08/2022 8:51:23 am | climateprediction.net | Scheduler request completed
18/08/2022 8:51:23 am | climateprediction.net | Project requested delay of 3636 seconds
18/08/2022 10:53:25 am | climateprediction.net | Started upload of wah2_nz25_a2db_200605_25_936_012152833_0_r1306077880_24.zip
18/08/2022 10:53:25 am | climateprediction.net | Started upload of wah2_nz25_a2db_200605_25_936_012152833_0_r1306077880_14.zip
18/08/2022 10:58:32 am | climateprediction.net | Temporarily failed upload of wah2_nz25_a2db_200605_25_936_012152833_0_r1306077880_14.zip: transient HTTP error
18/08/2022 10:58:32 am | climateprediction.net | Backing off 03:21:12 on upload of wah2_nz25_a2db_200605_25_936_012152833_0_r1306077880_14.zip
18/08/2022 10:58:32 am | climateprediction.net | Temporarily failed upload of wah2_nz25_a2db_200605_25_936_012152833_0_r1306077880_24.zip: transient HTTP error
18/08/2022 10:58:32 am | climateprediction.net | Backing off 00:05:37 on upload of wah2_nz25_a2db_200605_25_936_012152833_0_r1306077880_24.zip
18/08/2022 10:58:33 am | | Project communication failed: attempting access to reference site
18/08/2022 10:58:35 am | | Internet access OK - project servers may be temporarily down.
27) Message boards : Number crunching : Uploads slow and overtake the system (Message 65855)
Posted 18 Aug 2022 by KAMasud
Post:
HADSM4.
scheduler request failed; ssl peer certificate or ssh remote key was not ok.
Another problem is. on the first attempt to access the server page, I land somewhere else.
Intermittent service is another.
28) Message boards : Number crunching : Lost tasks (Message 65847)
Posted 17 Aug 2022 by KAMasud
Post:
KAMasud -

This probably won't solve your problem, but you might want to merge your computers. Do you really have 27 different computers that have contacted the CPDN website in the last 30 days?

Blue screen of death sounds like a non-CPDN issue.

Go to your Homepage on the CPDN website.
Click on Computers on this account.
At the bottom there is a "Merge by name" feature.

After this you might try the "Remove project/Add Project" again


I would love to merge my computers and did give it a try. Something has changed. It merges by name and all the names are different, computer generated. So, life goes on.
29) Message boards : Number crunching : Lost tasks (Message 65842)
Posted 16 Aug 2022 by KAMasud
Post:
I got the Blue Screen of death. Now I have reinstalled the OS and Boinc but everything is different. If I detach and reattach, I do not think it will change anything, or will it? Confused over the issue.
30) Message boards : Number crunching : Lost tasks (Message 65839)
Posted 15 Aug 2022 by KAMasud
Post:
I upgraded my eighth Gen to Windows 11 and promptly the computer crashed losing 36 tasks. They are in limbo now.
31) Message boards : Number crunching : New work Discussion (Message 65735)
Posted 1 Aug 2022 by KAMasud
Post:
Another possibility, is that you're running Linux in a VM on a Windows machine.

I've found it best not to make life too complicated for a computer running climate models.

That is certainly my experience. I lose a lot more CPDN work units running Ubuntu 20.04.4 under WSL/Win10 than I do on a native Ubuntu 20.04 machine.
And there is nothing wrong with BOINC 7.16.6 for CPDN, or the later ones either (7.18.1, 7.20.2) that I have found.
Layered software, such as VMs, does make for a more complicated life. With an ubuntu VM on VBox/Win10, the finger of blame points towards the automatic monthly Windows updates for the majority of our model crashes. From 88 cpdn models, 71 have completed, 14 have crashed during an unannounced Windoze update, 3 crashed during a hard reboot to recover an unresponsive Windoze. We now pause Windoze updates for as long as possible and once a month ungraciously close the ubuntu VM to do all software updates in one go. This seems to have improved the success rate.


I have paused all Windows updates for five weeks.
32) Message boards : Number crunching : New work Discussion (Message 65734)
Posted 1 Aug 2022 by KAMasud
Post:
I am still not willing to commit a wholesale massacre of WUs in order to change versions. Also, I am new to the World of Linux. Somebody, whoever it is should keep the updated version of Boinc.


I don't think I have ever had tasks crash as a result of changing versions, as far as I can tell they are no more likely to crash after a BOINC version change than any other instance of stopping and restarting the client.


I am not well versed with Linux. I know changing versions does not crash WU's. I just do not know how to go about it in Linux and it will be a massacre. I can however follow instructions. There might be others out there also following instructions. If someone can put up an instructions page, we will be happy to follow.
Yes, I do know my RAM is a bit low but to increase it, you have to shut down Boinc (I am running VMs). Shutting down Boinc, these WU's hate with a passion. However, I have noticed if I do " save the machine state" and then exit, these WU's are quite happy (also, they do not revert to the last checkpoint) but "save the machine state" does not allow a person to increase the RAM. You have to shut down Boinc to do it. It is something like catching a tiger by its tail.
The Version of Boinc is not the culprit. It is shutting down these WU"s and restarting. Now that I know this, I will just keep running these machines until they drop dead or complete the task's given/issued.
33) Message boards : Number crunching : New work Discussion (Message 65720)
Posted 1 Aug 2022 by KAMasud
Post:
Actually, it's the responsibility of the software packager (in this case, the Linux repository maintainers) to take an appropriate snapshot of the BOINC source code, and package it in a way that suits their distribution and its management tools. That's the way the Linux world works.


Welcome, Richard to the conversation.
I am still not willing to commit a wholesale massacre of WU's in order to change versions. Also, I am new to the World of Linux. Somebody, whoever it is should keep the updated version of Boinc.

https://boinc.berkeley.edu/wiki/Installing_BOINC
Debian
Open a terminal and enter the following command:

sudo apt-get install boinc-client boinc-manager

Ubuntu
Instructions are here.

Instructions for Ubuntu are the same as for Debian.

Help us out who do not know anything about Linux. Please. While in the meantime I will babysit these WU"s.
34) Message boards : Number crunching : New work Discussion (Message 65716)
Posted 31 Jul 2022 by KAMasud
Post:
If the Boinc site is giving instructions to do "sudo apt-get install boinc-client boinc-manager" then it is their responsibility to place the proper version from where ever this command gets it.
Anyway, what's done, is done. I am not going to abort just to change versions. As it is, even if I look or sneeze too hard, the WU goes into error.
35) Message boards : Number crunching : New work Discussion (Message 65713)
Posted 31 Jul 2022 by KAMasud
Post:
I think the problem may be the BOINC version.

For the BOINC site, it says that version is experimental.

So, where did you get that version version 7.16.6?



(sudo apt-get install boinc-client boinc-manager)
Is anything wrong and if it is wrong then the instructions given on the Boinc web site should be changed for us. Some WU's are normal.
36) Message boards : Number crunching : New work Discussion (Message 65706)
Posted 30 Jul 2022 by KAMasud
Post:
What is a buffin?

<core_client_version>7.16.6</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...

Model crashed: READDUMP: BAD BUFFIN OF DATA tmp/xnnuj.pipe_dummy

Model crashed: READDUMP: BAD BUFFIN OF DATA tmp/xnnuj.pipe_dummy

Model crashed: READDUMP: BAD BUFFIN OF DATA tmp/xnnuj.pipe_dummy

Model crashed: READDUMP: BAD BUFFIN OF DATA tmp/xnnuj.pipe_dummy

Model crashed: READDUMP: BAD BUFFIN OF DATA tmp/xnnuj.pipe_dummy

Model crashed: READDUMP: BAD BUFFIN OF DATA tmp/xnnuj.pipe_dummy
Sorry, too many model crashes! :-(
03:39:26 (2934): called boinc_finish(22)

</stderr_txt>
]]>
37) Message boards : Number crunching : New work Discussion (Message 65700)
Posted 29 Jul 2022 by KAMasud
Post:
We all have our own idiosyncrasy. Water under the bridge.
Still not sure what rule I broke!


Peter, life is unsure and sometimes not fair. We have all been through it.
I miss Mo'v and our conversations, but?
38) Message boards : Number crunching : New work Discussion (Message 65698)
Posted 27 Jul 2022 by KAMasud
Post:
We all have our own idiosyncrasy. Water under the bridge.
39) Message boards : Number crunching : New work Discussion (Message 65695)
Posted 27 Jul 2022 by KAMasud
Post:
[titter] Boinc is a wonderful platform which never ever goes wrong and everything always works perfectly.

__________
What was that all about? LOL
40) Message boards : Number crunching : New work Discussion (Message 65691)
Posted 26 Jul 2022 by KAMasud
Post:
hadam4h? I have looked up applications. This "h" seems to be new. Predicted days to completion, 103 days. Any ideas? Whatever it will be fun.


It won't take 103 days.

New machine/BOINC install? That's about what the N216 tasks are estimated to take on "default" performance numbers, before benchmarks have been run. After a while, BOINC will run the benchmarks, and any new tasks downloaded will have a more reasonable estimate, but the ones already present won't update to the new estimate.

It should take 15 days or so, depending on your hardware.

________________

Good to know that it will not take up to 103 days :). It was something new.
Les, the person who completed that task, might not be paying much attention.
Thank you.


Previous 20 · Next 20

©2024 climateprediction.net