climateprediction.net home page
Posts by Alan K

Posts by Alan K

1) Message boards : Number crunching : Batch 1005 WAH2 NZ region (Message 70623)
Posted 22 days ago by Profile Alan K
Post:
Mine seem to be going OK:-

06/03/2024 21:08:20 | climateprediction.net | Started upload of wah2_nz25_n316_201205_25_1005_012258088_0_r1313910418_19.zip
06/03/2024 21:08:20 | climateprediction.net | [file_xfer] URL: http://upload11.cpdn.org/cgi-bin/file_upload_handler
06/03/2024 21:08:22 | climateprediction.net | [file_xfer] http op done; retval 0 (Success)
06/03/2024 21:08:22 | climateprediction.net | [file_xfer] parsing upload response: <data_server_reply> <status>0</status> <file_size>0</file_size></data_server_reply>
06/03/2024 21:08:22 | climateprediction.net | [file_xfer] parsing status: 0
06/03/2024 21:08:22 | climateprediction.net | [fxd] starting upload, upload_offset 0
06/03/2024 21:08:57 | climateprediction.net | [file_xfer] http op done; retval 0 (Success)
06/03/2024 21:08:57 | climateprediction.net | [file_xfer] parsing upload response: <data_server_reply> <status>0</status></data_server_reply>
06/03/2024 21:08:57 | climateprediction.net | [file_xfer] parsing status: 0
06/03/2024 21:08:57 | climateprediction.net | [file_xfer] file transfer status 0 (Success)
06/03/2024 21:08:57 | climateprediction.net | Finished upload of wah2_nz25_n316_201205_25_1005_012258088_0_r1313910418_19.zip (90557393 bytes)
06/03/2024 21:08:57 | climateprediction.net | [file_xfer] Throughput 2577771 bytes/sec
2) Message boards : Number crunching : The system cannot find the drive specified. (Message 70423)
Posted 17 Feb 2024 by Profile Alan K
Post:
Had an error on a task from batch 1006 which had been running for about 12hrs. This is the STDRR

<core_client_version>7.24.1</core_client_version>
<![CDATA[
<message>
The system cannot find the drive specified.
(0xf) - exit code 15 (0xf)</message>
<stderr_txt>
modelGetExecutables: check control files, strTemp0 & 1 :
C:\ProgramData\BOINC/projects/climateprediction.net/wah2_eas25_h484_201912_24_1006_012264284/jobs/xadae.namelists
C:\ProgramData\BOINC/projects/climateprediction.net/wah2_eas25_h484_201912_24_1006_012264284/jobs/xacxf.namelists
modelGetExecutables: unzipping control files : strInput & strTmp
wah2_eas25_h484_201912_24_1006_012264284.zip
wah2_eas25_h484_201912_24_1006_012264284/jobs
gstrDump[0] = generic_phase1_spinup_eas25_global_aabaka
gstrDump[1] = generic_phase1_spinup_eas25_regional_aabaka
global model: command string: "C:\ProgramData\BOINC/projects/climateprediction.net/wah2am3m2_um_8.29_windows_intelx86.exe" wah2_eas25_h484_201912_24_1006_012264284 generic_phase1_spinup_eas25_global_aabaka ic19610807_10_N96 ALLclim_ancil_82months_OSTIA_temp_2017-01-01_2023-10-30 ALLclim_ancil_82months_OSTIA_ice_2017-01-01_2023-10-30 SO2DMS_N96_cmip6ssp245_2019-2030 oxi.addfa ozone_cmip6hist-ssp245_N96_1979_2031
regional model: command string: "C:\ProgramData\BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.29_windows_intelx86.exe" wah2_eas25_h484_201912_24_1006_012264284
cpdn_check_running: got RM PID of zero; ignoring this call and waiting for PID via shMem.
executeModelProcess: MonID=8284, GCM_PID=9744, RCM_PID=7216
Global Worker:: CPDN process is not running, exiting, bRetVal = T, checkPID = 9744, selfPID = 9744, iMonCtr = 1
No Process Handle
Regional Worker:: CPDN process is not running, exiting, bRetVal = T, checkPID = 9744, selfPID = 7216, iMonCtr = 1

</stderr_txt>
]]>
3) Message boards : Number crunching : New Work 2024 (Message 70356)
Posted 12 Feb 2024 by Profile Alan K
Post:
You could try setting "use computer time" to 100% for both when in use and not in use to reduce the number of suspends
4) Message boards : Number crunching : New Work 2024 (Message 70116)
Posted 15 Jan 2024 by Profile Alan K
Post:
Have got 7 of the EAS25 batch. 4 going OK - other 3 not yet started. For info - i7-4790K 4.00GHz CPU, 24Gb RAM, Gigabyte m/b as this is quite old, W10 O/S.

There are two EAS batches 1001 and 1002.


Eight 1002 and two 1001 (picked up an extra 2, not repeats)
5) Message boards : Number crunching : New Work 2024 (Message 70113)
Posted 15 Jan 2024 by Profile Alan K
Post:
Have got 7 of the EAS25 batch. 4 going OK - other 3 not yet started. For info - i7-4790K 4.00GHz CPU, 24Gb RAM, Gigabyte m/b as this is quite old, W10 O/S.
6) Message boards : Number crunching : Thanks and Merry Christmas. (Message 70092)
Posted 25 Dec 2023 by Profile Alan K
Post:
And to you.
7) Message boards : Number crunching : Batch 996 Weather@Home2 East Asia25 (Message 69881)
Posted 15 Oct 2023 by Profile Alan K
Post:
Some of my recent uploads:

15/10/2023 14:26:10 | climateprediction.net | Finished upload of wah2_eas25_a1rl_199612_24_996_012225837_0_r1703371973_13.zip (99338471 bytes)
15/10/2023 14:26:10 | climateprediction.net | [file_xfer] Throughput 181915 bytes/sec

15/10/2023 14:56:24 | climateprediction.net | Finished upload of wah2_eas25_a3is_200712_24_996_012228112_0_r685833828_13.zip (98900336 bytes)
15/10/2023 14:56:24 | climateprediction.net | [file_xfer] Throughput 198808 bytes/sec

15/10/2023 17:38:25 | climateprediction.net | Finished upload of wah2_eas25_a0q1_198912_24_996_012224485_2_r113967697_2.zip (99007240 bytes)
15/10/2023 17:38:25 | climateprediction.net | [file_xfer] Throughput 179704 bytes/sec

which have all gone through OK.
8) Message boards : Number crunching : New work discussion - 2 (Message 69847)
Posted 13 Oct 2023 by Profile Alan K
Post:
"[1]No matter how may different ways I use to stop it doing so, it keeps thwarting me."

The answer lies in settings for group permissions in the registry.
9) Message boards : Number crunching : Batch 996 Weather@Home2 East Asia25 (Message 69773)
Posted 11 Oct 2023 by Profile Alan K
Post:
Lost 9 of my 12 tasks following a "planned" reboot. 3 unexplained but the rest all sig 11 seg violation. One resend picked up this morning also failed sig 11 seg violation.
10) Message boards : Number crunching : Batch 996 Weather@Home2 East Asia25 (Message 69707)
Posted 8 Oct 2023 by Profile Alan K
Post:
Trickles still going through OK. Most zips have gone as well but one is stuck -

08/10/2023 23:09:16 | climateprediction.net | Backing off 03:26:52 on upload of wah2_eas25_a49c_201212_24_996_012229068_0_r1003289668_3.zip

whereas 4th,5th and 6th zips have gone:

08/10/2023 11:21:55 | climateprediction.net | Finished upload of wah2_eas25_a49c_201212_24_996_012229068_0_r1003289668_4.zip (99468575 bytes)
08/10/2023 08:58:22 | climateprediction.net | Finished upload of wah2_eas25_a49c_201212_24_996_012229068_0_r1003289668_5.zip (99652526 bytes)
08/10/2023 20:21:02 | climateprediction.net | Finished upload of wah2_eas25_a49c_201212_24_996_012229068_0_r1003289668_6.zip (99535063 bytes)

Should I just abort the transfer or keep my fingers crossed that it will go at some point?
11) Message boards : Number crunching : Batch 996 Weather@Home2 East Asia25 (Message 69684)
Posted 7 Oct 2023 by Profile Alan K
Post:
Trickle files uploading OK but zips are now getting stuck.

07/10/2023 08:17:46 | climateprediction.net | [fxd] starting upload, upload_offset -1
07/10/2023 08:17:46 | climateprediction.net | Started upload of wah2_eas25_a49c_201212_24_996_012229068_0_r1003289668_3.zip
07/10/2023 08:17:46 | climateprediction.net | [file_xfer] URL: http://upload7.cpdn.org/cgi-bin/file_upload_handler
07/10/2023 08:17:48 | climateprediction.net | [file_xfer] http op done; retval 0 (Success)
07/10/2023 08:17:48 | climateprediction.net | [error] Error reported by file upload server: [wah2_eas25_a49c_201212_24_996_012229068_0_r1003289668_3.zip] locked by file_upload_handler PID=3567801
07/10/2023 08:17:48 | climateprediction.net | [file_xfer] parsing upload response: <data_server_reply> <status>1</status> <message>[wah2_eas25_a49c_201212_24_996_012229068_0_r1003289668_3.zip] locked by file_upload_handler PID=3567801</message></data_server_reply>
07/10/2023 08:17:48 | climateprediction.net | [file_xfer] parsing status: -127
07/10/2023 08:17:48 | climateprediction.net | [file_xfer] file transfer status -127 (transient upload error)
07/10/2023 08:17:48 | climateprediction.net | Temporarily failed upload of wah2_eas25_a49c_201212_24_996_012229068_0_r1003289668_3.zip: transient upload error
07/10/2023 08:17:48 | climateprediction.net | [file_xfer] project-wide upload delay for 1913.964660 sec
07/10/2023 08:17:48 | climateprediction.net | Backing off 00:22:55 on upload of wah2_eas25_a49c_201212_24_996_012229068_0_r1003289668_3.zip
12) Message boards : Number crunching : Batch 996 Weather@Home2 East Asia25 (Message 69682)
Posted 6 Oct 2023 by Profile Alan K
Post:
These going OK. Another 8 picked up!
13) Message boards : Number crunching : Batch 996 Weather@Home2 East Asia25 (Message 69667)
Posted 5 Oct 2023 by Profile Alan K
Post:
Picked up 4.Fingers crossed.
14) Message boards : climateprediction.net Science : Climate change in the News (Message 69610)
Posted 7 Sep 2023 by Profile Alan K
Post:
It also helps to have a lot of European partners in your project.
15) Questions and Answers : Windows : Computation error when BOINC halts (Message 69370)
Posted 19 Jul 2023 by Profile Alan K
Post:
As for Windows automatic updates - they are an absolute pain, and should be blocked - others have suggested ways of doing this.


This can be done using group policies in the registry so that Windows has to ask you to do the updates - if you feel up to it. Search the web for details.
16) Message boards : Number crunching : New work discussion - 2 (Message 69337)
Posted 16 Jul 2023 by Profile Alan K
Post:
On checking there would seem to be little difference in the machines. The failure machine is an i! 3770 (3.4GHz) whereas mine is an i7 4790K (4.0GHz). Both have similar amounts of RAM and are running the same version on WIN10.
17) Message boards : Number crunching : New work discussion - 2 (Message 69335)
Posted 15 Jul 2023 by Profile Alan K
Post:
interesting that I have a resend from the NZ batch which when looked it had failed after the first zip file with negative theta error. This has now got past the fourth zip on my machine!!
18) Message boards : Number crunching : New work discussion - 2 (Message 69191)
Posted 9 Jul 2023 by Profile Alan K
Post:
Still having problems with the out file though the 25th zip has gone. Out stuck at 59% and it is going to upload7.
19) Message boards : Number crunching : New work discussion - 2 (Message 69112)
Posted 5 Jul 2023 by Profile Alan K
Post:
Looks like the server is down again.

05/07/2023 07:43:48 | climateprediction.net | [file_xfer] http op done; retval -184 (transient HTTP error)
05/07/2023 07:43:48 | climateprediction.net | [file_xfer] http op done; retval -184 (transient HTTP error)
05/07/2023 07:43:48 | climateprediction.net | [file_xfer] file transfer status -184 (transient HTTP error)
05/07/2023 07:43:48 | climateprediction.net | Temporarily failed upload of wah2_eas25_a1t4_200011_25_994_012217782_2_r734268285_25.zip: transient HTTP error
05/07/2023 07:43:48 | climateprediction.net | [file_xfer] project-wide upload delay for 13849.787896 sec
05/07/2023 07:43:48 | climateprediction.net | Backing off 03:46:21 on upload of wah2_eas25_a1t4_200011_25_994_012217782_2_r734268285_25.zip
05/07/2023 07:43:48 | climateprediction.net | [file_xfer] file transfer status -184 (transient HTTP error)
05/07/2023 07:43:48 | climateprediction.net | Temporarily failed upload of wah2_eas25_a1t4_200011_25_994_012217782_2_r734268285_out.zip: transient HTTP error
05/07/2023 07:43:48 | climateprediction.net | Backing off 03:28:21 on upload of wah2_eas25_a1t4_200011_25_994_012217782_2_r734268285_out.zip
05/07/2023 07:43:50 | | Internet access OK - project servers may be temporarily down.
20) Message boards : Number crunching : OpenIFS Discussion (Message 68350)
Posted 15 Feb 2023 by Profile Alan K
Post:
Is this related to the floating point issue:

Task 22307753

08:48:48 STEP 61 H= 15:15 +CPU= 23.513
[EC_DRHOOK:ubuntu:1:1:20673:20673] [20230214:084903:1676364543:1561.172] [signal_drhook@/home/abowery/Desktop/OpenIFS/oifs_43r3_bl/gc_oifs43r3-feature-cpdn/src/ifsaux/support/drhook.c:1538] Received signal#8 (SIGFPE) :: 4362MB (heap), 5076MB (maxrss), 0MB (maxstack), 0 (paging), nsigs = 1
[EC_DRHOOK:ubuntu:1:1:20673:20673] [20230214:084903:1676364543:1561.172] [signal_drhook@/home/abowery/Desktop/OpenIFS/oifs_43r3_bl/gc_oifs43r3-feature-cpdn/src/ifsaux/support/drhook.c:1542] Also activating Harakiri-alarm (SIGALRM=14) to expire after 500s elapsed to prevent hangs, nsigs = 1
[EC_DRHOOK:ubuntu:1:1:20673:20673] [20230214:084903:1676364543:1561.172] [signal_drhook@/home/abowery/Desktop/OpenIFS/oifs_43r3_bl/gc_oifs43r3-feature-cpdn/src/ifsaux/support/drhook.c:1544] Harakiri signal handler 'signal_harakiri' for signal#14 (SIGALRM) installed at 0x81f0c0 (old at (nil))
[EC_DRHOOK:ubuntu:1:1:20673:20673] [20230214:084903:1676364543:1561.172] [signal_drhook@/home/abowery/Desktop/OpenIFS/oifs_43r3_bl/gc_oifs43r3-feature-cpdn/src/ifsaux/support/drhook.c:1617] Signal#8 was caused by floating-point overflow [memaddr=0x1cc4a8f], nsigs = 1
[EC_DRHOOK:ubuntu:1:1:20673:20673] [20230214:084903:1676364543:1561.172] [signal_drhook@/home/abowery/Desktop/OpenIFS/oifs_43r3_bl/gc_oifs43r3-feature-cpdn/src/ifsaux/support/drhook.c:1686] Starting DrHook backtrace for signal#8, nsigs = 1
[EC_DRHOOK:ubuntu:1:1:20673:20673] [20230214:084903:1676364543:1561.172] [c_drhook_print_@/home/abowery/Desktop/OpenIFS/oifs_43r3_bl/gc_oifs43r3-feature-cpdn/src/ifsaux/support/drhook.c:3843] 4362 MB (maxheap), 5076 MB (maxrss), 0 MB (maxstack)
[EC_DRHOOK:ubuntu:1:1:20673:20673] [20230214:084903:1676364543:1561.172] [c_drhook_print_@/home/abowery/Desktop/OpenIFS/oifs_43r3_bl/gc_oifs43r3-feature-cpdn/src/ifsaux/support/drhook.c:3897] : MASTER


Next 20

©2024 climateprediction.net