climateprediction.net home page
Posts by Trotador

Posts by Trotador

1) Message boards : Number crunching : New work Discussion (Message 58992)
Posted 11 Nov 2018 by Trotador
Post:
And AGAIN they're only showing one trickle. :(
I've sent an email about it.

......


This is happening to me, I downloaded a bunch of units on the 6th, they tricked on the 7th and no more, one even finished and does not have anything else in the web page that the tricckle on 7th. The rest of units still have 5/6 days of crunching ahead left...

And today points only for the lonely trickle.
2) Message boards : Number crunching : New work Discussion (Message 56609)
Posted 1 Aug 2017 by Trotador
Post:
599's and 617's failing in my linux host

With SIGSEGV: segmentation violation
3) Message boards : Number crunching : New work Discussion (Message 56071)
Posted 14 Apr 2017 by Trotador
Post:
In my linux hosts most of them are ending in computing error after six seven hours crunching

<core_client_version>7.6.31</core_client_version>
<![CDATA[
<stderr_txt>
SIGSEGV: segmentation violation
Stack trace (12 frames):
/home/antonio/BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.25_i686-pc-linux-gnu(boinc_catch_signal+0x67)[0x839e357]
[0x2a999ca0]
/home/antonio/BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.25_i686-pc-linux-gnu[0x814442b]
/home/antonio/BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.25_i686-pc-linux-gnu[0x814b133]
/home/antonio/BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.25_i686-pc-linux-gnu[0x8141220]
/home/antonio/BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.25_i686-pc-linux-gnu[0x813ff46]
/home/antonio/BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.25_i686-pc-linux-gnu[0x8077583]
/home/antonio/BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.25_i686-pc-linux-gnu[0x831cd74]
/home/antonio/BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.25_i686-pc-linux-gnu[0x8330985]
/home/antonio/BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.25_i686-pc-linux-gnu[0x833318a]
/home/antonio/BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.25_i686-pc-linux-gnu[0x8334c8d]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf7)[0x2a767637]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=38309, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
Calling boinc_finish...09:36:24 (38309): called boinc_finish(0)
In boinc_exit called with status 0
Calloing set_signal_exit_code with status 0

</stderr_txt>
<message>
upload failure: <file_xfer_error>
<file_name>wah2_eu50r_mqvo_201512_13_561_010984979_1_r777845073_1.zip</file_name>
<error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>wah2_eu50r_mqvo_201512_13_561_010984979_1_r777845073_2.zip</file_name>
<error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>wah2_eu50r_mqvo_201512_13_561_010984979_1_r777845073_3.zip</file_name>
<error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>wah2_eu50r_mqvo_201512_13_561_010984979_1_r777845073_4.zip</file_name>
<error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>wah2_eu50r_mqvo_201512_13_561_010984979_1_r777845073_5.zip</file_name>
<error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>wah2_eu50r_mqvo_201512_13_561_010984979_1_r777845073_6.zip</file_name>
<error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>wah2_eu50r_mqvo_201512_13_561_010984979_1_r777845073_7.zip</file_name>
<error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>wah2_eu50r_mqvo_201512_13_561_010984979_1_r777845073_8.zip</file_name>
<error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>wah2_eu50r_mqvo_201512_13_561_010984979_1_r777845073_9.zip</file_name>
<error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>wah2_eu50r_mqvo_201512_13_561_010984979_1_r777845073_10.zip</file_name>
<error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>wah2_eu50r_mqvo_201512_13_561_010984979_1_r777845073_11.zip</file_name>
<error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>wah2_eu50r_mqvo_201512_13_561_010984979_1_r777845073_12.zip</file_name>
<error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>wah2_eu50r_mqvo_201512_13_561_010984979_1_r777845073_13.zip</file_name>
<error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>wah2_eu50r_mqvo_201512_13_561_010984979_1_r777845073_restart.zip</file_name>
<error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
4) Message boards : Number crunching : Late November batch of Windows work (Message 53005)
Posted 1 Dec 2015 by Trotador
Post:
Data from downloaded and runing units

trickles:
Latest Trickles Received
Result ID 	Result Name 	Phase	Timestep 	 CPU Time (sec) 	Average (sec/TS)
19107519 	wah2_eu25_h1ao_197112_12_010205426_0 1 	23,339 	161,793 		6.9323
19107519 	wah2_eu25_h1ao_197112_12_010205426_0 1 	11,819 	82,173 		6.9526

Latest Trickles Received
Result ID 	Result Name 	Phase 	Timestep 	CPU Time (sec) 	Average (sec/TS)
19107486 	wah2_eu25_h0ik_197012_12_010205393_0 	1 	23,339 	161,913 	6.9374
19107486 	wah2_eu25_h0ik_197012_12_010205393_0 	1 	11,819 	82,149 	6.9506

Latest Trickles Received
Result ID 	Result Name 	Phase 	Timestep 	CPU Time (sec) 	Average (sec/TS)
19107361 	wah2_eu25_f2bd_195212_12_010202599_1 	1 	23,339 	161,386 	6.9149
19107361 	wah2_eu25_f2bd_195212_12_010202599_1 	1 	11,819 	82,047 	6.9420


zip files sizes:

16.407 wah2_eu25_j1ic_199112_12_010208514.zip
15.844 wah2_eu25_c0dm_192012_12_010197870.zip
15.631 wah2_eu25_f2bd_195212_12_010202599.zip
16.224 wah2_eu25_g2hd_196212_12_010204179.zip
15.735 wah2_eu25_g8il_196812_12_010205096.zip
15.629 wah2_eu25_h0ib_197012_12_010205384.zip
15.629 wah2_eu25_h0ik_197012_12_010205393.zip
15.631 wah2_eu25_h1ao_197112_12_010205426.zip
5) Message boards : Number crunching : Task at 100% but not finishing (Message 48801)
Posted 14 Apr 2014 by Trotador
Post:
No trace of upload in the stdout file. I can see the last trickle message but I can't find any later finished/upload message.

In the "dataout" folder of the task, there are two files with the name of the unit and ending in .nc that I guess should be the output data but no zip file.

Edit: these two files update their modification date every time I click boinc manager update
6) Message boards : Number crunching : Task at 100% but not finishing (Message 48795)
Posted 14 Apr 2014 by Trotador
Post:
When I click update there is no relevant message, just "sending scheduler reques requested by user" and "scheduler request completed". there is no files in the transfer tab.

I've looked in the cpdn directory and there is a folder for this unit. Two text files and several folders are inside. The files, stderr_um.txt is empty and stdout_um.txt contains what seems to be all the development of the task processing ending with "Model finished with xxxx CPU time... Closing model.." which looks like the task was properly finished.

The folders are full of files.

I can't state whether the zip file has been sent or not.

thanks
7) Message boards : Number crunching : Task at 100% but not finishing (Message 48792)
Posted 14 Apr 2014 by Trotador
Post:
Hi,

This task

http://climateapps2.oerc.ox.ac.uk/cpdnboinc/result.php?resultid=16386669

is about to be 24 hours at 100% but boincmanager says it is still running. However, it is actually not using any CPU resource. I have shutdown the boincmanager and check that it does not kept stuck in memory. Suspending and restarting the task did not help either.

It seems that the computation has finished but for whatever reason it does not manage to close itself. I think that all crunching results should be already at cpdn server. Could you confirm and advise?. If I abort it will be reissued and maybe it is not necessary.

thanks


8) Message boards : Number crunching : ANOTHER UPLOAD PROBLEM (Message 47675)
Posted 27 Nov 2013 by Trotador
Post:
Trotador

Could you please have a look in client_state.xml with notepad, (better still, a copy of it :) ), look for the string 6w0r, and keep looking until you reach the line for zip13.
Then, a few lines earlier, there'll be a line ending in file_upload_handler.
Copy this line and paste it here.

This will tell me the exact url to which BOINC is trying to send the file.

Thanks.



this one?

<file_info>
<name>hadam3p_pnw_6w0r_2002_1_007590083.zip</name>
<nbytes>12766.000000</nbytes>
<max_nbytes>0.000000</max_nbytes>
<md5_cksum>582a163389f911283352728c384377d6</md5_cksum>
<status>1</status>
<url>http://cpdn-downloads.oerc.ox.ac.uk/download/hadam3p/hadam3p_pnw_6w0r_2002_1_007590083.zip</url>
</file_info>
<file_info>
<name>ic19611201_10_N96.gz</name>
<nbytes>1312884.000000</nbytes>
<max_nbytes>0.000000</max_nbytes>
<md5_cksum>aec9660d191182affe1d98eeda2d6abe</md5_cksum>
<status>1</status>
<url>http://cpdn-downloads.oerc.ox.ac.uk/download/hadam3p/ancil/mirror.php?file=ic19611201_10_N96.gz</url>
</file_info>
<file_info>
<name>xaclfa.start.0000.gz</name>
<nbytes>26657809.000000</nbytes>
<max_nbytes>0.000000</max_nbytes>
<md5_cksum>a4a3d4947da96f3f37dff5c0878407c0</md5_cksum>
<status>1</status>
<url>http://cpdn-downloads.oerc.ox.ac.uk/download/hadam3p/ancil/mirror.php?file=xaclfa.start.0000.gz</url>
</file_info>
<file_info>
<name>dchaba.start.pnw.b.0000.gz</name>
<nbytes>6083952.000000</nbytes>
<max_nbytes>0.000000</max_nbytes>
<md5_cksum>2f3d1c3e97c7eb668d15f3cb4e7e5fbc</md5_cksum>
<status>1</status>
<url>http://cpdn-downloads.oerc.ox.ac.uk/download/hadam3p/ancil/mirror.php?file=dchaba.start.pnw.b.0000.gz</url>
</file_info>
<file_info>
<name>HadISST_SI_N96_2002_12_2004_01f.gz</name>
<nbytes>1085060.000000</nbytes>
<max_nbytes>0.000000</max_nbytes>
<md5_cksum>ca7d7f7c03b72c5b027071ce157bbe86</md5_cksum>
<status>1</status>
<url>http://cpdn-downloads.oerc.ox.ac.uk/download/hadam3p/ancil/mirror.php?file=HadISST_SI_N96_2002_12_2004_01f.gz</url>
</file_info>
<file_info>
<name>HadISST_SST_N96_2002_12_2004_01f.gz</name>
<nbytes>3893018.000000</nbytes>
<max_nbytes>0.000000</max_nbytes>
<md5_cksum>8b234747ed142e60d75242a5b2cf1c3f</md5_cksum>
<status>1</status>
<url>http://cpdn-downloads.oerc.ox.ac.uk/download/hadam3p/ancil/mirror.php?file=HadISST_SST_N96_2002_12_2004_01f.gz</url>
</file_info>
<file_info>
<name>hadam3p_pnw_6w0r_2002_1_007590083_2_1.zip</name>
<nbytes>7563497.000000</nbytes>
<max_nbytes>150000000.000000</max_nbytes>
<md5_cksum>339d82aa5b1763cde20d2388681a5f39</md5_cksum>
<generated_locally/>
<status>0</status>
<uploaded/>
<upload_when_present/>
<url>http://boinc1.coas.oregonstate.edu/cpdn_cgi_main/file_upload_handler</url>
</file_info>
<file_info>
<name>hadam3p_pnw_6w0r_2002_1_007590083_2_2.zip</name>
<nbytes>7577967.000000</nbytes>
<max_nbytes>150000000.000000</max_nbytes>
<md5_cksum>9e280e9af5046245a5681f1f95bc1d9d</md5_cksum>
<generated_locally/>
<status>0</status>
<uploaded/>
<upload_when_present/>
<url>http://boinc1.coas.oregonstate.edu/cpdn_cgi_main/file_upload_handler</url>
</file_info>
<file_info>
<name>hadam3p_pnw_6w0r_2002_1_007590083_2_3.zip</name>
<nbytes>7657353.000000</nbytes>
<max_nbytes>150000000.000000</max_nbytes>
<md5_cksum>a20b7679b13537c530b5595b3113b464</md5_cksum>
<generated_locally/>
<status>0</status>
<uploaded/>
<upload_when_present/>
<url>http://boinc1.coas.oregonstate.edu/cpdn_cgi_main/file_upload_handler</url>
</file_info>
<file_info>
<name>hadam3p_pnw_6w0r_2002_1_007590083_2_4.zip</name>
<nbytes>7737615.000000</nbytes>
<max_nbytes>150000000.000000</max_nbytes>
<md5_cksum>07595e77fffc4033bfaacba8d6597597</md5_cksum>
<generated_locally/>
<status>0</status>
<uploaded/>
<upload_when_present/>
<url>http://boinc1.coas.oregonstate.edu/cpdn_cgi_main/file_upload_handler</url>
</file_info>
<file_info>
<name>hadam3p_pnw_6w0r_2002_1_007590083_2_5.zip</name>
<nbytes>7785628.000000</nbytes>
<max_nbytes>150000000.000000</max_nbytes>
<md5_cksum>0e9e3fe10610d31f93d4224d11a3bfba</md5_cksum>
<generated_locally/>
<status>0</status>
<uploaded/>
<upload_when_present/>
<url>http://boinc1.coas.oregonstate.edu/cpdn_cgi_main/file_upload_handler</url>
</file_info>
<file_info>
<name>hadam3p_pnw_6w0r_2002_1_007590083_2_6.zip</name>
<nbytes>7713417.000000</nbytes>
<max_nbytes>150000000.000000</max_nbytes>
<md5_cksum>1e736f1ea5006028fcf796ae5cf2df53</md5_cksum>
<generated_locally/>
<status>0</status>
<uploaded/>
<upload_when_present/>
<url>http://boinc1.coas.oregonstate.edu/cpdn_cgi_main/file_upload_handler</url>
</file_info>
<file_info>
<name>hadam3p_pnw_6w0r_2002_1_007590083_2_7.zip</name>
<nbytes>7711829.000000</nbytes>
<max_nbytes>150000000.000000</max_nbytes>
<md5_cksum>296ef500ea92f563f31340e80b2a68e9</md5_cksum>
<generated_locally/>
<status>0</status>
<uploaded/>
<upload_when_present/>
<url>http://boinc1.coas.oregonstate.edu/cpdn_cgi_main/file_upload_handler</url>
</file_info>
<file_info>
<name>hadam3p_pnw_6w0r_2002_1_007590083_2_8.zip</name>
<nbytes>7707894.000000</nbytes>
<max_nbytes>150000000.000000</max_nbytes>
<md5_cksum>9b091c143608b03a7ba633cac0dccfd3</md5_cksum>
<generated_locally/>
<status>0</status>
<uploaded/>
<upload_when_present/>
<url>http://boinc1.coas.oregonstate.edu/cpdn_cgi_main/file_upload_handler</url>
</file_info>
<file_info>
<name>hadam3p_pnw_6w0r_2002_1_007590083_2_9.zip</name>
<nbytes>7800310.000000</nbytes>
<max_nbytes>150000000.000000</max_nbytes>
<md5_cksum>9ec5a615dfcac482a7e7c93cf421607d</md5_cksum>
<generated_locally/>
<status>0</status>
<uploaded/>
<upload_when_present/>
<url>http://boinc1.coas.oregonstate.edu/cpdn_cgi_main/file_upload_handler</url>
</file_info>
<file_info>
<name>hadam3p_pnw_6w0r_2002_1_007590083_2_10.zip</name>
<nbytes>7853736.000000</nbytes>
<max_nbytes>150000000.000000</max_nbytes>
<md5_cksum>602dc898a4537497b79d4c39b9f583e9</md5_cksum>
<generated_locally/>
<status>0</status>
<uploaded/>
<upload_when_present/>
<url>http://boinc1.coas.oregonstate.edu/cpdn_cgi_main/file_upload_handler</url>
</file_info>
<file_info>
<name>hadam3p_pnw_6w0r_2002_1_007590083_2_11.zip</name>
<nbytes>7896535.000000</nbytes>
<max_nbytes>150000000.000000</max_nbytes>
<md5_cksum>f36058919a4a856648010be9e23e2139</md5_cksum>
<generated_locally/>
<status>0</status>
<uploaded/>
<upload_when_present/>
<url>http://boinc1.coas.oregonstate.edu/cpdn_cgi_main/file_upload_handler</url>
</file_info>
<file_info>
<name>hadam3p_pnw_6w0r_2002_1_007590083_2_12.zip</name>
<nbytes>8044336.000000</nbytes>
<max_nbytes>150000000.000000</max_nbytes>
<md5_cksum>1fea2126fd65965d676e4e60ef50fdfe</md5_cksum>
<generated_locally/>
<status>0</status>
<uploaded/>
<upload_when_present/>
<url>http://boinc1.coas.oregonstate.edu/cpdn_cgi_main/file_upload_handler</url>
</file_info>
<file_info>
<name>hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip</name>
<nbytes>33737443.000000</nbytes>
<max_nbytes>150000000.000000</max_nbytes>
<md5_cksum>44c55ea026ea604749cc9ff8592c7fad</md5_cksum>
<generated_locally/>
<status>1</status>
<upload_when_present/>
<url>http://cpdn-restarts.oerc.ox.ac.uk/cgi-bin/file_upload_handler</url>
<persistent_file_xfer>
<num_retries>28</num_retries>
<first_request_time>1385379856.550667</first_request_time>
<next_request_time>1385534194.430921</next_request_time>
<time_so_far>1044.810202</time_so_far>
<last_bytes_xferred>33737658.000000</last_bytes_xferred>
</persistent_file_xfer>
</file_info>
9) Message boards : Number crunching : ANOTHER UPLOAD PROBLEM (Message 47665)
Posted 26 Nov 2013 by Trotador
Post:
Same issue with this unit. Should I abort?

http://climateapps2.oerc.ox.ac.uk/cpdnboinc/workunit.php?wuid=7768213


mar 26 nov 2013 15:28:08 CET climateprediction.net Started upload of hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip
mar 26 nov 2013 15:28:44 CET climateprediction.net [error] Error reported by file upload server: can't open file /storage/cpdn-restarts/incoming/uploader/hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip: No such file or directory
mar 26 nov 2013 15:28:44 CET climateprediction.net Temporarily failed upload of hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip: transient upload error
mar 26 nov 2013 15:28:44 CET climateprediction.net Backing off 2 hr 6 min 9 sec on upload of hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip
mar 26 nov 2013 17:34:53 CET climateprediction.net Started upload of hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip
mar 26 nov 2013 17:35:30 CET climateprediction.net [error] Error reported by file upload server: can't open file /storage/cpdn-restarts/incoming/uploader/hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip: No such file or directory
mar 26 nov 2013 17:35:30 CET climateprediction.net Temporarily failed upload of hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip: transient upload error
mar 26 nov 2013 17:35:30 CET climateprediction.net Backing off 2 hr 38 min 0 sec on upload of hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip
mar 26 nov 2013 20:13:31 CET climateprediction.net Started upload of hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip
mar 26 nov 2013 20:14:07 CET climateprediction.net [error] Error reported by file upload server: can't open file /storage/cpdn-restarts/incoming/uploader/hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip: No such file or directory
mar 26 nov 2013 20:14:07 CET climateprediction.net Temporarily failed upload of hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip: transient upload error
mar 26 nov 2013 20:14:07 CET climateprediction.net Backing off 1 hr 34 min 2 sec on upload of hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip
mar 26 nov 2013 21:35:50 CET climateprediction.net Started upload of hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip
mar 26 nov 2013 21:36:24 CET climateprediction.net [error] Error reported by file upload server: can't open file /storage/cpdn-restarts/incoming/uploader/hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip: No such file or directory
mar 26 nov 2013 21:36:24 CET climateprediction.net Temporarily failed upload of hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip: transient upload error
mar 26 nov 2013 21:36:24 CET climateprediction.net Backing off 8 min 55 sec on upload of hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip
10) Message boards : Number crunching : Several jobs uploads in project backoff (Message 46115)
Posted 29 Apr 2013 by Trotador
Post:
Yeah, here too with two wus in back-off mode...




©2024 climateprediction.net