|
1)
Message boards :
Number crunching :
New work Discussion
(Message 58992)
Posted 11 Nov 2018 by Trotador Post: And AGAIN they're only showing one trickle. :( This is happening to me, I downloaded a bunch of units on the 6th, they tricked on the 7th and no more, one even finished and does not have anything else in the web page that the tricckle on 7th. The rest of units still have 5/6 days of crunching ahead left... And today points only for the lonely trickle. |
2)
Message boards :
Number crunching :
New work Discussion
(Message 56609)
Posted 1 Aug 2017 by Trotador Post: 599's and 617's failing in my linux host With SIGSEGV: segmentation violation |
3)
Message boards :
Number crunching :
New work Discussion
(Message 56071)
Posted 14 Apr 2017 by Trotador Post: In my linux hosts most of them are ending in computing error after six seven hours crunching <core_client_version>7.6.31</core_client_version> <![CDATA[ <stderr_txt> SIGSEGV: segmentation violation Stack trace (12 frames): /home/antonio/BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.25_i686-pc-linux-gnu(boinc_catch_signal+0x67)[0x839e357] [0x2a999ca0] /home/antonio/BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.25_i686-pc-linux-gnu[0x814442b] /home/antonio/BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.25_i686-pc-linux-gnu[0x814b133] /home/antonio/BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.25_i686-pc-linux-gnu[0x8141220] /home/antonio/BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.25_i686-pc-linux-gnu[0x813ff46] /home/antonio/BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.25_i686-pc-linux-gnu[0x8077583] /home/antonio/BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.25_i686-pc-linux-gnu[0x831cd74] /home/antonio/BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.25_i686-pc-linux-gnu[0x8330985] /home/antonio/BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.25_i686-pc-linux-gnu[0x833318a] /home/antonio/BOINC/projects/climateprediction.net/wah2rm3m2t_um_8.25_i686-pc-linux-gnu[0x8334c8d] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf7)[0x2a767637] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=38309, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_ain::Monitor... Calling boinc_finish...09:36:24 (38309): called boinc_finish(0) In boinc_exit called with status 0 Calloing set_signal_exit_code with status 0 </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>wah2_eu50r_mqvo_201512_13_561_010984979_1_r777845073_1.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eu50r_mqvo_201512_13_561_010984979_1_r777845073_2.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eu50r_mqvo_201512_13_561_010984979_1_r777845073_3.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eu50r_mqvo_201512_13_561_010984979_1_r777845073_4.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eu50r_mqvo_201512_13_561_010984979_1_r777845073_5.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eu50r_mqvo_201512_13_561_010984979_1_r777845073_6.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eu50r_mqvo_201512_13_561_010984979_1_r777845073_7.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eu50r_mqvo_201512_13_561_010984979_1_r777845073_8.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eu50r_mqvo_201512_13_561_010984979_1_r777845073_9.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eu50r_mqvo_201512_13_561_010984979_1_r777845073_10.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eu50r_mqvo_201512_13_561_010984979_1_r777845073_11.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eu50r_mqvo_201512_13_561_010984979_1_r777845073_12.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eu50r_mqvo_201512_13_561_010984979_1_r777845073_13.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_eu50r_mqvo_201512_13_561_010984979_1_r777845073_restart.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> </message> ]]> |
4)
Message boards :
Number crunching :
Late November batch of Windows work
(Message 53005)
Posted 1 Dec 2015 by Trotador Post: Data from downloaded and runing units trickles: Latest Trickles Received Result ID Result Name Phase Timestep CPU Time (sec) Average (sec/TS) 19107519 wah2_eu25_h1ao_197112_12_010205426_0 1 23,339 161,793 6.9323 19107519 wah2_eu25_h1ao_197112_12_010205426_0 1 11,819 82,173 6.9526 Latest Trickles Received Result ID Result Name Phase Timestep CPU Time (sec) Average (sec/TS) 19107486 wah2_eu25_h0ik_197012_12_010205393_0 1 23,339 161,913 6.9374 19107486 wah2_eu25_h0ik_197012_12_010205393_0 1 11,819 82,149 6.9506 Latest Trickles Received Result ID Result Name Phase Timestep CPU Time (sec) Average (sec/TS) 19107361 wah2_eu25_f2bd_195212_12_010202599_1 1 23,339 161,386 6.9149 19107361 wah2_eu25_f2bd_195212_12_010202599_1 1 11,819 82,047 6.9420 zip files sizes: 16.407 wah2_eu25_j1ic_199112_12_010208514.zip 15.844 wah2_eu25_c0dm_192012_12_010197870.zip 15.631 wah2_eu25_f2bd_195212_12_010202599.zip 16.224 wah2_eu25_g2hd_196212_12_010204179.zip 15.735 wah2_eu25_g8il_196812_12_010205096.zip 15.629 wah2_eu25_h0ib_197012_12_010205384.zip 15.629 wah2_eu25_h0ik_197012_12_010205393.zip 15.631 wah2_eu25_h1ao_197112_12_010205426.zip |
5)
Message boards :
Number crunching :
Task at 100% but not finishing
(Message 48801)
Posted 14 Apr 2014 by Trotador Post: No trace of upload in the stdout file. I can see the last trickle message but I can't find any later finished/upload message. In the "dataout" folder of the task, there are two files with the name of the unit and ending in .nc that I guess should be the output data but no zip file. Edit: these two files update their modification date every time I click boinc manager update |
6)
Message boards :
Number crunching :
Task at 100% but not finishing
(Message 48795)
Posted 14 Apr 2014 by Trotador Post: When I click update there is no relevant message, just "sending scheduler reques requested by user" and "scheduler request completed". there is no files in the transfer tab. I've looked in the cpdn directory and there is a folder for this unit. Two text files and several folders are inside. The files, stderr_um.txt is empty and stdout_um.txt contains what seems to be all the development of the task processing ending with "Model finished with xxxx CPU time... Closing model.." which looks like the task was properly finished. The folders are full of files. I can't state whether the zip file has been sent or not. thanks |
7)
Message boards :
Number crunching :
Task at 100% but not finishing
(Message 48792)
Posted 14 Apr 2014 by Trotador Post: Hi, This task http://climateapps2.oerc.ox.ac.uk/cpdnboinc/result.php?resultid=16386669 is about to be 24 hours at 100% but boincmanager says it is still running. However, it is actually not using any CPU resource. I have shutdown the boincmanager and check that it does not kept stuck in memory. Suspending and restarting the task did not help either. It seems that the computation has finished but for whatever reason it does not manage to close itself. I think that all crunching results should be already at cpdn server. Could you confirm and advise?. If I abort it will be reissued and maybe it is not necessary. thanks |
8)
Message boards :
Number crunching :
ANOTHER UPLOAD PROBLEM
(Message 47675)
Posted 27 Nov 2013 by Trotador Post: Trotador this one? <file_info> <name>hadam3p_pnw_6w0r_2002_1_007590083.zip</name> <nbytes>12766.000000</nbytes> <max_nbytes>0.000000</max_nbytes> <md5_cksum>582a163389f911283352728c384377d6</md5_cksum> <status>1</status> <url>http://cpdn-downloads.oerc.ox.ac.uk/download/hadam3p/hadam3p_pnw_6w0r_2002_1_007590083.zip</url> </file_info> <file_info> <name>ic19611201_10_N96.gz</name> <nbytes>1312884.000000</nbytes> <max_nbytes>0.000000</max_nbytes> <md5_cksum>aec9660d191182affe1d98eeda2d6abe</md5_cksum> <status>1</status> <url>http://cpdn-downloads.oerc.ox.ac.uk/download/hadam3p/ancil/mirror.php?file=ic19611201_10_N96.gz</url> </file_info> <file_info> <name>xaclfa.start.0000.gz</name> <nbytes>26657809.000000</nbytes> <max_nbytes>0.000000</max_nbytes> <md5_cksum>a4a3d4947da96f3f37dff5c0878407c0</md5_cksum> <status>1</status> <url>http://cpdn-downloads.oerc.ox.ac.uk/download/hadam3p/ancil/mirror.php?file=xaclfa.start.0000.gz</url> </file_info> <file_info> <name>dchaba.start.pnw.b.0000.gz</name> <nbytes>6083952.000000</nbytes> <max_nbytes>0.000000</max_nbytes> <md5_cksum>2f3d1c3e97c7eb668d15f3cb4e7e5fbc</md5_cksum> <status>1</status> <url>http://cpdn-downloads.oerc.ox.ac.uk/download/hadam3p/ancil/mirror.php?file=dchaba.start.pnw.b.0000.gz</url> </file_info> <file_info> <name>HadISST_SI_N96_2002_12_2004_01f.gz</name> <nbytes>1085060.000000</nbytes> <max_nbytes>0.000000</max_nbytes> <md5_cksum>ca7d7f7c03b72c5b027071ce157bbe86</md5_cksum> <status>1</status> <url>http://cpdn-downloads.oerc.ox.ac.uk/download/hadam3p/ancil/mirror.php?file=HadISST_SI_N96_2002_12_2004_01f.gz</url> </file_info> <file_info> <name>HadISST_SST_N96_2002_12_2004_01f.gz</name> <nbytes>3893018.000000</nbytes> <max_nbytes>0.000000</max_nbytes> <md5_cksum>8b234747ed142e60d75242a5b2cf1c3f</md5_cksum> <status>1</status> <url>http://cpdn-downloads.oerc.ox.ac.uk/download/hadam3p/ancil/mirror.php?file=HadISST_SST_N96_2002_12_2004_01f.gz</url> </file_info> <file_info> <name>hadam3p_pnw_6w0r_2002_1_007590083_2_1.zip</name> <nbytes>7563497.000000</nbytes> <max_nbytes>150000000.000000</max_nbytes> <md5_cksum>339d82aa5b1763cde20d2388681a5f39</md5_cksum> <generated_locally/> <status>0</status> <uploaded/> <upload_when_present/> <url>http://boinc1.coas.oregonstate.edu/cpdn_cgi_main/file_upload_handler</url> </file_info> <file_info> <name>hadam3p_pnw_6w0r_2002_1_007590083_2_2.zip</name> <nbytes>7577967.000000</nbytes> <max_nbytes>150000000.000000</max_nbytes> <md5_cksum>9e280e9af5046245a5681f1f95bc1d9d</md5_cksum> <generated_locally/> <status>0</status> <uploaded/> <upload_when_present/> <url>http://boinc1.coas.oregonstate.edu/cpdn_cgi_main/file_upload_handler</url> </file_info> <file_info> <name>hadam3p_pnw_6w0r_2002_1_007590083_2_3.zip</name> <nbytes>7657353.000000</nbytes> <max_nbytes>150000000.000000</max_nbytes> <md5_cksum>a20b7679b13537c530b5595b3113b464</md5_cksum> <generated_locally/> <status>0</status> <uploaded/> <upload_when_present/> <url>http://boinc1.coas.oregonstate.edu/cpdn_cgi_main/file_upload_handler</url> </file_info> <file_info> <name>hadam3p_pnw_6w0r_2002_1_007590083_2_4.zip</name> <nbytes>7737615.000000</nbytes> <max_nbytes>150000000.000000</max_nbytes> <md5_cksum>07595e77fffc4033bfaacba8d6597597</md5_cksum> <generated_locally/> <status>0</status> <uploaded/> <upload_when_present/> <url>http://boinc1.coas.oregonstate.edu/cpdn_cgi_main/file_upload_handler</url> </file_info> <file_info> <name>hadam3p_pnw_6w0r_2002_1_007590083_2_5.zip</name> <nbytes>7785628.000000</nbytes> <max_nbytes>150000000.000000</max_nbytes> <md5_cksum>0e9e3fe10610d31f93d4224d11a3bfba</md5_cksum> <generated_locally/> <status>0</status> <uploaded/> <upload_when_present/> <url>http://boinc1.coas.oregonstate.edu/cpdn_cgi_main/file_upload_handler</url> </file_info> <file_info> <name>hadam3p_pnw_6w0r_2002_1_007590083_2_6.zip</name> <nbytes>7713417.000000</nbytes> <max_nbytes>150000000.000000</max_nbytes> <md5_cksum>1e736f1ea5006028fcf796ae5cf2df53</md5_cksum> <generated_locally/> <status>0</status> <uploaded/> <upload_when_present/> <url>http://boinc1.coas.oregonstate.edu/cpdn_cgi_main/file_upload_handler</url> </file_info> <file_info> <name>hadam3p_pnw_6w0r_2002_1_007590083_2_7.zip</name> <nbytes>7711829.000000</nbytes> <max_nbytes>150000000.000000</max_nbytes> <md5_cksum>296ef500ea92f563f31340e80b2a68e9</md5_cksum> <generated_locally/> <status>0</status> <uploaded/> <upload_when_present/> <url>http://boinc1.coas.oregonstate.edu/cpdn_cgi_main/file_upload_handler</url> </file_info> <file_info> <name>hadam3p_pnw_6w0r_2002_1_007590083_2_8.zip</name> <nbytes>7707894.000000</nbytes> <max_nbytes>150000000.000000</max_nbytes> <md5_cksum>9b091c143608b03a7ba633cac0dccfd3</md5_cksum> <generated_locally/> <status>0</status> <uploaded/> <upload_when_present/> <url>http://boinc1.coas.oregonstate.edu/cpdn_cgi_main/file_upload_handler</url> </file_info> <file_info> <name>hadam3p_pnw_6w0r_2002_1_007590083_2_9.zip</name> <nbytes>7800310.000000</nbytes> <max_nbytes>150000000.000000</max_nbytes> <md5_cksum>9ec5a615dfcac482a7e7c93cf421607d</md5_cksum> <generated_locally/> <status>0</status> <uploaded/> <upload_when_present/> <url>http://boinc1.coas.oregonstate.edu/cpdn_cgi_main/file_upload_handler</url> </file_info> <file_info> <name>hadam3p_pnw_6w0r_2002_1_007590083_2_10.zip</name> <nbytes>7853736.000000</nbytes> <max_nbytes>150000000.000000</max_nbytes> <md5_cksum>602dc898a4537497b79d4c39b9f583e9</md5_cksum> <generated_locally/> <status>0</status> <uploaded/> <upload_when_present/> <url>http://boinc1.coas.oregonstate.edu/cpdn_cgi_main/file_upload_handler</url> </file_info> <file_info> <name>hadam3p_pnw_6w0r_2002_1_007590083_2_11.zip</name> <nbytes>7896535.000000</nbytes> <max_nbytes>150000000.000000</max_nbytes> <md5_cksum>f36058919a4a856648010be9e23e2139</md5_cksum> <generated_locally/> <status>0</status> <uploaded/> <upload_when_present/> <url>http://boinc1.coas.oregonstate.edu/cpdn_cgi_main/file_upload_handler</url> </file_info> <file_info> <name>hadam3p_pnw_6w0r_2002_1_007590083_2_12.zip</name> <nbytes>8044336.000000</nbytes> <max_nbytes>150000000.000000</max_nbytes> <md5_cksum>1fea2126fd65965d676e4e60ef50fdfe</md5_cksum> <generated_locally/> <status>0</status> <uploaded/> <upload_when_present/> <url>http://boinc1.coas.oregonstate.edu/cpdn_cgi_main/file_upload_handler</url> </file_info> <file_info> <name>hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip</name> <nbytes>33737443.000000</nbytes> <max_nbytes>150000000.000000</max_nbytes> <md5_cksum>44c55ea026ea604749cc9ff8592c7fad</md5_cksum> <generated_locally/> <status>1</status> <upload_when_present/> <url>http://cpdn-restarts.oerc.ox.ac.uk/cgi-bin/file_upload_handler</url> <persistent_file_xfer> <num_retries>28</num_retries> <first_request_time>1385379856.550667</first_request_time> <next_request_time>1385534194.430921</next_request_time> <time_so_far>1044.810202</time_so_far> <last_bytes_xferred>33737658.000000</last_bytes_xferred> </persistent_file_xfer> </file_info> |
9)
Message boards :
Number crunching :
ANOTHER UPLOAD PROBLEM
(Message 47665)
Posted 26 Nov 2013 by Trotador Post: Same issue with this unit. Should I abort? http://climateapps2.oerc.ox.ac.uk/cpdnboinc/workunit.php?wuid=7768213 mar 26 nov 2013 15:28:08 CET climateprediction.net Started upload of hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip mar 26 nov 2013 15:28:44 CET climateprediction.net [error] Error reported by file upload server: can't open file /storage/cpdn-restarts/incoming/uploader/hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip: No such file or directory mar 26 nov 2013 15:28:44 CET climateprediction.net Temporarily failed upload of hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip: transient upload error mar 26 nov 2013 15:28:44 CET climateprediction.net Backing off 2 hr 6 min 9 sec on upload of hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip mar 26 nov 2013 17:34:53 CET climateprediction.net Started upload of hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip mar 26 nov 2013 17:35:30 CET climateprediction.net [error] Error reported by file upload server: can't open file /storage/cpdn-restarts/incoming/uploader/hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip: No such file or directory mar 26 nov 2013 17:35:30 CET climateprediction.net Temporarily failed upload of hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip: transient upload error mar 26 nov 2013 17:35:30 CET climateprediction.net Backing off 2 hr 38 min 0 sec on upload of hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip mar 26 nov 2013 20:13:31 CET climateprediction.net Started upload of hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip mar 26 nov 2013 20:14:07 CET climateprediction.net [error] Error reported by file upload server: can't open file /storage/cpdn-restarts/incoming/uploader/hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip: No such file or directory mar 26 nov 2013 20:14:07 CET climateprediction.net Temporarily failed upload of hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip: transient upload error mar 26 nov 2013 20:14:07 CET climateprediction.net Backing off 1 hr 34 min 2 sec on upload of hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip mar 26 nov 2013 21:35:50 CET climateprediction.net Started upload of hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip mar 26 nov 2013 21:36:24 CET climateprediction.net [error] Error reported by file upload server: can't open file /storage/cpdn-restarts/incoming/uploader/hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip: No such file or directory mar 26 nov 2013 21:36:24 CET climateprediction.net Temporarily failed upload of hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip: transient upload error mar 26 nov 2013 21:36:24 CET climateprediction.net Backing off 8 min 55 sec on upload of hadam3p_pnw_6w0r_2002_1_007590083_2_13.zip |
10)
Message boards :
Number crunching :
Several jobs uploads in project backoff
(Message 46115)
Posted 29 Apr 2013 by Trotador Post: Yeah, here too with two wus in back-off mode... |
©2023 climateprediction.net