climateprediction.net home page
Posts by RichardRodd

Posts by RichardRodd

1) Questions and Answers : Windows : Two more models behaving strangely (Message 41498)
Posted 17 Jan 2011 by RichardRodd
Post:
Well the bizarre thing is that it only shows a Blue World SOMETIMES - usually when the Time to Complete zooms upwards. Right now TTC is running at 846 hours (Completed is 1018, 51%), but when TTC next sits at approx 1300 hours (% complete ~35%) the temperature (default?) graphics will then show a Blue World - I've not checked all graphic modes in that state - to be honest I can't remember how to!

So if you care to tell me how to check, I'll let you know next time it glitches.

In the meantime, the other model (12453603) just struggles onwards ...

Cheers - Richard
2) Questions and Answers : Windows : Two more models behaving strangely (Message 41491)
Posted 17 Jan 2011 by RichardRodd
Post:
12018622 a HadCM3 Coupled Model Experiment Optimised File I/O v6.04 - has been running for ever. I's progress and completion numbers fluctuate wildly and sometimes the graphics show a Blue World. At any particular level, Time To Complete seems to rise gently.. Weird.

On the other CPU I have 12453603 which may be suffering as a consequence and just seems to hover at 98 hours to completion without going anywhere.

Any ideas? I have stoppped and retarted BOINC as well as rebooted the m/c a number of times.

Thanks for any help -- Richard
3) Questions and Answers : Windows : HADAM3P Model stuck (Message 41400)
Posted 31 Dec 2010 by RichardRodd
Post:
Worked a treat - nothing like flushing out the old buffers!!
4) Questions and Answers : Windows : HADAM3P Model stuck (Message 41388)
Posted 30 Dec 2010 by RichardRodd
Post:
Hi,

Can anyone advise me whether these models can descend into Ice Worlds, and thereby merit aborting?

I'm running hadam3p_eu_w2q6_1989_1_006770086_1, Task 12314662, work unit 6973402. It's been hung at 99+% for a few days (not sure how long to be honest) and now I see the estimate to completion is slowly rising, while the data (http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=12314662) seems to suggest the temperatures are falling...

Advice please?

Many thanks - Richard
5) Message boards : Number crunching : Iceworlds & Slowdowns hadsm3/mh - Closed - Discussion (Message 36509)
Posted 27 Mar 2009 by RichardRodd
Post:
Greetings, again.

Further to the post just above, I suspended those two tasks, and the waiting task that started up is behaving the same... a rising to-completion number from the outset.

Can someone help me fathom this out? Might it be because this m/c is running with quite a full hard drive - 10Gb free out of 70.

This is a 2-cpu m/c and BOINC hasn\'t downloaded a second task to me. I found a BOINC discussion thread (http://einstein.phys.uwm.edu/forum_thread.php?id=6389) that suggested BOINC knows this m/c is struggling, but I\'m not sure how to go about trouble-shooting...

Oh, I also upgraded from 6.4.5 to 6.4.7, so it should be a clean installation here.

All help gratefully received - as ever!

Many thanks - Richard
6) Message boards : Number crunching : Iceworlds & Slowdowns hadsm3/mh - Closed - Discussion (Message 36507)
Posted 27 Mar 2009 by RichardRodd
Post:
Greetings.

I\'ve just spotted a couple of very slow-running models. I should probably abort - can someone please advise...

1. http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=7673807
Current timestep = 49417 of 259248
s/TS = 56.6

2. http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=7731951
Current timestep = 39775 of 259248
s/TS = 3.64

Neither are blue / ice worlds, but the to-completion time is rising, so maybe they\'re turning blue...
It\'s an Intel machine runnig WinXP, not overclocked and has, prior to these two models, turned in a few complete runs.

Thanks - Richard
7) Message boards : Number crunching : A blue world for aborting...? (Message 36258)
Posted 28 Feb 2009 by RichardRodd
Post:
Thanks everyone for all those comments and suggestions. I\'ll start on the task list suggested and keep you posted, but it\'s a \'remote\' m/c, so it\'ll take a week or three...

Bye for now.
8) Message boards : Number crunching : A blue world for aborting...? (Message 36248)
Posted 28 Feb 2009 by RichardRodd
Post:
Greetings all.

I\'m posting back here after a week or three to ask for more advice, but I don\'t want to go over the top with this...!

I\'m wondering if the problem I reported above may well be this particular machine - the \'elderly\' Packard Bell. Each work unit loaded and run here seems to rapidly turn into a Blue World. I have uninstalled and reinstalled BOINC (now at Ver 6.4.5) but it seems to have made no difference.

I\'ve had a look at how the current task 6191307 is running elsewhere. The three users with the highest credit are still in Phase 1 with no reported precipitation. I now understand this is a warning sign mentioned above for a Blue World, but I\'m not sure at what stage: the last note above suggests at stage-end - which implies I should keep the faith until end of Stage 1. Anyway, the s/TS of these other users are reasonable and or steady, mine is already 8.33 and rising. But that might have something to do with the computer\'s general use - I don\'t know?

So here\'s my question. Is this actually indicative of (another) Blue World - so do I abort and try again? Or is this telling me I have a m/c that can\'t hack it?! Some of the posts above suggested the latter... This model is running steadily (after a fashion!) but at 74 CPU hours and 4% complete something is clearly not happy - and, as you\'d expect, the \'To completion\' figure just rises steadily, if imperceptably - now on 650 hours. On better behaved, albeit more powerful, members of my little \'family\' I expect to complete a model in some 400-500 hours, and do.

When I look at the m/c it seems unremarkable - XP (Home edition, admittedly) SP3. A few years old, a chip that\'s neither fast nor slow. BIOS...do we really care? I put 2Gb RAM in it, so it\'s not that. Plenty of space on the hard drive. No interference from other apps.

Any ideas anyone?

Many thanks as always - Richard

9) Message boards : Number crunching : A blue world for aborting...? (Message 36028)
Posted 26 Jan 2009 by RichardRodd
Post:
Hi there mo.v and thanks for your comprehensive advice.

When I hit my first ice world a couple of years ago, the advice then was just let it run because it *will* run to completion: I think I had a couple back then ... and they did, but only after some 1000+ cpu hours!

The tasks you suggest I abort are both only running with a S/TS less than 6 (how high that number was used to be a factor in advising whether or not to abort, I recall), and one has only approx 100 hours to go (the other 300). It seems a shame to kill them at this late stage IF IF IF ice world results DO contribute useful information, as was implied to me the first time around. But I gather from your penultimate paragraph that wisdom has moved on and they\'re now considered of no value.

I\'d appreciate your final view on that...

Anyhow, the task that sparked this thread was the first one on a new computer I\'ve just added to my \'family\'. But it\'s an ancient Packard Bell, so may not be up to the task, even though the raw spec suggested it\'d be okay - a 2.66GHz chip with 2GB RAM, not used for anything other than basic MS Office stuff - no hungry graphics games! So I\'ve got the next task running and will watch progress with interest.

Once again, many thanks for you interest.

Kind regards
Richard
10) Message boards : Number crunching : A blue world for aborting...? (Message 36008)
Posted 24 Jan 2009 by RichardRodd
Post:
Thanks Iain.

Sadly no backup - so I\'ve consigned this one to the bin...

I\'ll watch the progress of the next one with interest to see if there\'s some other problem there as you suspect. After which probably a BOINC reinstall... sigh!

All the best.
Richard
11) Message boards : Number crunching : A blue world for aborting...? (Message 36000)
Posted 24 Jan 2009 by RichardRodd
Post:
Hi.

I seem to have picked up a clutch of blue worlds on my small family of machines.

The first one is hadsm3fub_k78a_005974157_9 which I started late December. I only just spotted it\'s blue! It\'s never trickled, is already running at 90 s/TS and the graphics display tells me it\'s only got to model date 25/03/1811 (a cold day for March...!).

Can you advise me is this one for aborting? I\'m very happy to keep the faith - I have had blue worlds run for months and eventually complete, but I don\'t think I\'ve ever seen one this early in the cycle.

Thanks for your help - Richard
12) Questions and Answers : Windows : Error Message: No start tag in scheduler reply (Message 34691)
Posted 19 Aug 2008 by RichardRodd
Post:
It was a service installation.
13) Questions and Answers : Windows : Error Message: No start tag in scheduler reply (Message 34689)
Posted 19 Aug 2008 by RichardRodd
Post:
It was a service installation.
14) Questions and Answers : Windows : Error Message: No start tag in scheduler reply (Message 34681)
Posted 18 Aug 2008 by RichardRodd
Post:
Curioser and curioser...

So, no worries really: I uninstalled 6.2.18 and reinstalled 5.10.45 which picked up work-in-progress (neat, that) AND \'reinstated\' all comms working as before.

So summat changed between those two versions of BOINC as far as I can infer - but not on my system!!

Any ideas? \'Cos the day will come when I\'ll have to move off 5.10.45.

Richard
15) Questions and Answers : Windows : Error Message: No start tag in scheduler reply (Message 34680)
Posted 18 Aug 2008 by RichardRodd
Post:
Greetings again.

At the risk of yet more embarrassment...

It seemed I\'d avoided the problem I discussed here by simply killing CyberPatrol every time I chose manually to release trickles: inelegant, but it worked.

But now I\'ve upgraded to BOINC V6.2.18, ands I\'m back to square one - except this time with CyberPatrol definitely killed (as far as I can see...!) I\'m still getting all the comms error messages I was before.

Am I missing something? Is there a \'reset\' I can do?

I have followed all the seteps suggested above as before. Nothing else has changed in my setup, and with CyberPatrol killed, it seems I still can\'t establish comms with the BOINC server.

Your help once again much appreciated...

Richard
16) Questions and Answers : Windows : Error Message: No start tag in scheduler reply (Message 33306)
Posted 12 Apr 2008 by RichardRodd
Post:
Mike (et al), hi.

I tried the \'update\' just in case it did anything significantly - but I was already pretty sure it\'s just another way of kicking-off comms between client and server.

So I\'m still looking for a little more help here. This realtime CP monitor which I switched on *only* reports on webpages - so it \'sees\' me writing on this page, for instance. But trickles and the like, it *doesn\'t* see, but still intercepts.

Yesterday I mentioned the file sched_request_climateprediction.net.xml, and when I looked through it (with my almost layman\'s eyes) I spotted the three letters \'rpc\', which made me wonder about how the whole BOINC thing is architected - grid for sure, but maybe client-server using remote procedure calling? So then I start thinking that maybe we have a client-server app, with my client sending *out* requests (in which the CP app has no interest) and receiving stuff back from the server - an xml file which CP intercepts and corrupts, *even* though CP doesn\'t see it as web activity, it\'s still *interfering* (with the best of intentions!!).

This begins to scale up from my own personal situation to a wider question of whether there\'s a design issue relating to the co-hosting (if you will) of internet activity monitors (to genericise the CP app) with BOINC (or other grid-type client-server apps).

If I\'m right with this (i.e. BOINC being an RPC-based client-server app), I sort of think this needs to become a pretty technical discussion - which I might struggle to hold up at my end!

Again, all comments and feedback gratefully received. If anyone watching this thread *can* confirm the basic BOINC architecture, then at least I could go back to the CP folk and ask them why CP intercepts/corrupts inbound RPC calls. We might just have two incompatible technologies here - and just to have teased that out might be useful for the BOINC community as a whole.

Richard
17) Questions and Answers : Windows : Error Message: No start tag in scheduler reply (Message 33292)
Posted 11 Apr 2008 by RichardRodd
Post:
Hello again. Sorry to be away for a few days - work and stuff called. But now it\'s the weekend...!

Despite all the adverse comment about these internet guards :-) the CyberPatrol people seem genuinely keen to help me resolve this problemette. They asked me to run a utility that apparently monitors in realtime incoming and outgoing requests to, and traffic from, webpages. So I started the thing up, and asked BOINC to \'retry communications\', only to see this utility trap *nothing*! Okay, I hear all you real techies out there express no surprise at that!!

So please help me here to have at least one more dialogue with CP. Can you help me understand *to what* BOINC sends the file sched_request_climateprediction.net.xml? I get the impression that this file clearly seems to get through, because (it seems to me) that it\'s the *reply* (from wherever) that gets intercepted by CP. I\'m trying to understand *why* this CP monitor doesn\'t see the outgoing *or* incoming traffic.

Okay - all suggestion most gratefully received - as ever.

I\'ll get to all the coments about reinstalling BOINC in different forms when I run this thread to exhaustion.

Mike - I completely agree with your sentiments regarding workarounds, and public placing of PCs. Maybe those of us who grew up without all this PC stuff just have to let today\'s kids play with their generation\'s fire. I\'m sure in our turn we all had our own temptations... But, take it from me, it\'s distressing when one\'s kids seem drawn to the Dark Side. Maybe we\'ll have a rethink about placement within the family home -- and I do appreciate your comments.

Richard
18) Questions and Answers : Windows : Error Message: No start tag in scheduler reply (Message 33247)
Posted 7 Apr 2008 by RichardRodd
Post:
Argh...!!!

All I did was copy-and-paste - I never *read* the file!! So thanks for pointing out the obvious and mea culpa.

With a household of teenagers determined to plunder the darkest parts of the internet, I\'m afraid *this* protective parent has resorted to CyberPatrol. And running BOINC as a service means that the CP intercept (which normally displays on the screen) apparently gets caught in the sched_reply_climateprediction.net.xml file. Heavy-handed CP may be, as per the Register incident, but frankly better that than adult material on tap for my kids! But hey that\'s a personal thing and not for this forum...

Anyhow, if I can find a fix, we may want to capture the resolution for other similarly afflicted folk...?


Finally, if all else fails and I need to reinstall BOINC not as a service, how do I avoid losing the few hours work clocked-up on this m/c?

Thanks for all your help.
Richard
19) Questions and Answers : Windows : Error Message: No start tag in scheduler reply (Message 33242)
Posted 7 Apr 2008 by RichardRodd
Post:
Thanks for your help Iain - my technical stature plummets by the minute!

So here\'s the contents of the file sched_reply_climateprediction.net.xml --


<!-- saved from url=(0014)about:internet -->
<HTML xmlns:v=\"urn:schemas-microsoft-com:vml\">
<HEAD>
<TITLE>CyberPatrol Intercept</TITLE>
<META HTTP-EQUIV=\"Pragma\" CONTENT=\"no-cache\">
<META HTTP-EQUIV=\"Expires\" CONTENT=\"-1\">
<style>
v\\:* { behavior: url(#default#VML); }
</style>

<script language=\"javascript\" type=\"text/javascript\">
var g_bImageDisplayed = false;
var g_iTimer = 0;

function ImageLoadOK(objItem)
{
objItem.style.display = \"inline\";
g_bImageDisplayed = true;
}

function OnTimer()
{
if (g_iTimer != 0)
{
clearInterval(g_iTimer);
g_iTimer = 0;
}

if (g_bImageDisplayed == false)
{
var objVML = document.all.item(\"VmlImage\");
if (objVML != null)
{
objVML.style.display = \"inline\";
}
else
{
document.all.item(\"BackupText\").style.display = \"inline\";
}
}
}

function InsertImageCode(szFilename, iWidth, iHeight, szStrokeColour)
{
document.write(\'<div ID=\"StdImage\">\');
document.write(\'<IMG SRC=\"\' + szFilename + \'\" style=\"DISPLAY: none\" width=\"\' + iWidth + \'\" height=\"\' + iHeight + \'\" border=\"0\" onload=\"javascript:ImageLoadOK(this);\">\');
document.write(\'</div>\');
document.write(\'<!--[if gte IE 7]><DIV id=\"VmlImage\" style=\"DISPLAY: none\"><v:rect style=\"WIDTH: \' + iWidth + \'px; HEIGHT: \' + iHeight + \'px\" coordsize = \"21600,21600\" strokecolor = \"\' + szStrokeColour + \'\">\');
document.write(\'<v:imagedata src = \"\' + szFilename + \'\"></v:imagedata></v:rect>\');
document.write(\'</DIV><![endif]-->\');
document.write(\'<div ID=\"BackupText\" style=\"DISPLAY: none\">CyberPatrol</div>\');
}

g_iTimer = window.setInterval(\"OnTimer()\", 1000);

</script>
</HEAD>
<body bgcolor=\"#F2B12A\" text=\"#FFFFFF\" link=\"#770000\" vlink=\"#770000\" alink=\"#770000\">

<CENTER>
<table border=\"1\" bgcolor=\"#FFFFFF\" width=\"400\" height=\"140\" bordercolor=\"#808080\">
<tr>
<td width=\"100%\" align=\"center\"><font face=\"Verdana\" color=\"black\" size=\"6\">
<script language=\"javascript\" type=\"text/javascript\">
InsertImageCode(\"file:///C:/Applications/Other Applications/Internet Filtering/STYLES/shield/cp.gif\", 266, 129, \"white\");
</script>
</font></td>
</tr>
</table>
<h1><font face=\"Verdana\" color=\"#FFFFFF\">Access Restricted</font></h1>
<p><font face=\"Verdana\" size=\"4\"><b>User Profile: &lt;Default&gt;</b></font></p>
<p><font face=\"Verdana\" size=\"4\"><b>Reason</b>: An internal error has occurred - Failed to load one or more CyberLISTs. This could be due to a corrupted CyberLIST file, please re-run setup to fix this.</font></p>
<p><font face=\"Verdana\" size=\"4\"><b>Category</b>: None</font></p>
<p><font face=\"Verdana\" size=\"4\"><!--OVERRIDE--></font></p>
<p><font face=\"Verdana\" size=\"2\"><b>To change any of the filter settings please speak to your CyberPatrol Headquarters’ Administrator.</b></font></p>
</CENTER>
</body>
</HTML>



I hope that provides some insight!

Richard
20) Questions and Answers : Windows : Error Message: No start tag in scheduler reply (Message 33239)
Posted 7 Apr 2008 by RichardRodd
Post:
Hi.

I just realised I forgot to reply to Thyme\'s query about the contents of the file sched_reply_climateprediction.net.xml, which might be illuminating in its brevity...!


The XML page cannot be displayed
Cannot view XML input using XSL style sheet. Please correct the error and then click the Refresh button, or try again later.


--------------------------------------------------------------------------------

End tag \'div\' does not match the start tag \'IMG\'. Error processing resource \'file:///C:/Program Files/BOINC/sched_reply_cli...

document.write(\'</div>\');
--------------------^


So am I right in thinking that the scheduler request (whatever that is) goes off okay, but the reply gets scrambled coming back in?

And if so, why?

Am I best off just trying a clean re-install? And if I do that, how do I keep the work done on the two units currently running (but not trickling!)?

Okay - thanks in anticipation for your help.

Richard


Next 20

©2024 climateprediction.net