Message boards : Number crunching : Purge function
Author | Message |
---|---|
dcdc Send message Joined: 3 Nov 05 Posts: 1832 Credit: 119,821,902 RAC: 15,180 |
I posted a request for a purge function over on the BOINC forum - I've requested it before and I think I recall FluffyChicken doing the same at some point, possibly on Rom Walton's blog... Anyway, Nicholas has posted back there saying it's already implemented in BOINC! I think this would be a really useful addition to R@H for two reasons: 1. Reduced processing of expired tasks 2. Ability to purge jobs that are causing problems after their release There must be thousands of computers out there running tasks that have expired. I know some of these results will still be of use, but I'm sure in most of the cases the cycles could be put to better use. There are lots of computers out there that are off for a while, then used heavily for a while, and off again - for example students who go through cycles of doing assignments. My mum went away for a week recently and when the computer was switched back on on her return, it would have run expired tasks for two days if I didn't mail her and ask her to remove the expired jobs... So to summarise, I think it'd boost the useful throughput of the project, and give a very useful tool to reduce the chances of fatal tasks being crunched. Any comments/thoughts? cheers Danny Edit: Here's a good example - my friend is on holiday and when he gets back his computer is going to run all of these expired jobs before it gets on to some useful work! |
dcdc Send message Joined: 3 Nov 05 Posts: 1832 Credit: 119,821,902 RAC: 15,180 |
Thanks to Rattledagger on the BOINC forum: This is handled by 2 flags: I think it would be very wise to implement this before a batch of bad tasks is released rather than waiting until it happens, as it's almost inevitable that it will happen at some point. |
dcdc Send message Joined: 3 Nov 05 Posts: 1832 Credit: 119,821,902 RAC: 15,180 |
Quote from Sekerob: WCG implemented it and created some of the server side code. Recently it send instruction to BOINC clients for a bad batch. It only works on >= 5.8 clients. Think D@H adopted their code. Theoretically servers could send an instruction to clients where work was deemed 'Too Late' or 'No Reply', but thing is, the client has to initiate a server contact. Not heard if the abort feature is being employed to do those. Helps efficiency and lessens frustration if inadvertently one gets completed and zero credit is awarded. It looks like the majority of the code is already available. Any comments?... |
FluffyChicken Send message Joined: 1 Nov 05 Posts: 1260 Credit: 369,635 RAC: 0 |
Quote from Sekerob: bumpalicious Team mauisun.org |
dcdc Send message Joined: 3 Nov 05 Posts: 1832 Credit: 119,821,902 RAC: 15,180 |
bump! There's a 2.4GHz C2D here that's going to be crunching nothing but expired WUs! Surely adding this function would be equivalent to gaining a few thousand computers??? |
dcdc Send message Joined: 3 Nov 05 Posts: 1832 Credit: 119,821,902 RAC: 15,180 |
the cancelled gp04 jobs are another reason for the purge function! There are probably still thousands of those jobs in the wild, some of which will cause some people problems! |
dcdc Send message Joined: 3 Nov 05 Posts: 1832 Credit: 119,821,902 RAC: 15,180 |
1.16GB VM for a stalled rosetta thread! that's a record for me. I've lost 11hrs crunching on this machine and 22hrs on another, and now i've lost a remote cruncher permanently because one of these jobs was hogging memory! If the purge function was activated then these jobs could have been removed from people's queues before they caused problems. It's inevitable that a batch jobs are going to be released that cause major problems and being able to purge these after they've been downloaded to the clients could make a huge difference. On top of that, it'll give the project a big boost in processing power - far fewer expired jobs would be run. Can someone pls have a look at implementing this, or just comment on it - the code is already in BOINC! Danny |
Matt3223 Send message Joined: 15 Dec 05 Posts: 10 Credit: 58,569 RAC: 0 |
Sounds to be something worth at least looking into! I'll watch this thread to see when a reply shows up [edit] seems the purge idea has been in requested as far back as 2005!....apparently the priority isn't all that high... I remember using FADs purge function quite a bit........definitly made me feel like I wasn't wasting time crunching unnecessary units..[/edit] |
dcdc Send message Joined: 3 Nov 05 Posts: 1832 Credit: 119,821,902 RAC: 15,180 |
I remember using FADs purge function quite a bit........definitly made me feel like I wasn't wasting time crunching unnecessary units..[/edit] yeah - imo it was one of the best additions they made at FaD ;) It's inevitable that it's gonna be needed at some point or the project risks losing lots of computers to either annoyed users or stuck jobs. (It's not a threat - I won't leave - just a concern!) :D |
dcdc Send message Joined: 3 Nov 05 Posts: 1832 Credit: 119,821,902 RAC: 15,180 |
|
Message boards :
Number crunching :
Purge function
©2024 University of Washington
https://www.bakerlab.org