Message boards : Number crunching : Problems and Technical Issues with Rosetta@home
Previous · 1 . . . 282 · 283 · 284 · 285 · 286 · 287 · 288 . . . 315 · Next
Author | Message |
---|---|
Sid Celery Send message Joined: 11 Feb 08 Posts: 2183 Credit: 41,726,991 RAC: 6,784 |
I'm just in the final stages of clearing down all the excess WCG tasks Boinc brought down from the previous Rosetta outage and we're out of Rosetta tasks again. A relatively small number of tasks available - showing 230k on the front page 3hrs ago. Hopefully part of more, but may not be. It's something |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 2025 Credit: 9,943,884 RAC: 6,777 |
A relatively small number of tasks available - showing 230k on the front page 3hrs ago. Hopefully part of more, but may not be. Snd seems a new kind of simulations: "testmpnn_hallucinated" and "testmpnn_diffusion" |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2183 Credit: 41,726,991 RAC: 6,784 |
A relatively small number of tasks available - showing 230k on the front page 3hrs ago. Hopefully part of more, but may not be. Yup - wonder what that's all about. A few more tasks becoming available too - still not a great amount. Showing 475k an hour ago on the front page. Every little bit helps |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 2025 Credit: 9,943,884 RAC: 6,777 |
Snd seems a new kind of simulations: "testmpnn_hallucinated" and "testmpnn_diffusion" Maybe related to "message-passing neural networks" (mpnn), like this |
kotenok2000 Send message Joined: 22 Feb 11 Posts: 276 Credit: 513,050 RAC: 161 |
Graphics work with these tasks. |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 2025 Credit: 9,943,884 RAC: 6,777 |
I think it went to almost 500k, but I took a look at 20:35 UK time just as parts of boinc-process came back online and after a refresh it was all back Now the server are green, but there are over 18k wu pending validation. Increasing. |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 2025 Credit: 9,943,884 RAC: 6,777 |
Graphics work with these tasks. And also wus seems ok, no errors despite the name "test".... |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2183 Credit: 41,726,991 RAC: 6,784 |
Snd seems a new kind of simulations: "testmpnn_hallucinated" and "testmpnn_diffusion" Very likely. Thanks for the link - looks like good work. |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2183 Credit: 41,726,991 RAC: 6,784 |
I think it went to almost 500k, but I took a look at 20:35 UK time just as parts of boinc-process came back online and after a refresh it was all back Now pink - boinc-process is down again and 56k awaiting validation. And not too many tasks left to come down either. We continue to be very hand-to-mouth atm |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1751 Credit: 18,534,891 RAC: 857 |
Some more work would be nice. It's been freezing the last few mornings here, and the system has been keeping the lounge room almost comfortable. Buit now it's out of work, and tomorrow morning if more work doesn't come along, it'll be almost as cold inside as it is outside (or an upgraded version over at Ralph & some new work there would be nice- either this or that, or even both would be nice). Grant Darwin NT |
G.L.I.S. Send message Joined: 25 Dec 08 Posts: 26 Credit: 2,450,252 RAC: 2,483 |
Still... 'completed awaiting validation'... More credits gone, along with electricity and time? |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2183 Credit: 41,726,991 RAC: 6,784 |
Some more work would be nice. Your post made me look for the first time exactly where Darwin is and, after checking on a weather site, discover your winter is still 2-4C higher than this English summer. My sympathies are therefore quite limited, as well as thinking it's a rather inefficient way to heat the house. While I understand and accept your reasoning for keeping a tight cache, I can only repeat my advice to change from setting a default runtime at Rosetta, which turns out to be only 3hrs, to making it explicitly 8hrs to match what Boinc thinks it is (at the point of download anyway). Not only would you get an extra 5hrs work, you would reduce your churn through tasks by almost two-thirds, marginally extending how long each batch of tasks will last, which is valuable when we see each batch run out before further tasks become available. To emphasise the difference between me and you, I keep a 0.5 plus 0.1 cache and set a 12hr runtime. So when I have 4-5hrs of tasks remaining, I already have 16 tasks (8C16T) cued up and another 16 can come down, which works out at 28-29hrs of work when Rosetta runs out. At an 8hr runtime, this would still be 20-21hrs. As compared to your maximum of 3hrs work while trying to gobble up tasks only at the last minute. The difference is huge for one host and, the more people who make the runtime change I suggest, the longer batches of tasks would last and the shorterfewer periods without any on the whole site. This is why I keep repeating myself. Everyone should do both yourselves and everyone else a favour imo, |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2183 Credit: 41,726,991 RAC: 6,784 |
Still... 'completed awaiting validation'... All credits do get caught up once the server is restarted. No time or energy lost. Just a hiccup in when they get awarded which might take a day or two at most (but might also be just a few hours) |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 2025 Credit: 9,943,884 RAC: 6,777 |
Just a hiccup in when they get awarded which might take a day or two at most (but might also be just a few hours) Well, not so few hours... |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2183 Credit: 41,726,991 RAC: 6,784 |
Just a hiccup in when they get awarded which might take a day or two at most (but might also be just a few hours) I looked earlier today. I think it came back about 10hrs after your post, so between 1 & 2 days. I think I saw the whole site go down (again) a few hours before too. Everything seems so fragile. No new tasks yet, but I've picked up a few resends through the day |
Ace Casino Send message Joined: 16 Jul 07 Posts: 18 Credit: 14,827,983 RAC: 17,320 |
Try putting another shrimp on the barbie. |
Tom M Send message Joined: 20 Jun 17 Posts: 98 Credit: 17,749,090 RAC: 42,645 |
Try putting another shrimp on the barbie. And wined the Ken up? Help, my tagline is missing..... Help, my tagline is......... Help, m........ Hel..... |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1751 Credit: 18,534,891 RAC: 857 |
I wonder if they've got data centre issues? Server Status page shows, well, next to nothing (although Ralph still shows everything's ok). Grant Darwin NT |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 2025 Credit: 9,943,884 RAC: 6,777 |
I wonder if they've got data centre issues? Well, it's summer The servers went on vacation |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2183 Credit: 41,726,991 RAC: 6,784 |
'Project down for maintenance' messages being issued for over 24hrs While all tasks are pretty much completed this sounds like the best time Servers being randomly up and down over a considerable period, it does need a thorough going over Let's hope they find and resolve everything... ...and have a whole bunch of tasks waiting for us on completion I can dream |
Message boards :
Number crunching :
Problems and Technical Issues with Rosetta@home
©2025 University of Washington
https://www.bakerlab.org