Message boards : Number crunching : Problems and Technical Issues with Rosetta@home
Previous · 1 . . . 280 · 281 · 282 · 283 · 284 · 285 · 286 . . . 315 · Next
Author | Message |
---|---|
Sid Celery Send message Joined: 11 Feb 08 Posts: 2183 Credit: 41,726,991 RAC: 6,784 |
Ooh, 360k tasks. We live to fight another day (or two) While I know most people will have finished up their outstanding tasks already, I managed to sneak 4 extra returned tasks today and now discover that the validators running under boinc-process are down again. Better now than at other times, I guess |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1751 Credit: 18,534,891 RAC: 857 |
That boinc-process server has developed a habit of regularly falling over, it was well past due for another crash. Grant Darwin NT |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2183 Credit: 41,726,991 RAC: 6,784 |
Ooh, 360k tasks. We live to fight another day (or two) Or maybe not better now as 660k tasks newly available |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 2025 Credit: 9,943,884 RAC: 6,777 |
Or maybe not better now as 660k tasks newly available 0 wus and a lot of daemons are down.... |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2183 Credit: 41,726,991 RAC: 6,784 |
Or maybe not better now as 660k tasks newly available Yup. I would've expected 660k to last at least 2 days, but I'm not sure it lasted much more than 15hrs, Unless tasks got pulled. Front page figures borked on top of boinc-process server borked Edit: Actually, I'm now thinking tasks did get pulled. Unvalidated tasks were about 20k before the new batch arrived - now 160k In progress tasks were about 30k, now 112k That implies 222k tasks were grabbed But the front page is locked at 7am with 660k queued, 440k have gone missing, presumed pulled |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2183 Credit: 41,726,991 RAC: 6,784 |
Or maybe not better now as 660k tasks newly available Still the same - now nudged Edit while posting: site went down, back 5mins later, no apparent change yet but might be shortly (fingers-crossed) |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1751 Credit: 18,534,891 RAC: 857 |
boinc-process server still dead, front page Server Status numbers still not updated (Last update, 07:04 UTC, yesterday). Grant Darwin NT |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2183 Credit: 41,726,991 RAC: 6,784 |
boinc-process server still dead, front page Server Status numbers still not updated (Last update, 07:04 UTC, yesterday). Add it to the very long list of things I'm completely wrong about... <sigh> I've asked. We wait. |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1751 Credit: 18,534,891 RAC: 857 |
Just heard the fans in my system wind up. Checked BOINC & lo and behold- Rosetta has work again. Now if they could just get that boinc-process server that's been dead for a while now up and running again then all would be good. Grant Darwin NT |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2183 Credit: 41,726,991 RAC: 6,784 |
Just heard the fans in my system wind up. Both you, and this PC were ahead of me. The rest, still just as you say. In a way, knowing if there are tasks or not, and whether they give credit or not, or how long they'll last, isn't massively different |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1751 Credit: 18,534,891 RAC: 857 |
Server Status on the front page is yet to update, but all the servers on the Server Status page are now green and work is still flowing. Grant Darwin NT |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2183 Credit: 41,726,991 RAC: 6,784 |
And now everything is finally back. Currently As of 22 Jun 2024, 11:02:26 UTC [ Scheduler running ] Total queued jobs: 1,336,930 In progress: 153,424 Successes last 24h: 91,239 and Tasks ready to send 4785 Tasks in progress 153988 Workunits waiting for validation 0 Workunits waiting for assimilation 0 |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1751 Credit: 18,534,891 RAC: 857 |
And now everything is finally back. CurrentlyAt last! And plenty of work as well. Now things just need to stop falling over in the first place. Grant Darwin NT |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2183 Credit: 41,726,991 RAC: 6,784 |
Now things just need to stop falling over in the first place. Yes, but also I'd remind everyone of my view Rosetta Beta 6.04 tasks wrongly default to 3hrs CPU runtime while Rosetta v4.20 rightly default to 8hrs. The more people make this change, the better for everyone, whether that boinc-process server goes down or not |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1751 Credit: 18,534,891 RAC: 857 |
boinc-process server is dead again, Validation backlog continues to grow. Grant Darwin NT |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1751 Credit: 18,534,891 RAC: 857 |
boinc-process server is dead again, Validation backlog continues to grow.And it's back again. Grant Darwin NT |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2183 Credit: 41,726,991 RAC: 6,784 |
boinc-process server is dead again, Validation backlog continues to grow.And it's back again. This is getting like my home-life... "I've lost my xyz" "You could at least help to look" "Oh, there it is" Me: "What was that you said?" If I play dumb long enough before paying any attention, most things right themselves on their own Edit: I just reached 40,000,000 on Rosetta Edit2: And 100,000,000 for my team across all projects |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2183 Credit: 41,726,991 RAC: 6,784 |
Now things just need to stop falling over in the first place. Queued jobs down to 153k 3hrs ago, so another shout out for this. I'm estimating we only have another 12-13hrs of tasks unless more get queued up. |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2183 Credit: 41,726,991 RAC: 6,784 |
Queued jobs down to 153k 3hrs ago, so another shout out for this. I think we had a few extra Rosetta 4.20 tasks but not many and we're out anyway now Fingers crossed for another batch |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1751 Credit: 18,534,891 RAC: 857 |
boinc-process server has died, again. Grant Darwin NT |
Message boards :
Number crunching :
Problems and Technical Issues with Rosetta@home
©2025 University of Washington
https://www.bakerlab.org