Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 284 · 285 · 286 · 287 · 288 · 289 · 290 . . . 315 · Next

AuthorMessage
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1751
Credit: 18,534,891
RAC: 1,045
Message 109509 - Posted: 1 Aug 2024, 22:22:00 UTC - in response to Message 109506.  
Last modified: 1 Aug 2024, 22:30:13 UTC

Haven't received any work units for two or three weeks now , what's up with that?
18/07 was the last day of work being available. Late on the 23/07 several servers crashed/went MIA & they remain that way to this day (Although i have notice the amount of spam posts here is picking up.).
This is after one particular server was crashing pretty much weekly for hours (sometimes days) at a time before being brought back up.

There's been no signs of life at Ralph since 10/07. Still waiting on an updated application & data to test.
Grant
Darwin NT
ID: 109509 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
CLIFF HANGER

Send message
Joined: 8 May 17
Posts: 2
Credit: 7,876,583
RAC: 9,342
Message 109510 - Posted: 1 Aug 2024, 23:52:40 UTC - in response to Message 109509.  

Thanks for the info....
ID: 109510 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
hadron

Send message
Joined: 4 Sep 22
Posts: 69
Credit: 1,599,624
RAC: 1,050
Message 109511 - Posted: 2 Aug 2024, 11:19:17 UTC - in response to Message 109506.  

Haven't received any work units for two or three weeks now , what's up with that?

You haven't found the Boinc log file, or what?
If you would read that file, you would find this:

Fri 02 Aug 2024 04:53:12 AM | Rosetta@home | Scheduler request completed: got 0 new tasks
Fri 02 Aug 2024 04:53:12 AM | Rosetta@home | Project is temporarily shut down for maintenance

It will be back when it's back.
ID: 109511 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 109515 - Posted: 2 Aug 2024, 18:46:35 UTC - in response to Message 109509.  

I think Ralph is DOA.
As far as spam goes, there is no moderator to speak of here.
When was the last time any one (Dr. B or a staff member) or a staff member posted anything here regarding the project? Seems to me they just put this on the back burner and if there is something that they don't want to run on their fancy system, they send it here.

Server page hasn't changed in 48 hrs:

Upload server boinc.bakerlab.org Running
Scheduler bwsrv1 Not Running

Status of remote daemons is missing

Tasks ready to send 0 <-- has been for weeks
Tasks in progress 163 <-- hasn't changed in 48 hours
Workunits waiting for validation 0
Workunits waiting for assimilation 0
Workunits waiting for file deletion 2
Tasks waiting for file deletion 1
Transitioner backlog (hours) 239.27 <--- What is this device?
ID: 109515 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 109516 - Posted: 2 Aug 2024, 18:50:56 UTC

From Baker Lab Robetta page (this feed Rosetta): June 28, 2024 - We are currently having issues with high memory RoseTTAFold jobs (aa >= 700) and instability in the CM sequence alignment pipeline and are looking into possible causes. Sorry for any inconvenience.
ID: 109516 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 109517 - Posted: 2 Aug 2024, 19:09:14 UTC - in response to Message 109510.  

Thanks for the info....


Forgot about World Community Grid. They went through a rough patch after they moved, where they had massive technical problems and so on. That's probably fixed now. But as fast as each project having work, I think its a bit like here, hit and miss on big batches of work.
I just rejoined. Mapping Cancer Markers project is active.

They had COVID research back then, but not sure if that is still something they do or just haven't bothered removing it. Each "project" is its own thing. WCG is just the host.

It's worth signing on to for now. If they run out of work then the other projects I mentioned will keep your machine busy.

If your into Astrophysics join up with Einstein to comb through data for gravitational waves. They never run out of work. LHC is particle physics. I run ATLAS (high credit) and Theory tasks. ATLAS will take 4 cores for its work.

If you want to make your system work for its power, then you sign up with Moo!Wrapper. That takes 2 CPU and 2 GPU for one task.

All these different projects will keep your system busy while you wait for stuff to show up here.
ID: 109517 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 276
Credit: 513,050
RAC: 161
Message 109518 - Posted: 2 Aug 2024, 19:12:52 UTC - in response to Message 109517.  

It can run even 8 cores if you install CVMFS and runc on your linux machine and enable native tasks in settings.
Unfortunately documentation is all over the place.
ID: 109518 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1751
Credit: 18,534,891
RAC: 1,045
Message 109519 - Posted: 2 Aug 2024, 21:54:20 UTC - in response to Message 109515.  

Server page hasn't changed in 48 hrs:
Try a week and a half going on 2 weeks.


Transitioner backlog (hours) 239.27 <--- What is this device?
It cleans things up.
Once completed Tasks are returned and Validated, the transitioner moves the Validated results from the master database to the main science database.
Grant
Darwin NT
ID: 109519 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1751
Credit: 18,534,891
RAC: 1,045
Message 109520 - Posted: 2 Aug 2024, 21:59:45 UTC - in response to Message 109516.  

From Baker Lab Robetta page (this feed Rosetta): June 28, 2024 - We are currently having issues with high memory RoseTTAFold jobs (aa >= 700) and instability in the CM sequence alignment pipeline and are looking into possible causes. Sorry for any inconvenience.
That was back in June & there were 2 large releases of work after that date.
And it was about a week after the second of those releases of work that everything went down in a screaming heap.
Grant
Darwin NT
ID: 109520 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jean-David Beyer

Send message
Joined: 2 Nov 05
Posts: 201
Credit: 6,765,644
RAC: 7,452
Message 109521 - Posted: 3 Aug 2024, 3:39:40 UTC - in response to Message 109502.  

And I see one of our favourites, Universe@Home has also been down for a long time.

I dropped MilkyWay and Universe. I forget which was which. Both of them awarded way too much credit for the small amount of work done. And that was embarrassing. But that is not why I dropped them.

One of them does only GPU work now, and I refuse to run Boinc on my GPU. I think the sponsor of the other died, or something like that, and they no longer send out work.
ID: 109521 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bill Swisher

Send message
Joined: 10 Jun 13
Posts: 45
Credit: 35,820,158
RAC: 35,038
Message 109522 - Posted: 3 Aug 2024, 3:55:08 UTC - in response to Message 109517.  

If it wasn't for WCG I'd just turn the computers off. The folks at DENIS@Home have gone on vacation. I do have one really, really old laptop running Einstein simply because the computer won't die and it doesn't have enough horsepower to run anything else.
ID: 109522 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Bill F
Avatar

Send message
Joined: 29 Jan 08
Posts: 49
Credit: 1,656,004
RAC: 1,685
Message 109524 - Posted: 4 Aug 2024, 4:42:16 UTC - in response to Message 109521.  

Clarifications on Milkyway and Universe

Milkyway ended one application that did use either CPU's or GPU's and now the remaining application only runs CPU work

Universe is on pause while the University and the Family of the principle Lead on the project work out a path forward.

Both are worthwhile projects and are valid science efforts. Hopefully the Universe project will find a way to move forward soon.

Bill F

Milkyway since July 2009
Universe since June 2016
In October 1969 I took an oath to support and defend the Constitution of the United States against all enemies, foreign and domestic;
There was no expiration date.

ID: 109524 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 2025
Credit: 9,943,884
RAC: 7,659
Message 109526 - Posted: 5 Aug 2024, 9:43:08 UTC - in response to Message 109520.  

And it was about a week after the second of those releases of work that everything went down in a screaming heap.


Maybe they are busy with RosettaCON2024
ID: 109526 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 109527 - Posted: 5 Aug 2024, 16:21:09 UTC - in response to Message 109519.  

Server page hasn't changed in 48 hrs:
Try a week and a half going on 2 weeks.


Transitioner backlog (hours) 239.27 <--- What is this device?
It cleans things up.
Once completed Tasks are returned and Validated, the transitioner moves the Validated results from the master database to the main science database.



And it hasn't moved 239 hours and now 309 hours of work? Man that is slow.
And a 163 in progress? I saw that when I posted my comments and its still the same.
But on the other hand, everything is offline. So who knows whats going on.

It's almost like a shutdown of the project.
ID: 109527 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 109528 - Posted: 5 Aug 2024, 16:28:20 UTC - in response to Message 109524.  

Clarifications on Milkyway and Universe

Milkyway ended one application that did use either CPU's or GPU's and now the remaining application only runs CPU work

Universe is on pause while the University and the Family of the principle Lead on the project work out a path forward.

Both are worthwhile projects and are valid science efforts. Hopefully the Universe project will find a way to move forward soon.

Bill F

Milkyway since July 2009
Universe since June 2016


I've been here on ( when there was updates and mods) and off (after they migrated to inhouse) since May 2006
LHC is my next oldest since December 2008
Milkway since August 2009
Universe since November 2017 until they stopped.
ID: 109528 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Bill F
Avatar

Send message
Joined: 29 Jan 08
Posts: 49
Credit: 1,656,004
RAC: 1,685
Message 109530 - Posted: 6 Aug 2024, 0:03:53 UTC

While we are waiting for new Rosetta tasks another Science based project that supports Windows, Linux and Mac ... CPU's and / or GPU's is Asteroids@home

I have been running the project since January of 2013.

Respectfully
Bill F
In October 1969 I took an oath to support and defend the Constitution of the United States against all enemies, foreign and domestic;
There was no expiration date.

ID: 109530 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1751
Credit: 18,534,891
RAC: 1,045
Message 109531 - Posted: 6 Aug 2024, 8:22:24 UTC - in response to Message 109527.  

And it hasn't moved 239 hours and now 309 hours of work? Man that is slow.
And a 163 in progress? I saw that when I posted my comments and its still the same.
But on the other hand, everything is offline. So who knows whats going on.

It's almost like a shutdown of the project.
As i posted previously, the servers have crashed, and nothing has been done about it.
For 2 weeks, as of today.
Grant
Darwin NT
ID: 109531 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Tigers_Dave
Avatar

Send message
Joined: 9 Dec 05
Posts: 6
Credit: 111,799,236
RAC: 44,952
Message 109532 - Posted: 6 Aug 2024, 13:08:00 UTC - in response to Message 109530.  

While we are waiting for new Rosetta tasks another Science based project that supports Windows, Linux and Mac ... CPU's and / or GPU's is Asteroids@home

I have been running the project since January of 2013.

Respectfully
Bill F


Bill, thanks for sharing! Asteroids does not support GPUs on Intel-based Macs. On the other hand, Amicable Numbers <-https://sech.me/boinc/Amicable/> supports CPUs, NVIDIA GPUs, AMD GPUs, and Intel GPUs on Windows, Linux, and Intel-based Macs.

Tigers_Dave
ID: 109532 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 109533 - Posted: 6 Aug 2024, 20:55:22 UTC - in response to Message 109531.  

And it hasn't moved 239 hours and now 309 hours of work? Man that is slow.
And a 163 in progress? I saw that when I posted my comments and its still the same.
But on the other hand, everything is offline. So who knows whats going on.

It's almost like a shutdown of the project.
As i posted previously, the servers have crashed, and nothing has been done about it.
For 2 weeks, as of today.


Summer break and they will deal with it later if at all.
Crashed how? Got caught in the windows death spiral or something else?
ID: 109533 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 276
Credit: 513,050
RAC: 161
Message 109535 - Posted: 7 Aug 2024, 1:52:41 UTC

It is ubuntu so Kernel Panic spiral.
ID: 109535 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 284 · 285 · 286 · 287 · 288 · 289 · 290 . . . 315 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2025 University of Washington
https://www.bakerlab.org