Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 272 · 273 · 274 · 275 · 276 · 277 · 278 . . . 316 · Next

AuthorMessage
hadron

Send message
Joined: 4 Sep 22
Posts: 69
Credit: 1,643,537
RAC: 3,558
Message 109087 - Posted: 5 Apr 2024, 5:44:51 UTC - in response to Message 109071.  

Yep, The Validator is borked,

For me, anything returned from 3 Apr 2024, 22:02:46 UTC fails, and a quick look at th top computers shows the same thing- everything going back at present fails Validation.

If someone could get the Projects attention?

First one here (of 52 in total) was at 4:28:28 UTC 3 April; another task reported at the same time did validate.
Then 16 tasks in a row were validated, the last at 12:54:34 UTC, after which everything was marked invalid. (There are no Rosetta tasks running on my system since 4 Apr 2024, 21:05:07 UTC at the latest.)

The only thing that appears common among them all is that they all took approximately 10800 to 10900 seconds to run.
ID: 109087 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1758
Credit: 18,534,891
RAC: 388
Message 109089 - Posted: 5 Apr 2024, 6:48:01 UTC - in response to Message 109084.  

These INVALID results are a problem with the Rosetta BETA binary.
No, they are a result of the system outage, which when resolved came back with the Validator not working (even though the Server Staus page shows it as running).

I would expect the vast majority of all the Invalid Tasks are actually Valid. They just need to fix the Validator, then re-Validate all the Invalid Tasks.
That way every one will get Credit for the work they have done, except for those few Tasks that are actually Invalid.
Grant
Darwin NT
ID: 109089 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
MJH333

Send message
Joined: 29 Jan 21
Posts: 18
Credit: 6,830,521
RAC: 3,720
Message 109090 - Posted: 5 Apr 2024, 10:03:43 UTC - in response to Message 109086.  

Looking like they're still fed up with me... no response & no change I can notice
Thank you for trying.

The Server Status info on the Rosetta home page has not updated since 4 Apr 2024, 11:03:04 UTC. I hope this will start updating again once the validator is fixed.
ID: 109090 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
MJH333

Send message
Joined: 29 Jan 21
Posts: 18
Credit: 6,830,521
RAC: 3,720
Message 109091 - Posted: 5 Apr 2024, 10:03:47 UTC - in response to Message 109086.  
Last modified: 5 Apr 2024, 10:05:15 UTC

[Duplicate post deleted]
ID: 109091 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2198
Credit: 41,933,740
RAC: 17,353
Message 109093 - Posted: 6 Apr 2024, 5:49:28 UTC - in response to Message 109090.  

Looking like they're still fed up with me... no response & no change I can notice
Thank you for trying.

The Server Status info on the Rosetta home page has not updated since 4 Apr 2024, 11:03:04 UTC. I hope this will start updating again once the validator is fixed.

On the home page, yes. Somehow the Server Status page itself is being refreshed.
I've been scouting around for another appropriate email address and in the process discovered that the one I've been using has been removed - but I'm not getting any bounces back - so maybe I'm not getting ignored at all but my mails are going to dev/nul or something.
When I get home on Sunday I'm going to try a different more generic email address and see if I can get a message to whatever IT people they use rather than go via researchers.
ID: 109093 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
MStenholm

Send message
Joined: 18 Apr 20
Posts: 19
Credit: 27,951,567
RAC: 58,470
Message 109094 - Posted: 6 Apr 2024, 5:52:57 UTC

I just had my first job validated when it was returned
Received 6 Apr 2024, 5:24:13 UTC
Server state Over
Outcome Success

Older returned jobs are still invalid
ID: 109094 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2198
Credit: 41,933,740
RAC: 17,353
Message 109095 - Posted: 6 Apr 2024, 7:42:14 UTC - in response to Message 109094.  

I just had my first job validated when it was returned
Received 6 Apr 2024, 5:24:13 UTC
Server state Over
Outcome Success

Older returned jobs are still invalid

You have indeed.
I was just coming back to correct myself that the date/time of the server info on the home page had finally been updated and your information indicates that was just a symptom of a wider correction.
As you also say, hopefully their next job is to revalidate our older tasks from April 3rd ~22:00 UTC to April 6th ~05:20
ID: 109095 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1758
Credit: 18,534,891
RAC: 388
Message 109096 - Posted: 6 Apr 2024, 8:28:07 UTC - in response to Message 109095.  

As you also say, hopefully their next job is to revalidate our older tasks from April 3rd ~22:00 UTC to April 6th ~05:20
*Fingers crossed*
Grant
Darwin NT
ID: 109096 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1758
Credit: 18,534,891
RAC: 388
Message 109097 - Posted: 7 Apr 2024, 6:34:08 UTC

Picked up a couple of resends, and they Validated OK.
Just need the Invalids re-validated & all will be good.
Grant
Darwin NT
ID: 109097 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2198
Credit: 41,933,740
RAC: 17,353
Message 109100 - Posted: 8 Apr 2024, 20:30:51 UTC - in response to Message 109068.  

And we're back...

Looks like the whole website went down for about 10hours today.
Couldn't even get to the Rosetta home page let alone upload results.
Everything going through fine now

And again, we're back.
Another 2-3hr outage of the entire website.

I did send another email - and mentioned how Validation didn't come back last time, so to double-check that.
And snuck in a request for revalidation of tasks from April 4-6 that all errored out, just in case they're in a good mood (I didn't mention it last time)
ID: 109100 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 2030
Credit: 10,082,016
RAC: 12,014
Message 109101 - Posted: 9 Apr 2024, 15:59:06 UTC - in response to Message 109097.  

Just need the Invalids re-validated & all will be good.


No optimism here
ID: 109101 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2198
Credit: 41,933,740
RAC: 17,353
Message 109102 - Posted: 10 Apr 2024, 11:19:41 UTC - in response to Message 109101.  

Just need the Invalids re-validated & all will be good.

No optimism here

Nor here, if not done by now
ID: 109102 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2198
Credit: 41,933,740
RAC: 17,353
Message 109103 - Posted: 12 Apr 2024, 2:25:21 UTC - in response to Message 109100.  

And we're back...

Looks like the whole website went down for about 10hours today.
Couldn't even get to the Rosetta home page let alone upload results.
Everything going through fine now

And again, we're back.
Another 2-3hr outage of the entire website.

I did send another email - and mentioned how Validation didn't come back last time, so to double-check that.
And snuck in a request for revalidation of tasks from April 4-6 that all errored out, just in case they're in a good mood (I didn't mention it last time)

Anyone else notice the entire website went down again today?
For at least 6hrs when I was wondering why the backoff was up to several hrs
Not sure what's been going on recently
ID: 109103 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 2030
Credit: 10,082,016
RAC: 12,014
Message 109104 - Posted: 12 Apr 2024, 8:03:42 UTC - in response to Message 109103.  

Anyone else notice the entire website went down again today?
For at least 6hrs when I was wondering why the backoff was up to several hrs
Not sure what's been going on recently


yes. Also Ralph, that is on other hw, went down for some hrs

And no work....
ID: 109104 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile rilian
Avatar

Send message
Joined: 16 Jun 07
Posts: 28
Credit: 3,348,030
RAC: 14,098
Message 109105 - Posted: 12 Apr 2024, 16:03:45 UTC

I see there are about 7000 tasks in progress, and one of my computers got one resend
i crunch for Ukraine. Join our team forums about Rosetta@home
ID: 109105 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jean-David Beyer

Send message
Joined: 2 Nov 05
Posts: 202
Credit: 6,885,058
RAC: 10,922
Message 109106 - Posted: 12 Apr 2024, 20:17:08 UTC - in response to Message 109103.  
Last modified: 12 Apr 2024, 20:17:44 UTC

Anyone else notice the entire website went down again today?
For at least 6hrs when I was wondering why the backoff was up to several hrs
Not sure what's been going on recently


Yes, but it is up right now and I just got a bunch of tasks -- Rosetta Beta 6.05.
ID: 109106 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 276
Credit: 523,512
RAC: 610
Message 109107 - Posted: 12 Apr 2024, 20:45:48 UTC

Graphics do not work with linux version of rosetta beta.

When i start graphical app it immediately closes and outputs this in stderrgfx.txt:

cat /var/lib/boinc/slots/6/stderrgfx.txt

ERROR: Unable to open file: /var/lib/boinc/projects/boinc.bakerlab.org_rosetta/../database/chemical/residue_type_sets/fa_standard/residue_types.txt

ERROR:: Exit from: src/core/chemical/GlobalResidueTypeSet.cc line: 145
23:25:39 (68987): called boinc_finish(0)


It should look for database at /var/lib/boinc/projects/boinc.bakerlab.org_rosetta/database_0f7f01a1b07/database , not /var/lib/boinc/projects/database
ID: 109107 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2198
Credit: 41,933,740
RAC: 17,353
Message 109108 - Posted: 13 Apr 2024, 0:08:39 UTC - in response to Message 109106.  

Anyone else notice the entire website went down again today?
For at least 6hrs when I was wondering why the backoff was up to several hrs
Not sure what's been going on recently

Yes, but it is up right now and I just got a bunch of tasks -- Rosetta Beta 6.05.

Looks like another million tasks got released.
I'm not going to say the site is running well recently, in a variety of ways, but what seems like a regular million tasks each week with a few blank days in between is the best we've had for a very long time.
If they can keep this going I won't be too unhappy, however much better it's been in the distant past.
ID: 109108 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile adrianxw
Avatar

Send message
Joined: 18 Sep 05
Posts: 655
Credit: 11,874,794
RAC: 2,211
Message 109116 - Posted: 16 Apr 2024, 6:01:20 UTC

I've set no new tasks again. The current jobs have 8 hours as their runtime, but here, they are running for three times that, (4GHz i7), which is pushing my system into panic mode.
Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.
ID: 109116 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1758
Credit: 18,534,891
RAC: 388
Message 109117 - Posted: 16 Apr 2024, 7:09:30 UTC - in response to Message 109116.  
Last modified: 16 Apr 2024, 7:18:04 UTC

I've set no new tasks again. The current jobs have 8 hours as their runtime, but here, they are running for three times that, (4GHz i7), which is pushing my system into panic mode.
And the same issue is happening with your other projects.
Asteroids- 2hrs Runtime,1hr CPU time.
SIdock- 31.5hrs Runtime, 27hrs 40min CPU time.
Denis- 3hr 40min Runtime, 1hr CPU time.
Got to love Denis, almost 4 times as much time spent to do a given amount of work. Even worse than your Seti times.


And you have been told repeatedly how to resolve the issue, yet you continue to ignore that advice.
So why bother even posting about it?
Grant
Darwin NT
ID: 109117 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 272 · 273 · 274 · 275 · 276 · 277 · 278 . . . 316 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2025 University of Washington
https://www.bakerlab.org