Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 290 · 291 · 292 · 293 · 294 · 295 · 296 . . . 313 · Next

AuthorMessage
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1743
Credit: 18,534,891
RAC: 3,108
Message 109735 - Posted: 17 Sep 2024, 5:05:18 UTC
Last modified: 17 Sep 2024, 5:06:14 UTC

Anyone getting errors with these Tasks, within a minute or so, with this in the Stderr output

<core_client_version>8.0.2</core_client_version>
<![CDATA[
<message>
Incorrect function.
 (0x1) - exit code 1 (0x1)</message>
<stderr_txt>
command: projects/boinc.bakerlab.org_rosetta/rosetta_beta_6.06_windows_x86_64.exe @srmpnn12_10_hallucinated_127_36_dldesign_0_cycle0.flags -nstruct 10000 -cpu_run_time 28800 -boinc:max_nstruct 20000 -checkpoint_interval 120 -mute all -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -boinc::cpu_run_timeout 36000 -run::rng mt19937 -constant_seed -jran 3384584
Extracting in slot directory: minirosetta_database.zip
Using database: minirosetta_database
Cannot find database: minirosetta_database

</stderr_txt>
]]>


Try resetting the Project.
Once again, there is an issue with where things are, and where your existing installation actually has them (or not).
One of my systems started processing with no problems, the other producing just errors until resetting the project sorted it out.
Grant
Darwin NT
ID: 109735 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2166
Credit: 41,629,484
RAC: 5,494
Message 109736 - Posted: 17 Sep 2024, 16:26:37 UTC - in response to Message 109735.  

Anyone getting errors with these Tasks, within a minute or so, with this in the Stderr output

<core_client_version>8.0.2</core_client_version>
<![CDATA[
<message>
Incorrect function.
 (0x1) - exit code 1 (0x1)</message>
<stderr_txt>
command: projects/boinc.bakerlab.org_rosetta/rosetta_beta_6.06_windows_x86_64.exe @srmpnn12_10_hallucinated_127_36_dldesign_0_cycle0.flags -nstruct 10000 -cpu_run_time 28800 -boinc:max_nstruct 20000 -checkpoint_interval 120 -mute all -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -boinc::cpu_run_timeout 36000 -run::rng mt19937 -constant_seed -jran 3384584
Extracting in slot directory: minirosetta_database.zip
Using database: minirosetta_database
Cannot find database: minirosetta_database

</stderr_txt>
]]>


Try resetting the Project.
Once again, there is an issue with where things are, and where your existing installation actually has them (or not).
One of my systems started processing with no problems, the other producing just errors until resetting the project sorted it out.

No. And I think the fact that one of your systems works fine and the other doesn't backs that up.
Why it should be happening with one and not the other, I have no idea.
ID: 109736 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1743
Credit: 18,534,891
RAC: 3,108
Message 109739 - Posted: 17 Sep 2024, 22:33:36 UTC - in response to Message 109736.  

Why it should be happening with one and not the other, I have no idea.
Neither do i, but it has been a recurring problem over at Ralph (when it's working, which it isn't again) and when it has work.
Several times it's been necessary to reset the project to stop errors occurring because the updated application doesn't have all the files it needs, or it's looking for them in the wrong place.

Both systems have the same hardware (CPU, motherboard) similar GPU (RTX 2060 & RTX 2060 super), same video driver, same AV software, same OS & updates, same version of BOINC, same projects, some configuration settings.
They are, the same. Yet weirdness continues to occur.
Grant
Darwin NT
ID: 109739 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 273
Credit: 511,834
RAC: 207
Message 109740 - Posted: 17 Sep 2024, 22:38:00 UTC - in response to Message 109739.  

Compare project directories then.
Copy both to usb hdd and then compare with winmerge.
ID: 109740 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1743
Credit: 18,534,891
RAC: 3,108
Message 109741 - Posted: 18 Sep 2024, 0:23:30 UTC - in response to Message 109740.  

Compare project directories then.
Copy both to usb hdd and then compare with winmerge.
Too late now, but something to think about if it occurs again on one system and not the other.
Grant
Darwin NT
ID: 109741 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 2014
Credit: 9,842,981
RAC: 4,009
Message 109746 - Posted: 18 Sep 2024, 12:32:51 UTC - in response to Message 109684.  

Oh, what a surprise.
boinc-process is down again, so there's a Validation backlog once again that continues to grow.


Still.
And already 70k wus pending for validation...
ID: 109746 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kasdashdfjsah

Send message
Joined: 15 Jan 24
Posts: 10
Credit: 0
RAC: 0
Message 109747 - Posted: 18 Sep 2024, 16:47:36 UTC - in response to Message 109746.  

Yeah, but resetting the project worked for me at least, despite the server status page saying that no tasks are available, and clicking the update button over and over again didn't work, so this is very likely the only fix right now, and only works for some people.
ID: 109747 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
TPCBF

Send message
Joined: 29 Nov 10
Posts: 111
Credit: 5,145,661
RAC: 316
Message 109748 - Posted: 18 Sep 2024, 18:23:15 UTC - in response to Message 80630.  



I'm not sure why you aren't getting work units. The system seems ok now and clients should be getting jobs. My desktops are crunching and were able to get jobs recently. Can you try to detach and reattach and see if that helps?

Still getting new WUs (at least during last night it seems), and on Monday at least, some got validated, but at last since yesterday, all crunched WUs just end up "Validation pending", and a ton of servers on the server status page are shown not running... :(

Ralf
ID: 109748 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2166
Credit: 41,629,484
RAC: 5,494
Message 109749 - Posted: 18 Sep 2024, 18:37:34 UTC - in response to Message 109739.  

Why it should be happening with one and not the other, I have no idea.
Neither do i, but it has been a recurring problem over at Ralph (when it's working, which it isn't again) and when it has work.
Several times it's been necessary to reset the project to stop errors occurring because the updated application doesn't have all the files it needs, or it's looking for them in the wrong place.

Both systems have the same hardware (CPU, motherboard) similar GPU (RTX 2060 & RTX 2060 super), same video driver, same AV software, same OS & updates, same version of BOINC, same projects, some configuration settings.
They are, the same. Yet weirdness continues to occur.

Tbf I was looking at one of my other PCs a short time ago, which is offsite to where I am atm, and all its tasks crashed within about 300 seconds, so it does seem to be a bit of pot luck
ID: 109749 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2166
Credit: 41,629,484
RAC: 5,494
Message 109750 - Posted: 18 Sep 2024, 18:40:40 UTC - in response to Message 109746.  

Oh, what a surprise.
boinc-process is down again, so there's a Validation backlog once again that continues to grow.


Still.
And already 70k wus pending for validation...

Just back home and looking to load up with tasks before they run out and I'm too late.
Then discovered what you have about boinc-process going down again. 139k awaiting validation now

Just one easy day is all I ask. Will seemingly never happen...
ID: 109750 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1743
Credit: 18,534,891
RAC: 3,108
Message 109752 - Posted: 18 Sep 2024, 22:24:13 UTC

The boinc-process host is down again, so no Validation for work being returned at this time.
Grant
Darwin NT
ID: 109752 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2166
Credit: 41,629,484
RAC: 5,494
Message 109753 - Posted: 18 Sep 2024, 22:47:45 UTC - in response to Message 109752.  

The boinc-process host is down again, so no Validation for work being returned at this time.

Sometimes the server page doesn't report accurately, so when I see some parts of boinc-process are running (some assimilators) I'm not sure what to think.
Rosetta_beta and Rosetta_python validators were showing as running for a while, even when other parts weren't, but have now switched to not running again.
Whatever's really happening, it all comes across as very flaky <sigh>
ID: 109753 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
TPCBF

Send message
Joined: 29 Nov 10
Posts: 111
Credit: 5,145,661
RAC: 316
Message 109754 - Posted: 18 Sep 2024, 23:54:16 UTC - in response to Message 109753.  

Well, no new task during the day, nothing validated, still all assimilator/vaildators not running... :(
ID: 109754 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1743
Credit: 18,534,891
RAC: 3,108
Message 109759 - Posted: 19 Sep 2024, 22:31:27 UTC

The boinc-process host is back up again, although we now have a error message on the main page in the Server Status section
Notice: Undefined variable: stats in /projects/boinc/rosetta/html/user/index.php on line 81

Grant
Darwin NT
ID: 109759 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1743
Credit: 18,534,891
RAC: 3,108
Message 109760 - Posted: 20 Sep 2024, 3:53:47 UTC

Another 600k or so Tasks just released.

Hopefully things will stay up for a while.
Grant
Darwin NT
ID: 109760 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2166
Credit: 41,629,484
RAC: 5,494
Message 109764 - Posted: 20 Sep 2024, 12:08:51 UTC - in response to Message 109760.  

Another 600k or so Tasks just released.

Hopefully things will stay up for a while.

I arrived at my PC that crashed every task it grabbed from the last batch, like yours did, last night and saw boinc-process was back an hour or two before you posted.
It'd been back for some while already, going by how much the validation backlog had reduced.
Now that tasks are available, let's see if it handles this new batch any better.

I'm currently on another PC that crashed last Monday and missed the last batch altogether, but is rushing through its last few WCG tasks that are right up against their deadline, so I won't find out how this one goes until I get home tonight. Fingers crossed on them all.
ID: 109764 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1743
Credit: 18,534,891
RAC: 3,108
Message 109770 - Posted: 21 Sep 2024, 3:29:40 UTC

Server Status is showing all green, but a backlog is developing with the Assimilators.
Grant
Darwin NT
ID: 109770 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2166
Credit: 41,629,484
RAC: 5,494
Message 109773 - Posted: 21 Sep 2024, 10:33:08 UTC - in response to Message 109764.  
Last modified: 21 Sep 2024, 10:33:22 UTC

Another 600k or so Tasks just released.

Hopefully things will stay up for a while.

I arrived at my PC that crashed every task it grabbed from the last batch, like yours did, last night and saw boinc-process was back an hour or two before you posted.
It'd been back for some while already, going by how much the validation backlog had reduced.
Now that tasks are available, let's see if it handles this new batch any better.

I'm currently on another PC that crashed last Monday and missed the last batch altogether, but is rushing through its last few WCG tasks that are right up against their deadline, so I won't find out how this one goes until I get home tonight. Fingers crossed on them all.

Both running fine and running all tasks to completion.
Not sure what the previous blip was about
ID: 109773 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Bill F
Avatar

Send message
Joined: 29 Jan 08
Posts: 49
Credit: 1,632,964
RAC: 978
Message 109775 - Posted: 23 Sep 2024, 4:55:22 UTC

Trying to get attention that the Stat's export for RALPH has not been updated in over 42 days and that the Posting on the RALPH Message Board is getting no attention.

Respectfully
Bill F
In October 1969 I took an oath to support and defend the Constitution of the United States against all enemies, foreign and domestic;
There was no expiration date.

ID: 109775 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 2014
Credit: 9,842,981
RAC: 4,009
Message 109776 - Posted: 23 Sep 2024, 5:12:27 UTC - in response to Message 109775.  
Last modified: 23 Sep 2024, 5:13:37 UTC

Trying to get attention that the Stat's export for RALPH has not been updated in over 42 days and that the Posting on the RALPH Message Board is getting no attention.


After years on Ralph, i think it's a lost cause...
I write, sometimes, in their forums, but i have not a lot of hope

(not that the Rosetta forums are much better)
ID: 109776 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 290 · 291 · 292 · 293 · 294 · 295 · 296 . . . 313 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2025 University of Washington
https://www.bakerlab.org