Too many restarts with no progress.

Message boards : Number crunching : Too many restarts with no progress.

To post messages, you must log in.

AuthorMessage
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1832
Credit: 119,821,902
RAC: 13,431
Message 39569 - Posted: 18 Apr 2007, 15:46:48 UTC

Hi

I've got a couple of low-end machines running rosie:
Machine A:
BOINC 5.4.11
http://boinc bakerlab.org/rosetta/show_host_detail.php?hostid=354306

Machine B:
BOINC 5.8.16
https://boinc.bakerlab.org/rosetta/show_host_detail.php?hostid=405599

I’m also using the BONICStats account manager (BAM) with A using host-specific settings and B using a standard venue that I created there.

The problem is that both machines are now giving the following error:
Too many restarts with no progress. Keep application in memory while preempted.

Keep application in memory is set to ‘Yes’ everywhere I can set it, including all four BIONC venues:
General, Home, School and Work, in all the venues I’ve created through the BAM and in the host-specific settings for machine A.

Anyone know why it’s erroring out on these? I don’t have easy access to these machines (I can Remote Desktop into them but only when their owners are around and free to let me in). Is this a problem with the ‘Do work while computer is in use’ setting as that’s set to ‘No’ on both machines. Neither is using the BOINC screensaver, but BOINC is installed as a service running under the system account and is allowed to intereact with the desktop on both (not ideal I know, but will do for now).

Any ideas?
Ta
Danny
ID: 39569 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 39578 - Posted: 18 Apr 2007, 21:03:00 UTC - in response to Message 39569.  

...Is this a problem with the ‘Do work while computer is in use’ setting as that’s set to ‘No’ on both machines.


As long as you leave in memory, then stopping while computer is in use shouldn't be a problem.

Is it possible that the PC was turned off, or BOINC was ended, before a checkpoint was reached? ... 5 times in a row? That SHOULD be the cause. And if you removed from memory, then it would be as simple as the user stepping in to use the computer before a checkpoint is reached 5 times in a row since you do not allow BOINC while computer in use.

Rosetta Moderator: Mod.Sense
ID: 39578 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile anders n

Send message
Joined: 19 Sep 05
Posts: 403
Credit: 537,991
RAC: 0
Message 39594 - Posted: 19 Apr 2007, 5:03:54 UTC

Check the computers so they don't have something running that stops
Boinc from comming in when it should have.
Anders n
ID: 39594 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1832
Credit: 119,821,902
RAC: 13,431
Message 39597 - Posted: 19 Apr 2007, 8:36:14 UTC

I VNC'd into Machine A last night and boinc was running but rosetta wasn't while the computer was in use, so i synchronised it with the BAM and a rosetta thread appeared shortly after. I left task manager open and rosetta started crunching after around 3 mins. I moved the mouse and rosetta stopped again... looks like that one was working ok. I'd have expected it to have returned a result overnight though seeing as it was left on running some updates, although it might have picked up my 8hr default run time, rather than using the 4hr one it's supposed to from the 'Home' venue here (BOINCStats BAM doesn't let you change project-specific stettings such as run-time unfortunately).
ID: 39597 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : Too many restarts with no progress.



©2024 University of Washington
https://www.bakerlab.org