minirosetta 2.16

Message boards : Number crunching : minirosetta 2.16

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 68182 - Posted: 23 Oct 2010, 23:39:04 UTC

PCS_2RN2_atensor.frag_1-100_SAVE_ALL_OUT_22378_16_0 died @ 3.5 seconds

ERROR: ERROR: FragmentIO: could not open file boinc_aafrag_1-100_09_05.200_v1_3.gz
ERROR:: Exit from: ....srccorefragmentFragmentIO.cc line: 258
BOINC:: Error reading and gzipping output datafile: default.out
called boinc_finish


ID: 68182 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
cleaner

Send message
Joined: 22 Aug 10
Posts: 6
Credit: 26,245
RAC: 0
Message 68189 - Posted: 25 Oct 2010, 7:42:45 UTC

The last two nights when Rosetta has been in screen saver for about an hour,the computer has either frozen or else rosetta does not respond and the process has to be terminated. Up until now 2.16 had been running fine.....
ID: 68189 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile MalX

Send message
Joined: 18 Jan 06
Posts: 1
Credit: 66,403
RAC: 0
Message 68194 - Posted: 25 Oct 2010, 15:46:09 UTC

Having the exact same problem. EVERY WU fails with computation error, and complains about an absent output file for the task.

I am using Gentoo hardened, and even by easing off with paxctl, I still cant get any WU's to crunch.

The paxctl command works fine with enigma but not Rosetta. Any ideas?
ID: 68194 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Ross Parlette

Send message
Joined: 10 Nov 05
Posts: 32
Credit: 2,165,044
RAC: 0
Message 68226 - Posted: 28 Oct 2010, 4:08:39 UTC

I'm getting the exited with zero status a lot lately. For the most part, the task is restarted and completes correctly (?) and is uploaded are reported. Here follows an example:

10/25/2010 9:17:46 PM rosetta@home Task mem_widd_run02_Menv_B_round02_0013_SAVE_ALL_OUT_IGNORE_THE_REST_22363_5424_0 exited with zero status but no 'finished' file
10/25/2010 9:17:46 PM rosetta@home If this happens repeatedly you may need to reset the project.
10/25/2010 9:17:46 PM rosetta@home Restarting task mem_widd_run02_Menv_B_round02_0013_SAVE_ALL_OUT_IGNORE_THE_REST_22363_5424_0 using minirosetta version 216
10/26/2010 10:20:10 PM rosetta@home Computation for task mem_widd_run02_Menv_B_round02_0013_SAVE_ALL_OUT_IGNORE_THE_REST_22363_5424_0 finished

I have examined this task in my account. It is the one which was sent on 23 Oct 2010 6:39:24 UTC and returned on the 27th. According to the account, it was successfully completed.

I have been getting multiple examples of this. What should I do? Should I reset the project? Just what does that mean?

Thanks.

Ross
ID: 68226 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 68238 - Posted: 28 Oct 2010, 16:04:08 UTC

Ross, so long as other tasks are completing normally, I would not suggest taking any steps to try and resolve this. It sounds more like a problem in the task then on your machine, so there isn't much you'll be able to do about it. You might observe them as they run though and see if they are using excessive memory or anything like that, just so you can report additional symptoms.
Rosetta Moderator: Mod.Sense
ID: 68238 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 68245 - Posted: 29 Oct 2010, 3:05:38 UTC

Hi.

This tasks seemed to be stuck in a loop, the last checkpoint was at 51min the run

time was up to 4hrs 19mins, when i looked at the graphics it was at

STAGE: rb_CA_CA_07 if that helps. And had 205 models at STEP: 5800 and not

moving, i stop and rebooted on restart it went back to 51mins and is now running

and moving i'll let it finish if it does!


celldivs_LL_1de2_2oqk_ProteinInterfaceDesign_26Oct2010_22394_16_0


https://boinc.bakerlab.org/rosetta/workunit.php?wuid=342826273

ID: 68245 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile SubNuke

Send message
Joined: 2 Aug 08
Posts: 1
Credit: 1,242,551
RAC: 0
Message 68266 - Posted: 30 Oct 2010, 16:19:57 UTC
Last modified: 30 Oct 2010, 16:42:47 UTC

I am also seeing tasks fail with computation error accompanied by message indicating an output file is absent.

This is on Core i7-920 systems [with 9800 GTX+'s] running 64-bit Fedora 13 and BOINC 6.10.45 packages installed from the Fedora 13 repository.

If this problem has already been resolved, please point me in the direction of the solution. If I can provide some bit of info that would help to diagnose and resolve the issue, please just ask.

Thanks!

ID: 68266 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 68267 - Posted: 30 Oct 2010, 17:49:11 UTC

Thank you SubNuke. The main thing that is helpful is if you can provide links to specific tasks that are failing, and if there is any pattern to the task names of those that fail vs. those that complete normally.
Rosetta Moderator: Mod.Sense
ID: 68267 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2

Message boards : Number crunching : minirosetta 2.16



©2024 University of Washington
https://www.bakerlab.org