Problems with Rosetta version 5.68

Message boards : Number crunching : Problems with Rosetta version 5.68

To post messages, you must log in.

Previous · 1 · 2 · 3 · Next

AuthorMessage
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 41920 - Posted: 7 Jun 2007, 7:32:46 UTC
Last modified: 7 Jun 2007, 7:34:04 UTC

Has anyone noticed that the WU's with 1hz6a protein (specificly the tree jump wu's) suffer similar scaling problems to what was seen with 1gidA? The values when they reach in my case -113 in energy are off the scale. When it gets back around say 110 or so then it comes back into scale with the rest of the charting for accepted energy. Then when it reaches -113 or so again it is off the scale.
ID: 41920 · Rating: 9.9920072216264E-15 · rate: Rate + / Rate - Report as offensive    Reply Quote
Tim Kunz

Send message
Joined: 27 Dec 05
Posts: 9
Credit: 1,120,252
RAC: 0
Message 41925 - Posted: 7 Jun 2007, 14:46:54 UTC

Yet again I'm getting very low average granted credit for the amount of CPU time expended.
ID: 41925 · Rating: 9.9920072216264E-15 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 41929 - Posted: 7 Jun 2007, 16:25:57 UTC
Last modified: 7 Jun 2007, 16:26:42 UTC

It seems that average credit is off for me.
According to the graph I am down 3 points in 3 days and the only thing I see in results for this period is a -1.05 points credit granted followed by +.91 and +.28 which equals a +1.19 which gives a difference of +.14 points. So the scale should be unchanged at best. It looks like on 6th I was at about 212 average points and then by the 7th I take a drop of 207 approximately. This does not make sense based on the information given in the results page. Yet my RAC states here its still 212. So what is it really?
ID: 41929 · Rating: -0.99999999999999 · rate: Rate + / Rate - Report as offensive    Reply Quote
M.L.

Send message
Joined: 21 Nov 06
Posts: 182
Credit: 180,462
RAC: 0
Message 41994 - Posted: 9 Jun 2007, 17:57:56 UTC - in response to Message 41877.  
Last modified: 9 Jun 2007, 18:01:20 UTC


Sorry, my mistake.


ID: 41994 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Odysseus

Send message
Joined: 3 May 07
Posts: 14
Credit: 241,831
RAC: 0
Message 42057 - Posted: 11 Jun 2007, 10:35:41 UTC

My Core2 Duo iMac got an error on 1gidA_BOINC_MG_SASAPAIR_EVENRES_RNA_ABINITIO_SAVE_ALL_OUT_BARCODE_RNA_CONTACT_RNA_LONG_RANGE_CONTACT_RNA_SASA-1gidA-_1760_57723 after just a few seconds of crunching; the salient part of the output appears to be
trouble finding jump_templates_RNA_basepairs_v2.dat
ERROR:: Exit from: read_paths.cc line: 360 
A WinXP/AMD system got a similar error on the WU; one other host appeared to finish it OK but got a Validate error.
ID: 42057 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Keith T.
Avatar

Send message
Joined: 1 Mar 07
Posts: 58
Credit: 34,135
RAC: 0
Message 42062 - Posted: 11 Jun 2007, 13:10:51 UTC

https://boinc.bakerlab.org/rosetta/result.php?resultid=85538633 CNTRL_01ABRELAX_SAVE_ALL_OUT_-1pgx_-_filters_1782_4616_0

Task froze. I noticed it when there was no progress for about 30 minutes. I exited BOINC. When the task resumed it uploaded and reported as Success.

Athlon XP 2200+, Win XP Pro, 256MB RAM
ID: 42062 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Fynjy

Send message
Joined: 18 Sep 06
Posts: 8
Credit: 10,762,260
RAC: 0
Message 42139 - Posted: 13 Jun 2007, 13:52:52 UTC

Hi! My teammate have problem with new computer https://boinc.bakerlab.org/rosetta/results.php?hostid=507544 All new WU ends with errors like this

<core_client_version>5.9.7</core_client_version>
<![CDATA[
<message>
app_version download error: couldn't get input files:
<file_xfer_error>
<file_name>rosetta_5.68_windows_intelx86.exe</file_name>
<error_code>-200</error_code>
</file_xfer_error>

</message>
]]>

Any opinions?


Help people! Join TSC!Russia!
ID: 42139 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 42152 - Posted: 13 Jun 2007, 20:19:01 UTC

Fynjy, perhaps your teammate is running from behind a corporate firewall that blocks the download of .exe files?

See similar discussion here
Rosetta Moderator: Mod.Sense
ID: 42152 · Rating: 9.9920072216264E-15 · rate: Rate + / Rate - Report as offensive    Reply Quote
svincent

Send message
Joined: 30 Dec 05
Posts: 219
Credit: 12,120,035
RAC: 0
Message 42189 - Posted: 14 Jun 2007, 19:33:32 UTC

Workunit 78076946 stuck at 0% for several hours.


ID: 42189 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1832
Credit: 119,821,902
RAC: 15,180
Message 42196 - Posted: 15 Jun 2007, 12:29:11 UTC - in response to Message 42152.  

Fynjy, perhaps your teammate is running from behind a corporate firewall that blocks the download of .exe files?

See similar discussion here

If so, you can put the files in the rosetta folder manually and it'll skip trying to download them.

HTH
Danny
ID: 42196 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1832
Credit: 119,821,902
RAC: 15,180
Message 42278 - Posted: 17 Jun 2007, 21:32:11 UTC
Last modified: 17 Jun 2007, 21:33:06 UTC

Is this an error? The run-time was very short:

DONE :: 1 starting structures 8744.13 cpu seconds
This process generated 10241508 decoys from -1100697074 attempts

Is that attempts number negative because it's wrapped because it's too big?

I assume these are supposed to have lots of decoys anyway?:
1dhn__BOINC_1000DECOYS_ALLFEATURES_CORRECTION_ABRELAX_SAVE_ALL_OUT_BARCODE-1dhn_-frags83__1791_860_0
ID: 42278 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 42285 - Posted: 18 Jun 2007, 13:30:02 UTC
Last modified: 18 Jun 2007, 13:34:16 UTC

dcdc, did you perhaps link the wrong task? That one looks normal and doesn't have the CPU seconds you mentioned.

Oh here it is. Yes, certainly looks like some kind of overflow. Wonder how it figured out to grant credits.
Rosetta Moderator: Mod.Sense
ID: 42285 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Fynjy

Send message
Joined: 18 Sep 06
Posts: 8
Credit: 10,762,260
RAC: 0
Message 42354 - Posted: 20 Jun 2007, 8:59:30 UTC - in response to Message 42152.  

dcdc
First, what he did. It didn't help.
Mod.Sense
https://boinc.bakerlab.org/rosetta/results.php?hostid=519033 here it is. We can't solve this truble yet. Any other ideas?
Help people! Join TSC!Russia!
ID: 42354 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 42364 - Posted: 20 Jun 2007, 16:32:23 UTC

Fynjy, I suggest your teammate go to the download page and see if they can just save the .exe from there. If not, they have a firewall blocking download of .exe files and the thread I linked previously has the details. Here is the link for the download page:
https://boinc.bakerlab.org/rosetta/download
Rosetta Moderator: Mod.Sense
ID: 42364 · Rating: -0.99999999999999 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sam Miorelli

Send message
Joined: 16 Feb 06
Posts: 7
Credit: 1,303,044
RAC: 0
Message 42400 - Posted: 21 Jun 2007, 12:10:51 UTC

I just had a workunit crash on a Prescott-based machine. This thing has run SETI, Einstein, LHC and Rosetta on and off for the past 2 years. I've never had it crash any WUs other that Rosetta (it had been the screensaver crash most of the time in the past).

This one crashed overnight and gave memory-address error messages in Windows (RAM was pre-tested and burned in so is OK) and reported in BOINC Manager this morning as I logged back in.

6/21/2007 8:04:47 AM|rosetta@home|Unrecoverable error for result CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ig5A-_filters_1782_108276_0 ( - exit code -1073741819 (0xc0000005))

Here's the link to the computer: https://boinc.bakerlab.org/rosetta/show_host_detail.php?hostid=220976

ID: 42400 · Rating: -0.99999999999999 · rate: Rate + / Rate - Report as offensive    Reply Quote
Ken Starnes

Send message
Joined: 7 Jan 06
Posts: 10
Credit: 2,716,317
RAC: 1,281
Message 42436 - Posted: 22 Jun 2007, 11:55:52 UTC

Since downloading the latest version of BOINC it doesnt seem to swap tasks every 60 minutes as set in my General Preferences. (I'm running about 5 different projects/tasks - if work units are made available)

Is this an issue for anyone else or is there another BOINC forum I should ask this question in?

Thanks

Ken
ID: 42436 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 42443 - Posted: 22 Jun 2007, 14:31:52 UTC

Ken if this topic gets several posts I'll start a new thread for it, as it is not specific to Rosetta 5.68.

Often times people think that by setting the preference to a given time that they are forcing BOINC to switch between projects every 60min (or whatever time frame). In fact you are simply asking BOINC to reconsider which project to be running every 60min. It is not uncommon for BOINC to decide to continue crunching the same project it was on for the prior hour. BOINC makes this decision based on the reletive debt levels of the different projects. Once the debts balance out you will see more switching between projects.
Rosetta Moderator: Mod.Sense
ID: 42443 · Rating: -1 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Dean

Send message
Joined: 11 Feb 07
Posts: 4
Credit: 631,230
RAC: 0
Message 42470 - Posted: 22 Jun 2007, 21:09:26 UTC

With 5.68 on a Debian Linux 2.6x machine, most Rosetta tasks will run to about 84% completion, and then hang. The "CPU time" does not increment for the task, and the task will remain hung for as long as it is the executable task.

I am also running World Communit Grid, and there is no problem with the WCG tasks. But, when WCG releases BOINC to Rosetta, the Rosetta tasks go nowhere. I have seen this on multiple tasks, and most recently with:CNTRL_01ABRELAX_SAVE_ALL_OUT_-1elwA-_filters_1782_11292_1 and CNTRL_01ABRELAX_SAVE_ALL_OUT_-1iibA-_filters_1782_128542_1.

I have paused the tasks and then resumed, restarted BOINC, reset the Rosetta project, left it to run for several days, all to no avail. A new Rosetta task will run to the 84% completion, and then hang. Once in a while, a task will actually complete, usually right after I reset Rosetta.

I am also running Rosetta on Windows XP and 2000 machines with no problems. Since I despise Microsoft products, I am very motivated to get this fixed on Linux ;)
ID: 42470 · Rating: 1 · rate: Rate + / Rate - Report as offensive    Reply Quote
ambi

Send message
Joined: 19 Mar 07
Posts: 1
Credit: 117,496
RAC: 0
Message 42587 - Posted: 24 Jun 2007, 22:48:39 UTC - in response to Message 42470.  

With 5.68 on a Debian Linux 2.6x machine, most Rosetta tasks will run to about 84% completion, and then hang. The "CPU time" does not increment for the task, and the task will remain hung for as long as it is the executable task.

I am also running World Communit Grid, and there is no problem with the WCG tasks. But, when WCG releases BOINC to Rosetta, the Rosetta tasks go nowhere. I have seen this on multiple tasks, and most recently with:CNTRL_01ABRELAX_SAVE_ALL_OUT_-1elwA-_filters_1782_11292_1 and CNTRL_01ABRELAX_SAVE_ALL_OUT_-1iibA-_filters_1782_128542_1.

I have paused the tasks and then resumed, restarted BOINC, reset the Rosetta project, left it to run for several days, all to no avail. A new Rosetta task will run to the 84% completion, and then hang. Once in a while, a task will actually complete, usually right after I reset Rosetta.

I am also running Rosetta on Windows XP and 2000 machines with no problems. Since I despise Microsoft products, I am very motivated to get this fixed on Linux ;)


I have similar problems on my various Linux boxes with SLES9, Fedora 7, Debian Sarge. My windows boxes run fine, but I have more boxes running Linux and I cannot login to each of them every day to see if they are still crunching.

The Rosetta tasks just stop consuming CPU at a certain point and only killing them or restarting the client helps.
ID: 42587 · Rating: 1 · rate: Rate + / Rate - Report as offensive    Reply Quote
cyberschnook

Send message
Joined: 26 Oct 06
Posts: 2
Credit: 4,012
RAC: 0
Message 42653 - Posted: 26 Jun 2007, 13:27:48 UTC

Messages showing in BOINC:

06/26/2007 09:09:45|rosetta@home|Deferring communication for 1 min 0 sec
06/26/2007 09:09:45|rosetta@home|Reason: Unrecoverable error for result BENCH_051207_ABRELAX_SAVE_ALL_OUT_-1cc8A-_BARCODE_R29_R67_filters_1804_2237_1 (The system cannot find the path specified. (0x3) - exit code 3 (0x3))
06/26/2007 09:09:45|rosetta@home|Computation for task BENCH_051207_ABRELAX_SAVE_ALL_OUT_-1cc8A-_BARCODE_R29_R67_filters_1804_2237_1 finished
06/26/2007 09:09:45|rosetta@home|Output file BENCH_051207_ABRELAX_SAVE_ALL_OUT_-1cc8A-_BARCODE_R29_R67_filters_1804_2237_1_0 for task BENCH_051207_ABRELAX_SAVE_ALL_OUT_-1cc8A-_BARCODE_R29_R67_filters_1804_2237_1 absent
=========
Contents of debc_accompat.txt (MS error reporting; sent to MS as well):

<?xml version="1.0" encoding="UTF-16"?>
<DATABASE>
<EXE NAME="rosetta_5.68_windows_intelx86.exe" FILTER="GRABMI_FILTER_PRIVACY">
<MATCHING_FILE NAME="rosetta_5.68_windows_intelx86.exe" SIZE="2587136" CHECKSUM="0x7044EED1" MODULE_TYPE="WIN32" PE_CHECKSUM="0x0" LINKER_VERSION="0x0" LINK_DATE="06/02/2007 00:15:11" UPTO_LINK_DATE="06/02/2007 00:15:11" />
</EXE>
<EXE NAME="kernel32.dll" FILTER="GRABMI_FILTER_THISFILEONLY">
<MATCHING_FILE NAME="kernel32.dll" SIZE="984064" CHECKSUM="0xF12E1D4A" BIN_FILE_VERSION="5.1.2600.2945" BIN_PRODUCT_VERSION="5.1.2600.2945" PRODUCT_VERSION="5.1.2600.2945" FILE_DESCRIPTION="Windows NT BASE API Client DLL" COMPANY_NAME="Microsoft Corporation" PRODUCT_NAME="Microsoft® Windows® Operating System" FILE_VERSION="5.1.2600.2945 (xpsp_sp2_gdr.060704-2349)" ORIGINAL_FILENAME="kernel32" INTERNAL_NAME="kernel32" LEGAL_COPYRIGHT="© Microsoft Corporation. All rights reserved." VERFILEDATEHI="0x0" VERFILEDATELO="0x0" VERFILEOS="0x40004" VERFILETYPE="0x2" MODULE_TYPE="WIN32" PE_CHECKSUM="0xF724D" LINKER_VERSION="0x50001" UPTO_BIN_FILE_VERSION="5.1.2600.2945" UPTO_BIN_PRODUCT_VERSION="5.1.2600.2945" LINK_DATE="07/05/2006 10:55:00" UPTO_LINK_DATE="07/05/2006 10:55:00" VER_LANGUAGE="English (United States) [0x409]" />
</EXE>
</DATABASE>
=========
Application Error from Event log:

Faulting application rosetta_5.68_windows_intelx86.exe, version 0.0.0.0, faulting module rosetta_5.68_windows_intelx86.exe, version 0.0.0.0, fault address 0x00799f42.

0000: 41 70 70 6c 69 63 61 74 Applicat
0008: 69 6f 6e 20 46 61 69 6c ion Fail
0010: 75 72 65 20 20 72 6f 73 ure ros
0018: 65 74 74 61 5f 35 2e 36 etta_5.6
0020: 38 5f 77 69 6e 64 6f 77 8_window
0028: 73 5f 69 6e 74 65 6c 78 s_intelx
0030: 38 36 2e 65 78 65 20 30 86.exe 0
0038: 2e 30 2e 30 2e 30 20 69 .0.0.0 i
0040: 6e 20 72 6f 73 65 74 74 n rosett
0048: 61 5f 35 2e 36 38 5f 77 a_5.68_w
0050: 69 6e 64 6f 77 73 5f 69 indows_i
0058: 6e 74 65 6c 78 38 36 2e ntelx86.
0060: 65 78 65 20 30 2e 30 2e exe 0.0.
0068: 30 2e 30 20 61 74 20 6f 0.0 at o
0070: 66 66 73 65 74 20 30 30 ffset 00
0078: 37 39 39 66 34 32 0d 0a 799f42..



ID: 42653 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · Next

Message boards : Number crunching : Problems with Rosetta version 5.68



©2024 University of Washington
https://www.bakerlab.org