Message boards : Number crunching : Problems with Rosetta version 5.68
Previous · 1 · 2 · 3 · Next
Author | Message |
---|---|
Greg_BE Send message Joined: 30 May 06 Posts: 5691 Credit: 5,859,226 RAC: 0 |
Has anyone noticed that the WU's with 1hz6a protein (specificly the tree jump wu's) suffer similar scaling problems to what was seen with 1gidA? The values when they reach in my case -113 in energy are off the scale. When it gets back around say 110 or so then it comes back into scale with the rest of the charting for accepted energy. Then when it reaches -113 or so again it is off the scale. |
Tim Kunz Send message Joined: 27 Dec 05 Posts: 9 Credit: 1,120,252 RAC: 0 |
Yet again I'm getting very low average granted credit for the amount of CPU time expended. |
Greg_BE Send message Joined: 30 May 06 Posts: 5691 Credit: 5,859,226 RAC: 0 |
It seems that average credit is off for me. According to the graph I am down 3 points in 3 days and the only thing I see in results for this period is a -1.05 points credit granted followed by +.91 and +.28 which equals a +1.19 which gives a difference of +.14 points. So the scale should be unchanged at best. It looks like on 6th I was at about 212 average points and then by the 7th I take a drop of 207 approximately. This does not make sense based on the information given in the results page. Yet my RAC states here its still 212. So what is it really? |
M.L. Send message Joined: 21 Nov 06 Posts: 182 Credit: 180,462 RAC: 0 |
Sorry, my mistake. |
Odysseus Send message Joined: 3 May 07 Posts: 14 Credit: 241,831 RAC: 0 |
My Core2 Duo iMac got an error on 1gidA_BOINC_MG_SASAPAIR_EVENRES_RNA_ABINITIO_SAVE_ALL_OUT_BARCODE_RNA_CONTACT_RNA_LONG_RANGE_CONTACT_RNA_SASA-1gidA-_1760_57723 after just a few seconds of crunching; the salient part of the output appears to be trouble finding jump_templates_RNA_basepairs_v2.dat ERROR:: Exit from: read_paths.cc line: 360A WinXP/AMD system got a similar error on the WU; one other host appeared to finish it OK but got a Validate error. |
Keith T. Send message Joined: 1 Mar 07 Posts: 58 Credit: 34,135 RAC: 0 |
https://boinc.bakerlab.org/rosetta/result.php?resultid=85538633 CNTRL_01ABRELAX_SAVE_ALL_OUT_-1pgx_-_filters_1782_4616_0 Task froze. I noticed it when there was no progress for about 30 minutes. I exited BOINC. When the task resumed it uploaded and reported as Success. Athlon XP 2200+, Win XP Pro, 256MB RAM |
Fynjy Send message Joined: 18 Sep 06 Posts: 8 Credit: 10,762,260 RAC: 0 |
Hi! My teammate have problem with new computer https://boinc.bakerlab.org/rosetta/results.php?hostid=507544 All new WU ends with errors like this <core_client_version>5.9.7</core_client_version> <![CDATA[ <message> app_version download error: couldn't get input files: <file_xfer_error> <file_name>rosetta_5.68_windows_intelx86.exe</file_name> <error_code>-200</error_code> </file_xfer_error> </message> ]]> Any opinions? Help people! Join TSC!Russia! |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
Fynjy, perhaps your teammate is running from behind a corporate firewall that blocks the download of .exe files? See similar discussion here Rosetta Moderator: Mod.Sense |
svincent Send message Joined: 30 Dec 05 Posts: 219 Credit: 12,120,035 RAC: 0 |
Workunit 78076946 stuck at 0% for several hours. |
dcdc Send message Joined: 3 Nov 05 Posts: 1832 Credit: 119,821,902 RAC: 15,180 |
Fynjy, perhaps your teammate is running from behind a corporate firewall that blocks the download of .exe files? If so, you can put the files in the rosetta folder manually and it'll skip trying to download them. HTH Danny |
dcdc Send message Joined: 3 Nov 05 Posts: 1832 Credit: 119,821,902 RAC: 15,180 |
Is this an error? The run-time was very short: DONE :: 1 starting structures 8744.13 cpu seconds This process generated 10241508 decoys from -1100697074 attempts Is that attempts number negative because it's wrapped because it's too big? I assume these are supposed to have lots of decoys anyway?: 1dhn__BOINC_1000DECOYS_ALLFEATURES_CORRECTION_ABRELAX_SAVE_ALL_OUT_BARCODE-1dhn_-frags83__1791_860_0 |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
dcdc, did you perhaps link the wrong task? That one looks normal and doesn't have the CPU seconds you mentioned. Oh here it is. Yes, certainly looks like some kind of overflow. Wonder how it figured out to grant credits. Rosetta Moderator: Mod.Sense |
Fynjy Send message Joined: 18 Sep 06 Posts: 8 Credit: 10,762,260 RAC: 0 |
dcdc First, what he did. It didn't help. Mod.Sense https://boinc.bakerlab.org/rosetta/results.php?hostid=519033 here it is. We can't solve this truble yet. Any other ideas? Help people! Join TSC!Russia! |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
Fynjy, I suggest your teammate go to the download page and see if they can just save the .exe from there. If not, they have a firewall blocking download of .exe files and the thread I linked previously has the details. Here is the link for the download page: https://boinc.bakerlab.org/rosetta/download Rosetta Moderator: Mod.Sense |
Sam Miorelli Send message Joined: 16 Feb 06 Posts: 7 Credit: 1,303,044 RAC: 0 |
I just had a workunit crash on a Prescott-based machine. This thing has run SETI, Einstein, LHC and Rosetta on and off for the past 2 years. I've never had it crash any WUs other that Rosetta (it had been the screensaver crash most of the time in the past). This one crashed overnight and gave memory-address error messages in Windows (RAM was pre-tested and burned in so is OK) and reported in BOINC Manager this morning as I logged back in. 6/21/2007 8:04:47 AM|rosetta@home|Unrecoverable error for result CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ig5A-_filters_1782_108276_0 ( - exit code -1073741819 (0xc0000005)) Here's the link to the computer: https://boinc.bakerlab.org/rosetta/show_host_detail.php?hostid=220976 |
Ken Starnes Send message Joined: 7 Jan 06 Posts: 10 Credit: 2,716,317 RAC: 1,281 |
Since downloading the latest version of BOINC it doesnt seem to swap tasks every 60 minutes as set in my General Preferences. (I'm running about 5 different projects/tasks - if work units are made available) Is this an issue for anyone else or is there another BOINC forum I should ask this question in? Thanks Ken |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
Ken if this topic gets several posts I'll start a new thread for it, as it is not specific to Rosetta 5.68. Often times people think that by setting the preference to a given time that they are forcing BOINC to switch between projects every 60min (or whatever time frame). In fact you are simply asking BOINC to reconsider which project to be running every 60min. It is not uncommon for BOINC to decide to continue crunching the same project it was on for the prior hour. BOINC makes this decision based on the reletive debt levels of the different projects. Once the debts balance out you will see more switching between projects. Rosetta Moderator: Mod.Sense |
Dean Send message Joined: 11 Feb 07 Posts: 4 Credit: 631,230 RAC: 0 |
With 5.68 on a Debian Linux 2.6x machine, most Rosetta tasks will run to about 84% completion, and then hang. The "CPU time" does not increment for the task, and the task will remain hung for as long as it is the executable task. I am also running World Communit Grid, and there is no problem with the WCG tasks. But, when WCG releases BOINC to Rosetta, the Rosetta tasks go nowhere. I have seen this on multiple tasks, and most recently with:CNTRL_01ABRELAX_SAVE_ALL_OUT_-1elwA-_filters_1782_11292_1 and CNTRL_01ABRELAX_SAVE_ALL_OUT_-1iibA-_filters_1782_128542_1. I have paused the tasks and then resumed, restarted BOINC, reset the Rosetta project, left it to run for several days, all to no avail. A new Rosetta task will run to the 84% completion, and then hang. Once in a while, a task will actually complete, usually right after I reset Rosetta. I am also running Rosetta on Windows XP and 2000 machines with no problems. Since I despise Microsoft products, I am very motivated to get this fixed on Linux ;) |
ambi Send message Joined: 19 Mar 07 Posts: 1 Credit: 117,496 RAC: 0 |
With 5.68 on a Debian Linux 2.6x machine, most Rosetta tasks will run to about 84% completion, and then hang. The "CPU time" does not increment for the task, and the task will remain hung for as long as it is the executable task. I have similar problems on my various Linux boxes with SLES9, Fedora 7, Debian Sarge. My windows boxes run fine, but I have more boxes running Linux and I cannot login to each of them every day to see if they are still crunching. The Rosetta tasks just stop consuming CPU at a certain point and only killing them or restarting the client helps. |
cyberschnook Send message Joined: 26 Oct 06 Posts: 2 Credit: 4,012 RAC: 0 |
Messages showing in BOINC: 06/26/2007 09:09:45|rosetta@home|Deferring communication for 1 min 0 sec 06/26/2007 09:09:45|rosetta@home|Reason: Unrecoverable error for result BENCH_051207_ABRELAX_SAVE_ALL_OUT_-1cc8A-_BARCODE_R29_R67_filters_1804_2237_1 (The system cannot find the path specified. (0x3) - exit code 3 (0x3)) 06/26/2007 09:09:45|rosetta@home|Computation for task BENCH_051207_ABRELAX_SAVE_ALL_OUT_-1cc8A-_BARCODE_R29_R67_filters_1804_2237_1 finished 06/26/2007 09:09:45|rosetta@home|Output file BENCH_051207_ABRELAX_SAVE_ALL_OUT_-1cc8A-_BARCODE_R29_R67_filters_1804_2237_1_0 for task BENCH_051207_ABRELAX_SAVE_ALL_OUT_-1cc8A-_BARCODE_R29_R67_filters_1804_2237_1 absent ========= Contents of debc_accompat.txt (MS error reporting; sent to MS as well): <?xml version="1.0" encoding="UTF-16"?> <DATABASE> <EXE NAME="rosetta_5.68_windows_intelx86.exe" FILTER="GRABMI_FILTER_PRIVACY"> <MATCHING_FILE NAME="rosetta_5.68_windows_intelx86.exe" SIZE="2587136" CHECKSUM="0x7044EED1" MODULE_TYPE="WIN32" PE_CHECKSUM="0x0" LINKER_VERSION="0x0" LINK_DATE="06/02/2007 00:15:11" UPTO_LINK_DATE="06/02/2007 00:15:11" /> </EXE> <EXE NAME="kernel32.dll" FILTER="GRABMI_FILTER_THISFILEONLY"> <MATCHING_FILE NAME="kernel32.dll" SIZE="984064" CHECKSUM="0xF12E1D4A" BIN_FILE_VERSION="5.1.2600.2945" BIN_PRODUCT_VERSION="5.1.2600.2945" PRODUCT_VERSION="5.1.2600.2945" FILE_DESCRIPTION="Windows NT BASE API Client DLL" COMPANY_NAME="Microsoft Corporation" PRODUCT_NAME="Microsoft® Windows® Operating System" FILE_VERSION="5.1.2600.2945 (xpsp_sp2_gdr.060704-2349)" ORIGINAL_FILENAME="kernel32" INTERNAL_NAME="kernel32" LEGAL_COPYRIGHT="© Microsoft Corporation. All rights reserved." VERFILEDATEHI="0x0" VERFILEDATELO="0x0" VERFILEOS="0x40004" VERFILETYPE="0x2" MODULE_TYPE="WIN32" PE_CHECKSUM="0xF724D" LINKER_VERSION="0x50001" UPTO_BIN_FILE_VERSION="5.1.2600.2945" UPTO_BIN_PRODUCT_VERSION="5.1.2600.2945" LINK_DATE="07/05/2006 10:55:00" UPTO_LINK_DATE="07/05/2006 10:55:00" VER_LANGUAGE="English (United States) [0x409]" /> </EXE> </DATABASE> ========= Application Error from Event log: Faulting application rosetta_5.68_windows_intelx86.exe, version 0.0.0.0, faulting module rosetta_5.68_windows_intelx86.exe, version 0.0.0.0, fault address 0x00799f42. 0000: 41 70 70 6c 69 63 61 74 Applicat 0008: 69 6f 6e 20 46 61 69 6c ion Fail 0010: 75 72 65 20 20 72 6f 73 ure ros 0018: 65 74 74 61 5f 35 2e 36 etta_5.6 0020: 38 5f 77 69 6e 64 6f 77 8_window 0028: 73 5f 69 6e 74 65 6c 78 s_intelx 0030: 38 36 2e 65 78 65 20 30 86.exe 0 0038: 2e 30 2e 30 2e 30 20 69 .0.0.0 i 0040: 6e 20 72 6f 73 65 74 74 n rosett 0048: 61 5f 35 2e 36 38 5f 77 a_5.68_w 0050: 69 6e 64 6f 77 73 5f 69 indows_i 0058: 6e 74 65 6c 78 38 36 2e ntelx86. 0060: 65 78 65 20 30 2e 30 2e exe 0.0. 0068: 30 2e 30 20 61 74 20 6f 0.0 at o 0070: 66 66 73 65 74 20 30 30 ffset 00 0078: 37 39 39 66 34 32 0d 0a 799f42.. |
Message boards :
Number crunching :
Problems with Rosetta version 5.68
©2024 University of Washington
https://www.bakerlab.org