Minirosetta v1.34 bug thread

Message boards : Number crunching : Minirosetta v1.34 bug thread

To post messages, you must log in.

1 · 2 · 3 · 4 · Next

AuthorMessage
James Thompson

Send message
Joined: 13 Oct 05
Posts: 46
Credit: 186,109
RAC: 0
Message 55711 - Posted: 12 Sep 2008, 5:20:16 UTC

Please post bugs/issues with minirosetta v1.34 here. This has several new scientific updates that David has mentioned in his journal. The basic idea is that we ran new code within the lab during CASP8, and we'd like to take the successful approaches we've found and port them to Rosetta@Home.

See this thread for more information on what we're trying to do. Thanks!
ID: 55711 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
BrnmccO1

Send message
Joined: 26 Jun 07
Posts: 17
Credit: 578,825
RAC: 0
Message 55725 - Posted: 12 Sep 2008, 18:14:56 UTC
Last modified: 12 Sep 2008, 18:15:22 UTC

Still getting the "needs psipred_ss2 to run filters" messages just like in 1.32...

191557749

What gives with the filters?
ID: 55725 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 55727 - Posted: 12 Sep 2008, 19:13:19 UTC
Last modified: 12 Sep 2008, 19:14:09 UTC

looking back at 1.32 stuff in my list it seems that the tasks with abinitio_homfrag at the front of the name have the error you are describing. if you look at all your old tasks that have abinitio_homfrag in their title I bet you will find that error and I bet that all the 1.34 taks with this same name will have the same error. it's not fatal, just annoying.
ID: 55727 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
PJCN88

Send message
Joined: 11 Aug 07
Posts: 2
Credit: 149,276
RAC: 0
Message 55732 - Posted: 13 Sep 2008, 8:32:56 UTC

Hi,

also problems with 1.34
see some results (only 1 success in this list):

191764927 175174794 13 Sep 2008 5:43:05 UTC 13 Sep 2008 5:59:50 UTC Over Client error Compute error 507.67 3.13 ---
191763944 175172930 13 Sep 2008 5:47:17 UTC 13 Sep 2008 6:06:13 UTC Over Client error Compute error 374.48 2.31 ---
191763450 175172707 13 Sep 2008 5:27:15 UTC 13 Sep 2008 5:47:17 UTC Over Client error Compute error 259.81 1.60 ---
191762838 175176972 13 Sep 2008 5:51:28 UTC 13 Sep 2008 6:10:24 UTC Over Client error Compute error 189.06 1.17 ---
191761188 175174225 13 Sep 2008 5:21:53 UTC 13 Sep 2008 5:38:54 UTC Over Client error Compute error 10.13 0.06 ---
191760861 175173573 13 Sep 2008 5:59:50 UTC 13 Sep 2008 6:18:47 UTC Over Client error Compute error 544.75 3.36 ---
191758470 175170003 13 Sep 2008 4:52:28 UTC 13 Sep 2008 5:38:54 UTC Over Client error Compute error 446.63 2.76 ---
191753392 175167017 13 Sep 2008 4:33:49 UTC 13 Sep 2008 5:27:15 UTC Over Client error Compute error 259.30 1.60 ---
191751755 174404418 13 Sep 2008 4:05:40 UTC 13 Sep 2008 7:56:01 UTC Over Success Done 10,231.16 63.12 57.76
191750907 175163066 13 Sep 2008 4:01:29 UTC 13 Sep 2008 4:52:28 UTC Over Client error Compute error 1,112.52 6.86 ---
191749223 175164598 13 Sep 2008 3:57:17 UTC 13 Sep 2008 4:33:49 UTC Over Client error Compute error 1,956.11 12.07 ---
191747517 175162998 13 Sep 2008 3:52:59 UTC 13 Sep 2008 4:01:29 UTC Over Client error Compute error 222.41 1.37 ---
191745720 175159528 13 Sep 2008 3:48:48 UTC 13 Sep 2008 3:57:17 UTC Over Client error Compute error 144.91 0.89 ---
191743572 175156087 13 Sep 2008 3:25:42 UTC 13 Sep 2008 3:48:48 UTC Over Client error Compute error 120.67 0.74 ---
191743456 175155908 13 Sep 2008 3:16:15 UTC 13 Sep 2008 3:44:36 UTC Over Client error Compute error 515.50 3.18 ---
191741034 175152527 13 Sep 2008 3:12:04 UTC 13 Sep 2008 3:35:59 UTC Over Client error Compute error 603.16 3.72 ---
191740767 175156907 13 Sep 2008 3:35:59 UTC 13 Sep 2008 3:48:48 UTC Over Client error Compute error 52.25 0.32 ---
191740243 175155953 13 Sep 2008 3:07:52 UTC 13 Sep 2008 3:25:42 UTC Over Client error Compute error 718.48 4.43 ---
191738823 175153189 13 Sep 2008 3:44:36 UTC 13 Sep 2008 3:57:17 UTC Over Client error Compute error 430.50 2.66 ---
191738401 175152551 13 Sep 2008 3:03:42 UTC 13 Sep 2008 3:16:15 UTC Over Client error Compute error 260.44 1.61 ---

running :
12/09/08 23:53:58||Starting BOINC client version 6.2.18 for windows_x86_64
12/09/08 23:53:58||log flags: task, file_xfer, sched_ops
12/09/08 23:53:58||Libraries: libcurl/7.18.0 OpenSSL/0.9.8g zlib/1.2.3
12/09/08 23:53:58||Running as a daemon
12/09/08 23:53:58||Data directory: D:boincdata
12/09/08 23:53:58||Running under account boinc_master
12/09/08 23:53:58||Processor: 2 GenuineIntel Intel(R) Core(TM)2 Duo CPU E8200 @ 2.66GHz [Intel64 Family 6 Model 23 Stepping 6]
12/09/08 23:53:58||Processor features: fpu tsc pae nx sse sse2 pni
12/09/08 23:53:58||OS: Microsoft Windows Vista: Ultimate x64 Editon, Service Pack 1, (06.00.6001.00)
12/09/08 23:53:58||Memory: 4.00 GB physical, 8.17 GB virtual
12/09/08 23:53:58||Disk: 368.10 GB total, 351.77 GB free
12/09/08 23:53:58||Local time is UTC +2 hours
12/09/08 23:53:58||No coprocessors

Patrick
ID: 55732 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Path7

Send message
Joined: 25 Aug 07
Posts: 128
Credit: 61,751
RAC: 0
Message 55733 - Posted: 13 Sep 2008, 9:11:34 UTC - in response to Message 55727.  
Last modified: 13 Sep 2008, 9:36:42 UTC

looking back at 1.32 stuff in my list it seems that the tasks with abinitio_homfrag at the front of the name have the error you are describing. if you look at all your old tasks that have abinitio_homfrag in their title I bet you will find that error and I bet that all the 1.34 taks with this same name will have the same error. it's not fatal, just annoying.

Hello all,

Didn't have any errors running the 1.32 abinitio_homfrag Wu's so far.

Received my first Minirosetta 1.34 (a abinitio_homfrag) which gave me my first -1073741819 (0xc0000005) - Unhandled Exception Detected... since I started crunching R@H at august 25, 2007.

abinitio_nohomfrag_70_A_1qgvA_4466_2075_0
Window XP-home SP3 – Boinc 5.10.45.

Edit: My second Minirostta 1.34 (abinitio_nohomfrag_70_A_1wouA_4466_2622_0) ran fine.

Have a nice day,
Path7.
ID: 55733 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2127
Credit: 41,266,340
RAC: 8,573
Message 55745 - Posted: 14 Sep 2008, 2:13:01 UTC

Task ID 191517268 Exit status -226 (0xffffff1e)
Task ID 191592185 Exit status -226 (0xffffff1e)
Task ID 191631273 Exit status -226 (0xffffff1e)
Task ID 191648953 Exit status -226 (0xffffff1e)
Task ID 191665453 Exit status -226 (0xffffff1e)
Task ID 191712784 Exit status -226 (0xffffff1e)
Task ID 191761187 Exit status -226 (0xffffff1e)
Task ID 191802254 Exit status -226 (0xffffff1e)
Task ID 191823750 Exit status -226 (0xffffff1e)
Task ID 191841097 Exit status -226 (0xffffff1e)
Task ID 191884904 Exit status -226 (0xffffff1e)

All the above produce the error:
too many exit(0)s
[...]
Can't acquire lockfile - exiting
[...many repeats]


I also had 19 successful runs with Mini 1.34 though - see all my results - 61% success, 39% failures. A little better than with 1.32

One odd error in there too - I've not seen it before.

Task ID 191862476 Exit status 1 (0x1)
<core_client_version>6.2.18</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>

ERROR: unrecognized aa HOH
ERROR:: Exit from: ....srccoreiopdbfile_data.cc line: 468
called boinc_finish

</stderr_txt>
]]>

ID: 55745 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 55748 - Posted: 14 Sep 2008, 6:28:09 UTC

This is my first error with my new Ubuntu rig.

It ran for just over 14min then this.

Sun 14 Sep 2008 15:45:43 EST|rosetta@home|Output file abinitio_nohomfrag_70_A_1qgvA_4466_5826_0_0 for task abinitio_nohomfrag_70_A_1qgvA_4466_5826_0 absent

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=175322746

pete.
ID: 55748 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
BrnmccO1

Send message
Joined: 26 Jun 07
Posts: 17
Credit: 578,825
RAC: 0
Message 55753 - Posted: 14 Sep 2008, 16:12:56 UTC
Last modified: 14 Sep 2008, 16:16:12 UTC

First unhandled exception error, access violation again:

192043183

Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x007D3863 read attempt to address 0x00000008

Got one of these too, a day ago or so: WU

ERROR: unrecognized aa HOH
ERROR:: Exit from: ....srccoreiopdbfile_data.cc line: 468
called boinc_finish
ID: 55753 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 55779 - Posted: 15 Sep 2008, 17:34:31 UTC

abinitio_nohomfrag_70_A_1a8oA_4466_250_1
ERROR: unrecognized aa HOH
ERROR:: Exit from: ....srccoreiopdbfile_data.cc line: 468
called boinc_finish
failed at 1.39 secs

abinitio_nohomfrag_70_A_1qgvA_4466_3796 failed on my system and another one - Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x007D3863 read attempt to address 0x00000008

died at 574 secs
ID: 55779 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 55799 - Posted: 16 Sep 2008, 6:16:42 UTC
Last modified: 16 Sep 2008, 6:19:58 UTC

abinitio_nohomfrag_70_A_1qgvA_4466_3796_1

this ran 578 secs and the same access violation as below

abinitio_nohomfrag_70_A_1qgvA_4466_8165_0
Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x007D3863 read attempt to address 0x00000008


ran 341 secs and died
ID: 55799 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
AMD_is_logical

Send message
Joined: 20 Dec 05
Posts: 299
Credit: 31,460,681
RAC: 0
Message 55824 - Posted: 17 Sep 2008, 3:13:04 UTC

ID: 55824 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mike Francis
Avatar

Send message
Joined: 24 Nov 05
Posts: 8
Credit: 623,519
RAC: 0
Message 55827 - Posted: 17 Sep 2008, 7:18:51 UTC

9/17/2008 2:14:07 AM|rosetta@home|Computation for task abinitio_nohomfrag_70_A_1a8oA_4466_10334_0 finished
9/17/2008 2:14:07 AM|rosetta@home|Output file abinitio_nohomfrag_70_A_1a8oA_4466_10334_0_0 for task abinitio_nohomfrag_70_A_1a8oA_4466_10334_0 absent

ID: 55827 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2127
Credit: 41,266,340
RAC: 8,573
Message 55836 - Posted: 17 Sep 2008, 15:07:42 UTC - in response to Message 55745.  
Last modified: 17 Sep 2008, 15:15:15 UTC

All the above produce the error:
too many exit(0)s
[...]
Can't acquire lockfile - exiting
[...many repeats]


I also had 19 successful runs with Mini 1.34 though - see all my results - 61% success, 39% failures. A little better than with 1.32

Update on this. Of next 44 WUs:

19 success (43%), 25 fail (57%) (22 can't acquire lockfile, 3 Incorrect function. (0x1) - exit code 1 (0x1) ERROR: unrecognized aa HOH, ERROR:: Exit from: ....srccoreiopdbfile_data.cc line: 468)


As attempted before with 1.32, I reduced runtime from 3 hours to 2 hours in rosetta preferences with very good results.

Of next 33 WUs, 24 success (73%), 9 failures (27%) - 5 lockfiles errors, 3 unrecognised aa HOH, and my first error Unhandled Exception (as reported by greg_be, Peter Leman and others above)

Task 192495818
<core_client_version>6.2.18</core_client_version>
<![CDATA[
<message>
- exit code -1073741819 (0xc0000005)
</message>
<stderr_txt>
# cpu_run_time_pref: 7200


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x007D3863 read attempt to address 0x74767A39

Engaging BOINC Windows Runtime Debugger...


This was under Vista64 version of Boinc Manager - might explain the different address for Access Violation to greg_be's error.
ID: 55836 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Saharak

Send message
Joined: 28 Apr 07
Posts: 7
Credit: 1,170,212
RAC: 0
Message 55837 - Posted: 17 Sep 2008, 17:04:35 UTC
Last modified: 17 Sep 2008, 17:08:26 UTC

ID: 55837 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Terrasapiens

Send message
Joined: 25 Apr 08
Posts: 15
Credit: 368,919
RAC: 0
Message 55851 - Posted: 18 Sep 2008, 4:24:02 UTC

I've had nothing but failures on the v1.34 WUs, just like I did with the v1.32 ones. There are only two projects running, RAH and Seti and RAH is set to run 2/3 of the time (the PC has been on almost 24/7). Yet since August 29th I've only received about 200 credits. I think all of those are from rosetta beta. There are too many failed WUs to list so here's the link to all my tasks:
https://boinc.bakerlab.org/rosetta/results.php?userid=254884

I have not had time to try the code that is supposed to block the mini WUs but I'm wondering if anyone has tried it and got it to work. I'd like to be processing RAH but the BOINC client keeps feeding me the mini WUs my machine doesn't like! Any suggestions/work-arounds???
ID: 55851 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
The_Bad_Penguin
Avatar

Send message
Joined: 5 Jun 06
Posts: 2751
Credit: 4,271,025
RAC: 0
Message 55853 - Posted: 18 Sep 2008, 4:57:05 UTC

Failed on 2 computers:

abinitio_nohomfrag_70_A_1qgvA_4466_4760_1

CPU time 2604.702
stderr out <core_client_version>6.2.18</core_client_version>
<![CDATA[
<message>
- exit code -1073741819 (0xc0000005)
</message>
<stderr_txt>

</stderr_txt>
]]>


Validate state Invalid
Claimed credit 6.57750143226297
Granted credit 0
application version 1.34
ID: 55853 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
The_Bad_Penguin
Avatar

Send message
Joined: 5 Jun 06
Posts: 2751
Credit: 4,271,025
RAC: 0
Message 55854 - Posted: 18 Sep 2008, 5:00:33 UTC
Last modified: 18 Sep 2008, 5:00:59 UTC

Failed on 2 computers:

abinitio_nohomfrag_70_A_1qgvA_4466_3410_1

<core_client_version>6.2.18</core_client_version>
<![CDATA[
<message>
- exit code -1073741819 (0xc0000005)
</message>
<stderr_txt>

Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x007D3863 read attempt to address 0x00000008

Engaging BOINC Windows Runtime Debugger...
ID: 55854 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
The_Bad_Penguin
Avatar

Send message
Joined: 5 Jun 06
Posts: 2751
Credit: 4,271,025
RAC: 0
Message 55855 - Posted: 18 Sep 2008, 5:02:44 UTC

Failed on 2 computers:

abinitio_nohomfrag_70_A_1a8oA_4466_1342

stderr out <core_client_version>6.2.18</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>

ERROR: unrecognized aa HOH
ERROR:: Exit from: ....srccoreiopdbfile_data.cc line: 468
called boinc_finish

</stderr_txt>
]]>

ID: 55855 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
The_Bad_Penguin
Avatar

Send message
Joined: 5 Jun 06
Posts: 2751
Credit: 4,271,025
RAC: 0
Message 55856 - Posted: 18 Sep 2008, 5:06:31 UTC

Failed on 2 computers:

abinitio_nohomfrag_70_A_1a8oA_4466_3733_1

stderr out <core_client_version>5.10.13</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>

ERROR: unrecognized aa HOH
ERROR:: Exit from: ....srccoreiopdbfile_data.cc line: 468
called boinc_finish

</stderr_txt>
]]>

ID: 55856 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Odd Braathun

Send message
Joined: 2 Sep 08
Posts: 9
Credit: 16,125
RAC: 0
Message 55865 - Posted: 18 Sep 2008, 17:20:01 UTC

Hi. I am new to this, but today I have the second computation error with
mini 1.34. abinitio_nohomfrag_70_A_1qgvA_4466_19414_1 stopped after 39:32
It also blocked my 'puter for computing other tasks for 6 hours.
I have 1 finished task ready to report, but are not receiving any new jobs.
My first error was task ID 191724559 work unit 175139940. In both cases mini 1.34 asked for permission to enter internet, so I was assuming that the error
was reported back to the project. Last time I reset the project, but now I am
not sure what to do.

Odd

ID: 55865 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
1 · 2 · 3 · 4 · Next

Message boards : Number crunching : Minirosetta v1.34 bug thread



©2024 University of Washington
https://www.bakerlab.org