Project encountered internal error: shared memory

Message boards : Number crunching : Project encountered internal error: shared memory

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Profile Vincent JG

Send message
Joined: 17 Aug 07
Posts: 4
Credit: 2,555,848
RAC: 0
Message 49156 - Posted: 28 Nov 2007, 22:18:07 UTC

Started getting this error message a couple of hours ago:

11/28/2007 4:34:35 PM|rosetta@home|Message from server: Project encountered internal error: shared memory

WU's will upload, but they can't be reported. Also I'm not getting any new WU's. Might this be from the minor update they did today for linux?
ID: 49156 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
EvieSpain

Send message
Joined: 16 May 07
Posts: 1
Credit: 9,136
RAC: 0
Message 49158 - Posted: 28 Nov 2007, 23:01:20 UTC

28/11/2007 23:50:58|rosetta@home|Message from server: Project encountered internal error: shared memory

I know it has happened before to others, but is there something I am doing wrong or is it the site ?
Evie
ID: 49158 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 49160 - Posted: 28 Nov 2007, 23:08:56 UTC

The Rosetta servers are having a problem. Nothing on your end to change.
Rosetta Moderator: Mod.Sense
ID: 49160 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Path7

Send message
Joined: 25 Aug 07
Posts: 128
Credit: 61,751
RAC: 0
Message 49161 - Posted: 28 Nov 2007, 23:22:18 UTC - in response to Message 49160.  

The Rosetta servers are having a problem. Nothing on your end to change.

Thanks for your reaction Mod.Sense.
Hopefully the problems will be solved soon. :o)

Have a nice day,
Path7.
ID: 49161 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bill Hounslow

Send message
Joined: 25 Sep 05
Posts: 1
Credit: 50,090
RAC: 0
Message 49162 - Posted: 28 Nov 2007, 23:29:15 UTC

Getting this following manual update. All seems to be running normally, but is it significant?

ID: 49162 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Steve Dodd

Send message
Joined: 13 Dec 05
Posts: 7
Credit: 3,779,087
RAC: 562
Message 49163 - Posted: 28 Nov 2007, 23:30:30 UTC

There must be an epidemic of these 'shared memory' problems. Milkway@home had a similar problem that's still causing issues with new wu / binaries of new version.
ID: 49163 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
galea5

Send message
Joined: 27 Apr 07
Posts: 1
Credit: 18,333
RAC: 0
Message 49166 - Posted: 29 Nov 2007, 0:02:28 UTC

29/11/2007 10:57:46 AM|rosetta@home|Sending scheduler request: Requested by user
29/11/2007 10:57:46 AM|rosetta@home|Requesting 1613875 seconds of new work
29/11/2007 10:57:51 AM|rosetta@home|Scheduler RPC succeeded
29/11/2007 10:57:51 AM|rosetta@home|Message from server: Project encountered internal error: shared memory
29/11/2007 10:57:51 AM|rosetta@home|Deferring communication for 1 hr 0 min 0 sec
29/11/2007 10:57:51 AM|rosetta@home|Reason: project is down
ID: 49166 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Nathaniel

Send message
Joined: 5 Apr 06
Posts: 2
Credit: 542,398
RAC: 0
Message 49168 - Posted: 29 Nov 2007, 0:55:44 UTC

Currently ive ran this client on 2 desktops, and when i go to run it on my laptop which has a dedicated graphics card with shared graphical memory i am getting a internal server error from the client and i believe its due to the shared memory, but i am unsure about this.
ID: 49168 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Leonzio
Avatar

Send message
Joined: 19 Nov 07
Posts: 8
Credit: 2,731
RAC: 0
Message 49169 - Posted: 29 Nov 2007, 1:35:38 UTC

I have a 64 bit Linux system; I write from Italy.
gio 29 nov 2007 00:52:48 CET|rosetta@home|Computation for task gb3_DC_BOINC_MFR_ABRELAX_PICKED_2342_18844_0 finished
thu 29 nov 2007 00:52:48 CET|rosetta@home|Output file gb3_DC_BOINC_MFR_ABRELAX_PICKED_2342_18844_0_0 for task gb3_DC_BOINC_MFR_ABRELAX_PICKED_2342_18844_0 absent
thu 29 nov 2007 01:21:30 CET|rosetta@home|Sending scheduler request: Requested by user
thu 29 nov 2007 01:21:30 CET|rosetta@home|Reporting 1 tasks
thu 29 nov 2007 01:21:35 CET|rosetta@home|Scheduler RPC succeeded

After,
thu 29 nov 2007 01:21:30 CET|rosetta@home|Sending scheduler request: Requested by user
thu 29 nov 2007 01:21:30 CET|rosetta@home|Reporting 1 tasks
thu 29 nov 2007 01:21:35 CET|rosetta@home|Scheduler RPC succeeded
thu 29 nov 2007 01:21:35 CET|rosetta@home|Message from server: Project encountered internal error: shared memory
thu 29 nov 2007 01:21:35 CET|rosetta@home|Deferring communication for 1 hr 0 min 0 sec
thu 29 nov 2007 01:21:35 CET|rosetta@home|Reason: project is down


From several days continued a thing like that (this is a "Rosetta Beta 5.85", but it was the same with "Rosetta 5.82").
I'll computate an other packet of Rosetta, but I think - I fear. I hope not - that it'll be the last...
:-(
ID: 49169 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 49170 - Posted: 29 Nov 2007, 1:59:34 UTC

I've been getting this all morning to still the same, feeder is down.

11/29/2007 12:55:50 PM|rosetta@home|Message from server: Project encountered internal error: shared memory

Pete.


ID: 49170 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
BarryAZ

Send message
Joined: 27 Dec 05
Posts: 153
Credit: 30,843,285
RAC: 0
Message 49171 - Posted: 29 Nov 2007, 2:48:36 UTC
Last modified: 29 Nov 2007, 3:30:03 UTC

This problem appears to be something which requires some extra onsite troubleshooting. I am hoping the admin folks are actually aware of this -- having not seen any note of this on the home page as the 'time to repair' clock doesn't really start until the folks who need to take care of it are aware of the problem. (Correction -- tracing back on this thread, I see one of the project admins did comment, so the project folks are aware there is a problem).

Rosetta has had a long run of stable operation and solid 'on time' response to issues. That seems to have fallen off a bit of late (say over the past month or so) for some reason. One example here is that the technical notes haven't had any updates in over two months and there have been some technical noteworthy events in the intervening time).

I've joined 8 projects over the years. Two are no longer running (BBC Climate stopped issuing new work this past Spring, and Predictor has been essentially defunct since August). Of the remaining 6, Rosetta has been one of my best projects over the long haul. With the more recent issues, I've gone to my standard approach for projects -- that is, after an unannounced disconnect of several hours, I suspect the project in my various workstations. That shifts processing to 'live' projects and reduces the 'surge' effect of backlogged results being uploaded when a project wakes back up.

The other aspect of this for me is depending on the project handling of a problem (do they update their home page to alert folks, does an admin jump in to ongoing problem reporting threads, is their any indication of the length of the outage), I either retain or reduct the resource share. My thinking there is that I typically don't tweak resource shares for my farm, but don't like major backlogs of results either.

Since this problem showed up in the morning and it is now night time, I figure we'll not see it resolved until sometime tomorrow and perhaps later than that, this outage fits in the 'reduce share' category. That isn't that big a deal, I typically have three or more projects on each workstation, it just means that Rosetta's CPU share loss becomes a SETI, Climate, Einstein, Spinhenge or World Grid gain.

That being said, I would like to see some form of information about the problems, their resolution, and perhaps a follow up regarding information updating for the hoi polloi.




ID: 49171 · Rating: 1 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Michael T. Allison Retired Federal Goverment
Avatar

Send message
Joined: 26 Nov 07
Posts: 10
Credit: 11,591
RAC: 0
Message 49173 - Posted: 29 Nov 2007, 3:02:23 UTC

message from server:project encountered internal error:shared memory... I get this message when trying to return finnished tasks. What should I do? Is this a problem with my computer? (((HELP)))

ID: 49173 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Nathaniel

Send message
Joined: 5 Apr 06
Posts: 2
Credit: 542,398
RAC: 0
Message 49175 - Posted: 29 Nov 2007, 3:31:13 UTC

I just tried putting the client on my Turion x2 TL-50 with 4G DDR2 667 and a geforce 7600 GO card that pulls almost 1.25 gigs shared memory, and im getting the same bug, but from what im reading, its server side so ill just have to wait till they fix the problem i suppose. Or is there something i can do to allievate this?
ID: 49175 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Beringse
Avatar

Send message
Joined: 10 Oct 06
Posts: 20
Credit: 401,284
RAC: 0
Message 49178 - Posted: 29 Nov 2007, 6:21:43 UTC

Been going on since mid morning here, same message. I'll let it run on this laptop and check the farm out when I get home tomorrow-hopefully it'll be back up and running by then.
ID: 49178 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Beringse
Avatar

Send message
Joined: 10 Oct 06
Posts: 20
Credit: 401,284
RAC: 0
Message 49200 - Posted: 29 Nov 2007, 12:58:33 UTC

Still down. Latest message follows...

11/29/2007 4:52:41 AM|rosetta@home|Fetching scheduler list
11/29/2007 4:52:46 AM|rosetta@home|Master file download succeeded
11/29/2007 4:52:51 AM|rosetta@home|Sending scheduler request: Requested by user. Requesting 0 seconds of work, reporting 0 completed tasks
11/29/2007 4:52:56 AM|rosetta@home|Scheduler request succeeded: got 0 new tasks
11/29/2007 4:52:56 AM|rosetta@home|Message from server: Project encountered internal error: shared memory

ID: 49200 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 49204 - Posted: 29 Nov 2007, 13:28:28 UTC - in response to Message 49200.  

the feeder server is offline
see server status link in the top right of the home page

Still down. Latest message follows...

11/29/2007 4:52:41 AM|rosetta@home|Fetching scheduler list
11/29/2007 4:52:46 AM|rosetta@home|Master file download succeeded
11/29/2007 4:52:51 AM|rosetta@home|Sending scheduler request: Requested by user. Requesting 0 seconds of work, reporting 0 completed tasks
11/29/2007 4:52:56 AM|rosetta@home|Scheduler request succeeded: got 0 new tasks
11/29/2007 4:52:56 AM|rosetta@home|Message from server: Project encountered internal error: shared memory


ID: 49204 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
BarryAZ

Send message
Joined: 27 Dec 05
Posts: 153
Credit: 30,843,285
RAC: 0
Message 49206 - Posted: 29 Nov 2007, 14:12:28 UTC - in response to Message 49204.  

Right, looks like someone needs to come into the lab and perhaps reboot that server. I've now got two projects offline in my collection -- Spinhenge went dead about 12 hours ago (they have zero connectivity - no home page, nothing). Rosetta and Spinhenge were just about my top two projects. I've suspended both and cpu cycles are getting diverted to Climate, Einstein, SETI and World Grid.



the feeder server is offline
see server status link in the top right of the home page

Still down. Latest message follows...

11/29/2007 4:52:41 AM|rosetta@home|Fetching scheduler list
11/29/2007 4:52:46 AM|rosetta@home|Master file download succeeded
11/29/2007 4:52:51 AM|rosetta@home|Sending scheduler request: Requested by user. Requesting 0 seconds of work, reporting 0 completed tasks
11/29/2007 4:52:56 AM|rosetta@home|Scheduler request succeeded: got 0 new tasks
11/29/2007 4:52:56 AM|rosetta@home|Message from server: Project encountered internal error: shared memory



ID: 49206 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Brent

Send message
Joined: 13 Nov 07
Posts: 1
Credit: 605,061
RAC: 0
Message 49207 - Posted: 29 Nov 2007, 14:13:09 UTC

ID: 49207 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile TheRiceKing

Send message
Joined: 23 Apr 07
Posts: 2
Credit: 619,489
RAC: 0
Message 49210 - Posted: 29 Nov 2007, 14:38:39 UTC

I'm having same problem as you guys but how do I set up Rosetta@home so it could keep sending me work in the meantime?
ID: 49210 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 49213 - Posted: 29 Nov 2007, 14:43:03 UTC - in response to Message 49210.  

I'm having same problem as you guys but how do I set up Rosetta@home so it could keep sending me work in the meantime?


Finally a question I can help with.

BOINC's General Preferences allow you to define how much work you like to keep on your computer. If you had had a few days of work at the time of this failure, you would still be crunching happily today.

Click the "Participants" link at the top of this message board and go in to your General Preferences.

You can also change the setting for one specific machine by opening the BOINC Manager, going to preferences and clicking the Network Usage tab.
Rosetta Moderator: Mod.Sense
ID: 49213 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
1 · 2 · Next

Message boards : Number crunching : Project encountered internal error: shared memory



©2024 University of Washington
https://www.bakerlab.org