project is down

Message boards : Number crunching : project is down

To post messages, you must log in.

AuthorMessage
Rob Jacob

Send message
Joined: 11 Aug 07
Posts: 9
Credit: 702,294
RAC: 0
Message 45842 - Posted: 9 Sep 2007, 16:28:53 UTC

Well...I guess progress is being made. The server seems to be up as I can see stats and the message baord again. I was getting a bunch of "can't open log file" errors, but those have gone away.

Now I am getting "project is down" messages:

9/9/2007 12:22:28 PM|rosetta@home|[file_xfer] Started upload of file Ly49A_BOINC_MFR_ABRELAX_PICKED_2065_14466_0_0
9/9/2007 12:22:28 PM|rosetta@home|[file_xfer] Started upload of file Ly49A_BOINC_MFR_ABRELAX_PICKED_2065_25484_0_0
9/9/2007 12:22:29 PM|rosetta@home|Sending scheduler request: To report completed tasks
9/9/2007 12:22:29 PM|rosetta@home|Requesting 30240 seconds of new work, and reporting 2 completed tasks
9/9/2007 12:22:31 PM|rosetta@home|[file_xfer] Finished upload of file Ly49A_BOINC_MFR_ABRELAX_PICKED_2065_25484_0_0
9/9/2007 12:22:31 PM|rosetta@home|[file_xfer] Throughput 18596 bytes/sec
9/9/2007 12:22:32 PM|rosetta@home|[file_xfer] Finished upload of file Ly49A_BOINC_MFR_ABRELAX_PICKED_2065_14466_0_0
9/9/2007 12:22:32 PM|rosetta@home|[file_xfer] Throughput 19676 bytes/sec
9/9/2007 12:22:34 PM|rosetta@home|Scheduler RPC succeeded
9/9/2007 12:22:34 PM|rosetta@home|Message from server: Project encountered internal error: shared memory
9/9/2007 12:22:34 PM|rosetta@home|Deferring communication for 1 hr 0 min 0 sec
9/9/2007 12:22:34 PM|rosetta@home|Reason: project is down
ID: 45842 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
BarryAZ

Send message
Joined: 27 Dec 05
Posts: 153
Credit: 30,843,285
RAC: 0
Message 45844 - Posted: 9 Sep 2007, 16:32:14 UTC

I amseeing the same problem here trying to report results -- seems like there may be just one more recovery tweak still required.
ID: 45844 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Rob Jacob

Send message
Joined: 11 Aug 07
Posts: 9
Credit: 702,294
RAC: 0
Message 45858 - Posted: 9 Sep 2007, 17:34:06 UTC

I was guessing a permissions problem for the log file. Like maybe they didn't open the permissions enough, or create a directory or something for the log file thing. But now it looks like they took the system down to fix it. Hopefully it comes back soon. I want to get cranking out results again!
ID: 45858 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
BarryAZ

Send message
Joined: 27 Dec 05
Posts: 153
Credit: 30,843,285
RAC: 0
Message 45867 - Posted: 9 Sep 2007, 19:08:24 UTC - in response to Message 45858.  

Could well be -- with Rosetta being offline this week, I simply suspended it and let work reallocate to my other active projects. I've joined a total of 8 projects for my collection of PC's on BOINC over the years. Current status of my eight projects:

Climate BBC -- basically shut down, no new work since early this year.


Predictor -- dormant -- until they generate a batch of new work. I pick up 'returned for reprocessing' work units as they time out and get recycled, but nothing new, nor is there ANY indication of when new work will be available from the project team.


Rosetta -- well we know about this guy.


Einstein -- active, but since the new work units require fairly long cycles with no trickles and since they don't run nearly as efficiently on AMD systems, I've reduced resource share.

That leaves my current (today) most active four

SETI -- the original BOINC project for me -- they have been having problems generating new work over the past week.


Climate -- (not Climate BBC) -- the payoff here is that though their work units are VERY large, they handle trickles so one can get daily process credit.


World Grid -- different interface and structure from the other BOINC projects, but it has been very solid for me.

Spinhenge -- I added this one last month when Predictor went dormant. And I've added it to a number of my existing workstations once I get down to three active projects on a particular workstation. Spinhenge is nice for folks who want short work unit cycles (basically 10 credits each) with pretty quick validation and credit award cycle. The only downside here is that it is in Germany and most of the traffic in their newsgroups is in German.



I was guessing a permissions problem for the log file. Like maybe they didn't open the permissions enough, or create a directory or something for the log file thing. But now it looks like they took the system down to fix it. Hopefully it comes back soon. I want to get cranking out results again!


ID: 45867 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : project is down



©2024 University of Washington
https://www.bakerlab.org