WUs stuck at "Uploading"

Message boards : Number crunching : WUs stuck at "Uploading"

To post messages, you must log in.

AuthorMessage
Profile Cobra

Send message
Joined: 9 Nov 05
Posts: 7
Credit: 16,588,989
RAC: 2,264
Message 57394 - Posted: 1 Dec 2008, 13:30:32 UTC

I recognize that there was a Rosetta fileserver crash 11/30. However, both the home page and the technical news page seem to imply that the fileservers are back online, which makes me think that client/server communication should have been restored.

However, my machines all have a number of WUs stuck at the "Uploading" phase, and there are a number of "temporarily failed upload" lines under the BOINC Mgr Messages tab.

It's not clear to me from what's on the home page and tech news page whether I should expect client/server comm to be back to normal now, or if I should expect a few more days of difficulty before things get back on track.

Should I hang tight, or do I need to reset my project to clear out the stuck WUs?

(An extra sentence on the tech news page saying something like, "Clients may experience delays communicating with servers for the next few days, but backlogged WUs should eventually get delivered," or "Clients still showing WUs in 'Uploaded' state should be reset," would be greatly appreciated.)
ID: 57394 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1232
Credit: 14,281,662
RAC: 1,150
Message 57395 - Posted: 1 Dec 2008, 14:03:29 UTC - in response to Message 57394.  

I recognize that there was a Rosetta fileserver crash 11/30. However, both the home page and the technical news page seem to imply that the fileservers are back online, which makes me think that client/server communication should have been restored.

However, my machines all have a number of WUs stuck at the "Uploading" phase, and there are a number of "temporarily failed upload" lines under the BOINC Mgr Messages tab.

It's not clear to me from what's on the home page and tech news page whether I should expect client/server comm to be back to normal now, or if I should expect a few more days of difficulty before things get back on track.

Should I hang tight, or do I need to reset my project to clear out the stuck WUs?

(An extra sentence on the tech news page saying something like, "Clients may experience delays communicating with servers for the next few days, but backlogged WUs should eventually get delivered," or "Clients still showing WUs in 'Uploaded' state should be reset," would be greatly appreciated.)


I think something like that has already been added. The new fileserver is online, but not all the files have been transferred from the old one; that is likely to take a few more days. I doubt if all the servers will work normally before then.

If you reset before it allows you to upload the previous results, you'll lose those results.

Checking for lockfiles now seems worthwhile, though.

ID: 57395 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 57396 - Posted: 1 Dec 2008, 14:18:42 UTC - in response to Message 57395.  

I recognize that there was a Rosetta fileserver crash 11/30. However, both the home page and the technical news page seem to imply that the fileservers are back online, which makes me think that client/server communication should have been restored.

However, my machines all have a number of WUs stuck at the "Uploading" phase, and there are a number of "temporarily failed upload" lines under the BOINC Mgr Messages tab.

It's not clear to me from what's on the home page and tech news page whether I should expect client/server comm to be back to normal now, or if I should expect a few more days of difficulty before things get back on track.

Should I hang tight, or do I need to reset my project to clear out the stuck WUs?

(An extra sentence on the tech news page saying something like, "Clients may experience delays communicating with servers for the next few days, but backlogged WUs should eventually get delivered," or "Clients still showing WUs in 'Uploaded' state should be reset," would be greatly appreciated.)


I think something like that has already been added. The new fileserver is online, but not all the files have been transferred from the old one; that is likely to take a few more days. I doubt if all the servers will work normally before then.

If you reset before it allows you to upload the previous results, you'll lose those results.

Checking for lockfiles now seems worthwhile, though.



you should be able to deal with lockfiles without losing any data. the upload issue does not seem to be resolved yet. it has been a problem since i tried uploading at 10am european time. see other thread about unable to upload.
ID: 57396 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1018
Credit: 4,334,829
RAC: 0
Message 57403 - Posted: 1 Dec 2008, 16:08:21 UTC

The upload server is trying to catch up. The data should eventually get uploaded.
ID: 57403 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : WUs stuck at "Uploading"



©2024 University of Washington
https://www.bakerlab.org