ENTRIES
Welcome to Eric Cheng's online journal! You are not logged in. [ Log in ]
«  :: index ::  »

RAID 5 drive failure

:: Thursday, May 25th, 2006 @ 11:18:49 am

:: Tags:

Maxtor Drive

Shit happens, and hard drives fail. I woke up this morning to a “corrupted directory” notice and beeping noises coming from the server area under my stage. As a reminder, my main server holds an 8-disk RAID 5 configuration + a system drive, and has been running smoothly for quite awhile now.

I’m in the process of rebuilding the array using the original drive (it is still spinning and accepts a rebuild option), but I’ve also ordered a replacement drive so I can shove the thing in if the RAID fails again. The rebuild will take 30 hours, so I have some time to wait for the replacement to arrive. :) The next time I build a RAID, I’m going to have a dedicated spare in the mix so I don’t have to stress out about sourcing a matching drive.

The failure orphaned hundreds of files, which were recovered and placed in a useless directory structure by CHKDSK. Chris Emura has always warned me about data loss in RAID 5 failure and luckily, I am always backed up (onto a ReadyNas 600 system). When the RAID finishes its rebuild, I’ll do a restore from the backups.

I’m on track for my failure rate of 1-2 drives a year (out of the 20 or so that I use).

Popularity: 9% | Oakland, CA | link | trackback | qrcode | May 25, 2006 11:18:49

:: View Comments (rss)

  1. posted by Victor A. on Thu, May 25, 2006 @ 11:54 am

    Eric – anymore drive failures and I’ll be suggesting one of these:

    http://www.apple.com/xserve/raid/

    since is it certifiied to work with Microsoft Windows 2003 (and 2000) :-)

    http://www.apple.com/xserve/ra.....tions.html

    Per GB, it is pretty cheap and you can keep a hot spare in the 7th bay on each side of the RAID.

  2. posted by echeng on Thu, May 25, 2006 @ 12:33 pm

    Victor – I’m not sure how an xserve would prevent drive failure! :) they sure are sexy, though.

  3. posted by Chester on Thu, May 25, 2006 @ 12:37 pm

    Yeah, but consider that Apple might end up introducing a widget that will attach to your iPod and which will wirelessly alert you when a drive goes down.

  4. posted by echeng on Thu, May 25, 2006 @ 12:49 pm

    You make good points, Chester. :)

  5. posted by Adam Steffes on Thu, May 25, 2006 @ 1:37 pm

    You may want to investigate a RAID controller which supports dual-parity ADG RAID-6, such as the HP Smart Array P600. :)

    http://h18004.www1.hp.com/prod.....index.html

  6. posted by echeng on Thu, May 25, 2006 @ 1:46 pm

    Adam – how am I going to get 2TB of online storage via SCSI? :) I think I’m going to have to stick with SATA. :) Performance isn’t important for me, since there are usually at most 2-3 machines accessing the data at once. It’s all over gigabit, so I’m capped at 26-28MB/s of transfer, anyway.

  7. posted by Victor A. on Thu, May 25, 2006 @ 2:31 pm

    Eric,

    Re: your comment to me. Sadly nothing can prevent drive failure (not even brushing your teeth 3 times a day, saying your prayers and eating your broccoli). What an enterprise-clase storage device would do would reduce the impact that drive failure has (RAID 50, hot spare, battery backup, etc..)

    With you being gone on multi-day/week trips, a system with 2+ hot spares that auto-rebuild the raid might mean you could just stop home to do laundry, swap dead drives, have a meal with Vienna and head back to the AirPort. :-)

    After seeing your setup, it almost seems that if there was a next (reinforcement) step up to be taken, a Xserve RAID (or similar) might be it. Plus, dang it, Apple’s RAID works for windows – it is a rare time I get to recommend hardware from that little fruit company that menas not buying a Mac.

    hehehe.

  8. posted by echeng on Thu, May 25, 2006 @ 3:47 pm

    Yeah, I understand your point, Victor. My system already does auto-rebuilds, but I need a swap drive, which I am not doing at the moment because I need all the space I have!

    During the next big upgrade (after I pass 2TB of data), I’ll be sure to configure a 7-drive RAID 5 + 1 drive swap.

  9. posted by Curtis Leo on Fri, May 26, 2006 @ 5:36 am

    Sorry to hear about bad drives. Since you’re never home, you might try a raid 5 + 2 online spares.

    Another method that I’ve used in building arrays, 10 Drive array, 2 x Raid 5 + 1 (4 drives + 1 online spare). You’ll have a “faster rebuild” time, thus lowering your unprotected state. Each in a protected group.

    When you do buy another set of drives, buy 2 extra shelf spares. At work, we run through drives like crazy and found a bunch of drives with different servo codes, firmware codes, sector size differences. The sector size difference is nasty during a rebuild process.

    Also unstead of having all the drives in the same case as your motherboard, go external enclosure. I think it’s time for a 42U rackmount frame! When drives are all spinning in a array, there is a vibration induced by drives. As you add more drives to the mix, each one is vibrating at a different frequency. Basically all the drives can vibrate each other to death.

    I just qualified the Hitachi Deskstar T7K500 500gb SATA 3.0gb/s drives in our arrays. :) 500gb x 10! :)

    I’m guessing that you already have UPSs but I have a bunch of APC SU700 UPS that I don’t need anymore. Think I have 6 more of them. Want them? They just need new batteries.

  10. posted by echeng on Fri, May 26, 2006 @ 10:25 am

    Thanks, Curtis. You’re the master of old gear. :) I have a big UPS under there and have been wanting to get another, since I have two devices that can shut themselves off via UPS control signal. Problem is that one of them requires USB, so a SU700 won’t work.

    I wonder if the serial port in my main server is working… ;)

  11. posted by Adam Steffes on Fri, May 26, 2006 @ 1:41 pm

    Eric, the same controller exists for SATA and SAS disks. I suppose you would have to invest in a SAS chassis that accepts SATA disks, though. Then buy a couple of these bad boys:

    http://www.seagate.com/cda/pro.....43,00.html

    Check out the P400 if you’re remotely interested in the RAID6 stuff: http://h18004.www1.hp.com/prod.....index.html

  12. posted by Adam Steffes on Fri, May 26, 2006 @ 1:43 pm

    Can I have a UPS, Curtis? :D

  13. posted by alexking.org: Blog > Around the web on Sun, May 28, 2006 @ 7:12 am

    [...] [ECHENG.COM] – RAID 5 drive failure – never let Eric anywhere near a HD you care about. [...]

:: leave a reply

Use: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>

RECENT TWITTER ACTIVITY
ARCHIVES
Journal Home
Where is Eric? (password)
Stuff for Sale
March 2010 (8)
February 2010 (17)
January 2010 (29)
December 2009 (21)
November 2009 (23)
October 2009 (32)
September 2009 (19)
August 2009 (34)
July 2009 (21)
June 2009 (30)
May 2009 (23)
April 2009 (18)
March 2009 (6)
February 2009 (25)
January 2009 (5)
December 2008 (6)
November 2008 (22)
October 2008 (27)
September 2008 (25)
August 2008 (34)
July 2008 (34)
June 2008 (32)
May 2008 (26)
April 2008 (15)
March 2008 (19)
February 2008 (31)
January 2008 (43)
December 2007 (33)
November 2007 (29)
October 2007 (29)
September 2007 (9)
August 2007 (19)
July 2007 (10)
June 2007 (17)
May 2007 (26)
April 2007 (38)
March 2007 (39)
February 2007 (13)
January 2007 (35)
December 2006 (35)
November 2006 (14)
October 2006 (6)
September 2006 (20)
August 2006 (24)
July 2006 (32)
June 2006 (17)
May 2006 (23)
April 2006 (16)
March 2006 (16)
February 2006 (26)
January 2006 (34)
December 2005 (17)
November 2005 (21)
October 2005 (18)
September 2005 (17)
August 2005 (5)
July 2005 (15)
June 2005 (20)
May 2005 (25)
April 2005 (7)
March 2005 (22)
February 2005 (20)
January 2005 (38)
December 2004 (6)
November 2004 (24)
October 2004 (16)
September 2004 (22)
August 2004 (12)
July 2004 (17)
June 2004 (15)
May 2004 (11)
April 2004 (35)
March 2004 (40)
February 2004 (29)
January 2004 (36)
December 2003 (20)
November 2003 (18)
October 2003 (10)
September 2003 (18)
August 2003 (10)
July 2003 (34)
June 2003 (12)
May 2003 (49)
April 2003 (42)
March 2003 (42)
February 2003 (15)
January 2003 (7)
December 2002 (17)
November 2002 (19)
October 2002 (24)
September 2002 (22)
August 2002 (20)
July 2002 (21)
June 2002 (14)
May 2002 (15)
April 2002 (11)
March 2002 (13)
February 2002 (20)
January 2002 (17)
December 2001 (16)
Even Older Journal
Travel Journals

CATEGORIES / TAGS
(5) (2) (2) (2) (11) (4) (1) (1) (1) (4) (1) (1) (1) (5) (1) (1) (5) (394) (12) (1) (1) (12) (10) (1) (1) (26) (5) (2) (1) (4) (1) (31) (5) (2) (24) (1) (3) (4) (1) (2) (89) (2) (14) (1) (1) (9) (1) (11) (171) (1) (1) (1) (3) (1) (1) (1) (6) (419) (5) (1) (1) (1) (69) (1) (7) (1) (15) (3) (13) (2) (1) (1) (1) (1) (84) (8) (246) (50) (34) (1) (53) (1) (1) (1) (1) (1) (2) (1) (1) (15) (2) (4) (1)
VIENNA TENG
Support my friend and favorite singer-songwriter, Vienna Teng!

--- Next Show ---

Schedule coming soon.
[ discography ]

Eric Cheng's RSS Journal Journal RSS
Eric Cheng's RSS Journal Comments RSS

proudly powered by wordpress
script exec time: 1.01s
i hate computers.