Dying Hard Drive or PSU

From: ANT_THOMAS10 Jan 2014 19:40
To: ALL1 of 47
One of the hard drives in my server occasionally disappears. More so recently.

It's only ever one hard drive. A reboot doesn't usually sort it but a proper power cycle does.

First thought possibly the PSU not being able to cope with so many hard drives (9 :$) but it's been fine for about 2 years so starting to think hard drive.

Hard drive is out of warranty. The SMART stats claim one failure but I'm not sure I understand what the numbers actually mean. Do the numbers have to be above the threshold?

Code:

HD Tune: ST31000340AS Health
 
ID                               Current  Worst    ThresholdData       Status   
(01) Raw Read Error Rate         101      99       6        45405802   Ok       
(03) Spin Up Time                91       90       0        0          Ok       
(04) Start/Stop Count            100      100      20       215        Ok       
(05) Reallocated Sector Count    19       19       36       1665       Failed   
(07) Seek Error Rate             57       52       30       81935363   Ok       
(09) Power On Hours Count        47       47       0        47258      Ok       
(0A) Spin Retry Count            100      100      97       2          Ok       
(0C) Power Cycle Count           100      100      20       216        Ok       
(B8) (unknown attribute)         100      100      99       0          Ok       
(BB) (unknown attribute)         100      100      0        0          Ok       
(BC) (unknown attribute)         100      99       0        327685     Ok       
(BD) (unknown attribute)         100      100      0        0          Ok       
(BE) Airflow Temperature         52       24       45       823722032  Ok       
(C2) Temperature                 47       76       0        47         Ok       
(C3) Hardware ECC Recovered      46       10       0        45405802   Ok       
(C5) Current Pending Sector      100      100      0        382        Ok       
(C6) Offline Uncorrectable       100      100      0        382        Ok       
(C7) Ultra DMA CRC Error Count   200      200      0        0          Ok       
 
Power On Time         : 47258
Health Status         : Failed

 

From: ANT_THOMAS10 Jan 2014 19:40
To: ALL2 of 47
Well that fucked up. Looked fine in the editor.
From: ANT_THOMAS10 Jan 2014 19:41
To: ALL3 of 47
Code:

HD Tune: ST31000340AS Health
 
ID                               Current  Worst    ThresholdData       Status   
(01) Raw Read Error Rate         101      99       6        45405802   Ok       
(03) Spin Up Time                91       90       0        0          Ok       
(04) Start/Stop Count            100      100      20       215        Ok       
(05) Reallocated Sector Count    19       19       36       1665       Failed   
(07) Seek Error Rate             57       52       30       81935363   Ok       
(09) Power On Hours Count        47       47       0        47258      Ok       
(0A) Spin Retry Count            100      100      97       2          Ok       
(0C) Power Cycle Count           100      100      20       216        Ok       
(B8) (unknown attribute)         100      100      99       0          Ok       
(BB) (unknown attribute)         100      100      0        0          Ok       
(BC) (unknown attribute)         100      99       0        327685     Ok       
(BD) (unknown attribute)         100      100      0        0          Ok       
(BE) Airflow Temperature         52       24       45       823722032  Ok       
(C2) Temperature                 47       76       0        47         Ok       
(C3) Hardware ECC Recovered      46       10       0        45405802   Ok       
(C5) Current Pending Sector      100      100      0        382        Ok       
(C6) Offline Uncorrectable       100      100      0        382        Ok       
(C7) Ultra DMA CRC Error Count   200      200      0        0          Ok       
 
Power On Time         : 47258
Health Status         : Failed


 

From: ANT_THOMAS10 Jan 2014 19:42
To: ALL4 of 47
I really can't use the code tags :((
From: cynicoid10 Jan 2014 20:04
To: ANT_THOMAS 5 of 47
Have you tried it on a different power rail (swap with one of the drives you know is working).

I was getting random BSODs which turned out to be a drive failing because it was sharing a rail with a graphics card, the draw on the rail was too much even though the PSU could cope overall.
From: CHYRON (DSMITHHFX)11 Jan 2014 02:42
To: ANT_THOMAS 6 of 47
If SMART says fail, then that's pretty serious (although I have a drive that says fail in this OS and fine in that, so apparently subjective. Still the drive deffo had major problems and had to be replaced).

I would immediately try to back up the drive and replace it without dithering over whether there's a problem or not. The worst outcome is that a) it fails and b) can't be recovered.
EDITED: 11 Jan 2014 02:43 by DSMITHHFX
From: ANT_THOMAS11 Jan 2014 14:13
To: cynicoid CHYRON (DSMITHHFX) 7 of 47
I'll try the different rail idea but I think this gives me a good excuse to get another bigger hard drive.
From: ANT_THOMAS11 Jan 2014 20:33
To: ALL8 of 47
Well it's gone again and I can't be bothered going into the attic to power cycle.

Is there a manufacturer of choice these days? Or is the standard pretty similar these days? I assume my drives spindown since they're in a windows server so do I even need a NAS designed drive?

All my drives are Seagate and this is first possible failure I've had. Drive is about 6 years old and definitely out of Seagate warranty.

Seen a Toshiba drive that seems the best £/TB currently.

Also, do people still like ebuyer? I probably stupidly started reading comments on HUKD and there's a lot of negative feelings towards them at the moment. Mainly due to their return policy.
From: milko11 Jan 2014 23:19
To: ANT_THOMAS 9 of 47
I've never had to return anything to them I don't think, but I bought something off them for the first time in years the other day and it was all good, free next day delivery too. 
From: ANT_THOMAS12 Jan 2014 00:14
To: milko 10 of 47
My only issue in the past (or annoyance I guess) is that the free saver delivery was always next day delivery but dispatched about 5 days after you order. Didn't make sense apart from them trying to encourage/force people onto more expensive options.
From: koswix12 Jan 2014 01:06
To: ANT_THOMAS 11 of 47
I remember having a right ball ache returning stuff to them about 10 years ago or more. Beyond that, can't really help.
From: Lucy (X3N0PH0N)12 Jan 2014 19:23
To: ANT_THOMAS 12 of 47
I always tend to buy from them and I've not had any issues. Had to return my Nexus 7 and that all went fine too, not returned anything else. They're not as good as they used to be on price, but they're not bad.
From: ANT_THOMAS12 Jan 2014 20:02
To: ALL13 of 47
Same all round for me. I've generally bought from there or Scan, since I can actually go to the Scan shop/warehouse and buy it there and then. Never had any issues with eBuyer either, I guess it could well be a very vocal minority.
From: koswix13 Jan 2014 00:09
To: ANT_THOMAS 14 of 47
Their merry Xmas photo was quite good - with the leader board on the wall in the background showing which member of staff had successfully rejected the most returns. They quickly removed that from their Facebook page.
From: ANT_THOMAS13 Jan 2014 16:30
To: ALL15 of 47
Just bought a 3TB Seagate external drive that had a drive inside that (I think) costs about £10 more on its own. Put the 3TB in the server and maybe going to have the 1TB as some sort of spare external drive. Probably not an amazing idea as a backup drive if it's not reliable.

Is hard drives overheating a big issue? Because the 1TB was in the middle of a set of 5 hard drives.
From: ANT_THOMAS13 Jan 2014 16:40
To: ALL16 of 47
Should I spread the bottom 5 hard drives out a bit better?


From: CHYRON (DSMITHHFX)13 Jan 2014 16:52
To: ANT_THOMAS 17 of 47
Either that or boost ventilation. # of drives does seem excessive to me, and an invitation to heat/psu issues. Are you just trying to eek out some extra life from old, small drives?
From: ANT_THOMAS13 Jan 2014 17:08
To: CHYRON (DSMITHHFX) 18 of 47
Depends what you class as small.

160GB (OS drive)
400GB

10GB (old server OS drive that has now been pulled)

500GB
500GB
3TB (was 1TB position)
2TB
2TB

I might get another 5.25 to 3.5 bracket and pop the 3TB at the top.
From: CHYRON (DSMITHHFX)13 Jan 2014 20:04
To: ANT_THOMAS 19 of 47
In terms of what's on the store shelves currently, anything <1TB. I think they're cheap enough that you could replace the 4 small drives with one drive for an overall cooler and less power-hungry set up. Though you'd have to copy (and possibly reinstall) stuff, which is never fun. You could still use the small drives in usb for backups.
From: Ken (SHIELDSIT)15 Jan 2014 07:49
To: ANT_THOMAS 20 of 47
If I have to stack my drives like that I make sure to put a fan at the front so air can at least get pulled through and keep them somewhat cool.  You might be able to find some software that will monitor/report the temperature of the drives.