View previous topic :: View next topic |
Author |
Message |
schorsch_76 Guru
Joined: 19 Jun 2012 Posts: 450
|
Posted: Wed Apr 22, 2015 7:01 pm Post subject: Would you trust this HDD? |
|
|
Hi,
i just got a new 4 TB harddrive. It is a external drive with USB3 and an additional powersupply. I put on it some backups and other stuff. As i mounted it again, the ext4 filesystem was severly damaged! The superblock reported an size bigger than the HDD.
Code: | EXT4-fs (dm-8): bad geometry: block count 629145600 exceeds size of device (367001600 blocks) |
I tried fsck -f, extundelete to recover the data, but i had no luck. No really important data was lost. Just backups whose machines are still up and running...
I tried badblocks on the device "badblocks /dev/sdb" which didnt find any error. I looked at smartctl which said, the device is good. The short test did complete without error.
The only signs which tell me, is that
- "fsck.ext4 -f -b 32768 /dev/sdb1" did fail with read/write errors
- the long offline test did abort
- my first data is lost ... but the device is just about 100h running. The data was lost at 93 hours lifetime.
Here is the smartlog ...
Code: |
smartctl 6.3 2014-07-26 r3976 [x86_64-linux-3.18.2-slim] (local build)
Copyright (C) 2002-14, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Device Model: TOSHIBA MD04ACA400
Serial Number: 74V3K67GFSAA
LU WWN Device Id: 5 000039 5ab805350
Firmware Version: FP2A
User Capacity: 4.000.787.030.016 bytes [4,00 TB]
Sector Sizes: 512 bytes logical, 4096 bytes physical
Rotation Rate: 7200 rpm
Form Factor: 3.5 inches
Device is: Not in smartctl database [for details use: -P showall]
ATA Version is: ATA8-ACS (minor revision not indicated)
SATA Version is: SATA 3.0, 6.0 Gb/s (current: 3.0 Gb/s)
Local Time is: Wed Apr 22 20:56:27 2015 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Offline data collection status: (0x82) Offline data collection activity
was completed without error.
Auto Offline Data Collection: Enabled.
Self-test execution status: ( 249) Self-test routine in progress...
90% of test remaining.
Total time to complete Offline
data collection: ( 120) seconds.
Offline data collection
capabilities: (0x5b) SMART execute Offline immediate.
Auto Offline data collection on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
No Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 2) minutes.
Extended self-test routine
recommended polling time: ( 466) minutes.
SCT capabilities: (0x003d) SCT Status supported.
SCT Error Recovery Control supported.
SCT Feature Control supported.
SCT Data Table supported.
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000b 100 100 050 Pre-fail Always - 0
2 Throughput_Performance 0x0005 100 100 050 Pre-fail Offline - 0
3 Spin_Up_Time 0x0027 100 100 001 Pre-fail Always - 7090
4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 92
5 Reallocated_Sector_Ct 0x0033 100 100 050 Pre-fail Always - 0
7 Seek_Error_Rate 0x000b 100 100 050 Pre-fail Always - 0
8 Seek_Time_Performance 0x0005 100 100 050 Pre-fail Offline - 0
9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 116
10 Spin_Retry_Count 0x0033 101 100 030 Pre-fail Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 62
191 G-Sense_Error_Rate 0x0032 100 100 000 Old_age Always - 2
192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 58
193 Load_Cycle_Count 0x0032 100 100 000 Old_age Always - 92
194 Temperature_Celsius 0x0022 100 100 000 Old_age Always - 40 (Min/Max 17/59)
196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 100 100 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0030 100 100 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
220 Disk_Shift 0x0002 100 100 000 Old_age Always - 0
222 Loaded_Hours 0x0032 100 100 000 Old_age Always - 74
223 Load_Retry_Count 0x0032 100 100 000 Old_age Always - 0
224 Load_Friction 0x0022 100 100 000 Old_age Always - 0
226 Load-in_Time 0x0026 100 100 000 Old_age Always - 219
240 Head_Flying_Hours 0x0001 100 100 001 Pre-fail Offline - 0
SMART Error Log Version: 1
No Errors Logged
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Short offline Completed without error 00% 116 -
# 2 Extended offline Aborted by host 90% 103 -
# 3 Extended offline Aborted by host 90% 93 -
# 4 Short offline Completed without error 00% 92 -
SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
|
i didn't abort these two Extended offline tests....
Now i reformated the drive and all seems good again, but why did the first partition got damaged? The drive is just sitting near my machine on the table .... Any hints and opinions are welcome!
Thanks
schorsch_76 |
|
Back to top |
|
|
eccerr0r Watchman
Joined: 01 Jul 2004 Posts: 9679 Location: almost Mile High in the USA
|
Posted: Wed Apr 22, 2015 7:14 pm Post subject: |
|
|
Well according to SMART the disk is fine, but I'd make sure I have a backup of that backup...
With more bytes on a disk, the risk for immense data loss gets higher and higher.
Just being pedantic, you are using GPT partitioning on this, right? And not used it with another system that doesn't understand it? _________________ Intel Core i7 2700K/Radeon R7 250/24GB DDR3/256GB SSD
What am I supposed watching? |
|
Back to top |
|
|
kernelOfTruth Watchman
Joined: 20 Dec 2005 Posts: 6111 Location: Vienna, Austria; Germany; hello world :)
|
|
Back to top |
|
|
schorsch_76 Guru
Joined: 19 Jun 2012 Posts: 450
|
Posted: Thu Apr 23, 2015 8:38 am Post subject: |
|
|
Yes, i use GPT on this HDD. I use ext4 since ages as a filesystem. |
|
Back to top |
|
|
madchaz l33t
Joined: 01 Jul 2003 Posts: 993 Location: Quebec, Canada
|
Posted: Sun May 17, 2015 7:17 pm Post subject: |
|
|
Stupid question, but no one else asked. Did you properly umount the disk before you unplugged it/turned it off?
Not doing that can cause the kind of issues you had. _________________ Someone asked me once if I suffered from mental illness. I told him I enjoyed every second of it.
www.madchaz.com A small candle of a website. As my lab specs on it. |
|
Back to top |
|
|
krinn Watchman
Joined: 02 May 2003 Posts: 7470
|
Posted: Mon May 18, 2015 2:34 am Post subject: |
|
|
For HDD my answer is simple: if you start asking, then i won't trust it. |
|
Back to top |
|
|
P.Kosunen Guru
Joined: 21 Nov 2005 Posts: 309 Location: Finland
|
Posted: Tue May 19, 2015 4:30 pm Post subject: |
|
|
Code: | 191 G-Sense_Error_Rate 0x0032 100 100 000 Old_age Always - 2 |
Have you dropped it? |
|
Back to top |
|
|
NeddySeagoon Administrator
Joined: 05 Jul 2003 Posts: 54232 Location: 56N 3W
|
Posted: Tue May 19, 2015 4:53 pm Post subject: |
|
|
P.Kosunen,
Thats OK. VALUE and WORST are both > THRESH.
All the smart data is normalised so that provided the above is true, smart passes.
I have had a number of failed drives that pass the smart checks. _________________ Regards,
NeddySeagoon
Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail. |
|
Back to top |
|
|
P.Kosunen Guru
Joined: 21 Nov 2005 Posts: 309 Location: Finland
|
Posted: Wed May 20, 2015 4:35 pm Post subject: |
|
|
Raw value two might explain file system damage if drop protection activated in middle of writing. Does it need power cycle to start working again or is it smart enough to continue writing from where it was before protection hit? |
|
Back to top |
|
|
NeddySeagoon Administrator
Joined: 05 Jul 2003 Posts: 54232 Location: 56N 3W
|
Posted: Wed May 20, 2015 7:00 pm Post subject: |
|
|
P.Kosunen,
Be very careful how you interpret raw values.
They are vendor specific and are often packed fields. _________________ Regards,
NeddySeagoon
Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail. |
|
Back to top |
|
|
schorsch_76 Guru
Joined: 19 Jun 2012 Posts: 450
|
Posted: Wed May 27, 2015 1:18 pm Post subject: |
|
|
I decided to trust this HDD again but not yet fully I set it up as a member of a RAID1. |
|
Back to top |
|
|
|