Gentoo Forums
Gentoo Forums
Gentoo Forums
Quick Search: in
Questions around hard drive crash
View unanswered posts
View posts from last 24 hours

 
Reply to topic    Gentoo Forums Forum Index Kernel & Hardware
View previous topic :: View next topic  
Author Message
lothar
Tux's lil' helper
Tux's lil' helper


Joined: 10 Mar 2004
Posts: 87
Location: Norway

PostPosted: Thu May 10, 2007 1:02 pm    Post subject: Questions around hard drive crash Reply with quote

One week ago one of my IDE ATA hard drives started to act funny.
Dmesg reported unreadable LBAs and when running a smartctl -t short (or long) it reported errors while reading the disk.
I also ran a badblocks test that created a log file with 551 bad blocks in it.
However if I ran the badblocks test again on some of the large areas of bad blocks they had no errors.

So I thought the disk is about to die, and I bought a new one.
This disk was part of a LVM group so I just used PVMOVE to move the data to the new disk.
Dmesg only reported 3 or 4 read errors while doing this, so I havent lost any important data.

I still have warranty left on the faulty disk, so I want to return it for replacement.
When I do this I usually wipe the disk with dd
Code:

root@terra: [~/test] 1f 8.0Kb $ dd if=/dev/zero of=/dev/hdj bs=4096
dd: writing `/dev/hdj': No space left on device
48840247+0 records in
48840246+0 records out
200049647616 bytes (200 GB) copied, 50989.1 s, 3.9 MB/s


While doing this dmesg reported some read errors:
Code:

hdj: dma_timer_expiry: dma status == 0x61
hdj: DMA timeout error
hdj: dma timeout error: status=0x53 { DriveReady SeekComplete Index Error }
hdj: dma timeout error: error=0x40 { UncorrectableError }, LBAsect=50480298, high=3, low=148650, sector=50480063
ide: failed opcode was: unknown
end_request: I/O error, dev hdj, sector 50480063
hdj: task_in_intr: status=0x59 { DriveReady SeekComplete DataRequest Error }
hdj: task_in_intr: error=0x40 { UncorrectableError }, LBAsect=50480298, high=3, low=148650, sector=50480295
ide: failed opcode was: unknown
end_request: I/O error, dev hdj, sector 50480295
hdj: dma_timer_expiry: dma status == 0x61
hdj: DMA timeout error
hdj: dma timeout error: status=0x51 { DriveReady SeekComplete Error }
hdj: dma timeout error: error=0x40 { UncorrectableError }, LBAsect=78328230, high=4, low=11219366, sector=78327743
ide: failed opcode was: unknown
end_request: I/O error, dev hdj, sector 78327743
hdj: task_in_intr: status=0x59 { DriveReady SeekComplete DataRequest Error }
hdj: task_in_intr: error=0x40 { UncorrectableError }, LBAsect=78328230, high=4, low=11219366, sector=78328215
ide: failed opcode was: unknown
end_request: I/O error, dev hdj, sector 78328215
hdj: dma_timer_expiry: dma status == 0x61
hdj: DMA timeout error
hdj: dma timeout error: status=0x51 { DriveReady SeekComplete Error }
hdj: dma timeout error: error=0x40 { UncorrectableError }, LBAsect=78328228, high=4, low=11219364, sector=78328223
ide: failed opcode was: unknown
end_request: I/O error, dev hdj, sector 78328223
hdj: dma_timer_expiry: dma status == 0x61
hdj: DMA timeout error
hdj: dma timeout error: status=0x51 { DriveReady SeekComplete Error }
hdj: dma timeout error: error=0x40 { UncorrectableError }, LBAsect=78328230, high=4, low=11219366, sector=78328224
ide: failed opcode was: unknown
end_request: I/O error, dev hdj, sector 78328224
hdj: task_in_intr: status=0x59 { DriveReady SeekComplete DataRequest Error }
hdj: task_in_intr: error=0x40 { UncorrectableError }, LBAsect=78328230, high=4, low=11219366, sector=78328224
ide: failed opcode was: unknown
end_request: I/O error, dev hdj, sector 78328224
Buffer I/O error on device hdj, logical block 9791028
hdj: task_in_intr: status=0x59 { DriveReady SeekComplete DataRequest Error }
hdj: task_in_intr: error=0x40 { UncorrectableError }, LBAsect=78328226, high=4, low=11219362, sector=78328224
ide: failed opcode was: unknown
end_request: I/O error, dev hdj, sector 78328224
Buffer I/O error on device hdj, logical block 9791028
hdj: task_in_intr: status=0x59 { DriveReady SeekComplete DataRequest Error }
hdj: task_in_intr: error=0x40 { UncorrectableError }, LBAsect=78328230, high=4, low=11219366, sector=78328224
ide: failed opcode was: unknown
end_request: I/O error, dev hdj, sector 78328224
Buffer I/O error on device hdj, logical block 9791028
hdj: task_in_intr: status=0x59 { DriveReady SeekComplete DataRequest Error }
hdj: task_in_intr: error=0x40 { UncorrectableError }, LBAsect=78328230, high=4, low=11219366, sector=78328224
ide: failed opcode was: unknown
end_request: I/O error, dev hdj, sector 78328224
Buffer I/O error on device hdj, logical block 9791028
 hdj:
hdj: task_out_intr: status=0x50 { DriveReady SeekComplete }
ide: failed opcode was: unknown
hdj: task_out_intr: status=0x50 { DriveReady SeekComplete }
ide: failed opcode was: unknown
hdj: task_out_intr: status=0x50 { DriveReady SeekComplete }
ide: failed opcode was: unknown
hdj: task_out_intr: status=0x50 { DriveReady SeekComplete }
ide: failed opcode was: unknown
hdj: task_out_intr: status=0x50 { DriveReady SeekComplete }
ide: failed opcode was: unknown
hdj: task_out_intr: status=0x58 { DriveReady SeekComplete DataRequest }
ide: failed opcode was: unknown
hdi: status timeout: status=0xd0 { Busy }
ide: failed opcode was: unknown
hdi: DMA disabled
PDC202XX: Primary channel reset.
hdi: drive not ready for command
ide4: reset: success
hdi: task_out_intr: status=0x50 { DriveReady SeekComplete }
ide: failed opcode was: unknown



But now after the disk has been overwritten with zeroes the errors are gone.
If I run a S.M.A.R.T test now it shows no errors, but as you can see here I got errors before:
Code:

Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error       00%        99         -
# 2  Short offline       Completed without error       00%        98         -
# 3  Short offline       Completed: read failure       90%        83         78328230
# 4  Extended offline    Completed: read failure       90%        83         78328230
# 5  Short offline       Completed: read failure       90%        83         78328230
# 6  Short offline       Completed: read failure       90%        79         50480298
# 7  Short offline       Completed: read failure       90%        79         50480298


So my question now is.. Is the disk OK? Why did the errors go away when I wiped it with dd? Should I still return it for replacement?
Is it possible that this is a problem with the IDE controller card? (Promise Technology, Inc. PDC20268 (Ultra100 TX2))
None of my other hard drives pops up in dmesg or gives me errors when running a S.M.A.R.T test.
Back to top
View user's profile Send private message
eccerr0r
Watchman
Watchman


Joined: 01 Jul 2004
Posts: 10016
Location: almost Mile High in the USA

PostPosted: Thu May 10, 2007 6:13 pm    Post subject: Reply with quote

Likely it simply reassigned the bad sectors when you rewrote the disk. Disk still has bad sectors. If a sector goes bad, it's usually impossible for a disk to remap it for you silently :-)

Depending on the manufacturer's discretion this current state might or might not allow for warranty replacement... likely if you were to start using that disk it might actually work fine again. For how long, who knows...

A SMART selftest should not show read errors in your dmesg/logs, please be more careful checking time stamps. Oops no timestamps?
_________________
Intel Core i7 2700K/Radeon R7 250/24GB DDR3/256GB SSD
What am I supposed watching?
Back to top
View user's profile Send private message
lothar
Tux's lil' helper
Tux's lil' helper


Joined: 10 Mar 2004
Posts: 87
Location: Norway

PostPosted: Thu May 10, 2007 7:07 pm    Post subject: Reply with quote

Hmm thats what I thought might've happened. It will probably create more bad sectors though. A badblocks test varies if I physsicaly place the hard drive vertical og horizontal, so its clear that its some problems with the seeker.

I guess I'll just have to call the webshop I bought it from and ask what they think.
After all they can see the errors I've had in the SMART log...

But one other thing. How can I get timestamped dmesg?
Back to top
View user's profile Send private message
linuxtuxhellsinki
l33t
l33t


Joined: 15 Nov 2004
Posts: 700
Location: Hellsinki

PostPosted: Thu May 10, 2007 8:59 pm    Post subject: Reply with quote

lothar wrote:

But one other thing. How can I get timestamped dmesg?

Code:
cat /var/log/messages


I also have drive with errors and it's still working fine (so smart is not always so smart, or absolutely true) :?
Quote:
Local Time is: Sat Mar 4 05:02:18 2006 EET
5 Reallocated_Sector_Ct 0x0033 001 001 063 Pre-fail Always FAILING_NOW 2556
.......
Local Time is: Thu May 10 23:57:26 2007 EEST
5 Reallocated_Sector_Ct 0x0033 002 001 063 Pre-fail Always FAILING_NOW 2544

_________________
1st use 'Search' & lastly add [Solved] to
the subject of your first post in the thread.
Back to top
View user's profile Send private message
sonicbhoc
Veteran
Veteran


Joined: 24 Oct 2005
Posts: 1805
Location: In front of the computer screen

PostPosted: Thu May 10, 2007 9:21 pm    Post subject: Reply with quote

lothar wrote:

But one other thing. How can I get timestamped dmesg?


Virtual Terminal 12 should have it. CTRL+ALT+F12
_________________
I'm too lazy to keep this stupid signature up to date, so here's something more interesting:
My friend Hetdegon can draw if you ask me.
Now using PClinuxOS on my laptop and Gentoo on my desktop and new laptop.
Back to top
View user's profile Send private message
Display posts from previous:   
Reply to topic    Gentoo Forums Forum Index Kernel & Hardware All times are GMT
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum