View previous topic :: View next topic |
Author |
Message |
lothar Tux's lil' helper


Joined: 10 Mar 2004 Posts: 87 Location: Norway
|
Posted: Thu May 10, 2007 1:02 pm Post subject: Questions around hard drive crash |
|
|
One week ago one of my IDE ATA hard drives started to act funny.
Dmesg reported unreadable LBAs and when running a smartctl -t short (or long) it reported errors while reading the disk.
I also ran a badblocks test that created a log file with 551 bad blocks in it.
However if I ran the badblocks test again on some of the large areas of bad blocks they had no errors.
So I thought the disk is about to die, and I bought a new one.
This disk was part of a LVM group so I just used PVMOVE to move the data to the new disk.
Dmesg only reported 3 or 4 read errors while doing this, so I havent lost any important data.
I still have warranty left on the faulty disk, so I want to return it for replacement.
When I do this I usually wipe the disk with dd
Code: |
root@terra: [~/test] 1f 8.0Kb $ dd if=/dev/zero of=/dev/hdj bs=4096
dd: writing `/dev/hdj': No space left on device
48840247+0 records in
48840246+0 records out
200049647616 bytes (200 GB) copied, 50989.1 s, 3.9 MB/s
|
While doing this dmesg reported some read errors:
Code: |
hdj: dma_timer_expiry: dma status == 0x61
hdj: DMA timeout error
hdj: dma timeout error: status=0x53 { DriveReady SeekComplete Index Error }
hdj: dma timeout error: error=0x40 { UncorrectableError }, LBAsect=50480298, high=3, low=148650, sector=50480063
ide: failed opcode was: unknown
end_request: I/O error, dev hdj, sector 50480063
hdj: task_in_intr: status=0x59 { DriveReady SeekComplete DataRequest Error }
hdj: task_in_intr: error=0x40 { UncorrectableError }, LBAsect=50480298, high=3, low=148650, sector=50480295
ide: failed opcode was: unknown
end_request: I/O error, dev hdj, sector 50480295
hdj: dma_timer_expiry: dma status == 0x61
hdj: DMA timeout error
hdj: dma timeout error: status=0x51 { DriveReady SeekComplete Error }
hdj: dma timeout error: error=0x40 { UncorrectableError }, LBAsect=78328230, high=4, low=11219366, sector=78327743
ide: failed opcode was: unknown
end_request: I/O error, dev hdj, sector 78327743
hdj: task_in_intr: status=0x59 { DriveReady SeekComplete DataRequest Error }
hdj: task_in_intr: error=0x40 { UncorrectableError }, LBAsect=78328230, high=4, low=11219366, sector=78328215
ide: failed opcode was: unknown
end_request: I/O error, dev hdj, sector 78328215
hdj: dma_timer_expiry: dma status == 0x61
hdj: DMA timeout error
hdj: dma timeout error: status=0x51 { DriveReady SeekComplete Error }
hdj: dma timeout error: error=0x40 { UncorrectableError }, LBAsect=78328228, high=4, low=11219364, sector=78328223
ide: failed opcode was: unknown
end_request: I/O error, dev hdj, sector 78328223
hdj: dma_timer_expiry: dma status == 0x61
hdj: DMA timeout error
hdj: dma timeout error: status=0x51 { DriveReady SeekComplete Error }
hdj: dma timeout error: error=0x40 { UncorrectableError }, LBAsect=78328230, high=4, low=11219366, sector=78328224
ide: failed opcode was: unknown
end_request: I/O error, dev hdj, sector 78328224
hdj: task_in_intr: status=0x59 { DriveReady SeekComplete DataRequest Error }
hdj: task_in_intr: error=0x40 { UncorrectableError }, LBAsect=78328230, high=4, low=11219366, sector=78328224
ide: failed opcode was: unknown
end_request: I/O error, dev hdj, sector 78328224
Buffer I/O error on device hdj, logical block 9791028
hdj: task_in_intr: status=0x59 { DriveReady SeekComplete DataRequest Error }
hdj: task_in_intr: error=0x40 { UncorrectableError }, LBAsect=78328226, high=4, low=11219362, sector=78328224
ide: failed opcode was: unknown
end_request: I/O error, dev hdj, sector 78328224
Buffer I/O error on device hdj, logical block 9791028
hdj: task_in_intr: status=0x59 { DriveReady SeekComplete DataRequest Error }
hdj: task_in_intr: error=0x40 { UncorrectableError }, LBAsect=78328230, high=4, low=11219366, sector=78328224
ide: failed opcode was: unknown
end_request: I/O error, dev hdj, sector 78328224
Buffer I/O error on device hdj, logical block 9791028
hdj: task_in_intr: status=0x59 { DriveReady SeekComplete DataRequest Error }
hdj: task_in_intr: error=0x40 { UncorrectableError }, LBAsect=78328230, high=4, low=11219366, sector=78328224
ide: failed opcode was: unknown
end_request: I/O error, dev hdj, sector 78328224
Buffer I/O error on device hdj, logical block 9791028
hdj:
hdj: task_out_intr: status=0x50 { DriveReady SeekComplete }
ide: failed opcode was: unknown
hdj: task_out_intr: status=0x50 { DriveReady SeekComplete }
ide: failed opcode was: unknown
hdj: task_out_intr: status=0x50 { DriveReady SeekComplete }
ide: failed opcode was: unknown
hdj: task_out_intr: status=0x50 { DriveReady SeekComplete }
ide: failed opcode was: unknown
hdj: task_out_intr: status=0x50 { DriveReady SeekComplete }
ide: failed opcode was: unknown
hdj: task_out_intr: status=0x58 { DriveReady SeekComplete DataRequest }
ide: failed opcode was: unknown
hdi: status timeout: status=0xd0 { Busy }
ide: failed opcode was: unknown
hdi: DMA disabled
PDC202XX: Primary channel reset.
hdi: drive not ready for command
ide4: reset: success
hdi: task_out_intr: status=0x50 { DriveReady SeekComplete }
ide: failed opcode was: unknown
|
But now after the disk has been overwritten with zeroes the errors are gone.
If I run a S.M.A.R.T test now it shows no errors, but as you can see here I got errors before:
Code: |
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Extended offline Completed without error 00% 99 -
# 2 Short offline Completed without error 00% 98 -
# 3 Short offline Completed: read failure 90% 83 78328230
# 4 Extended offline Completed: read failure 90% 83 78328230
# 5 Short offline Completed: read failure 90% 83 78328230
# 6 Short offline Completed: read failure 90% 79 50480298
# 7 Short offline Completed: read failure 90% 79 50480298
|
So my question now is.. Is the disk OK? Why did the errors go away when I wiped it with dd? Should I still return it for replacement?
Is it possible that this is a problem with the IDE controller card? (Promise Technology, Inc. PDC20268 (Ultra100 TX2))
None of my other hard drives pops up in dmesg or gives me errors when running a S.M.A.R.T test. |
|
Back to top |
|
 |
eccerr0r Watchman

Joined: 01 Jul 2004 Posts: 10016 Location: almost Mile High in the USA
|
Posted: Thu May 10, 2007 6:13 pm Post subject: |
|
|
Likely it simply reassigned the bad sectors when you rewrote the disk. Disk still has bad sectors. If a sector goes bad, it's usually impossible for a disk to remap it for you silently
Depending on the manufacturer's discretion this current state might or might not allow for warranty replacement... likely if you were to start using that disk it might actually work fine again. For how long, who knows...
A SMART selftest should not show read errors in your dmesg/logs, please be more careful checking time stamps. Oops no timestamps? _________________ Intel Core i7 2700K/Radeon R7 250/24GB DDR3/256GB SSD
What am I supposed watching? |
|
Back to top |
|
 |
lothar Tux's lil' helper


Joined: 10 Mar 2004 Posts: 87 Location: Norway
|
Posted: Thu May 10, 2007 7:07 pm Post subject: |
|
|
Hmm thats what I thought might've happened. It will probably create more bad sectors though. A badblocks test varies if I physsicaly place the hard drive vertical og horizontal, so its clear that its some problems with the seeker.
I guess I'll just have to call the webshop I bought it from and ask what they think.
After all they can see the errors I've had in the SMART log...
But one other thing. How can I get timestamped dmesg? |
|
Back to top |
|
 |
linuxtuxhellsinki l33t


Joined: 15 Nov 2004 Posts: 700 Location: Hellsinki
|
Posted: Thu May 10, 2007 8:59 pm Post subject: |
|
|
lothar wrote: |
But one other thing. How can I get timestamped dmesg? |
Code: | cat /var/log/messages |
I also have drive with errors and it's still working fine (so smart is not always so smart, or absolutely true)
Quote: | Local Time is: Sat Mar 4 05:02:18 2006 EET
5 Reallocated_Sector_Ct 0x0033 001 001 063 Pre-fail Always FAILING_NOW 2556
.......
Local Time is: Thu May 10 23:57:26 2007 EEST
5 Reallocated_Sector_Ct 0x0033 002 001 063 Pre-fail Always FAILING_NOW 2544 |
_________________ 1st use 'Search' & lastly add [Solved] to
the subject of your first post in the thread. |
|
Back to top |
|
 |
sonicbhoc Veteran


Joined: 24 Oct 2005 Posts: 1805 Location: In front of the computer screen
|
Posted: Thu May 10, 2007 9:21 pm Post subject: |
|
|
lothar wrote: |
But one other thing. How can I get timestamped dmesg? |
Virtual Terminal 12 should have it. CTRL+ALT+F12 _________________ I'm too lazy to keep this stupid signature up to date, so here's something more interesting:
My friend Hetdegon can draw if you ask me.
Now using PClinuxOS on my laptop and Gentoo on my desktop and new laptop. |
|
Back to top |
|
 |
|
|
You cannot post new topics in this forum You cannot reply to topics in this forum You cannot edit your posts in this forum You cannot delete your posts in this forum You cannot vote in polls in this forum
|
|