Gentoo Forums
Gentoo Forums
Gentoo Forums
Quick Search: in
SATA hard drive stops working...
View unanswered posts
View posts from last 24 hours

 
Reply to topic    Gentoo Forums Forum Index Off the Wall
View previous topic :: View next topic  
Author Message
Zepp
Veteran
Veteran


Joined: 15 Mar 2004
Posts: 1246
Location: Ontario, Canada

PostPosted: Sun Aug 03, 2008 8:23 pm    Post subject: SATA hard drive stops working... Reply with quote

This computer isn't running gentoo (arch) so I will post it here...

The sata hard disk became inaccessible all of a sudden.

This is what I found in dmesg:
Quote:
kjournald starting. Commit interval 5 seconds
EXT3 FS on sdb1, internal journal
EXT3-fs: mounted filesystem with ordered data mode.
warning: `smbd' uses 32-bit capabilities (legacy support in use)
EXT3-fs warning (device sdb1): empty_dir: bad directory (dir #60818152) - no `.' or `..'
hda-intel: Invalid position buffer, using LPIB read method instead.
ata3: exception Emask 0x10 SAct 0x0 SErr 0x90200 action 0xe frozen
ata3: irq_stat 0x00400000, PHY RDY changed
ata3: SError: { Persist PHYRdyChg 10B8B }
ata3: hard resetting link
ata3: SATA link down (SStatus 0 SControl 300)
ata3: failed to recover some devices, retrying in 5 secs
ata3: hard resetting link
ata3: SATA link down (SStatus 0 SControl 300)
ata3: failed to recover some devices, retrying in 5 secs
ata3: hard resetting link
ata3: SATA link down (SStatus 0 SControl 300)
ata3.00: disabled
sd 2:0:0:0: rejecting I/O to offline device
sd 2:0:0:0: [sdb] Result: hostbyte=0x01 driverbyte=0x00
end_request: I/O error, dev sdb, sector 304178327
sd 2:0:0:0: rejecting I/O to offline device
sd 2:0:0:0: rejecting I/O to offline device
sd 2:0:0:0: rejecting I/O to offline device
end_request: I/O error, dev sdb, sector 304178343
Buffer I/O error on device sdb1, logical block 38022285
lost page write due to I/O error on sdb1
Aborting journal on device sdb1.
sd 2:0:0:0: rejecting I/O to offline device
Buffer I/O error on device sdb1, logical block 38011454
lost page write due to I/O error on sdb1
EXT3-fs error (device sdb1) in ext3_reserve_inode_write: Journal has aborted
sd 2:0:0:0: rejecting I/O to offline device
Buffer I/O error on device sdb1, logical block 0
lost page write due to I/O error on sdb1
EXT3-fs error (device sdb1) in ext3_dirty_inode: Journal has aborted
------------[ cut here ]------------
WARNING: at fs/buffer.c:1183 mark_buffer_dirty+0x78/0x90()
Modules linked in: ext2 parport_pc ppdev lp parport serio_raw ppp_generic pcspkr psmouse k8temp i2c_piix4 i2c_core shpchp pci_hotplug sg thermal processor fan evdev button battery ac snd_hda_intel snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_hwdep snd_pcm_oss snd_pcm snd_timer snd_page_alloc snd_mixer_oss snd soundcore slhc rtc_cmos rtc_core rtc_lib ext3 jbd mbcache dm_mod usbhid hid ff_memless ehci_hcd sd_mod ohci_hcd usbcore atl1 mii ahci ata_generic pata_atiixp pata_acpi libata scsi_mod dock
Pid: 3158, comm: SABnzbd.py Not tainted 2.6.25-ARCH #1

Call Trace:
[<ffffffff8023a5f4>] warn_on_slowpath+0x64/0xb0
[<ffffffff8022f3b9>] enqueue_task+0x59/0x70
[<ffffffff802b3200>] filldir+0x0/0xf0
[<ffffffff880f6faf>] :ext3:ext3_count_free_inodes+0x3f/0x70
[<ffffffff802ccb78>] mark_buffer_dirty+0x78/0x90
[<ffffffff881018f7>] :ext3:ext3_commit_super+0x57/0xa0
[<ffffffff802b3200>] filldir+0x0/0xf0
[<ffffffff881034d2>] :ext3:ext3_handle_error+0x52/0xd0
[<ffffffff881035a6>] :ext3:__ext3_std_error+0x56/0x70
[<ffffffff802b3200>] filldir+0x0/0xf0
[<ffffffff88104fce>] :ext3:__ext3_journal_stop+0x3e/0x60
[<ffffffff802c7864>] __mark_inode_dirty+0x34/0x1d0
[<ffffffff802b348c>] vfs_readdir+0xdc/0xe0
[<ffffffff802b360f>] sys_getdents+0x9f/0xe0
[<ffffffff8020c5aa>] system_call_after_swapgs+0x8a/0x8f

---[ end trace 8502b8f5b8e06205 ]---
sd 2:0:0:0: rejecting I/O to offline device
Buffer I/O error on device sdb1, logical block 0
lost page write due to I/O error on sdb1
ext3_abort called.
EXT3-fs error (device sdb1): ext3_journal_start_sb: Detected aborted journal
Remounting filesystem read-only
ata3: EH complete
ata3.00: detaching (SCSI 2:0:0:0)
Buffer I/O error on device sdb1, logical block 120061954
lost page write due to I/O error on sdb1
sd 2:0:0:0: [sdb] Synchronizing SCSI cache
sd 2:0:0:0: [sdb] Result: hostbyte=0x04 driverbyte=0x00
sd 2:0:0:0: [sdb] Stopping disk
sd 2:0:0:0: [sdb] START_STOP FAILED
sd 2:0:0:0: [sdb] Result: hostbyte=0x04 driverbyte=0x00
EXT3-fs error (device sdb1): ext3_find_entry: reading directory #60030977 offset 0
EXT3-fs error (device sdb1): ext3_find_entry: reading directory #60030977 offset 0
EXT3-fs error (device sdb1): ext3_find_entry: reading directory #2 offset 0
EXT3-fs error (device sdb1): ext3_find_entry: reading directory #2 offset 0
Buffer I/O error on device sdb1, logical block 38011454
lost page write due to I/O error on sdb1

This is twice that this has happened now. Before it was running for about 20 days before this happened. It has only been 2 days now since then, and it appears to have happen again. I have the earlier /var/log/messages.log as well if needed.

Is this possibly a kernel driver issue or is it hardware?

Here is the kernel version on the computer

Quote:
Linux the-library 2.6.25-ARCH #1 SMP PREEMPT Mon Jul 14 14:56:27 UTC 2008 x86_64 AMD Athlon(tm) 64 X2 Dual Core Processor 4600+ AuthenticAMD GNU/Linux
Back to top
View user's profile Send private message
Zepp
Veteran
Veteran


Joined: 15 Mar 2004
Posts: 1246
Location: Ontario, Canada

PostPosted: Sun Aug 03, 2008 8:27 pm    Post subject: Reply with quote

Hmm the /dev entry for the disk is gone now too.
Back to top
View user's profile Send private message
energyman76b
Advocate
Advocate


Joined: 26 Mar 2003
Posts: 2033
Location: Germany

PostPosted: Sun Aug 03, 2008 8:55 pm    Post subject: Reply with quote

lkml is there for a reason you know?
_________________
AidanJT wrote:

Libertardian denial of reality is wholly unimpressive and unconvincing, and simply serves to demonstrate what a bunch of delusional fools they all are.

Satan's got perfectly toned abs and rocks a c-cup.
Back to top
View user's profile Send private message
kernelOfTruth
Watchman
Watchman


Joined: 20 Dec 2005
Posts: 5769
Location: Vienna, Austria; Germany; hello world :)

PostPosted: Sun Aug 03, 2008 8:56 pm    Post subject: Reply with quote

SYD (save your data) :P

does it make strange click-noises ?

and

energyman76b wrote:
lkml is there for a reason you know?


++
_________________
https://github.com/kernelOfTruth/ZFS-for-SystemRescueCD/tree/ZFS-for-SysRescCD-4.3.0-r3
2.6.37.2_plus_v1: BFS, CFS,THP,compaction, zcache or TOI
Hardcore Linux user since 2004 :D
Back to top
View user's profile Send private message
Zepp
Veteran
Veteran


Joined: 15 Mar 2004
Posts: 1246
Location: Ontario, Canada

PostPosted: Sun Aug 03, 2008 9:01 pm    Post subject: Reply with quote

kernelOfTruth wrote:
SYD (save your data) :P

does it make strange click-noises ?

and

energyman76b wrote:
lkml is there for a reason you know?


++


No it doesn't make any strange noises, and it passes the S.M.A.R.T. check. I haven't tried a fool diagnostic using seatools or anything.

energyman76b wrote:
lkml is there for a reason you know?

how helpful :roll:. I didn't notice anything when I checked earlier, but I could of missed something.
Back to top
View user's profile Send private message
energyman76b
Advocate
Advocate


Joined: 26 Mar 2003
Posts: 2033
Location: Germany

PostPosted: Sun Aug 03, 2008 9:11 pm    Post subject: Reply with quote

Zepp wrote:
kernelOfTruth wrote:
SYD (save your data) :P

does it make strange click-noises ?

and

energyman76b wrote:
lkml is there for a reason you know?


++


No it doesn't make any strange noises, and it passes the S.M.A.R.T. check. I haven't tried a fool diagnostic using seatools or anything.

energyman76b wrote:
lkml is there for a reason you know?

how helpful :roll:. I didn't notice anything when I checked earlier, but I could of missed something.


it is pretty simply either your harddisk is dying or your controller - but both cases shouldn't make ext3 posting a backtrace IMHO of course - so you might want to send a mail to lkml. Or the driver is misbehaving, in that case you also want to send an mail to lkml. Either way, lkml is the right place. But before you do, check the hd with the manufactures tool - and save your data before you do so. Some like to erase all data ....
_________________
AidanJT wrote:

Libertardian denial of reality is wholly unimpressive and unconvincing, and simply serves to demonstrate what a bunch of delusional fools they all are.

Satan's got perfectly toned abs and rocks a c-cup.
Back to top
View user's profile Send private message
Zepp
Veteran
Veteran


Joined: 15 Mar 2004
Posts: 1246
Location: Ontario, Canada

PostPosted: Mon Aug 04, 2008 3:53 am    Post subject: Reply with quote

What are the odds it is the harddisk if the dev entry even disappears. I would think even if the drive was having issues it would still at least work enough for the OS to recognize it and keep the dev entry... why would it remove the entry?
Back to top
View user's profile Send private message
Akkara
Administrator
Administrator


Joined: 28 Mar 2006
Posts: 5401
Location: &akkara

PostPosted: Mon Aug 04, 2008 4:49 am    Post subject: Reply with quote

Sometimes a intermittent cable can do that, leading to gradual corruption of the filesystem until it is completely broken.
Back to top
View user's profile Send private message
pigeon768
l33t
l33t


Joined: 02 Jan 2006
Posts: 669

PostPosted: Mon Aug 04, 2008 4:52 am    Post subject: Reply with quote

Zepp wrote:
What are the odds it is the harddisk if the dev entry even disappears. I would think even if the drive was having issues it would still at least work enough for the OS to recognize it and keep the dev entry... why would it remove the entry?
If the problem is in the circuitry in the hard drive. The fact the /dev entry is gone indicates the issue is not with your physical platters. If you have really vital data on your disks, you can send it off to one of those recovery places that charge an exorbitant fee and they ought to be able to recover your data.

Obvious check number one: Hard disk is plugged in, and all connections are secure.
Obvious check number two: Kernel is set up correctly. This includes all the SCSI stuff and the ATA stuff.
Check three: Swap a different, known-working hard drive into that slot. Keep the same cables you're using now. If the drive works, your old drive is broke. Sorry.
4: If it doesn't work, swap out the SATA cable. If it works, your SATA cable is broke.
6: Use a different plug from the PSU to power your drive, preferably one of those four pin -> SATA power adapter things. If it works, your PSU is bad.
7: If none of the above work, your motherboard is bad. Sorry.

Keep going from there, of course. Try booting and accessing your data in Knoppix, or better yet, one of those system recovery disks that has all the fun disk-diagnostic tools or something. If it works there, blame Arch linux. Boot into Windows, see if it works. (if it works in Windows and not Linux, lern2config. Older versions of XP typically require you to load drivers. Make sure you do so, or use Vista) Plug the drive into different SATA ports on the motherboard, see if it works. Plug the hard drive into an external enclosure, if you have one.
_________________
My political bias.
Back to top
View user's profile Send private message
Zepp
Veteran
Veteran


Joined: 15 Mar 2004
Posts: 1246
Location: Ontario, Canada

PostPosted: Mon Aug 04, 2008 4:59 am    Post subject: Reply with quote

It's a separate headless machine, i just use it to store files, so I don't want to install windows ;). I will try the other things though once I have time.
Back to top
View user's profile Send private message
Zepp
Veteran
Veteran


Joined: 15 Mar 2004
Posts: 1246
Location: Ontario, Canada

PostPosted: Mon Aug 04, 2008 6:48 am    Post subject: Reply with quote

So i restarted a bunch today, halted a few, kept failing to detect the drive. So I decided to plug in my sata dvd burner and boot the gentoo livecd, see if it could see it, and of course now the drive appears to be working fine again. This is going to be really annoying if it keeps magically working and then failing again :P.

I wonder if maybe I should change my sata controller off ide mode and put it into ahci mode... it is on ide mode by default in the bios.
Back to top
View user's profile Send private message
Alcap
n00b
n00b


Joined: 01 Jan 2007
Posts: 25

PostPosted: Mon Aug 04, 2008 10:38 am    Post subject: Reply with quote

I would change my controller if I were you. By the way, you should use hiren's boot CD. It's great to run diagnostics on a disk because it has lots of tools for it. Not just hard drives, but also for ram, anti-virus, anti-spyware. It's not much (as the apps are from seagate, etc, but it's a place to start)
Back to top
View user's profile Send private message
jdmulloy
Tux's lil' helper
Tux's lil' helper


Joined: 24 Dec 2004
Posts: 139
Location: Massachusetts, USA

PostPosted: Tue Aug 05, 2008 12:05 am    Post subject: Reply with quote

If you can get to the Disk with a live CD i recommend backing everything up just in case. Actually you should be doing regular backups anyhow.
_________________
Joe Mulloy | http://twitter.com/jdmulloy | Ron Paul in 2012! | 5-1-07 | Unban Playfool | Fire your "Too big to fail" bank http://moveyourmoney.info
Back to top
View user's profile Send private message
Zepp
Veteran
Veteran


Joined: 15 Mar 2004
Posts: 1246
Location: Ontario, Canada

PostPosted: Tue Aug 05, 2008 12:26 am    Post subject: Reply with quote

The drive is working fine right now, it is mostly just ripped movies and music CDs. I could rip them all again if I had to, it would just be painful. The only other stuff on there is backups of my old desktop linux install but don't really care about those so much anymore. I might burn them to DVD anyway though.

I changed the sata controller from "IDE" mode to AHCI, that god rid of one warning when booting that I didn't notice before, but I don't know if it will help my problem at all. Guess I will wait and see.
Back to top
View user's profile Send private message
Display posts from previous:   
Reply to topic    Gentoo Forums Forum Index Off the Wall All times are GMT
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum