2.6.25-gentoo-r9 is VERY slow [Solved]

bfdi533 · Tux's lil' helper Joined: 11 Jun 2003 Posts: 133

I just upgraded my kernel from 2.6.23-gentoo-r9 to 2.6.25-gentoo-r9.

Now that I have done this, every time a program starts, it has a 20-second pause before the program runs, longer if it is an X app.

What information do I need to share to help debug this slowdown?

Any ideas on why this would be would be GREATLY appreciated.

mgrela · Posted: Mon Dec 15, 2008 8:45 pm Post subject:

Run the slow starting program with "strace" like this:

NeddySeagoon · Posted: Mon Dec 15, 2008 9:43 pm Post subject:

bfdi533,

You are probably missing DMA for your hard drive.

Please report what hdparm /dev/... shows.
If it shows DMA is off, also post your lspci, so we can describe how to fix it
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.

bfdi533 · Tux's lil' helper Joined: 11 Jun 2003 Posts: 133

mgrela, not really showing anything significant that I can tell with strace. top shows 80-95% id -- not sure what "id" is though.

NeddySeagoon, here is the data requested:

NeddySeagoon · Posted: Mon Dec 15, 2008 10:26 pm Post subject:

bfdi533,

id in top is idle.
You have two drive controller there:-
00:1f.1 IDE interface: Intel Corporation 82801EB/ER (ICH5/ICH5R) IDE Controller (rev 02)
00:1f.2 IDE interface: Intel Corporation 82801EB (ICH5) SATA Controller (rev 02)

With your hardware and that kernel, I would move to the libata driver, like this
Its not clear if you have two IDE drives on the IDE controller, in which case it looks to be ok or two SATA drives on the SATA controller running with the old depreciated IDE SATA driver.
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.

bfdi533 · Tux's lil' helper Joined: 11 Jun 2003 Posts: 133

eccerr0r · Posted: Wed Dec 17, 2008 6:19 pm Post subject:

Is your hard drive making strange noises or otherwise failing? Any SMART issues?

Are you -sure- there are no background tasks running, and does the old kernel exhibit proper behavior?

I'm having a hard time believing that any kernel change would cause a 9 second directory listing.
_________________
Intel Core i7 2700K/Radeon R7 250/24GB DDR3/256GB SSD
What am I supposed watching?

bfdi533 · Tux's lil' helper Joined: 11 Jun 2003 Posts: 133

NeddySeagoon · Posted: Wed Dec 17, 2008 7:30 pm Post subject:

bfdi533,

The two different times you posted are due to the kernel buffering disc reads incase the data is needed again.
Your first ls forces the kernel to read the drive, the second one only reads the in RAM cache.

I'm not sure what the data in the RAW_VALE fiels indicates but /sdb is clearly in a poor state.
Seek errors cause retries to read the data. A retry costs a single revolution of the disk at minimum, sometimes several.
If it also needs the head to be recalibrated, the retry process will take a lot longer.
Hardware_ECC_Recovered errors mean the data was recovered from the platter incorrectly but the drive electronics was subsequently able to correct the errors.

I would suggest that sdb is dying. Its been operating for 35623 hours, which is over 4 years nonstop. Its working hard to return valid data both with error correction and retries, What the SMART data does not tell is if the errors occur all over the drive surface, or if its a small part that is read repeatedly. I'm inclined to think its the former, as kernel caching should minimise the latter.

For a more thorough test, get the manufactuers test software from their website. However, it will need to write all over the drive so you will need to move your data off.
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.

bfdi533 · Tux's lil' helper Joined: 11 Jun 2003 Posts: 133

eccerr0r · Posted: Wed Dec 17, 2008 9:26 pm Post subject:

bfdi533 · Tux's lil' helper Joined: 11 Jun 2003 Posts: 133

eccerr0r · Posted: Thu Dec 18, 2008 12:15 am Post subject:

Run 'ps ax' and look for any processes whose STATe are "Z" or "D"...

Also cat /proc/interrupts and see if there are any interrupts that are "ringing off hook"? screwed up USB?
_________________
Intel Core i7 2700K/Radeon R7 250/24GB DDR3/256GB SSD
What am I supposed watching?

bfdi533 · Tux's lil' helper Joined: 11 Jun 2003 Posts: 133

It turned out to be just the hard drive. I copied all of the contents to a new drive and replaced it and the system is now zippy again. My guess is that about the time of one of the kernel builds and reboots, the hard drive started to have issues since I KNOW it was coincident with the new kernel and reboot.

Thanks for all for the helpful tips and insight.