Does anyone know of any good drive-level testing software that can be run without taking the whole machine offline? Preferably, I'd just like to take that device entirely out of the array using "mdadm /dev/mdX -r /dev/sdbX," and test it while the rest of the array is running.
Alternatively, I could take the server down for 5 mins to remove the defective drive, and then test it in another machine. I'm assuming the number next to the drive in /proc/mdstat is the physical drive connection? For example, [0] is SATA0 & [1] is SATA1? I'm paranoid about mistakenly removing the good drive and making things worse.
Code: Select all
md1 : active raid1 sda1[0]
40064 blocks [2/1] [U_]
md2 : active raid1 sdb2[1] sda2[0]
2008000 blocks [2/2] [UU]
md3 : active raid1 sda3[0]
154240000 blocks [2/1] [U_]TIA!

