md devices being assembled before multipath [SOLVED]

wildbug · n00b Joined: 07 Oct 2007 Posts: 73

I've recently installed a SAS2 disk array, and I'm having some issues bringing it up (correctly) on boot.

The root filesystem is on a RAID1 (motherboard SATA). The new storage array is in a JBOD attached to two LSI HBAs. multipath is used to map the two paths to one device, those multipath devices are assembled into four 9-disk RAID6 volumes which are part of an LVM2 volume group. A single logical volume exists in this volume group and is formatted with XFS.

This works fine when assembled manually but doesn't come up correctly when rebooting. The problem is that the md devices start before the multipath devices are created; AFAICT multipath fails because the devices are already in use by md.

I've turned off md autodetect in the kernel, and I edited the "before" line in /etc/init.d/multipath to include mdraid. The root device is assembled with a "md=127,/dev/sda1,/dev/sdb1" kernel parameter in grub.conf.

You can see that the md devices are already assembled when /etc/init.d/mdraid is run. Here's an excerpt from /var/log/rc.log:

wildbug · n00b Joined: 07 Oct 2007 Posts: 73

udev's doing this, isn't it?

NeddySeagoon · Posted: Fri Jul 22, 2011 5:34 pm Post subject:

wildbug,

Kernel auto assembly works with raid superblocks version 0.90 only. Thats not been the default for about 6 months now.
That change has caused a lot of people who were expecting raid auto assembly to just work issues.

What version raid suberblocsl do you have ?
Try

wildbug · n00b Joined: 07 Oct 2007 Posts: 73

Neddy, thanks for the reply, but despite my erroneous reply (and subsequent retraction) in the other thread, I realize that autoassemble only works for 0.90 superblocks. In fact it was my experience described above that made me think that autoassemble was working despite non-0.90 superblocks. However, the root device DOES have v0.90 superblocks and has been autoassembling correctly for months now. But with the recent addition of devices that require multipath to be active before md, I've intentionally turned off autodetect and just added a kernel parameter to assemble the root device (as detailed in my OP).

I'm not trying to get arrays to autoassemble; I'm trying to STOP them from being assembled before multipath is activated. I turned off raid autodetect in the kernel (and redundant "raid=noautodetect", just in case); non-root arrays are being assembled between kernel boot and the boot runlevel, which is why I'm now wondering if udev is responsible.

(For the record, my root device is 0.90 and the other md devices are a mixture of 1.1 and 1.2. Not that it's relevant.)

Here's the root device being correctly assembled (from dmesg):

NeddySeagoon · Posted: Fri Jul 22, 2011 7:15 pm Post subject:

wildbug,

We have established that its not the kernel doing auto assemble of your raid 1.1/1.2 raid sets, which is a step in the right direction.

What does

wildbug · n00b Joined: 07 Oct 2007 Posts: 73

wildbug · n00b Joined: 07 Oct 2007 Posts: 73

Here's my complete dmesg: http://pastebin.com/raw.php?i=QGq7wit4

Here's an excerpt of the array assembly timeline:

wildbug · n00b Joined: 07 Oct 2007 Posts: 73

Yep, udev is the culprit. The rule /lib/udev/rules.d/64-md-raid.rules (supplied by sys-fs/mdadm) calls "mdadm --incremental" on the device. If I remove that file, md arrays are not automatically assembled.

Now I have to figure out how to fix this in an upgrade-friendly way. I'd like to make that rule ignore disks attached via the HBAs. Could this be possible by creating a custom rule in /etc/udev/rules.d and without deleting/editing the "official" /lib/udev/rules.d/64-md-raid.rules?

NeddySeagoon · Posted: Mon Jul 25, 2011 8:25 pm Post subject:

wildbug,

If you create a rule in a file with a lower number than /lib/udev/rules.d/64-md-raid.rules ? /not etc?... ?
Say 03-md-raid.rules, that does nothing, it will be run before 64-md-raid.rules and will not be affected by updates either.
It must match the same thing(s) as in 64-md-raid.rules.

udev will trigger, execute your rule that does nothing, then you have full manual control.
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.

wildbug · n00b Joined: 07 Oct 2007 Posts: 73

I think I've solved this. I can't reboot to test right now as I currently have users running some long simulations, but udevadm test seems to produce correct results. I'll mark the thread as solved once I can reboot and confirm.

This is what I did:

The server in question has two LSI 9200-8e HBAs and an onboard SAS controller with the same chipset (LSI2008). As I only want to include devices connected to the HBAs, I used "udevadm info" to find differences between the device trees of drives attached to the motherboard and the HBAs. At one level I found differences -- ATTRS{subsystem_vendor} and ATTRS{subsystem_device}. I could now identify the correct devices.

The next part was overriding the array assembly. I finally realized that there was only one line in 64-md-raid.rules that I had to circumvent:

dmitryilyin · n00b Joined: 08 Apr 2008 Posts: 27 Location: Netherlands

You'll have more luck with better server distribution (assuming you are working with server) Debian or RedHat like.
They use advanced initramfs and much better suited for production servers.
Gentoo is good for learning linux, development and experimenting)

wildbug · n00b Joined: 07 Oct 2007 Posts: 73

So I finally got around to rebooting...

I think I have this sorted out. Custom udev rules are not necessary.

What happens is that udev runs in the sysinit bootlevel; /lib/udev/rules.d/64-md-raid.rules is part of this process, and it uses mdadm in incremental mode to attempt to automatically assemble RAID devices as components are discovered. However, it does respect /etc/mdadm.conf, so that file can be used to control behavior during this step. Whitelisting devices using a DEVICE line wasn't sufficent; it seems only ARRAY lines are affected by this. An AUTO line set to blacklist all arrays in conjunction with selectively whitelisting arrays in ARRAY lines with "auto=yes" will work. Setting "devices=/dev/mapper/*" in the ARRAY lines was also necessary.

/etc/mdadm.conf