in ext4_free_inode:363: Corrupt filesystem

Goverp · Advocate Joined: 07 Mar 2007 Posts: 2014

I see Nicias had similar messages last year, so this might be related.

My system suddenly threw the following messages into syslog:

NeddySeagoon · Posted: Tue Oct 02, 2018 5:34 pm Post subject:

Goverp,

smartctl -a for the affected drive .. or all the drives in the raid set may be useful.
What does /proc/mdstat say about the raid sets. Has a drive/partiton been dropped?
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.

Goverp · Advocate Joined: 07 Mar 2007 Posts: 2014

Hi Neddy.

As above, I doubt smartctl is of interest, as there are 4 partitions on the 4 disks in the array, and only this partition shows the problem. I looked at smartctl -H for all the drives, all happy.
For more detail, I looked at smartctl -a for each drive. No reallocations, no errors logged, one raw read error. I think the disks are fine!

I ran checkarray; it found a number of mismatches. All were in the dodgy partition. I've been playing with debugfs; all the mismatch sectors map to inode <7>. This is apparently a reserved/hidden inode,
EXT2_RESIZE_INO. It seems to be 400Mb, which looks a lot, but may make sense to someone who knows what the heck ext4 uses it for. The whole disk is just 43G.

I suspect (a) I should mdadm repair the disk, then (b) fsck -fp it, and hope everything is tidy.

There's something a bit suspect in the partition table. The one causing problems is partiion 1, which runs from sector 4 for a lot of sectors in md127. I'm not sure if that's vulnerable to being overwritten by the RAID superblocks. The array uses V1 superblocks, not 0.90.
_________________
Greybeard

NeddySeagoon · Posted: Tue Oct 02, 2018 8:44 pm Post subject:

Goverp,

mdadm repair probably won't do anything. It checks the underlying raid components for consistency.
Here, the problem is with the filesystem on top of the raid. I suspect that the raid is self consistent.

The version 1 raid superblock is at the beginning of the volume. When you donate whole drives to a raid set, it starts where the MBR would be if you had one.
As long as you have not created a partition table on one of the drives belonging to the raid set, you should be good.
mdadm would not be able at include that drive in the raid set (I hope), so you would be in degraded mode but working properly at the filesystem level.
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.

Jaglover · Posted: Tue Oct 02, 2018 9:14 pm Post subject:

FWIW, smartcontrol -a won't tell much without running a test beforehand. You could run ddrescue to read all sectors and make disk aware of errors, direct output to /dev/null.
_________________
My Gentoo installation notes.
Please learn how to denote units correctly!

Goverp · Advocate Joined: 07 Mar 2007 Posts: 2014

No, no mucking about creating partitions either on the real disks or in the RAID array.

I suspect the mismatches come from a problem earlier this year, which I eventually traces to a loose SATA cable to one drive. That drive dropped out; I sorted the cables and added the drive back into the array. IIUC that process uses the RAID bitmap, and as the inode <7> stuff isn't in normal use, presumably it wasn't processed. Anyway, I'll try repairing the array, which should clear the mismatches. If I get any further issues, I'll delete and recreate the partition from backup.

As I've just performed the checkarray, that will have read the entire surface of all the disks; no errors reported in SMART.
_________________
Greybeard