Question about the safety of data

axl · Veteran Joined: 11 Oct 2002 Posts: 1144 Location: Romania

Hello folks.

I've been watching these videos about zfs/btrfs. Which seem to raise a question in my mind that I couldn't possibly answer alone at this point. I've tried to figure it out alone, but I am unsure of it. I really hope this is the offtopic forum because here goes. Let me describe my setup:

2 x 4 Tb drives. I've put them in a raid1 array, formatted xfs and that's all she wrote.

OK. According to the videos I've seen, this setup is vulnerable to data alteration over time. I don't really know how that's a thing. Isn't a FS self consistent with itself? According to this theory, if a place on a hard-disk just decides to have 1's instead of 0's in a particular sector, in a raid1 array, with xfs on top of it (AGAIN just because ONE of the disks decided on it's own that in that particular sector it's a 1 and not a 0, the other disk would have 0) and somehow the end result of my data would be a zero, and not a 1. this theory is based in the idea that harddrives return other data that you put into them and they dont return an error for it.

i guess _some_ harddrives degrade over time. flash does it all the time.

so, going down the food chain. since i haven't used degradable storage options, if a drive decides to return other data than you actually input into the drive, would xfs know? i'm given to understand... no. If you have a file with letters numbers 10 in it, and disk somehow changes data inside the file, without updating the rest of xfs. data just degrades on the drive and somehow ends up 01. would xfs know? i'm given to understand no. assuming ofc, the drive doesn't complain to bios. which again, i'm given to understand, most cheap drives dont.

second question, going down the food chain. would md raid 1 know ? i'm given to understand no.

third question. would btrfs know ? i'm given to understand yes. same for zfs. is that correct ? did i understand correctly ?

NeddySeagoon · Posted: Fri Mar 17, 2017 1:52 pm Post subject:

axl,

axl · Veteran Joined: 11 Oct 2002 Posts: 1144 Location: Romania

Roman_Gruber · Posted: Fri Mar 17, 2017 3:03 pm Post subject:

a bit off topic.

when you care for data security, than you should not consider exotic file sytems like btfs, xfs, zfs.

I use ext4 because I assume it is hte mostly used one file system as of now. So I assume bugs would have been discovered more likely and fixed as in other file systems.

I would more be worried about data corruption. And that is likely to happen with exotic filesystems. Point finger at reiserfs.

Also I use 3 different brand and models, and production years for my backups. 3 SSDs. I slowly get rid of HDDS. but the market for used HDDs has vanished these days.

data safety: hardware vs software vs user.

steveL · Posted: Fri Mar 17, 2017 3:55 pm Post subject:

Great site, Neddy.

Makes me think "TF for sysadmins".. ;-)

Although every first-line Gentoo user is an admin of their own machine/s, we all rely on the services of dedicated system administators.

NeddySeagoon · Posted: Fri Mar 17, 2017 4:02 pm Post subject:

axl,

You cannot change user data on a HDD without changing the error correcting data and checksums too.
You send the drive a sectors worth of user data. The drive adds the error correcting data and checksum.
On read, the drive reads the data, applies error correction and checks the checksum.
If all is well, you get your data back. If not the drive does retries. When it runs out of retries, you get a read error back and no data.
Any file system that invents the data in the face of a read error is broken by design.

When you write random blocks on the drives in a two spindle raid1, the raid layer won't notice until you do a check.
Then it can tell that they are different, it can't tell which, if either, is correct.

The sector data on the drive is a continuous stream of flux reversals.
User data, error correction data and checksum. Its differentiated by context.
Much like to grub, the kernel and initrd files are data to be read.
Its only when grub jumps to the kernel start address the kernel becomes instructions to the CPU.
Instructions are different lengths too. Its still only context that separates instructions and data.

HDD sector reads either all works or it doesn't.
If the checksum becomes corrupt, the check fails, even if the user data is correct. You still get a read error.
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.

axl · Veteran Joined: 11 Oct 2002 Posts: 1144 Location: Romania

cool. i like your reply. let me answer point by point.

axl · Veteran Joined: 11 Oct 2002 Posts: 1144 Location: Romania

NeddySeagoon · Posted: Fri Mar 17, 2017 6:07 pm Post subject:

axl,

Google is your friend. Read about Partial Response Maximum Likelihood encoding on magnetic media. (That's a PDF)
Also read Data Layout That page is mostly rubbish but the diagram is correct.

What has the BIOS got to do with anything?
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.

Roman_Gruber · Posted: Sat Mar 18, 2017 12:18 pm Post subject:

Thanks for all the reply

Ant P. · Watchman Joined: 18 Apr 2009 Posts: 6920

Some timely reading material, since the Dunning-Kruger syndrome in this thread is approaching nauseating levels.

frostschutz · Advocate Joined: 22 Feb 2005 Posts: 2977 Location: Germany

NeddySeagoon · Posted: Sat Mar 18, 2017 7:57 pm Post subject:

Ant P.,

That illustration was not bitrot on the drive. The drive returned correct information, then it was corrupted during transmission over the interface, since changing the data cable fixed the problem.
Actually the corruption was detected during a read operation. It could have occurred during the write. Write corruptions would only be detected by the filesystem reading the data back and doing the checksum test. That sounds like it would halve the IO capacity of the drive, if that were done as a part of the write operation.
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.

axl · Veteran Joined: 11 Oct 2002 Posts: 1144 Location: Romania

axl · Veteran Joined: 11 Oct 2002 Posts: 1144 Location: Romania

yes. thank you. "bitrot".

everything I was trying to say can be easily summed up with that word.

and given that i finished migrating from md array/xfs to btrfs, i could post some impressions.

it went smooth. took double than i expected. copying all that data from drive A to drive B takes about 8 hours. at first the 2 drives were completely separate and i was using rsync to keep them synced. after, i created an array out of drive A, and copied all data on drive A, then added drive B to the array. And you could see as drive B was added to the array that data was pouring out of drive A onto drive B. simple.

but with btrfs it was different. when you add another drive to a btrfs fs or mountpoint and resync, data is read from the source, then written on the source, then on the copy. and it takes double time. again, resync on a md array usually means copy from A to B. on btrfs it means read from A, write to A, write to B.

performance also took a big hit. reading wise, used to read with 140-100Mbs. now it's 80-100Mbs. writing i dont really have a frame of reference.

also discovered seekwatcher while doing the migration but didn't have BLK_DEV_IO_TRACE enabled in kernel so... couldn't play with it. am considering switching back from btrfs to the old md/xfs design just to graph the migration. gonna stick with btffs for a few days for now.

I don't _think_ the types of drives I have really develop "bitrot". or if they do, i doubt they develop a lot of it. i keep md5 sums now of most of my files. the good thing about it is that the files stored on those drives are just phat video files that don't change a lot. they just sit there. and that machine doesn't get power outages. AT ALL. it has an UPS connected to the serial port, it would shut down before it runs out of power.

still, not sure what to make of this. just discovered the idea of bitrot a few months back. didn't even cross my mind before. not sure if btrfs/zfs are here to address an issue that is a non-issue, or it's something that we didn't even knew we face. more research required. what I DO want to try is this migration back and forth, and this time, graph it with seekwatcher.

axl · Veteran Joined: 11 Oct 2002 Posts: 1144 Location: Romania

axl · Veteran Joined: 11 Oct 2002 Posts: 1144 Location: Romania

Ant P. · Watchman Joined: 18 Apr 2009 Posts: 6920

axl · Veteran Joined: 11 Oct 2002 Posts: 1144 Location: Romania

NeddySeagoon · Posted: Sun Mar 19, 2017 11:24 pm Post subject:

axl,

You are just a bairn :)
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.

axl · Veteran Joined: 11 Oct 2002 Posts: 1144 Location: Romania

I don't mind being called childish, it's a quality I hold dear at heart. But i have to ask, where in god's green earth did you first encounter that word? bairn. it's the first time i've came across it.

NeddySeagoon · Posted: Mon Mar 20, 2017 12:00 am Post subject:

axl,

Its a Scottish word.
It does not mean that you are childlish, it means that you are young compared to the person using the phrase "just a bairn".
I'm 63.
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.

steveL · Posted: Mon Mar 20, 2017 1:46 am Post subject:

Hu · Moderator Joined: 06 Mar 2007 Posts: 21631

That comment is an impressive wall of text, but failed to mention anything that could be used to confirm or refute the diagnosis. It failed to identify the kernel version or even the distribution. It failed to mention what application(s) are misbehaving, other than to assert that they are popular. It openly admitted users routinely power off the system improperly. It asserts corruption, but then describes it as truncated files. Truncated files are definitely a sign of lost data, but corruption to me implies that the file has wrong data, not absent data. I agree with the latter concern: an obvious authoritative reference would be helpful.

steveL · Posted: Mon Mar 20, 2017 5:50 pm Post subject:

Well, Tso's article is pretty authoritative in that it comes from the upstream author of ext2/3/4, so the problem is definitely real.

It's fine to provide options that are only safe when you use an UPS. It is not fine to pretend to users that they are getting the same data=ordered treatment as ext3, while doing nothing of the sort in the default setup.

It is made even worse when you then a) blame the users, and b) blame everyone else (for your defaults, in your software, and the mess they caused.)

IMO delalloc should be off by default, and auto_da_alloc should switch on when delalloc is on, unless the user overrides.
Further, auto_da_alloc still needs work, as bleating about O_APPEND is simply dumb (especially for a FS coder.)

Sorry I haven't got more references, I'm in the middle of an install, and might be referring to other discussions or articles I read yesterday.

Just seems to me that the systemdbust mentality (blame the users, blame other programmers) is infecting other Linux developers which is not good.
I preferred: "Don't break the user experience."