Forums

Skip to content

Advanced search
  • Quick links
    • Unanswered topics
    • Active topics
    • Search
  • FAQ
  • Login
  • Register
  • Board index Assistance Other Things Gentoo
  • Search

Backup strategy - parity or other redundancy

Still need help with Gentoo, and your question doesn't fit in the above forums? Here is your last bastion of hope.
Post Reply
Advanced search
10 posts • Page 1 of 1
Author
Message
number_nine
Tux's lil' helper
Tux's lil' helper
Posts: 136
Joined: Thu May 05, 2005 4:12 am

Backup strategy - parity or other redundancy

  • Quote

Post by number_nine » Thu Jul 10, 2008 3:28 pm

I'm thinking about building a machine to act as a backup server. I have a bunch of static files I want to backup.

Of course, making a second copy is good... but what if the media starts to fail? E.g., a sector goes bad, I get a case of "bit rot", the media is scratched, etc?

So my thought was, short of making multiple redundant copies (which is too expensive), why not go "half way" and use something like parchive/par/par2 to add parity information to the data?

Just wondering if anyone else does anything like this, or has any other suggestions.

My goal is to protect against partial failures in the backup media. Say I have a collection of 10 files. And part of the disk (or whatever media) goes bad. I'd like to store some extra information with those 10 files to be able to recover all 10 files, even if the degraded media effectively deletes a couple files.

Thoughts?

Thanks!
Top
poly_poly-man
Advocate
Advocate
User avatar
Posts: 2477
Joined: Wed Dec 06, 2006 9:59 pm
Location: RIT, NY, US
Contact:
Contact poly_poly-man
Website

  • Quote

Post by poly_poly-man » Thu Jul 10, 2008 4:24 pm

two big hard drives in a RAID 1 array.

And fortnightly tape backups.

poly-p man
iVBORw0KGgoAAAANSUhEUgAAA

avatar: new version of logo - see [topic]838248[/topic]. Potentially still a WiP.
Top
halfgaar
l33t
l33t
User avatar
Posts: 781
Joined: Sun Feb 22, 2004 1:50 pm
Location: Netherlands
Contact:
Contact halfgaar
Website

  • Quote

Post by halfgaar » Thu Jul 10, 2008 10:41 pm

I do it like this:

- RAID1 array in my machine.
- daily backups of frequently changing dirs (/home, /etc, /var) with rsync to external drive.
- randomly scheduled backups with dar+parchive to external drive.

(external drive is encrypted with cryptsetup+LUKS. This not a backup feature, but merely protection against the user forgetting the backup disk in the train or something...)

Using parchive is actually a better idea than using multiple copies. Because if you have 100 backup disks, and 2 bytes are damaged on all of them, none of them are useful. Whereas when you use parity, let's say 5% on 1000MB, you can loose any 50 MB of data, scattered throughout either the backup files or the parity files. It's very robust. One wonders why they don't send parity information in communications in Star-Trek, to avoid the "part of the message has been lost captain" :)

I have automated the entire procedure. I simply have to run a script, and everything is taken care of.

I think you may be interested in my backup article. It considers a whole bunch of issues you need to watch out for.
Linux backups the right way.
Get surround sound working.
Top
Carnildo
Guru
Guru
Posts: 595
Joined: Thu Jun 17, 2004 6:27 am

  • Quote

Post by Carnildo » Thu Jul 10, 2008 11:31 pm

Hard drives use error-correcting codes, so it's reasonably safe to assume that if you can access the data, the data's correct. In order for the data to be corrupted undetectably, you would need at least three bit flips in a single error-correction unit -- an event so unlikely that, across all the hard drives in current use, it has probably never happened.
Top
halfgaar
l33t
l33t
User avatar
Posts: 781
Joined: Sun Feb 22, 2004 1:50 pm
Location: Netherlands
Contact:
Contact halfgaar
Website

  • Quote

Post by halfgaar » Sat Jul 12, 2008 11:18 am

But that doesn't guarantee that you can retrieve all of your data of any broken disk. That's the whole point of adding parity information to the dar archive. When copying, and certain area's of the disk are unreadable, it simply cannot retrieve that data. If the amount of data lost falls with the percentage of parity data you created, you can re-create it.
Linux backups the right way.
Get surround sound working.
Top
poly_poly-man
Advocate
Advocate
User avatar
Posts: 2477
Joined: Wed Dec 06, 2006 9:59 pm
Location: RIT, NY, US
Contact:
Contact poly_poly-man
Website

  • Quote

Post by poly_poly-man » Sat Jul 12, 2008 1:36 pm

Hmm... how about we just agree that no backup strategy is perfect :D

I'm still liking the idea of tape backups on top, tho.

poly-p man
iVBORw0KGgoAAAANSUhEUgAAA

avatar: new version of logo - see [topic]838248[/topic]. Potentially still a WiP.
Top
halfgaar
l33t
l33t
User avatar
Posts: 781
Joined: Sun Feb 22, 2004 1:50 pm
Location: Netherlands
Contact:
Contact halfgaar
Website

  • Quote

Post by halfgaar » Sat Jul 12, 2008 1:47 pm

poly_poly-man wrote:Hmm... how about we just agree that no backup strategy is perfect :D
You get no argument from me there :)
poly_poly-man wrote: I'm still liking the idea of tape backups on top, tho.
poly-p man
I consider tapes somewhat clumsy. Hard drives are cheap and big nowadays. I just have to turn on the disk, run a script, wait a couple of minutes, and the backup is up-to-date. The full-system backup I do with dar+parchive takes longer; I run it when I go away for the better part of a day, and it'll be done when I'm home.
Linux backups the right way.
Get surround sound working.
Top
twstd3bc
Apprentice
Apprentice
Posts: 289
Joined: Fri Feb 07, 2003 6:17 pm
Location: Los Angeles, USA

Re: Backup strategy - parity or other redundancy

  • Quote

Post by twstd3bc » Sun Jul 13, 2008 10:24 pm

number_nine wrote:I'm thinking about building a machine to act as a backup server. I have a bunch of static files I want to backup.

Of course, making a second copy is good... but what if the media starts to fail? E.g., a sector goes bad, I get a case of "bit rot", the media is scratched, etc?

So my thought was, short of making multiple redundant copies (which is too expensive), why not go "half way" and use something like parchive/par/par2 to add parity information to the data?

Just wondering if anyone else does anything like this, or has any other suggestions.

My goal is to protect against partial failures in the backup media. Say I have a collection of 10 files. And part of the disk (or whatever media) goes bad. I'd like to store some extra information with those 10 files to be able to recover all 10 files, even if the degraded media effectively deletes a couple files.

Thoughts?

Thanks!
I use Par2. More specifically, I tar up certain directories into, say, archive.tar, gpg encrypt it to a file archive.tar.gpg, split it up with the split command, make some pars from this, and mail them to one of my gmail accounts. For weekly backups I use a bunch of mini (and non-mini) DVD-RWs and keep them on a shelf at home. I also have a linode to back things up as well.
Top
halfgaar
l33t
l33t
User avatar
Posts: 781
Joined: Sun Feb 22, 2004 1:50 pm
Location: Netherlands
Contact:
Contact halfgaar
Website

  • Quote

Post by halfgaar » Sun Jul 13, 2008 10:58 pm

I use Par2. More specifically, I tar up certain directories into, say, archive.tar, gpg encrypt it to a file archive.tar.gpg, split it up with the split command, make some pars from this, and mail them to one of my gmail accounts. For weekly backups I use a bunch of mini (and non-mini) DVD-RWs and keep them on a shelf at home. I also have a linode to back things up as well.
Why don't you use dar? It has the ability to split automatically.
Linux backups the right way.
Get surround sound working.
Top
twstd3bc
Apprentice
Apprentice
Posts: 289
Joined: Fri Feb 07, 2003 6:17 pm
Location: Los Angeles, USA

  • Quote

Post by twstd3bc » Mon Jul 14, 2008 12:05 am

halfgaar wrote:
I use Par2. More specifically, I tar up certain directories into, say, archive.tar, gpg encrypt it to a file archive.tar.gpg, split it up with the split command, make some pars from this, and mail them to one of my gmail accounts. For weekly backups I use a bunch of mini (and non-mini) DVD-RWs and keep them on a shelf at home. I also have a linode to back things up as well.
Why don't you use dar? It has the ability to split automatically.
DAR looks nice, especially the split option and par support. I'll have to look into how it implements encryption, though I like that it uses blowfish, which is fast. Thanks!
Top
Post Reply

10 posts • Page 1 of 1

Return to “Other Things Gentoo”

Jump to
  • Assistance
  • ↳   News & Announcements
  • ↳   Frequently Asked Questions
  • ↳   Installing Gentoo
  • ↳   Multimedia
  • ↳   Desktop Environments
  • ↳   Networking & Security
  • ↳   Kernel & Hardware
  • ↳   Portage & Programming
  • ↳   Gamers & Players
  • ↳   Other Things Gentoo
  • ↳   Unsupported Software
  • Discussion & Documentation
  • ↳   Documentation, Tips & Tricks
  • ↳   Gentoo Chat
  • ↳   Gentoo Forums Feedback
  • ↳   Duplicate Threads
  • International Gentoo Users
  • ↳   中文 (Chinese)
  • ↳   Dutch
  • ↳   Finnish
  • ↳   French
  • ↳   Deutsches Forum (German)
  • ↳   Diskussionsforum
  • ↳   Deutsche Dokumentation
  • ↳   Greek
  • ↳   Forum italiano (Italian)
  • ↳   Forum di discussione italiano
  • ↳   Risorse italiane (documentazione e tools)
  • ↳   Polskie forum (Polish)
  • ↳   Instalacja i sprzęt
  • ↳   Polish OTW
  • ↳   Portuguese
  • ↳   Documentação, Ferramentas e Dicas
  • ↳   Russian
  • ↳   Scandinavian
  • ↳   Spanish
  • ↳   Other Languages
  • Architectures & Platforms
  • ↳   Gentoo on ARM
  • ↳   Gentoo on PPC
  • ↳   Gentoo on Sparc
  • ↳   Gentoo on Alternative Architectures
  • ↳   Gentoo on AMD64
  • ↳   Gentoo for Mac OS X (Portage for Mac OS X)
  • Board index
  • All times are UTC
  • Delete cookies

© 2001–2026 Gentoo Foundation, Inc.

Powered by phpBB® Forum Software © phpBB Limited

Privacy Policy

 

 

magic