Are SSD drives suitable for RAID-1 use these days?

Message

ipic · Post by **ipic** » Wed Aug 07, 2024 7:53 pm

On my Gentoo main system, I have four 2TB mechanical hard drives configured into two RAID-1 mirrors. I then add the physical drive partitions to Logical Volume manager, which I can use to create partitions as and when needed, as well as change partition sizes easily.

This configuration has served me well over many years. Copied from common practice where I worked - provides good resilience to single point failure in the storage systems.

The time to consider replacements is coming around again, and SSD drives have caught my eye. Last time I looked, the price premium for 2TB SSD was too high, now it is in the region of price parity with 2TB mechanical drives.

But - I recall that I also formed an opinion (some years ago) that SSD drives would not work well with a Gentoo system configured like mine, on account of the SSD drives having a very limited lifespan if written to regularly.

How do people view this these days? Have the manufacturers mitigated the problem with high write rates to SSD drives?

As an off the wall follow up question - what would people think of a SSD-Mechanical pairing in a RAID-0 mirror? Stupid idea, or some merit for faster read times?

Thanks in advance for any insights.

(Not sure this is the correct forum for this sort of discussion - hopefully the mods will move the post if not)
(Edit: RAID-0 to RAID-1, correcting a senior moment, sorry)

szatox · Post by **szatox** » Wed Aug 07, 2024 8:53 pm

But - I recall that I also formed an opinion (some years ago) that SSD drives would not work well with a Gentoo system configured like mine, on account of the SSD drives having a very limited lifespan if written to regularly.

Dunno, my Gentoo is doing fine. The vast majority of compilation happens in RAM anyway, and hitting swap once in a while is not a big deal.

As an off the wall follow up question - what would people think of a SSD-Mechanical pairing in a RAID-0 mirror? Stupid idea, or some merit for faster read times?

It should give you faster reads, but write performance will be limited by hdd.
Depending on the internal optimizations, I can imagine this setup failing slow HDD out of the raid set during a big write (or at any time really), so you'd have to really really stress test it.
Bcache and similar probably won't survive the backing device failing, even if cache is big enough to keep all data forever.
Maybe you could abuse DRBD in some creative way to flag HDD as write-mostly, if you're -brave-enough- totally insane. I mean, this idea definitely sucks, but I am now curious if it's possible

One thing that did actually work well when mixing HDD and SSD was LVM snapshots. Full snapshots on a single HDD degrade performance like 10 times. LVM allows you to specify physical volume where a logical volume should be created. Having both, HDD and SSD in one group allows you to put an LV on HDD and then snapshot is to SSD, which avoids write amplification penalty.

ipic · Post by **ipic** » Fri Aug 09, 2024 8:43 am

szatox wrote:
But - I recall that I also formed an opinion (some years ago) that SSD drives would not work well with a Gentoo system configured like mine, on account of the SSD drives having a very limited lifespan if written to regularly.
Dunno, my Gentoo is doing fine. The vast majority of compilation happens in RAM anyway, and hitting swap once in a while is not a big deal.

Sound promising. Do you use the SSD's in a RAID?

I'd be interested to know which manufacturers you have found to be reliable.
Thanks

ipic · Post by **ipic** » Fri Aug 09, 2024 8:57 am

szatox wrote:
As an off the wall follow up question - what would people think of a SSD-Mechanical pairing in a RAID-0 mirror? Stupid idea, or some merit for faster read times?
It should give you faster reads, but write performance will be limited by hdd.
Depending on the internal optimizations, I can imagine this setup failing slow HDD out of the raid set during a big write (or at any time really), so you'd have to really really stress test it.
Bcache and similar probably won't survive the backing device failing, even if cache is big enough to keep all data forever.
Maybe you could abuse DRBD in some creative way to flag HDD as write-mostly, if you're -brave-enough- totally insane. I mean, this idea definitely sucks, but I am now curious if it's possible

One thing that did actually work well when mixing HDD and SSD was LVM snapshots. Full snapshots on a single HDD degrade performance like 10 times. LVM allows you to specify physical volume where a logical volume should be created. Having both, HDD and SSD in one group allows you to put an LV on HDD and then snapshot is to SSD, which avoids write amplification penalty.

My main reason for thinking of this stupid setup is the way I replace drives in the RAID-0. Process is:
- remove the drive to be replaced from the RAID array.
- format the new drive by copying the partition tables

Code: Select all

sgdisk /dev/sdX -R /dev/sdY 
sgdisk -G /dev/sdY

- Join the new disk to the array, and let MDADM recovery do it's thing.

Your comment about the HDD being dropped because of slow writes *looks* like it will be OK in this, one, specific, scenario - but I guess it could fail immediately afterwards, before I have a chance to reverse the process?

lars_the_bear · Post by **lars_the_bear** » Fri Aug 09, 2024 9:01 am

ipic wrote: How do people view this these days? Have the manufacturers mitigated the problem with high write rates to SSD drives?

My understanding is that writes are less a problem these days, with wear levelling and so-forth. Seagate claims a total write life of 1200 terrabytes for their 'Barracuda' SSD range. I don't if that's been independently verified but, even if it's a hundred times smaller in practice, it's still long enough for my purposes. Even if it's only 12 Tb, that's still 6Gb per day, every day, for five years.

These days, I think I would be quite happy to use decent-quality SSDs in RAID, in the same circumstances where I would use them singly. But the sizes I want are still not really affordable in SSD.

BR, Lars.

szatox · Post by **szatox** » Fri Aug 09, 2024 10:09 am

I have four 2TB mechanical hard drives configured into two RAID-0 mirrors

Oh, before we continue... You meant RAID1, didn't you?
RAID0 is striping without any redundancy

Sound promising. Do you use the SSD's in a RAID?

I used to have a bunch of Debian servers with SSDs in RAID1 in my care, including a pretty big and damn busy integration database. It worked.
I don't know what SSDs was it stuffed with. You should be good with anything built to be installed inside the case (as a hot storage). Removable devices (cold storage, like memory cards and pendrives) are manufactured for low duty and will burn holes like crazy.

My personal Gentoo is just sitting on a single nvme. After a few years of daily use, it reports 2% wear with 0 cells burnt out. Looks like it's going tol last me a lifetime.

Your comment about the HDD being dropped because of slow writes *looks* like it will be OK in this, one, specific, scenario - but I guess it could fail immediately afterwards, before I have a chance to reverse the process?

I don't know how it's going to behave.
Depends on how defensively it's coded, you may be fine, or adding the new, fast device might kick out the device with data on it from raid, leaving you in an unusable state.
Honestly I'd rather take that machine down and copy data manually with dd instead. 2 TB is not _that_ much, standard 7200RPM disks should let you image copy them within 2-3 hours.

ipic · Post by **ipic** » Fri Aug 09, 2024 11:44 am

szatox wrote:
I have four 2TB mechanical hard drives configured into two RAID-0 mirrors
Oh, before we continue... You meant RAID1, didn't you?
RAID0 is striping without any redundancy

Yeah, senior moment, sorry. I've edited the heading and original post. RAID-1 mirrors.

ipic · Post by **ipic** » Fri Aug 09, 2024 12:02 pm

szatox wrote:
Your comment about the HDD being dropped because of slow writes *looks* like it will be OK in this, one, specific, scenario - but I guess it could fail immediately afterwards, before I have a chance to reverse the process?
I don't know how it's going to behave.
Depends on how defensively it's coded, you may be fine, or adding the new, fast device might kick out the device with data on it from raid, leaving you in an unusable state.
Honestly I'd rather take that machine down and copy data manually with dd instead. 2 TB is not _that_ much, standard 7200RPM disks should let you image copy them within 2-3 hours.

Thanks for the feedback, it's very useful. Good to know your RAID's work with SSD's.

I've gone back to my motherboard manual, and I have 6x SATA-6 connectors to play with, and a free 5.25inch drive bay. With adapter trays I can fit 2 SSD drives into the 5.25 bay.

So, a plan is:
- Boot with rescue image
- DD a HDD to one of the SDDs
- make new mirror with other SDD
- shutdown
- remove HDDs and replace with SDDs (to use original drive designations)
- reboot with SDD's now as original mirror
- pray, then test

If it all goes spectacularly wrong, I still have the original HDDs to go back to.

Another thought - would using different drive manufacturers for the two mirror SDDs be a way to spread risk a bit? I looked after a system at work a while back which had identical components in two redundant nodes - and they both failed at exactly the same time, due to the identical manufacturer error. So, burnt into my memory.

ipic · Post by **ipic** » Fri Aug 09, 2024 12:09 pm

lars_the_bear wrote:
ipic wrote: How do people view this these days? Have the manufacturers mitigated the problem with high write rates to SSD drives?
My understanding is that writes are less a problem these days, with wear levelling and so-forth. Seagate claims a total write life of 1200 terrabytes for their 'Barracuda' SSD range. I don't if that's been independently verified but, even if it's a hundred times smaller in practice, it's still long enough for my purposes. Even if it's only 12 Tb, that's still 6Gb per day, every day, for five years.

These days, I think I would be quite happy to use decent-quality SSDs in RAID, in the same circumstances where I would use them singly. But the sizes I want are still not really affordable in SSD.

BR, Lars.

Thanks for the feedback, appreciated.
As you say, even a whole order of magnitude lower would still make the drives good for probably the rest of my lifetime

.
I'll assume that the mean time to failure for other reasons (manufacturing process error etc) would be as good as or probably better than mechanical - since assembling very small moving parts carries its own risks.

lars_the_bear · Post by **lars_the_bear** » Fri Aug 09, 2024 12:45 pm

ipic wrote: I'll assume that the mean time to failure for other reasons (manufacturing process error etc) would be as good as or probably better than mechanical - since assembling very small moving parts carries its own risks.

Seagate claims MTBF of 1.8 million hours, which is about 200 years. So, again, even if you're at the worst end of the practical range, the lifetime is still likely to be tens of years.

Having said that, I've not seen any independent verification of these claims. To be frank, I'm not sure how they can easily be tested. You'd need to run thousands of drives side-by-side for several years to stand a reasonable probability of seeing a failure. In my working life I've seen mechanical drives fail, even when treated well. I suspect that the bearings just run dry eventually. But I've never seen a Seagate SSD fail, even after installing dozens of them over 10+ years. Even the cheap, unbranded ones don't usually fail -- they're usually defective when they arrive.

I don't think there's anything special about Seagate -- it's just that they actually publish service lifetime stuff, and I'm not sure manufacturers usually do.

BR, Lars.

szatox · Post by **szatox** » Fri Aug 09, 2024 1:18 pm

So, a plan is:
- Boot with rescue image
- DD a HDD to one of the SDDs
- make new mirror with other SDD
- shutdown
- remove HDDs and replace with SDDs (to use original drive designations)
- reboot with SDD's now as original mirror
- pray, then test

I think you're overcomplicating things.
You can clone both HDDs to respective SSDs at the same time and avoid resyncing RAID afterwards. Just make sure to remove old disks before you activate raid, mdraid identifies members by their superblocks.

If new and old devices are exactly the same size, you can even clone them whole, including partition tables. With slightly bigger drives and GPT partitions, you'll need to repair the backup partition table afterwards, but it doesn't affect superblocks, so your raid will still assemble just fine; Only smaller devices would require more care.

Another thought - would using different drive manufacturers for the two mirror SDDs be a way to spread risk a bit?

Yes.
Companies like buying stuff in bulk, so they put disks with consecutive serial numbers in the raid because that's what they have at hand. In theory it is also good for performance, but in practice comes with increased risk of common cause failures. Your story is not surprising.

pa4wdh · Post by **pa4wdh** » Fri Aug 09, 2024 5:26 pm

I have a very similar setup to yours in my server. It used to be 2 mechanical drives, but i've replaced them with SSD's by replacing them one at a time and let RAID to it's magic.

As for writes, i do monitor them but the drives capability is so high i'll never reach it's theoretical maximum. What i did do is to replace one SSD with a different model after a few years of use. My RAID array now consists of two different drives with an age difference of 4 years, the chance of them failing at the same time should be quite low.

I made a simple script to calculate the amount of written data per day and estimate how long it will take until you reach the specified TBW for the drive. This is the script:

Code: Select all

#!/bin/sh

DEVICE[0]="/dev/sda"
START[0]="2019-08-10"
TBW[0]="600"

DEVICE[1]="/dev/sdb"
START[1]="2023-09-02"
TBW[1]="600"

human_size()
{
 local INPUT=$1
 local SUFFIX="B"
 local FACTOR="1000"
 local LIMIT=$(($FACTOR*20))

 if [ "$INPUT" -gt "$LIMIT" ]
 then
  INPUT=$((INPUT/$FACTOR))
  SUFFIX="KB"
 fi
 if [ "$INPUT" -gt "$LIMIT" ]
 then
  INPUT=$((INPUT/$FACTOR))
  SUFFIX="MB"
 fi
 if [ "$INPUT" -gt "$LIMIT" ]
 then
  INPUT=$((INPUT/$FACTOR))
  SUFFIX="GB"
 fi
 if [ "$INPUT" -gt "$LIMIT" ]
 then
  INPUT=$((INPUT/$FACTOR))
  SUFFIX="TB"
 fi

 echo "$INPUT $SUFFIX"
}

for COUNT in ${!DEVICE[@]}
do
# Get sectors written from device
 SECTORS=`smartctl -A ${DEVICE[$COUNT]} | grep "^241 " | awk '{ print $NF }'`

 if [ -z "$SECTORS" ]
 then
  echo "${DEVICE[$COUNT]}: No write statistics available"
  continue
 fi

# Writes are reported by sectors, multiply by 512
 BW=$((SECTORS*512))
# Guaranteerd value is TBW so multiply by 1 TB
 ESTBW=$((${TBW[$COUNT]}*1000000000000))

# Calculate the days the device has been in use
 STARTUNIX=`date +'%s' -d "${START[$COUNT]}"`
 NOWUNIX=`date +'%s'`
 DAYS=$((($NOWUNIX-$STARTUNIX)/86400))
 if [ "$DAYS" -lt "1" ]
 then
  DAYS=1
 fi

# Calculate the daily write-rate
 DAYRATE=$(($BW/$DAYS))

# Calculate the number of days left given the current rate
 ESTDAYS=$((($ESTBW/$DAYRATE)-$DAYS))
 ESTDATE=`date +"%d-%m-%Y" -d "$ESTDAYS days"`

# Report
 echo "${DEVICE[$COUNT]}: Written `human_size $BW` in $DAYS days, `human_size $DAYRATE`/day, $ESTDAYS days left: $ESTDATE"
done

The arrays at the start are the place to specify the drive's device node, the date you started using it, and the TBW specified by the manufacturer. On my system this reports:

Code: Select all

/dev/sda: Written 6324 GB in 1826 days, 3463 MB/day, 171406 days left: 24-11-2493
/dev/sdb: Written 1400 GB in 342 days, 4096 MB/day, 146134 days left: 15-09-2424

So at least in theory i don't have to worry about them

To select the right drives i followed this advice: https://raid.wiki.kernel.org/index.php/ ... _SCT.2FERC
So far i've seen Samsung drives consistently supporting SCT/ERC in it's EVO series so my drives are an EVO 860 and an EVO 870.
Be aware that this feature needs activation, this is in my /etc/local.d:

Code: Select all

#!/bin/sh

# Set SCT Error Recovery Control to 7 seconds
smartctl -l scterc,70,70 /dev/sda
smartctl -l scterc,70,70 /dev/sdb

ipic · Post by **ipic** » Fri Aug 09, 2024 6:26 pm

pa4wdh wrote:I have a very similar setup to yours in my server. It used to be 2 mechanical drives, but i've replaced them with SSD's by replacing them one at a time and let RAID to it's magic.

Many thanks for the super useful post. I have now pulled the trigger and have started ordering hardware.

Re your comment: "replacing them one at a time and let RAID to it's magic"
Was there a time when you were using a RAID array with a HDD and a SDD active at the same time?
Or was that just for the time while you converted the drives from HDD to SDD?

pa4wdh · Post by **pa4wdh** » Sat Aug 10, 2024 6:26 am

Was there a time when you were using a RAID array with a HDD and a SDD active at the same time?
Or was that just for the time while you converted the drives from HDD to SDD?

For about a week i've been running with 1 HDD and 1 SSD. The difference in speed was huge (1TB 2.5" 5400rpm disk vs. EVO 860), but that didn't affect the array, just don't expect a performance increase during that time.

lars_the_bear · Post by **lars_the_bear** » Sat Aug 10, 2024 6:26 pm

This isn't an issue for everybody, I guess: a huge advantage for me, if I could afford SSDs as large as I need, would be that they are silent in operation. Right now, with my stack of 7200rpm magnetic disks, I have to choose between having a constant drone from them, or letting them spin up and spin down all the time.

Actually, my wife makes that choice for me

Personally, I would use SSDs if I could, even if it meant that individual drives won't last as long as the magnetic drives I currently use. But, as I said, I suspect they will.

As soon as the price of SSDs falls to about a quarter their current level, I'll be using them in my RAID for sure.

BR, Lars.

ipic · Post by **ipic** » Wed Aug 21, 2024 1:34 pm

Having taken delivery of a pair of 4TB SSD drives, I decided to try out the 'hot' replacement route.

Physically add the two new drives to the case
Use mdadm --grow to increase the RAID size to 4
Add each new SDD in turn, and wait for resync to finish
Use mdadm --fail and then --remove to remove the 'old' HDD drives
Use mdadm --grow to return RAID size to 2
Make sure that grub is installed on new SSDs and EFI partitions and boot entries are good
Shut down PC and physically remove 'old' HDD drives, move new SDD drives the bays and connectors the HDDs used
Start PC

Boot up went fine - Raids were assembled correctly, LVMs found and mounted - everything as before.

Very please with everything loading faster now

If I remember, I'll come back here from time to time with reliability reports.

ipic · Post by **ipic** » Wed Sep 11, 2024 12:04 pm

I was digging around the smartctl output for interesting data, and I found this:

Code: Select all

# smartctl -x /dev/sdk
.....
Device Statistics (GP Log 0x04)
...
0x01  0x010  4           28202  ---  Power-on Hours
0x01  0x018  6     71355063635  ---  Logical Sectors Written
...

So, I wrote a Perl script to dig the smartctl output, and out of curiosity used it on one of the old Western Digital HDDs that I replaced with and SSD.
Lo and behold, I got this:

Code: Select all

Power on for: 3.2 years
/dev/sdk: WDC WD20EZBX-00AYRA0 Written: 36.53 TB

That drive was part of my main system raid mirror, so it was running a whole Gentoo system, which is maintained daily, and has a kernel build whenever the main kernel.org published a stable branch version (via Gentoo-Sources package)
Machine also used for programming, WWW browsing, and playing games via Steam.

It's one data point, but in my case it shows a typical data write rate of a little bit more than 10TB a year for a largish Gentoo system (> 1500 packages installed).
Though this may be of interest to people thinking of using SDDs for a main Gentoo system.

Post by **NeddySeagoon** » Wed Sep 11, 2024 12:41 pm

lars_the_bear,

If you can connect the extra drive, without disconnecting are working member of the raid set, use mdadm --replace ...

It will populate the new volume from available data from any member of the raid. So if you have a drive fail during the replace, it still works.
See the mdadm man page.

I have an NVMe and SSD in raid1, with the SSD being write mostly.

As long as you don't use SMR HDD in raid, not ever, mix and match should be OK.
Stay away from Samsung, SSDs. Their warranty process is horrible and I had both halves of a mirror. fail with bad blocks just a few hours apart.

You can do your own wearout calculations. When I looked atone of mine a year or so ago, it should last 180 years.
iwon't care for several reasons. :)
Stay away from QLC SSDs if write life is a real cocern. They only have half the write life of SSDs.

smartctl -x on one of your HDD will tell all about data read/written.

ipic · Post by **ipic** » Tue Nov 12, 2024 11:27 am

NeddySeagoon wrote: ....
Stay away from Samsung, SSDs. Their warranty process is horrible and I had both halves of a mirror. fail with bad blocks just a few hours apart.
....

This is interesting. It seems Samsung reviews go either 'its the greatest' or 'it failed'. Hard to form a view, but your experience makes me leery of them

Out of interest, if you don't mind, which brands do you use?

I am now three months in, one Crucial (CT4000MX500SSD1), and one generic Chinese (FIKWOT FS810 4TB)
Approx 5TB written so far.

I see that they run at quite different temperatures when idle. crucial at 26C, Fixwot at 32C
I wonder if this is telling me anything . Perhaps consistent lower temperature is a sign of better quality build?

lars_the_bear · Post by **lars_the_bear** » Tue Nov 12, 2024 1:54 pm

ipic wrote: This is interesting. It seems Samsung reviews go either 'its the greatest' or 'it failed'. Hard to form a view, but your experience makes me leery of them

The problem, I guess, is that no amount of anecdotal reports will settle this. They've been OK for me, but I've heard horror stories. I don't think anybody except a drive manufacturer can do the kind of large-scale testing that would be needed, to establish a reliable MTBF.

I once had two Samsung drives in a RAID mirror fail within hours of one another, when another piece of equipment nearby exploded. I mean, it literally exploded, and the power went out in the whole building. I noticed one drive was showing errors, so I replaced it. But a few hours into the re-sync process, the other drive started showing errors. I was able (eventually) to recover all the data, and I have additional back-ups, anyway, but it was a tense day.

My point is that it's difficult to know what to attribute drive failures to. In my case, I'n guessing that some kind of supply voltage spike took out the drives, along with the thing that blew up. But who knows? These events might have been a coincidence.

The problem is that if you exclude a manufacturer that somebody had a bad experience with, you'll soon be left trusting nobody :/

BR, Lars.

ipic · Post by **ipic** » Wed Jan 22, 2025 5:30 pm

I decided to try a Samsung 870 EVO 2TB in one set of my RAID-1 arrays.

Disappointingly, after one month, it started showing signs of failure, chief of which was that the BIOS refused to boot off it. Additionally, occasional drops to 1.5 G/sec SATA speed reported. Several smartctl stats relating to errors were well into the 100's.

I was lucky enough to get a replacement and change it out before it totally failed. Chalk one more data point to the "avoid Samsung" I guess.

Anon-E-moose · Post by **Anon-E-moose** » Wed Jan 22, 2025 6:42 pm

The issue of heat is the greatest problem with ssd's in an enclosure, well other than a flaky power supply.

Heat comes from 2 main areas, lack of proper ventilation in the enclosure, and running the ssd with lots of writes,
which will cause the controller to run hot and eventually make everything in the ssd case start being affected by the heat.

I don't have any raid enclosures for ssd's but have been considering it, so I've been looking for a while.
With the hdd raid enclosures I have I try and keep the temps between 30-40 F, on average.
Edit to add: above temps should be centigrade

I've had samsung sdd's (evo's usually) running for years with no problems, but I keep the temps moderate.

Post by **John R. Graham** » Wed Jan 22, 2025 7:03 pm

ipic wrote:I decided to try a Samsung 870 EVO 2TB in one set of my RAID-1 arrays.

It's the Samsung Pro series, not the EVO, that's historically had the best reputation for longevity. Sadly, that report has become rather dated (it's from 2015). It would really be interesting to see it repeated with more modern hardware.

- John

ipic · Post by **ipic** » Mon Feb 10, 2025 6:49 pm

ipic wrote:I decided to try a Samsung 870 EVO 2TB in one set of my RAID-1 arrays.

Disappointingly, after one month, it started showing signs of failure, chief of which was that the BIOS refused to boot off it. Additionally, occasional drops to 1.5 G/sec SATA speed reported. Several smartctl stats relating to errors were well into the 100's.

I was lucky enough to get a replacement and change it out before it totally failed. Chalk one more data point to the "avoid Samsung" I guess.

Revised to be fair to the Samsung.
After replacing it with a Crucial drive of the same size, I started seeing the same set of errors.
Turns out the SATA power cable to that drive was faulty. Replacing the cable stopped the errors.

So not a Samsung data point after all.
I now have a Samsung 2TB SSD that I need to find a use for

ipic · Post by **ipic** » Sat Mar 15, 2025 11:30 am

So far, drives are all performing well in RAID-1 arrays. From my script for daily reporting:

Code: Select all

/dev/sda: FIKWOT FS810 4TB                Written: 10.27T  Power on for: 6.74 months
/dev/sdb: CT4000MX500SSD1                 Written: 10.78TB  Percentage of TBW: 1.0783%  Power on for: 6.83 months
/dev/sdc: CT2000MX500SSD1                 Written: 2.41TB  Percentage of TBW: 0.3442%  Power on for: 1.68 months
/dev/sdd: CT2000MX500SSD1                 Written: 2.57TB  Percentage of TBW: 0.3676%  Power on for: 2.81 months
/dev/sde: Samsung SSD 870 EVO 2TB         Written: 4.23TB  Percentage of TBW: 0.3523%  Power on for: 1.23 months

CT devices are all Crucial MX500 - now sadly discontinued

TBW values taken from manufacturer data sheets, where available.

My experience so far is that all failures I have seen are down to either power cables, or SATA cables.
I thus decided to try out an nvme drive, since my motherboard supports it, and that eliminated any cables as failure points.

Code: Select all

/dev/nvme0n1: WD_BLACK SN850X 4000GB     Written: 2.89 TB  Percentage of TBW: 0.1204% Power on for: 1.20 weeks

I have also found a mdadm option called --mostly-write
Not tried it yet, but if I'm reading the documentation correctly, it allows a member of a RAID-1 array to be 'unfavoured' when reading. So, if a RAID-1 has a SSD and a nvme drive, that SSD can be made --mostly-write, and thus not used when reading - leaving the nvme to fully utilise it's read speed.
Or, I've got the wrong end of the stick - again

Are SSD drives suitable for RAID-1 use these days?

Are SSD drives suitable for RAID-1 use these days?

Re: Are SSD drives suitable for RAID-0 use these days?

Re: Are SSD drives suitable for RAID-0 use these days?

Re: Are SSD drives suitable for RAID-0 use these days?