Pool Layout Question

Scharbag · Aug 31, 2022

Wow - SMR disks blow.

This pool now has no SMR disks (detached a CMR 4TB from backuptank and put it into bigtank):

This pool has 2 SMR disks (SMR 6TB disk I detached from bigtank is is the one being resilvered into backuptank):

And I am getting this wonderful news from TrueNAS:

Happy I will be rid of all SMR disks soon. The 20TB drives are just starting their disk-burnin.sh journey... Should take a week while (weeks??). Long SMART test will take ~24 hours. A big thank you to whoever invented TMUX!!!

Cheers,

ChrisRJ · Aug 31, 2022

The resilver speed for your SMR drives is still relatively decent. I remember reading a test (I think on STH - Serve The Home) where with SMR drives it took more than a week.

mav@ · Sep 1, 2022

Just for a note, new 13.0-U2 got number of scrub improvements, some of which should make the process a bit more sequential, that may help SMR a bit, if at all possible. Though the main direction of the improvements was to reduce scrub CPU usage.

HoneyBadger · Sep 1, 2022

It's a bit late to do it now, but some newer (WD and Seagate) SMR disks understand TRIM commands to "refresh" their shingle layout. Blasting the drive with zeroes (manually or via ATA_SECURE_ERASE) seems to also do the trick for the older models.

Put another one on my list of "things to do should I ever get the time" - try to tune enough ZFS tunables so that it never writes anything smaller than a 256MB full-zone allocation.

souporman · Sep 1, 2022

Arwen said:
I would have to agree that with drives larger than say 10TB, and certainly 20TB, the risk of un-recoverable read error during disk replacement starts to get high. Thus, 3 way mirrors or RAID-Z2.

? Maybe I am wrong, but I've always thought UREs didn't affect mirrors like they do RAIDz1 and 2 because of the fact that a mirror does not use a parity disk. I don't think a URE affects Mirrors during the re-mirroring process. Isn't that like one of the biggest selling points of mirrors, besides better random IO?

danb35 · Sep 1, 2022

souporman said:
I don't think a URE affects Mirrors during the re-mirroring process.

Of course it does--how could it not? If you have a two-disk mirror, one disk dies, you're trying to replace it, and you have a read error on the only remaining copy of your data, how would that not cause data loss?

Arwen · Sep 1, 2022

souporman said:
? Maybe I am wrong, but I've always thought UREs didn't affect mirrors like they do RAIDz1 and 2 because of the fact that a mirror does not use a parity disk. I don't think a URE affects Mirrors during the re-mirroring process. Isn't that like one of the biggest selling points of mirrors, besides better random IO?

danb35 said:
Of course it does--how could it not? If you have a two-disk mirror, one disk dies, you're trying to replace it, and you have a read error on the only remaining copy of your data, how would that not cause data loss?

Yes, it is a case of the disk's error rate. Nothing to do with RAID-5/6 or RAID-Z1/2/3.

If a disk's error rate is 1 in X, and you have to read X from a disk to recover from a different failed disk, then you have a chance of getting a URE. With newer disks being 5 x X in size, the possibility of a URE is much higher.

The problem is compounded by RAID-5/6 or RAID-Z1/2/3 because at times you have to read much more that X, because of the disk stripe. Thus, people have been suggesting RAID-6 or RAID-Z2/3 as a way to over come the problem.

It is possible to have hundreds / thousands, even millions of UREs during recovery and still have 100% data recovery. As long as you have enough redundancy in the individual stripes, ZFS can complete the re-silver.

However, this is much less likely in 2 way mirrors. If a disk fails completely in a 2 way mirror, and the data has a single copy, (aka ZFS "copies=1"), then any URE on a data block will result in failure of recovery for that block / file.

Note that ZFS by default has an extra copy of metadata, and even more of critical metadata. So in the case of a directory entry having a URE, another copy, ON THE SAME DISK, is available for recovery purposes. This is because metadata in some respects is more important that raw data, simply because it can cause much larger amount of data loss.

Scharbag · Sep 1, 2022

mav@ said:
Just for a note, new 13.0-U2 got number of scrub improvements, some of which should make the process a bit more sequential, that may help SMR a bit, if at all possible. Though the main direction of the improvements was to reduce scrub CPU usage.

Given I am mid scrub, and mid testing of the new drives, I will upgrade to 13.0U2 once everything is done. I plan on getting rid of all of my SMR drives ASAFP.

:)

Scharbag · Sep 2, 2022

Yeah, SMR can rot in the bowels of hell.

Scharbag · Sep 9, 2022

So, yeah, this is almost done...

Looks like a 20TB disk will take a little over 9 days to do a full burn in as there is still a 24 to 26 hour SMART extended test to do.

But, confidence should be high that these manufacturer rectified drives will last.

Then we move on to moving all of the data across...

ChrisRJ · Sep 9, 2022

Scharbag said:
But, confidence should be high [..[

"Confidence is high. I repeat, confidence is high." SCNR

Anyone recognize the movie?

Scharbag · Sep 9, 2022

ChrisRJ said:
"Confidence is high. I repeat, confidence is high." SCNR Anyone recognize the movie?

Would you like to play a game?

Scharbag · Sep 10, 2022

Finally done at about 4pm today. So, yeah, 9+ days.

Here goes the copy :)

Important Announcement for the TrueNAS Community.

Pool Layout Question

Scharbag

Guru

ChrisRJ

Wizard

mav@

iXsystems

HoneyBadger

actually does care

souporman

Explorer

danb35

Hall of Famer

Arwen

MVP

Scharbag

Guru

Scharbag

Guru

Scharbag

Guru

ChrisRJ

Wizard

Scharbag

Guru

Scharbag

Guru

Similar threads

Important Announcement for the TrueNAS Community.

Pool Layout Question

Guru

Wizard

iXsystems

actually does care

Explorer

Hall of Famer

MVP

Guru

Guru

Guru

Wizard

Guru

Guru

Important Announcement for the TrueNAS Community.

Related topics on forums.truenas.com for thread: "Pool Layout Question"

Similar threads