scrub repairing though mem, smart and checksums are ok ... is this ok?

Status
Not open for further replies.

Martin RR

Dabbler
Joined
Feb 26, 2016
Messages
15
Hallo.

my pools needs repairs after scrubs. Although smart test of the WD RED hdds are ok eand the memory seems to be ok too (memtest ECC)
Is this a normal behavior.

DURING scrub

Code:
zpool status
  pool: raid-hdd
state: ONLINE
  scan: scrub in progress since Wed Mar 15 10:16:50 2017
	12.3T scanned out of 13.8T at 268M/s, 1h37m to go
	116K repaired, 89.19% done
config:

	NAME										  STATE	 READ WRITE CKSUM
	raid-hdd									 ONLINE	   0	 0	 0
	 raidz1-0									ONLINE	   0	 0	 0
	   ata-WDC_WD60EFRX-68L0BN1_WD-WXB1HB4SXXX   ONLINE	   0	 0	 0  (repairing)
	   ata-WDC_WD60EFRX-68L0BN1_WD-WX11D86HDYYY  ONLINE	   0	 0	 0
	   ata-WDC_WD60EFRX-68L0BN1_WD-WX41DA5LVZZZ  ONLINE	   0	 0	 0



AFTER scrub

Code:
zpool status
  pool: raid-hdd
state: ONLINE
  scan: scrub repaired 116K in 15h12m with 0 errors on Thu Mar 16 01:29:27 2017
config:

	NAME										  STATE	 READ WRITE CKSUM
	raid-hdd									  ONLINE	   0	 0	 0
	  raidz1-0									ONLINE	   0	 0	 0
		ata-WDC_WD60EFRX-68L0BN1_WD-WXB1HB4SXXX   ONLINE	   0	 0	 0
		ata-WDC_WD60EFRX-68L0BN1_WD-WX11D86HDYYY  ONLINE	   0	 0	 0
		ata-WDC_WD60EFRX-68L0BN1_WD-WX41DA5LVZZZ  ONLINE	   0	 0	 0

errors: No known data errors
 
Last edited by a moderator:
Joined
Jan 7, 2015
Messages
1,155
We will need a full hardware list to properly assess this. What I can tell you is that by running Z1 you are only asking for trouble. If this pool is not very filled seriously consider going with another parity drive.

Now with that said, the scrub is at least doing its job. Its finding and fixing errors before they snowball out of control, so thats good at least.
 

Martin RR

Dabbler
Joined
Feb 26, 2016
Messages
15
Thank you.

Here is my hardware:
Asrock C236 WSI Mainboard
Intel(R) Xeon(R) CPU E3-1275 v5 @ 3.60GHz
32 GB ECC Memory
350 Watt PSU

Pool:
hddPool: RaidZ1 3x WD Red
ssdPool: 5x Samsung 850 Evo

I checked memory for 48h. Even with ECC error injection. Switched the drives/cables.

My question is a repair during a scrub a normal thing or should we be be concerned. Checksums are ok and smart values too. There are also no data errors after the scrub (see above).

Do you need more info?
Code:
lspci
00:00.0 Host bridge: Intel Corporation Skylake Host Bridge/DRAM Registers (rev 07)
00:01.0 PCI bridge: Intel Corporation Skylake PCIe Controller (x16) (rev 07)
00:02.0 VGA compatible controller: Intel Corporation HD Graphics P530 (rev 06)
00:14.0 USB controller: Intel Corporation Sunrise Point-H USB 3.0 xHCI Controller (rev 31)
00:14.2 Signal processing controller: Intel Corporation Sunrise Point-H Thermal subsystem (rev 31)
00:16.0 Communication controller: Intel Corporation Sunrise Point-H CSME HECI #1 (rev 31)
00:16.3 Serial controller: Intel Corporation Sunrise Point-H KT Redirection (rev 31)
00:17.0 SATA controller: Intel Corporation Sunrise Point-H SATA controller [AHCI mode] (rev 31)
00:1c.0 PCI bridge: Intel Corporation Sunrise Point-H PCI Express Root Port #1 (rev f1)
00:1c.7 PCI bridge: Intel Corporation Sunrise Point-H PCI Express Root Port #8 (rev f1)
00:1f.0 ISA bridge: Intel Corporation Sunrise Point-H LPC Controller (rev 31)
00:1f.2 Memory controller: Intel Corporation Sunrise Point-H PMC (rev 31)
00:1f.3 Audio device: Intel Corporation Sunrise Point-H HD Audio (rev 31)
00:1f.4 SMBus: Intel Corporation Sunrise Point-H SMBus (rev 31)
00:1f.6 Ethernet controller: Intel Corporation Ethernet Connection (2) I219-LM (rev 31)
01:00.0 VGA compatible controller: NVIDIA Corporation GP106 [GeForce GTX 1060 6GB] (rev a1)
01:00.1 Audio device: NVIDIA Corporation Device 10f1 (rev a1)
03:00.0 Ethernet controller: Intel Corporation I210 Gigabit Network Connection (rev 03)
 
Last edited by a moderator:
Joined
Jan 7, 2015
Messages
1,155
Nah your system is up to par. I think as long as it isnt happening all the time, its probably the scrub doing its job. If it were me, I wouldnt worry too much about it. Keep your eye on the smart data. Keep up the scrub routine. Watch out for anything fishy in the system logs.. My bet stands, ZFS doing what it was designed to do.

Consider adding more parity though. Z1 is dead to most people around here.

Good luck!
 

rs225

Guru
Joined
Jun 28, 2014
Messages
878
I find Raidz1 to be very motivating; my backups are always recent.

It's the Raidz2 and Raidz3 guys who ought to worry; they never backup.
 

droeders

Contributor
Joined
Mar 21, 2016
Messages
179
I find Raidz1 to be very motivating; my backups are always recent.

It's the Raidz2 and Raidz3 guys who ought to worry; they never backup.

A good backup system should never depend on the motivation of a human.

Validating and monitoring backups - that's a different story.

Coming from a Z2 guy...
 
Status
Not open for further replies.
Top