Mike Bruns
Dabbler
- Joined
- Dec 9, 2015
- Messages
- 21
Hi all,
Could someone help me interpret this zpool error after a scrub? The pool is functional, just giving a corruption error in a non-critical file. It's a new Freenas install.
My config is a: Dell Poweredge T110-ii Server, 16GB ECC RAM, 5x6TB drives, RaidZ2, Current 9.3.1 stable software.
Note: One of the drives shows a smartctl error "in the past" but appears fine now. I'm waiting to get the RMA replacement and will replace the questionable one after burn-in. Is it better to run a RaidZ2 with 1 questionable drive, or remove the questionable drive and run a RaidZ1
Could someone help me interpret this zpool error after a scrub? The pool is functional, just giving a corruption error in a non-critical file. It's a new Freenas install.
My config is a: Dell Poweredge T110-ii Server, 16GB ECC RAM, 5x6TB drives, RaidZ2, Current 9.3.1 stable software.
Note: One of the drives shows a smartctl error "in the past" but appears fine now. I'm waiting to get the RMA replacement and will replace the questionable one after burn-in. Is it better to run a RaidZ2 with 1 questionable drive, or remove the questionable drive and run a RaidZ1
Code:
########## ZPool status report summary for all pools ##########
+--------------+--------+------+------+------+----+--------+------+-----+
|Pool Name |Status |Read |Write |Cksum |Used|Scrub |Scrub |Last |
| | |Errors|Errors|Errors| |Repaired|Errors|Scrub|
| | | | | | |Bytes | |Age |
+--------------+--------+------+------+------+----+--------+------+-----+
|freenas-boot |ONLINE | 0| 0| 0| 7%| 0| 0| 1|
|fullvolume !|ONLINE | 0| 0| 28| 30%| 84K| 7| 0|
+--------------+--------+------+------+------+----+--------+------+-----+
########## ZPool status report for freenas-boot ##########
pool: freenas-boot
state: ONLINE
scan: scrub repaired 0 in 0h1m with 0 errors on Wed Dec 30 02:24:26 2015
config:
NAME STATE READ WRITE CKSUM
freenas-boot ONLINE 0 0 0
mirror-0 ONLINE 0 0 0
gptid/c97814ea-a44e-11e5-b1d0-f8db88ffc155 ONLINE 0 0 0
gptid/c9a28995-a44e-11e5-b1d0-f8db88ffc155 ONLINE 0 0 0
errors: No known data errors
########## ZPool status report for fullvolume ##########
pool: fullvolume
state: ONLINE
status: One or more devices has experienced an error resulting in data
corruption. Applications may be affected.
action: Restore the file in question if possible. Otherwise restore the
entire pool from backup.
see: http://illumos.org/msg/ZFS-8000-8A
scan: scrub repaired 84K in 6h47m with 7 errors on Wed Dec 30 08:48:23 2015
config:
NAME STATE READ WRITE CKSUM
fullvolume ONLINE 0 0 7
raidz2-0 ONLINE 0 0 14
gptid/89d13f2b-aba0-11e5-be4a-f8db88ffc155 ONLINE 0 0 1
gptid/8a83b206-aba0-11e5-be4a-f8db88ffc155 ONLINE 0 0 0
gptid/8b271ed3-aba0-11e5-be4a-f8db88ffc155 ONLINE 0 0 1
gptid/8bd76287-aba0-11e5-be4a-f8db88ffc155 ONLINE 0 0 3
gptid/8c8582df-aba0-11e5-be4a-f8db88ffc155 ONLINE 0 0 2
errors: Permanent errors have been detected in the following files:
/var/db/system/cores/python2.7.core
=========================
########## SMART status report summary for all drives ##########
+------+---------------+----+-----+-----+-----+-------+-------+--------+------+------+------+-------+----+
|Device|Serial |Temp|Power|Start|Spin |ReAlloc|Current|Offline |UDMA |Seek |High |Command|Last|
| | | |On |Stop |Retry|Sectors|Pending|Uncorrec|CRC |Errors|Fly |Timeout|Test|
| | | |Hours|Count|Count| |Sectors|Sectors |Errors| |Writes|Count |Age |
+------+---------------+----+-----+-----+-----+-------+-------+--------+------+------+------+-------+----+
|ada0 ?|WOL240326574 | 35 | 141| 8| 0| 0| 0| 0| 0| N/A| N/A| N/A| 5|
|ada1 ?|WOL240327198 | 35 | 284| 12| 0| 0| 0| 0| 0| N/A| N/A| N/A| 5|
|ada2 ?|WOL240327200 | 35 | 284| 12| 0| 0| 0| 0| 0| N/A| N/A| N/A| 5|
|ada3 ?|WOL240327207 | 34 | 284| 12| 0| 0| 0| 0| 0| N/A| N/A| N/A| 5|
|ada4 ?|WOL240327210 | 36 | 265| 14| 0| 0| 0| 0| 0| N/A| N/A| N/A| 5|
+------+---------------+----+-----+-----+-----+-------+-------+--------+------+------+------+-------+----+
########## SMART status report for ada0 drive (: WOL240326574) ##########
SMART overall-health self-assessment test result: PASSED
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0
3 Spin_Up_Time 0x0027 201 197 021 Pre-fail Always - 8941
4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 8
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 100 253 000 Old_age Always - 0
9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 141
10 Spin_Retry_Count 0x0032 100 253 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 8
16 Unknown_Attribute 0x0022 000 200 000 Old_age Always - 17197295533
183 Runtime_Bad_Block 0x0032 100 100 000 Old_age Always - 0
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 4
193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 3
194 Temperature_Celsius 0x0022 117 112 000 Old_age Always - 35
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0
No Errors Logged
Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
Extended offline Completed without error 00% 12 -
########## SMART status report for ada1 drive (: WOL240327198) ##########
SMART overall-health self-assessment test result: PASSED
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0
3 Spin_Up_Time 0x0027 211 207 021 Pre-fail Always - 8433
4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 12
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 100 253 000 Old_age Always - 0
9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 284
10 Spin_Retry_Count 0x0032 100 253 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 12
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 7
193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 63
194 Temperature_Celsius 0x0022 117 111 000 Old_age Always - 35
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0
No Errors Logged
Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
Conveyance offline Completed without error 00% 164 -
########## SMART status report for ada2 drive (: WOL240327200) ##########
SMART overall-health self-assessment test result: PASSED
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0
3 Spin_Up_Time 0x0027 210 206 021 Pre-fail Always - 8483
4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 12
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 100 253 000 Old_age Always - 0
9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 284
10 Spin_Retry_Count 0x0032 100 253 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 12
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 7
193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 61
194 Temperature_Celsius 0x0022 117 109 000 Old_age Always - 35
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0
No Errors Logged
Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
Conveyance offline Completed without error 00% 164 -
########## SMART status report for ada3 drive (: WOL240327207) ##########
SMART overall-health self-assessment test result: PASSED
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0
3 Spin_Up_Time 0x0027 209 206 021 Pre-fail Always - 8508
4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 12
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 100 253 000 Old_age Always - 0
9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 284
10 Spin_Retry_Count 0x0032 100 253 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 12
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 7
193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 62
194 Temperature_Celsius 0x0022 118 112 000 Old_age Always - 34
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0
No Errors Logged
Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
Conveyance offline Completed without error 00% 164 -
########## SMART status report for ada4 drive (: WOL240327210) ##########
SMART overall-health self-assessment test result: PASSED
See vendor-specific Attribute list for marginal Attributes.
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 001 051 Pre-fail Always In_the_past 0
3 Spin_Up_Time 0x0027 209 206 021 Pre-fail Always - 8533
4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 14
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 100 253 000 Old_age Always - 0
9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 265
10 Spin_Retry_Count 0x0032 100 253 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 10
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 6
193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 21
194 Temperature_Celsius 0x0022 116 110 000 Old_age Always - 36
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 198 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0008 100 253 000 Old_age Offline - 0
No Errors Logged
Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
Short offline Completed without error 00% 147 -