Suddenly FreeNAS won't report temp for 1 drive

sremick

Patron
Joined
Sep 24, 2014
Messages
323
So originally when I upgraded to 11.2, all 6 of my drives were reporting temps.

But now, /dev/ada2 is showing "N/A" for temperature in the Dashboard. All the other drives show a temp correctly.

smartctl -a /dev/ada2 correctly reports a temp, so it's not that the drive isn't providing it.

What might be going on?

(I know of the read errors in the logs, but those are from weeks ago and the drive hasn't reported any new ones. I'm keeping an eye on it.)

Code:
smartctl 6.6 2017-11-05 r4594 [FreeBSD 11.2-STABLE amd64] (local build)
Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Western Digital Red
Device Model:     WDC WD30EFRX-68EUZN0
Serial Number:    xxxxxxxxxxxxxxxx
LU WWN Device Id: 5 0014ee 604fc6e6e
Firmware Version: 82.00A82
User Capacity:    3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    5400 rpm
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-2 (minor revision not indicated)
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Thu Jan  3 20:27:36 2019 EST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00)    Offline data collection activity
                    was never started.
                    Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0)    The previous self-test routine completed
                    without error or no self-test has ever
                    been run.
Total time to complete Offline
data collection:         (40680) seconds.
Offline data collection
capabilities:              (0x7b) SMART execute Offline immediate.
                    Auto Offline data collection on/off support.
                    Suspend Offline collection upon new
                    command.
                    Offline surface scan supported.
                    Self-test supported.
                    Conveyance Self-test supported.
                    Selective Self-test supported.
SMART capabilities:            (0x0003)    Saves SMART data before entering
                    power-saving mode.
                    Supports SMART auto save timer.
Error logging capability:        (0x01)    Error logging supported.
                    General Purpose Logging supported.
Short self-test routine
recommended polling time:      (   2) minutes.
Extended self-test routine
recommended polling time:      ( 408) minutes.
Conveyance self-test routine
recommended polling time:      (   5) minutes.
SCT capabilities:            (0x703d)    SCT Status supported.
                    SCT Error Recovery Control supported.
                    SCT Feature Control supported.
                    SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       79
  3 Spin_Up_Time            0x0027   177   176   021    Pre-fail  Always       -       6116
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       76
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   051   051   000    Old_age   Always       -       36356
 10 Spin_Retry_Count        0x0032   100   253   000    Old_age   Always       -       0
 11 Calibration_Retry_Count 0x0032   100   253   000    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       76
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       46
193 Load_Cycle_Count        0x0032   200   200   000    Old_age   Always       -       396
194 Temperature_Celsius     0x0022   111   102   000    Old_age   Always       -       39
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       1
198 Offline_Uncorrectable   0x0030   100   253   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       14

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed: read failure       90%     36147         1433664792
# 2  Extended offline    Completed: read failure       90%     36053         1433664792
# 3  Short offline       Completed: read failure       10%     35980         1565560512
# 4  Extended offline    Completed: read failure       90%     35720         1433664793
# 5  Short offline       Completed: read failure       90%     35647         1433664793
# 6  Extended offline    Completed: read failure       10%     35508         1565560512
# 7  Short offline       Completed: read failure       90%     35500         1433664792
# 8  Short offline       Completed without error       00%     14332         -
# 9  Extended offline    Completed without error       00%     14244         -
#10  Short offline       Completed without error       00%     14164         -
#11  Short offline       Completed without error       00%     13996         -
#12  Extended offline    Completed without error       00%     13909         -
#13  Short offline       Completed without error       00%     13828         -
#14  Short offline       Completed without error       00%     13588         -
#15  Extended offline    Completed without error       00%     13501         -
#16  Short offline       Completed without error       00%     13420         -
#17  Short offline       Completed without error       00%     13252         -
#18  Extended offline    Completed without error       00%     13178         -
#19  Short offline       Completed without error       00%     13097         -
#20  Short offline       Completed without error       00%     12882         -
#21  Extended offline    Completed without error       00%     12794         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
 

Chris Moore

Hall of Famer
Joined
May 2, 2015
Messages
10,080
It could easily be a bug, it is still a very new software release, so you might want to file a report.
 

sremick

Patron
Joined
Sep 24, 2014
Messages
323
Oddly, for a brief period last night it was reporting, but it's back to "N/A" which is what it seems to be at the vast majority of the time. The other 5 drives remain consistently dependable in their temp reporting.

I'll watch it for a bit longer to try and make some sense of it, then pursue a bug report.
 

Chris Moore

Hall of Famer
Joined
May 2, 2015
Messages
10,080
Oddly, for a brief period last night it was reporting, but it's back to "N/A" which is what it seems to be at the vast majority of the time. The other 5 drives remain consistently dependable in their temp reporting.

I'll watch it for a bit longer to try and make some sense of it, then pursue a bug report.
Are they all connected to the same controller? Same model drives?
Code:
SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed: read failure       90%     36147         1433664792
# 2  Extended offline    Completed: read failure       90%     36053         1433664792
# 3  Short offline       Completed: read failure       10%     35980         1565560512
# 4  Extended offline    Completed: read failure       90%     35720         1433664793
# 5  Short offline       Completed: read failure       90%     35647         1433664793
# 6  Extended offline    Completed: read failure       10%     35508         1565560512
# 7  Short offline       Completed: read failure       90%     35500         1433664792
# 8  Short offline       Completed without error       00%     14332         -
Did you notice all these test failures?
It could be that there is a drive fault.
How do the other drives look?

I went back and I see that you commented on those. If a drive is failing to complete the self test, it is dead and time to be returned for warranty replacement or just replaced outright if it is out of warranty.
 
Last edited:

Chris Moore

Hall of Famer
Joined
May 2, 2015
Messages
10,080
Code:
  9 Power_On_Hours          0x0032   051   051   000    Old_age   Always       -       36356
You have a spare drive on hand?

That is a little over four years of run time. It is not that unusual for a drive to fail after that long. I would suggest replacement of this one and you might want to start planning for replacement of the rest between now and the five year mark. Some drives can last longer than five years, but around five years is when they (statistically) start to have a higher failure rate.
 
Last edited:

sremick

Patron
Joined
Sep 24, 2014
Messages
323
Are they all connected to the same controller?
Yes, all 6 are on the on-board Intel C226.

Same model drives?
All except one, which was already just replaced w/ an 8TB.

How do the other drives look?
Not home at the moment, but they looked good if I recall. I'll pull the full data later tonight.

I went back and I see that you commented on those. If a drive is failing to complete the self test, it is dead and time to be returned for warranty replacement or just replaced outright if it is out of warranty.

You have a spare drive on hand?
I do, if truly necessary. I was hoping that since in this case it was just 1 "Currently unreadable" sector back in November that then cleared and no more alerts, that I could get more life out of ada2. But in looking at the smart info, the latest error entry in the log was at the 36147 hour mark, and we're now at hour 36356 so just a little over 8 days ago. I certainly didn't get any alerts in FreeNAS about ada2 since November.

So based on this, I suppose a replacement of ada2 is due, but this raises some questions:

1) Due to all the errors in the logs, why would the drive SMART health be "PASSED", and "The previous self-test routine completed without error"?
2) Why hasn't FreeNAS notified me through one mechanism or another about the deteriorating health of this drive, since the drive seems to be aware itself, tests have failed, and there have been multiple read errors? I only got that one back in November which then cleared after I tried re-seating the cable.

I fear there's something fundamental I'm misunderstanding about SMART reporting, and FreeNAS drive health alerts. I want to ensure I have everything configured properly.
 

sremick

Patron
Joined
Sep 24, 2014
Messages
323
Here are the stats for the other 3TB drives:

Code:
# smartctl -a /dev/ada0
smartctl 6.6 2017-11-05 r4594 [FreeBSD 11.2-STABLE amd64] (local build)
Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Western Digital Red
Device Model:     WDC WD30EFRX-68EUZN0
Serial Number:    xxxxxxxxxxxxxxx
LU WWN Device Id: 5 0014ee 65a512aec
Firmware Version: 82.00A82
User Capacity:    3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    5400 rpm
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-2 (minor revision not indicated)
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Fri Jan  4 23:21:42 2019 EST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00)    Offline data collection activity
                    was never started.
                    Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0)    The previous self-test routine completed
                    without error or no self-test has ever
                    been run.
Total time to complete Offline
data collection:         (39120) seconds.
Offline data collection
capabilities:              (0x7b) SMART execute Offline immediate.
                    Auto Offline data collection on/off support.
                    Suspend Offline collection upon new
                    command.
                    Offline surface scan supported.
                    Self-test supported.
                    Conveyance Self-test supported.
                    Selective Self-test supported.
SMART capabilities:            (0x0003)    Saves SMART data before entering
                    power-saving mode.
                    Supports SMART auto save timer.
Error logging capability:        (0x01)    Error logging supported.
                    General Purpose Logging supported.
Short self-test routine
recommended polling time:      (   2) minutes.
Extended self-test routine
recommended polling time:      ( 393) minutes.
Conveyance self-test routine
recommended polling time:      (   5) minutes.
SCT capabilities:            (0x703d)    SCT Status supported.
                    SCT Error Recovery Control supported.
                    SCT Feature Control supported.
                    SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       0
  3 Spin_Up_Time            0x0027   178   177   021    Pre-fail  Always       -       6083
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       78
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   051   051   000    Old_age   Always       -       36383
 10 Spin_Retry_Count        0x0032   100   253   000    Old_age   Always       -       0
 11 Calibration_Retry_Count 0x0032   100   253   000    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       78
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       47
193 Load_Cycle_Count        0x0032   200   200   000    Old_age   Always       -       377
194 Temperature_Celsius     0x0022   115   105   000    Old_age   Always       -       35
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   100   253   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed without error       00%     36147         -
# 2  Extended offline    Completed without error       00%     36060         -
# 3  Short offline       Completed without error       00%     35979         -
# 4  Extended offline    Interrupted (host reset)      90%     35882         -
# 5  Extended offline    Interrupted (host reset)      90%     35882         -
# 6  Short offline       Completed without error       00%     35882         -
# 7  Short offline       Completed without error       00%     35815         -
# 8  Extended offline    Completed without error       00%     35727         -
# 9  Short offline       Completed without error       00%     35647         -
#10  Extended offline    Completed without error       00%     35507         -
#11  Short offline       Completed without error       00%     35499         -
#12  Short offline       Completed without error       00%     14331         -
#13  Extended offline    Completed without error       00%     14244         -
#14  Short offline       Completed without error       00%     14164         -
#15  Short offline       Completed without error       00%     13996         -
#16  Extended offline    Completed without error       00%     13908         -
#17  Short offline       Completed without error       00%     13828         -
#18  Short offline       Completed without error       00%     13588         -
#19  Extended offline    Completed without error       00%     13501         -
#20  Short offline       Completed without error       00%     13420         -
#21  Short offline       Completed without error       00%     13252         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.


Code:
# smartctl -a /dev/ada1
smartctl 6.6 2017-11-05 r4594 [FreeBSD 11.2-STABLE amd64] (local build)
Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Western Digital Red
Device Model:     WDC WD30EFRX-68EUZN0
Serial Number:    xxxxxxxxxxxxxxxxxxx
LU WWN Device Id: 5 0014ee 65a518afb
Firmware Version: 82.00A82
User Capacity:    3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    5400 rpm
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-2 (minor revision not indicated)
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Fri Jan  4 23:22:23 2019 EST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00)    Offline data collection activity
                    was never started.
                    Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0)    The previous self-test routine completed
                    without error or no self-test has ever
                    been run.
Total time to complete Offline
data collection:         (38880) seconds.
Offline data collection
capabilities:              (0x7b) SMART execute Offline immediate.
                    Auto Offline data collection on/off support.
                    Suspend Offline collection upon new
                    command.
                    Offline surface scan supported.
                    Self-test supported.
                    Conveyance Self-test supported.
                    Selective Self-test supported.
SMART capabilities:            (0x0003)    Saves SMART data before entering
                    power-saving mode.
                    Supports SMART auto save timer.
Error logging capability:        (0x01)    Error logging supported.
                    General Purpose Logging supported.
Short self-test routine
recommended polling time:      (   2) minutes.
Extended self-test routine
recommended polling time:      ( 390) minutes.
Conveyance self-test routine
recommended polling time:      (   5) minutes.
SCT capabilities:            (0x703d)    SCT Status supported.
                    SCT Error Recovery Control supported.
                    SCT Feature Control supported.
                    SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       0
  3 Spin_Up_Time            0x0027   182   181   021    Pre-fail  Always       -       5858
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       76
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   051   051   000    Old_age   Always       -       36383
 10 Spin_Retry_Count        0x0032   100   253   000    Old_age   Always       -       0
 11 Calibration_Retry_Count 0x0032   100   253   000    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       76
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       46
193 Load_Cycle_Count        0x0032   200   200   000    Old_age   Always       -       396
194 Temperature_Celsius     0x0022   112   103   000    Old_age   Always       -       38
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   100   253   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed without error       00%     36147         -
# 2  Extended offline    Completed without error       00%     36060         -
# 3  Short offline       Completed without error       00%     35979         -
# 4  Short offline       Completed without error       00%     35814         -
# 5  Extended offline    Completed without error       00%     35727         -
# 6  Short offline       Completed without error       00%     35647         -
# 7  Extended offline    Completed without error       00%     35507         -
# 8  Short offline       Completed without error       00%     35499         -
# 9  Short offline       Completed without error       00%     14331         -
#10  Extended offline    Completed without error       00%     14244         -
#11  Short offline       Completed without error       00%     14164         -
#12  Short offline       Completed without error       00%     13996         -
#13  Extended offline    Completed without error       00%     13908         -
#14  Short offline       Completed without error       00%     13828         -
#15  Short offline       Completed without error       00%     13588         -
#16  Extended offline    Completed without error       00%     13501         -
#17  Short offline       Completed without error       00%     13420         -
#18  Short offline       Completed without error       00%     13252         -
#19  Extended offline    Completed without error       00%     13177         -
#20  Short offline       Completed without error       00%     13097         -
#21  Short offline       Completed without error       00%     12881         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.


Code:
# smartctl -a /dev/ada4
smartctl 6.6 2017-11-05 r4594 [FreeBSD 11.2-STABLE amd64] (local build)
Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Western Digital Red
Device Model:     WDC WD30EFRX-68EUZN0
Serial Number:    xxxxxxxxxxxxxxxxxx
LU WWN Device Id: 5 0014ee 6afa6e2ab
Firmware Version: 82.00A82
User Capacity:    3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    5400 rpm
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-2 (minor revision not indicated)
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Fri Jan  4 23:23:50 2019 EST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00)    Offline data collection activity
                    was never started.
                    Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0)    The previous self-test routine completed
                    without error or no self-test has ever
                    been run.
Total time to complete Offline
data collection:         (40860) seconds.
Offline data collection
capabilities:              (0x7b) SMART execute Offline immediate.
                    Auto Offline data collection on/off support.
                    Suspend Offline collection upon new
                    command.
                    Offline surface scan supported.
                    Self-test supported.
                    Conveyance Self-test supported.
                    Selective Self-test supported.
SMART capabilities:            (0x0003)    Saves SMART data before entering
                    power-saving mode.
                    Supports SMART auto save timer.
Error logging capability:        (0x01)    Error logging supported.
                    General Purpose Logging supported.
Short self-test routine
recommended polling time:      (   2) minutes.
Extended self-test routine
recommended polling time:      ( 410) minutes.
Conveyance self-test routine
recommended polling time:      (   5) minutes.
SCT capabilities:            (0x703d)    SCT Status supported.
                    SCT Error Recovery Control supported.
                    SCT Feature Control supported.
                    SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       0
  3 Spin_Up_Time            0x0027   180   178   021    Pre-fail  Always       -       5991
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       76
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   051   051   000    Old_age   Always       -       36383
 10 Spin_Retry_Count        0x0032   100   253   000    Old_age   Always       -       0
 11 Calibration_Retry_Count 0x0032   100   253   000    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       76
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       46
193 Load_Cycle_Count        0x0032   200   200   000    Old_age   Always       -       371
194 Temperature_Celsius     0x0022   113   100   000    Old_age   Always       -       37
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   100   253   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed without error       00%     36147         -
# 2  Extended offline    Completed without error       00%     36060         -
# 3  Short offline       Completed without error       00%     35980         -
# 4  Short offline       Completed without error       00%     35815         -
# 5  Extended offline    Completed without error       00%     35728         -
# 6  Short offline       Completed without error       00%     35647         -
# 7  Extended offline    Completed without error       00%     35508         -
# 8  Short offline       Completed without error       00%     35500         -
# 9  Short offline       Completed without error       00%     14332         -
#10  Extended offline    Completed without error       00%     14245         -
#11  Short offline       Completed without error       00%     14164         -
#12  Short offline       Completed without error       00%     13996         -
#13  Extended offline    Completed without error       00%     13909         -
#14  Short offline       Completed without error       00%     13828         -
#15  Short offline       Completed without error       00%     13588         -
#16  Extended offline    Completed without error       00%     13501         -
#17  Short offline       Completed without error       00%     13420         -
#18  Short offline       Completed without error       00%     13252         -
#19  Extended offline    Completed without error       00%     13178         -
#20  Short offline       Completed without error       00%     13097         -
#21  Short offline       Completed without error       00%     12882         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.


Code:
# smartctl -a /dev/ada5
smartctl 6.6 2017-11-05 r4594 [FreeBSD 11.2-STABLE amd64] (local build)
Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Western Digital Red
Device Model:     WDC WD30EFRX-68EUZN0
Serial Number:    xxxxxxxxxxxxxxxxxxxx
LU WWN Device Id: 5 0014ee 6afa6e3cc
Firmware Version: 82.00A82
User Capacity:    3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    5400 rpm
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-2 (minor revision not indicated)
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Fri Jan  4 23:24:32 2019 EST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00)    Offline data collection activity
                    was never started.
                    Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0)    The previous self-test routine completed
                    without error or no self-test has ever
                    been run.
Total time to complete Offline
data collection:         (39720) seconds.
Offline data collection
capabilities:              (0x7b) SMART execute Offline immediate.
                    Auto Offline data collection on/off support.
                    Suspend Offline collection upon new
                    command.
                    Offline surface scan supported.
                    Self-test supported.
                    Conveyance Self-test supported.
                    Selective Self-test supported.
SMART capabilities:            (0x0003)    Saves SMART data before entering
                    power-saving mode.
                    Supports SMART auto save timer.
Error logging capability:        (0x01)    Error logging supported.
                    General Purpose Logging supported.
Short self-test routine
recommended polling time:      (   2) minutes.
Extended self-test routine
recommended polling time:      ( 399) minutes.
Conveyance self-test routine
recommended polling time:      (   5) minutes.
SCT capabilities:            (0x703d)    SCT Status supported.
                    SCT Error Recovery Control supported.
                    SCT Feature Control supported.
                    SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       0
  3 Spin_Up_Time            0x0027   184   181   021    Pre-fail  Always       -       5800
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       77
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   051   051   000    Old_age   Always       -       36383
 10 Spin_Retry_Count        0x0032   100   253   000    Old_age   Always       -       0
 11 Calibration_Retry_Count 0x0032   100   253   000    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       77
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       47
193 Load_Cycle_Count        0x0032   200   200   000    Old_age   Always       -       374
194 Temperature_Celsius     0x0022   114   100   000    Old_age   Always       -       36
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   100   253   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       1

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed without error       00%     36147         -
# 2  Extended offline    Completed without error       00%     36060         -
# 3  Short offline       Completed without error       00%     35980         -
# 4  Short offline       Completed without error       00%     35815         -
# 5  Extended offline    Completed without error       00%     35727         -
# 6  Short offline       Completed without error       00%     35647         -
# 7  Extended offline    Completed without error       00%     35508         -
# 8  Short offline       Completed without error       00%     35500         -
# 9  Short offline       Completed without error       00%     14332         -
#10  Extended offline    Completed without error       00%     14244         -
#11  Short offline       Completed without error       00%     14164         -
#12  Short offline       Completed without error       00%     13996         -
#13  Extended offline    Completed without error       00%     13909         -
#14  Short offline       Completed without error       00%     13828         -
#15  Short offline       Completed without error       00%     13588         -
#16  Extended offline    Completed without error       00%     13501         -
#17  Short offline       Completed without error       00%     13420         -
#18  Short offline       Completed without error       00%     13253         -
#19  Extended offline    Completed without error       00%     13178         -
#20  Short offline       Completed without error       00%     13097         -
#21  Short offline       Completed without error       00%     12882         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
 

telgordo

Cadet
Joined
Feb 4, 2016
Messages
7
Having some fun myself with a homebrew server the 8 Western digital Red 3TB drives are all working okay, they are connected via an Avago/LSI 9211-8i HBA, just assembled this and all was well until Freenas reported that drive mounted at 0 was running at 238 degrees celcius, clearly nonsense, the drive was no hotter than the others and there was no silver puddle in the case, I am currently working on the assumption that this is either a bug or a faulty sensor, but a second opinion would be nice
 

melloa

Wizard
Joined
May 22, 2016
Messages
1,749
Having some fun myself with a homebrew server the 8 Western digital Red 3TB drives are all working okay, they are connected via an Avago/LSI 9211-8i HBA, just assembled this and all was well until Freenas reported that drive mounted at 0 was running at 238 degrees celcius, clearly nonsense, the drive was no hotter than the others and there was no silver puddle in the case, I am currently working on the assumption that this is either a bug or a faulty sensor, but a second opinion would be nice

I'd move this to a new thread as it is a different issue. Also, when opening a new post, provide all you can about your hardware and smart report for the impacted drive.
 
Top