melloa
Wizard
- Joined
- May 22, 2016
- Messages
- 1,749
Interesting that one, and only one, of my HDDs is getting hot (da14 below). That is forcing thee server to run with heavy i/o or full fan setting.
It is at 38C, but I have the script set to go to full at 39C, so it is about to make noise :)
Resilver in process.
Just can't find any explanation of why, besides manufacture defect, maybe faulty sensor, but as doesn't show any errors, can't even RMA.
Code:
[2018-09-17 09:05:00] monitor_hdd_temp: Start [2018-09-17 09:05:00] monitor_hdd_temp: Getting list of drives [2018-09-17 09:05:02] monitor_hdd_temp: Processing: /dev/da2: WDC WD40EFRX-68WT0N0 - WD-WCC4E0NUJ0C4 - temp 33C [2018-09-17 09:05:02] monitor_hdd_temp: Processing: /dev/da2: WDC WD40EFRX-68WT0N0 - WD-WCC4E0NUJ0C4 - temp 33C [2018-09-17 09:05:02] monitor_hdd_temp: Processing: /dev/da3: WDC WD40EFRX-68WT0N0 - WD-WCC4E0ARJR2R - temp 32C [2018-09-17 09:05:02] monitor_hdd_temp: Processing: /dev/da3: WDC WD40EFRX-68WT0N0 - WD-WCC4E0ARJR2R - temp 32C [2018-09-17 09:05:02] monitor_hdd_temp: Processing: /dev/da4: WDC WD40EFRX-68WT0N0 - WD-WCC4E4FR61KF - temp 33C [2018-09-17 09:05:02] monitor_hdd_temp: Processing: /dev/da4: WDC WD40EFRX-68WT0N0 - WD-WCC4E4FR61KF - temp 33C [2018-09-17 09:05:02] monitor_hdd_temp: Processing: /dev/da5: WDC WD40EFRX-68WT0N0 - WD-WCC4E6SEFY6R - temp 34C [2018-09-17 09:05:02] monitor_hdd_temp: Processing: /dev/da5: WDC WD40EFRX-68WT0N0 - WD-WCC4E6SEFY6R - temp 34C [2018-09-17 09:05:02] monitor_hdd_temp: Processing: /dev/da6: WDC WD40EFRX-68WT0N0 - WD-WCC4E6UR30JF - temp 34C [2018-09-17 09:05:03] monitor_hdd_temp: Processing: /dev/da6: WDC WD40EFRX-68WT0N0 - WD-WCC4E6UR30JF - temp 34C [2018-09-17 09:05:03] monitor_hdd_temp: Processing: /dev/da7: WDC WD40EFRX-68WT0N0 - WD-WCC4E4PHJH37 - temp 33C [2018-09-17 09:05:03] monitor_hdd_temp: Processing: /dev/da7: WDC WD40EFRX-68WT0N0 - WD-WCC4E4PHJH37 - temp 33C [2018-09-17 09:05:04] monitor_hdd_temp: Processing: /dev/da8: WDC WD40EFRX-68WT0N0 - WD-WCC4E2XH9F5L - temp 34C [2018-09-17 09:05:05] monitor_hdd_temp: Processing: /dev/da8: WDC WD40EFRX-68WT0N0 - WD-WCC4E2XH9F5L - temp 34C [2018-09-17 09:05:05] monitor_hdd_temp: Processing: /dev/da9: WDC WD40EFRX-68N32N0 - WD-WCC7K3TTVV06 - temp 34C [2018-09-17 09:05:06] monitor_hdd_temp: Processing: /dev/da9: WDC WD40EFRX-68N32N0 - WD-WCC7K3TTVV06 - temp 34C [2018-09-17 09:05:06] monitor_hdd_temp: Processing: /dev/da10: WDC WD40EFRX-68WT0N0 - WD-WCC4E2DD5135 - temp 35C [2018-09-17 09:05:06] monitor_hdd_temp: Processing: /dev/da10: WDC WD40EFRX-68WT0N0 - WD-WCC4E2DD5135 - temp 35C [2018-09-17 09:05:07] monitor_hdd_temp: Processing: /dev/da11: WDC WD40EFRX-68WT0N0 - WD-WCC4E5RAAT4P - temp 34C [2018-09-17 09:05:07] monitor_hdd_temp: Processing: /dev/da11: WDC WD40EFRX-68WT0N0 - WD-WCC4E5RAAT4P - temp 34C [2018-09-17 09:05:08] monitor_hdd_temp: Processing: /dev/da12: WDC WD40EFRX-68WT0N0 - WD-WCC4E4DH1DNA - temp 32C [2018-09-17 09:05:08] monitor_hdd_temp: Processing: /dev/da12: WDC WD40EFRX-68WT0N0 - WD-WCC4E4DH1DNA - temp 32C [2018-09-17 09:05:08] monitor_hdd_temp: Processing: /dev/da13: WDC WD40EFRX-68N32N0 - WD-WCC7K5XZJTS7 - temp 32C [2018-09-17 09:05:08] monitor_hdd_temp: Processing: /dev/da13: WDC WD40EFRX-68N32N0 - WD-WCC7K5XZJTS7 - temp 32C [2018-09-17 09:05:09] monitor_hdd_temp: Processing: /dev/da14: WDC WD4002FFWX-68TZ4N0 - NHG3962K - temp 38C [2018-09-17 09:05:09] monitor_hdd_temp: Drive /dev/da14 current temperature (38C), exceeded alert temperature (35C). [2018-09-17 09:05:09] monitor_hdd_temp: Fancontrol: Keeping same fan setting >> HEAVY I/O
It is at 38C, but I have the script set to go to full at 39C, so it is about to make noise :)
Code:
=== START OF INFORMATION SECTION === Model Family: Western Digital Red Pro Device Model: WDC WD4002FFWX-68TZ4N0 Serial Number: NHG3962K LU WWN Device Id: 5 000cca 243c17fa2 Firmware Version: 83.H0A83 User Capacity: 4,000,787,030,016 bytes [4.00 TB] Sector Sizes: 512 bytes logical, 4096 bytes physical Rotation Rate: 7200 rpm Form Factor: 3.5 inches Device is: In smartctl database [for details use: -P show] ATA Version is: ACS-2, ATA8-ACS T13/1699-D revision 4 SATA Version is: SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s) Local Time is: Mon Sep 17 09:12:35 2018 EDT SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED # ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000b 100 100 016 Pre-fail Always - 0 2 Throughput_Performance 0x0005 134 134 054 Pre-fail Offline - 116 3 Spin_Up_Time 0x0007 136 136 024 Pre-fail Always - 484 (Average 484) 4 Start_Stop_Count 0x0012 100 100 000 Old_age Always - 99 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail Always - 0 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail Always - 0 8 Seek_Time_Performance 0x0005 128 128 020 Pre-fail Offline - 18 9 Power_On_Hours 0x0012 099 099 000 Old_age Always - 9299 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 93 192 Power-Off_Retract_Count 0x0032 097 097 000 Old_age Always - 3875 193 Load_Cycle_Count 0x0012 097 097 000 Old_age Always - 3875 194 Temperature_Celsius 0x0002 153 153 000 Old_age Always - 39 (Min/Max 15/47) 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 0 197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always - 0 SMART Error Log Version: 1 No Errors Logged
Resilver in process.
Just can't find any explanation of why, besides manufacture defect, maybe faulty sensor, but as doesn't show any errors, can't even RMA.