SOLVED smartd is not sending an alert/email

Status
Not open for further replies.

Juggernaut

Cadet
Joined
Dec 27, 2018
Messages
1
Hi guys,

I have an issue with smartd and I can't seem to find the solution myself so I am asking here.

I am running FreeNAS 11.2 Release
4x 4TB Seagate IronWolf


My issue is that after setting the critical hdd temperature in Services > SMART to a value which is lower than the current hdd temperature, I am not getting any email alerts or any alerts on the FreeNAS dashboard about the temperature being to high.

However looking at /var/log/daemon.log

Code:
DEC 27 23:30:21 freenas smartd[77014]: Device: /dev/ada3, Temperature 26 Celsius reached critical limit of 20 Celsius(Min/Max 26/26)
DEC 27 23:30:21 freenas smartd[77014]: Device: /dev/ada2, Temperature 25 Celsius reached critical limit of 20 Celsius(Min/Max 26/26)
DEC 27 23:30:21 freenas smartd[77014]: Device: /dev/ada2, Temperature 25 Celsius reached critical limit of 20 Celsius(Min/Max 26/26)
DEC 27 23:30:21 freenas smartd[77014]: Device: /dev/ada1, Temperature 26 Celsius reached critical limit of 20 Celsius(Min/Max 26/26)
DEC 27 23:30:21 freenas smartd[77014]: Device: /dev/ada1, Temperature 26 Celsius reached critical limit of 20 Celsius(Min/Max 26/26)
DEC 27 23:30:21 freenas smartd[77014]: Device: /dev/ada0, Temperature 23 Celsius reached critical limit of 20 Celsius(Min/Max 26/26)
DEC 27 23:30:21 freenas smartd[77014]: Device: /dev/ada0, Temperature 24 Celsius reached critical limit of 20 Celsius(Min/Max 26/26)
DEC 27 00:00:21 freenas smartd[77014]: Device: /dev/ada3, Temperature 26 Celsius reached critical limit of 20 Celsius(Min/Max 26/26)
DEC 27 00:00:21 freenas smartd[77014]: Device: /dev/ada3, Temperature 26 Celsius reached critical limit of 20 Celsius(Min/Max 26/26)
DEC 27 00:00:21 freenas smartd[77014]: Device: /dev/ada2, Temperature 25 Celsius reached critical limit of 20 Celsius(Min/Max 26/26)
DEC 27 00:00:21 freenas smartd[77014]: Device: /dev/ada2, Temperature 25 Celsius reached critical limit of 20 Celsius(Min/Max 26/26)
DEC 27 00:00:21 freenas smartd[77014]: Device: /dev/ada1, Temperature 26 Celsius reached critical limit of 20 Celsius(Min/Max 26/26)
DEC 27 00:00:21 freenas smartd[77014]: Device: /dev/ada1, Temperature 26 Celsius reached critical limit of 20 Celsius(Min/Max 26/26)
DEC 27 00:00:21 freenas smartd[77014]: Device: /dev/ada0, Temperature 23 Celsius reached critical limit of 20 Celsius(Min/Max 26/26)
DEC 27 00:00:21 freenas smartd[77014]: Device: /dev/ada0, Temperature 24 Celsius reached critical limit of 20 Celsius(Min/Max 26/26)


It looks like smartd is running every 30 minutes as it is set to in Services > SMART but not sending an alert/mail.

Now if manually start smartd with
smartd -d -i 1800
Code:
smartd 6.6 2017-11-05 r4594 [FreeBSD 11.2-STABLE amd64] (local build)
Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org

Opened configuration file /usr/local/etc/smartd.conf
Configuration file /usr/local/etc/smartd.conf parsed.
Device: /dev/ada3, opened
Device: /dev/ada3, ST4000VN008-2DR166, S/N:ZGY3DQL1, WWN:5-000c50-0b39e95a4, FW:SC60, 4.00 TB
Device: /dev/ada3, found in smartd database: Seagate IronWolf
Device: /dev/ada3, is SMART capable. Adding to "monitor" list.
Device: /dev/ada3, opened
Device: /dev/ada3, ST4000VN008-2DR166, S/N:ZGY3DQL1, WWN:5-000c50-0b39e95a4, FW:SC60, 4.00 TB
Device: /dev/ada3, found in smartd database: Seagate IronWolf
Device: /dev/ada3, is SMART capable. Adding to "monitor" list.
Device: /dev/ada2, opened
Device: /dev/ada2, ST4000VN008-2DR166, S/N:ZDH5FBA0, WWN:5-000c50-0b3b73d88, FW:SC60, 4.00 TB
Device: /dev/ada2, found in smartd database: Seagate IronWolf
Device: /dev/ada2, is SMART capable. Adding to "monitor" list.
Device: /dev/ada2, opened
Device: /dev/ada2, ST4000VN008-2DR166, S/N:ZDH5FBA0, WWN:5-000c50-0b3b73d88, FW:SC60, 4.00 TB
Device: /dev/ada2, found in smartd database: Seagate IronWolf
Device: /dev/ada2, is SMART capable. Adding to "monitor" list.
Device: /dev/ada1, opened
Device: /dev/ada1, ST4000VN008-2DR166, S/N:ZDH5FBA5, WWN:5-000c50-0b3b737f7, FW:SC60, 4.00 TB
Device: /dev/ada1, found in smartd database: Seagate IronWolf
Device: /dev/ada1, is SMART capable. Adding to "monitor" list.
Device: /dev/ada1, opened
Device: /dev/ada1, ST4000VN008-2DR166, S/N:ZDH5FBA5, WWN:5-000c50-0b3b737f7, FW:SC60, 4.00 TB
Device: /dev/ada1, found in smartd database: Seagate IronWolf
Device: /dev/ada1, is SMART capable. Adding to "monitor" list.
Device: /dev/ada0, opened
Device: /dev/ada0, ST4000VN008-2DR166, S/N:ZDH5FHP4, WWN:5-000c50-0b3b72b1f, FW:SC60, 4.00 TB
Device: /dev/ada0, found in smartd database: Seagate IronWolf
Device: /dev/ada0, is SMART capable. Adding to "monitor" list.
Device: /dev/ada0, opened
Device: /dev/ada0, ST4000VN008-2DR166, S/N:ZDH5FHP4, WWN:5-000c50-0b3b72b1f, FW:SC60, 4.00 TB
Device: /dev/ada0, found in smartd database: Seagate IronWolf
Device: /dev/ada0, is SMART capable. Adding to "monitor" list.
Monitoring 8 ATA/SATA, 0 SCSI/SAS and 0 NVMe devices
Device: /dev/ada3, opened ATA device
Device: /dev/ada3, initial Temperature is 26 Celsius (Min/Max ??/26)
Device: /dev/ada3, Temperature 26 Celsius reached critical limit of 20 Celsius (Min/Max ??/26)
Sending warning via /usr/local/www/freenasUI/tools/smart_alert.py to MYMAIL@gmail.com ...
Warning via /usr/local/www/freenasUI/tools/smart_alert.py to MYMAIL@gmail.com: successful
Device: /dev/ada3, opened ATA device
Device: /dev/ada3, initial Temperature is 26 Celsius (Min/Max ??/26)
Device: /dev/ada3, Temperature 26 Celsius reached critical limit of 20 Celsius (Min/Max ??/26)
Sending warning via /usr/local/www/freenasUI/tools/smart_alert.py to MYMAIL@gmail.com ...
Warning via /usr/local/www/freenasUI/tools/smart_alert.py to MYMAIL@gmail.com: successful
Device: /dev/ada2, opened ATA device
Device: /dev/ada2, initial Temperature is 25 Celsius (Min/Max ??/25)
Device: /dev/ada2, Temperature 25 Celsius reached critical limit of 20 Celsius (Min/Max ??/25)
Sending warning via /usr/local/www/freenasUI/tools/smart_alert.py to MYMAIL@gmail.com ...
Warning via /usr/local/www/freenasUI/tools/smart_alert.py to MYMAIL@gmail.com: successful
Device: /dev/ada2, opened ATA device
Device: /dev/ada2, initial Temperature is 25 Celsius (Min/Max ??/25)
Device: /dev/ada2, Temperature 25 Celsius reached critical limit of 20 Celsius (Min/Max ??/25)
Sending warning via /usr/local/www/freenasUI/tools/smart_alert.py to MYMAIL@gmail.com ...
Warning via /usr/local/www/freenasUI/tools/smart_alert.py to MYMAIL@gmail.com: successful
Device: /dev/ada1, opened ATA device
Device: /dev/ada1, initial Temperature is 26 Celsius (Min/Max ??/26)
Device: /dev/ada1, Temperature 26 Celsius reached critical limit of 20 Celsius (Min/Max ??/26)
Sending warning via /usr/local/www/freenasUI/tools/smart_alert.py to MYMAIL@gmail.com ...
Warning via /usr/local/www/freenasUI/tools/smart_alert.py to MYMAIL@gmail.com: successful
Device: /dev/ada1, opened ATA device
Device: /dev/ada1, initial Temperature is 26 Celsius (Min/Max ??/26)
Device: /dev/ada1, Temperature 26 Celsius reached critical limit of 20 Celsius (Min/Max ??/26)
Sending warning via /usr/local/www/freenasUI/tools/smart_alert.py to MYMAIL@gmail.com ...
Warning via /usr/local/www/freenasUI/tools/smart_alert.py to MYMAIL@gmail.com: successful
Device: /dev/ada0, opened ATA device
Device: /dev/ada0, initial Temperature is 23 Celsius (Min/Max ??/23)
Device: /dev/ada0, Temperature 23 Celsius reached critical limit of 20 Celsius (Min/Max ??/23)
Sending warning via /usr/local/www/freenasUI/tools/smart_alert.py to MYMAIL@gmail.com ...
Warning via /usr/local/www/freenasUI/tools/smart_alert.py to MYMAIL@gmail.com: successful
Device: /dev/ada0, opened ATA device
Device: /dev/ada0, initial Temperature is 23 Celsius (Min/Max ??/23)
Device: /dev/ada0, Temperature 23 Celsius reached critical limit of 20 Celsius (Min/Max ??/23)
Sending warning via /usr/local/www/freenasUI/tools/smart_alert.py to MYMAIL@gmail.com ...
Warning via /usr/local/www/freenasUI/tools/smart_alert.py to MYMAIL@gmail.com: successful


Looks like it should work but it doesn't really.
Earlier today after doing this exact command I got an alert for each hdd and a mail shortly after but now after running the command to copy the result to my clipboard I'm not getting a mail or alert on the dashboard at all.

Mail alerts generally work, I just got the mail that there is an update to 11.2 U1 for example...
I'm not sure where to go from here to fix this.

EDIT: After looking at smartd.conf and reading the man page I found this bit about the -w option:
-The warning email counter is reset if the temperature dropped below INFO or CRIT-5 if INFO is not specified.

My info value was set to a higher value than the critical temperature. I didn't think this would stop smartd from giving an alert but apparently it does.

I did change info to 0 and left crit at 20 for testing and did get an email a few minutes later so I assume everything is working as I want it to now.
 
Last edited:
Status
Not open for further replies.
Top