Upgraded to 11.1-RC1 and still getting Overheating errors

Status
Not open for further replies.

Ray Milyard

Patron
Joined
Aug 8, 2014
Messages
262
System Information
Hostname freenas.local ● Edit
Build FreeNAS-11.1-RC1
Platform Intel(R) Xeon(R) CPU E31275 @ 3.40GHz
Memory 16070MB
System Time Thu, 23 Nov 2017 07:38:37 -0700
Uptime 7:38AM up 23:09, 0 users

I was getting these errors on 11 U4 after installing an Intel 520x 10Gb NIC. I was this error was a bug and fixed in RC1 and will be on new release. I installed above version yesterday and still getting these errors late at night.
Code:
Nov 23 06:27:05 freenas daemon[3900]:	 2017/11/23 06:27:05 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 06:27:17 freenas coretemp5: critical temperature detected, suggest system shutdown
Nov 23 06:27:37 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:27:57 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:28:08 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:28:17 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:28:57 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:29:15 freenas daemon[3900]:	 2017/11/23 06:29:15 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 06:29:27 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:29:57 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:30:07 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:30:07 freenas coretemp5: critical temperature detected, suggest system shutdown
Nov 23 06:30:18 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:30:18 freenas coretemp5: critical temperature detected, suggest system shutdown
Nov 23 06:30:27 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:30:38 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:30:48 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:30:57 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:31:18 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:31:23 freenas daemon[3900]:	 2017/11/23 06:31:23 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 06:31:48 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:32:28 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:32:28 freenas coretemp5: critical temperature detected, suggest system shutdown
Nov 23 06:32:47 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:33:07 freenas coretemp5: critical temperature detected, suggest system shutdown
Nov 23 06:33:28 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:33:33 freenas daemon[3900]:	 2017/11/23 06:33:33 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 06:33:38 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:33:58 freenas coretemp5: critical temperature detected, suggest system shutdown
Nov 23 06:34:07 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:34:07 freenas coretemp5: critical temperature detected, suggest system shutdown
Nov 23 06:34:18 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:34:18 freenas coretemp5: critical temperature detected, suggest system shutdown
Nov 23 06:34:47 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:34:57 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:35:08 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:35:37 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:35:44 freenas daemon[3900]:	 2017/11/23 06:35:44 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 06:37:07 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:37:38 freenas coretemp5: critical temperature detected, suggest system shutdown
Nov 23 06:37:47 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:37:55 freenas daemon[3900]:	 2017/11/23 06:37:55 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 06:38:07 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:38:17 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:38:17 freenas coretemp5: critical temperature detected, suggest system shutdown
Nov 23 06:38:38 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:38:38 freenas coretemp5: critical temperature detected, suggest system shutdown
Nov 23 06:38:58 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:39:27 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:40:05 freenas daemon[3900]:	 2017/11/23 06:40:05 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 06:40:18 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:40:27 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:40:37 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:41:17 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:41:47 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:42:08 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:42:16 freenas daemon[3900]:	 2017/11/23 06:42:16 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 06:42:28 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:43:07 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:43:37 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:43:57 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:44:26 freenas daemon[3900]:	 2017/11/23 06:44:26 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 06:44:28 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:45:07 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:45:38 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:45:47 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:45:47 freenas coretemp5: critical temperature detected, suggest system shutdown
Nov 23 06:45:57 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:46:08 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:46:37 freenas daemon[3900]:	 2017/11/23 06:46:37 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 06:46:38 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:46:48 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:46:58 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:47:07 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:47:28 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:47:47 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:47:57 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:48:37 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:48:48 freenas daemon[3900]:	 2017/11/23 06:48:48 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 06:48:57 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:49:17 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:49:27 freenas coretemp5: critical temperature detected, suggest system shutdown
Nov 23 06:49:57 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:50:08 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:50:18 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:50:27 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:50:58 freenas daemon[3900]:	 2017/11/23 06:50:58 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 06:51:28 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:51:37 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:51:38 freenas coretemp5: critical temperature detected, suggest system shutdown
Nov 23 06:51:47 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:51:47 freenas coretemp5: critical temperature detected, suggest system shutdown
Nov 23 06:52:07 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:52:17 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:52:47 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:53:07 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:53:08 freenas daemon[3900]:	 2017/11/23 06:53:08 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 06:53:18 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:53:28 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:53:57 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:54:27 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:54:38 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:55:19 freenas daemon[3900]:	 2017/11/23 06:55:19 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 06:55:38 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:55:48 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:55:48 freenas coretemp5: critical temperature detected, suggest system shutdown
Nov 23 06:56:08 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:56:08 freenas coretemp5: critical temperature detected, suggest system shutdown
Nov 23 06:56:17 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:56:37 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:56:57 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:57:17 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:57:33 freenas daemon[3900]:	 2017/11/23 06:57:33 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 06:57:48 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:58:08 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:58:17 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:58:27 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:58:37 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:58:48 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:58:57 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:58:57 freenas coretemp5: critical temperature detected, suggest system shutdown
Nov 23 06:59:07 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:59:17 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:59:27 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:59:46 freenas daemon[3900]:	 2017/11/23 06:59:46 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 06:59:48 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 06:59:57 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 07:00:08 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 07:00:17 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 07:00:47 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 07:00:57 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 07:01:17 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 07:01:28 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 07:01:38 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 07:01:47 freenas coretemp5: critical temperature detected, suggest system shutdown
Nov 23 07:01:56 freenas daemon[3900]:	 2017/11/23 07:01:56 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 07:01:57 freenas coretemp4: critical temperature detected, suggest system shutdown
Nov 23 07:04:00 freenas daemon[3900]:	 2017/11/23 07:04:00 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 07:06:03 freenas daemon[3900]:	 2017/11/23 07:06:03 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 07:08:07 freenas daemon[3900]:	 2017/11/23 07:08:07 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 07:10:11 freenas daemon[3900]:	 2017/11/23 07:10:11 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 07:12:15 freenas daemon[3900]:	 2017/11/23 07:12:15 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 07:14:18 freenas daemon[3900]:	 2017/11/23 07:14:18 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 07:16:22 freenas daemon[3900]:	 2017/11/23 07:16:22 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 07:18:26 freenas daemon[3900]:	 2017/11/23 07:18:26 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 07:20:29 freenas daemon[3900]:	 2017/11/23 07:20:29 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 07:22:33 freenas daemon[3900]:	 2017/11/23 07:22:33 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 07:24:37 freenas daemon[3900]:	 2017/11/23 07:24:37 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 07:26:40 freenas daemon[3900]:	 2017/11/23 07:26:40 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 07:27:42 freenas kernel: arp: 10.0.1.100 moved from 02:4f:50:00:07:0a to 90:e2:ba:00:b8:2c on epair1b
Nov 23 07:28:44 freenas daemon[3900]:	 2017/11/23 07:28:44 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 07:30:48 freenas daemon[3900]:	 2017/11/23 07:30:48 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 07:32:52 freenas daemon[3900]:	 2017/11/23 07:32:52 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 07:34:55 freenas daemon[3900]:	 2017/11/23 07:34:55 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 07:36:59 freenas daemon[3900]:	 2017/11/23 07:36:59 [WARN] agent: Check 'service:nas-health' is now warning
Nov 23 07:39:03 freenas daemon[3900]:	 2017/11/23 07:39:03 [WARN] agent: Check 'service:nas-health' is now warning
 
Last edited by a moderator:

joeschmuck

Old Man
Moderator
Joined
May 28, 2011
Messages
10,994
I'd like to get some clarification if you don't mind.

1) You didn't have this problem until you installed the new NIC card?

2) Have you verified all fans are spinning properly?

3) What motherboard do you have and can you check the CPU temperature via the motherboard?

4) Have you checked the CPU temperature via sysctl -a | grep temper ?

5) Does the new NIC get really hot? If so maybe it is the cause and coretemp could be misreading it as the problem. Place a fan near your system to blow air across the NIC to cool it. This is only a troubleshooting step to try and isolate the issue.

6) If you remove the new NIC, does the problem go away?
 

Ray Milyard

Patron
Joined
Aug 8, 2014
Messages
262
1) You didn't have this problem until you installed the new NIC card? Correct when just using built in NICs or Intel PCI card with 2 NICs not having issue.

2) Have you verified all fans are spinning properly? Yes all fans are working fine.

3) What motherboard do you have and can you check the CPU temperature via the motherboard? Board is Intel S1200BTL.

4) Have you checked the CPU temperature via sysctl -a | grep temper ?
[root@freenas ~]# sysctl -a | grep temper hw.acpi.thermal.tz1.temperature: 29.9C hw.acpi.thermal.tz0.temperature: 27.9C dev.cpu.7.temperature: 59.0C dev.cpu.6.temperature: 59.0C dev.cpu.5.temperature: 63.0C dev.cpu.4.temperature: 63.0C dev.cpu.3.temperature: 56.0C dev.cpu.2.temperature: 57.0C dev.cpu.1.temperature: 59.0C dev.cpu.0.temperature: 59.0C [root@freenas ~]#

After had to reboot server

5) Does the new NIC get really hot? If so maybe it is the cause and coretemp could be misreading it as the problem. Place a fan near your system to blow air across the NIC to cool it. This is only a troubleshooting step to try and isolate the issue. Right now doesn't seem very hot. I haven't tried anything else as of yet. Issue only seems happen from 2-5am in mornings. Right now house windows are open and pretty cold in house so not thinking ambient heat issue.

Been running U4 for sometime then installed the 10gb NIC and issue started. Was told fixed in RC1 so installed yesterday hoping fixed it.

6) If you remove the new NIC, does the problem go away?[/QUOTE] Yes. If take card out seems fine.
 

joeschmuck

Old Man
Moderator
Joined
May 28, 2011
Messages
10,994
Been running U4 for sometime then installed the 10gb NIC and issue started. Was told fixed in RC1 so installed yesterday hoping fixed it.
You might try switching to the FreeNAS-11-Nightlies, it has all the latest bug fixes. Someone could have mispoken when they said RC1.

ssue only seems happen from 2-5am in mornings.
Do you know what is going on? Doing a backup or something?

And you are really only at ~27C according to the motherboard thermal sensors. What I'm conserned about is you have something really using your CPU heavily maybe to crunch data before it can be transferred over your new NIC. Even if the nightlies make the alarm indication clear, double check the temp values using the sysctl command, ensure the CPU temps really are reporting properly.
 

Chris Moore

Hall of Famer
Joined
May 2, 2015
Messages
10,080
The core temp reading that this system is complaining about could be the NIC. Those 10-gig NICs have big heat sinks on them but they're also intended to be used in servers that have high airflow. Probably put a fan near the NIS to cool it down and see if that makes the error go away.

Sent from my SAMSUNG-SGH-I537 using Tapatalk
 
Status
Not open for further replies.
Top