System randomly shuts down

TheThomen · Jan 10, 2023

So I migrated from core to scale on Saturday was working fine all the time until yesterday I decided to upgrade to 64gb DDR3 ECC fully registered memory.
I'm currently running MemTest86 and am awaiting completion, I have decided to attach my log files below I cant seem to find any issues that warrent a power down.

The issue started when i hopped in bed put plex on 10 mins in bam shutdown, no errors on starting back up and checking truenas it just states unscheduled reboot.

I'm hoping its the new ram as this would make life simpler but I have a feeling it wont be...

Log attached
System specs

R320 E5-2450L 64gb ram
Netapp Shelf with 12x 4tb drives.
connected to a UPS that helps clean power too

this has worked for 4 years fine not a single hiccup on core.

I greatly appreciate any help

sretalla · Jan 10, 2023

seems to report issues with sdu and sdq devices. does that make any sense to you?

TheThomen said:
this has worked for 4 years fine not a single hiccup on core.

unfortunately that makes absolutely no difference to the completely different OS and drivers you're running there... the FreeBSD drivers (although much narrower range of hardware supported) in CORE typically have a huge number more hours of testing in them.

TheThomen · Jan 10, 2023

sretalla said:
seems to report issues with sdu and squ devices. does that make any sense to you?

unfortunately that makes absolutely no difference to the completely different OS and drivers you're running there... the FreeBSD drivers (although much narrower range of hardware supported) in CORE typically have a huge number more hours of testing in them.

Hi,

Thanks for the reply. I dont understand what this indicates please can you explain like I'm five haha

sretalla · Jan 10, 2023

There are events indicating issues with 2 devices that look like disks as far as I can tell:

Code:

Jan  9 23:29:58 stratum kernel: sd 8:0:15:0: [sdu] tag#1609 Sense Key : Recovered Error [current]
Jan  9 23:29:58 stratum kernel: sd 8:0:15:0: [sdu] tag#1609 Add. Sense: No additional sense information
Jan  9 23:29:58 stratum kernel: sd 8:0:11:0: [sdq] tag#1035 Sense Key : Recovered Error [current]
Jan  9 23:29:58 stratum kernel: sd 8:0:11:0: [sdq] tag#1035 Add. Sense: No additional sense information

Those devices appear to be identified as sdu (looks like SCSI 8:0:15:0) and sdq (looks like SCSI 8:0:11:0), so maybe that's the hint to check those disks and their connections.

Note I corrected my incorrect typing of sdu...

TheThomen · Jan 10, 2023

I think those drives are duds i have yet to remove and are not in use. all my pools report healthy
Has to be something else i doubt drives could cause a system to turn off?

TheThomen · Jan 10, 2023

going to try and update bios etc see if this fixes it but seems to be mem related

sretalla · Jan 10, 2023

If the pool housing the system dataset has an issue, then there can be bad consequences for the system, so don't rule that out.

Are you saying that those disks aren't in any pool?

Also, those CPU voltage warnings don't look great...

Best of luck with the BIOS update.

Important Announcement for the TrueNAS Community.

System randomly shuts down

TheThomen

Dabbler

Attachments

sretalla

Powered by Neutrality

TheThomen

Dabbler

sretalla

Powered by Neutrality

TheThomen

Dabbler

TheThomen

Dabbler

sretalla

Powered by Neutrality

Similar threads

Important Announcement for the TrueNAS Community.

System randomly shuts down

TheThomen

Dabbler

Attachments

sretalla

Powered by Neutrality

TheThomen

Dabbler

sretalla

Powered by Neutrality

TheThomen

Dabbler

TheThomen

Dabbler

sretalla

Powered by Neutrality

Important Announcement for the TrueNAS Community.

Related topics on forums.truenas.com for thread: "System randomly shuts down"

Similar threads