System randomly shuts down

TheThomen

Dabbler
Joined
Jan 26, 2021
Messages
18
So I migrated from core to scale on Saturday was working fine all the time until yesterday I decided to upgrade to 64gb DDR3 ECC fully registered memory.
I'm currently running MemTest86 and am awaiting completion, I have decided to attach my log files below I cant seem to find any issues that warrent a power down.

The issue started when i hopped in bed put plex on 10 mins in bam shutdown, no errors on starting back up and checking truenas it just states unscheduled reboot.

I'm hoping its the new ram as this would make life simpler but I have a feeling it wont be...


Log attached
System specs

R320 E5-2450L 64gb ram
Netapp Shelf with 12x 4tb drives.
connected to a UPS that helps clean power too

this has worked for 4 years fine not a single hiccup on core.

I greatly appreciate any help
 

Attachments

  • messages.txt
    1.7 MB · Views: 84

sretalla

Powered by Neutrality
Moderator
Joined
Jan 1, 2016
Messages
9,703
seems to report issues with sdu and sdq devices. does that make any sense to you?

this has worked for 4 years fine not a single hiccup on core.
unfortunately that makes absolutely no difference to the completely different OS and drivers you're running there... the FreeBSD drivers (although much narrower range of hardware supported) in CORE typically have a huge number more hours of testing in them.
 
Last edited:

TheThomen

Dabbler
Joined
Jan 26, 2021
Messages
18
seems to report issues with sdu and squ devices. does that make any sense to you?


unfortunately that makes absolutely no difference to the completely different OS and drivers you're running there... the FreeBSD drivers (although much narrower range of hardware supported) in CORE typically have a huge number more hours of testing in them.
Hi,

Thanks for the reply. I dont understand what this indicates please can you explain like I'm five haha
 

sretalla

Powered by Neutrality
Moderator
Joined
Jan 1, 2016
Messages
9,703
There are events indicating issues with 2 devices that look like disks as far as I can tell:

Code:
Jan  9 23:29:58 stratum kernel: sd 8:0:15:0: [sdu] tag#1609 Sense Key : Recovered Error [current]
Jan  9 23:29:58 stratum kernel: sd 8:0:15:0: [sdu] tag#1609 Add. Sense: No additional sense information
Jan  9 23:29:58 stratum kernel: sd 8:0:11:0: [sdq] tag#1035 Sense Key : Recovered Error [current]
Jan  9 23:29:58 stratum kernel: sd 8:0:11:0: [sdq] tag#1035 Add. Sense: No additional sense information


Those devices appear to be identified as sdu (looks like SCSI 8:0:15:0) and sdq (looks like SCSI 8:0:11:0), so maybe that's the hint to check those disks and their connections.

Note I corrected my incorrect typing of sdu...
 

TheThomen

Dabbler
Joined
Jan 26, 2021
Messages
18
I think those drives are duds i have yet to remove and are not in use. all my pools report healthy
Has to be something else i doubt drives could cause a system to turn off?
 

TheThomen

Dabbler
Joined
Jan 26, 2021
Messages
18
1673357685220.png


going to try and update bios etc see if this fixes it but seems to be mem related
 

sretalla

Powered by Neutrality
Moderator
Joined
Jan 1, 2016
Messages
9,703
If the pool housing the system dataset has an issue, then there can be bad consequences for the system, so don't rule that out.

Are you saying that those disks aren't in any pool?

Also, those CPU voltage warnings don't look great...

Best of luck with the BIOS update.
 
Top