Hello, I've recently acquired a FreeNAS server and am busy putting it through some tests before trusting it in the home lab. My first clue something may be amiss was an apparent reboot the first night since the uptime reported starting at ~5AM the morning after. Searching online brought me to the forums and pointed me to running smart tests, more on those in a moment.
Questions:
1. Can the console output be found in any system logs that I can access for future diagnostics? Currently I only see this with a monitor plugged in.
2. I've run
The terminal monitor shows the following output.
Here's the output of
I'm also running
Questions:
1. Can the console output be found in any system logs that I can access for future diagnostics? Currently I only see this with a monitor plugged in.
2. I've run
smartctl -x on all my drives and have different output than all the posts I've seen. I'm not seeing the output table titled "SMART Self-test log structure revision X" etc. I suspect it could be smartctl needs a device specified but the LSI 9210 isn't listed in the smartctl manual. This is a wild hypothesis formulated in ignorance, suggestions welcome. What I do see is below:The terminal monitor shows the following output.
Code:
(da6:mps0:0:6:0): SCSI sense: MEDIUM ERROR asc:11,0 (Unrecovered read error) (da6:mps0:0:6:0): Info: 0x7640afc8 (da6:mps0:0:6:0): Field Replaceable Unit: 129 (da6:mps0:0:6:0): Actual Retry Count: 157 (da6:mps0:0:6:0): Error 5, Unretryable error (da6:mps0:0:6:0): READ(10). CDB: 28 00 76 40 af f8 00 00 08 00 (da6:mps0:0:6:0): CAM status: SCSI Status Error (da6:mps0:0:6:0): SCSI status: Check Condition (da6:mps0:0:6:0): SCSI sense: MEDIUM ERROR asc:11,0 (Unrecovered read error) (da6:mps0:0:6:0): Info: 0x7640aff9 (da6:mps0:0:6:0): Field Replaceable Unit: 129 (da6:mps0:0:6:0): Actual Retry Count: 157 (da6:mps0:0:6:0): Error 5, Unretryable error (da6:mps0:0:6:0): READ(10). CDB: 28 00 76 40 b0 10 00 00 08 00 (da6:mps0:0:6:0): CAM status: SCSI Status Error (da6:mps0:0:6:0): SCSI status: Check Condition (da6:mps0:0:6:0): SCSI sense: MEDIUM ERROR asc:18,5 (Recovered data - recommend reassignment) (da6:mps0:0:6:0): Info: 0x7640b014 (da6:mps0:0:6:0): Field Replaceable Unit: 1 (da6:mps0:0:6:0): Actual Retry Count: 17 (da6:mps0:0:6:0): READ(10). CDB: 28 00 76 40 af f8 00 00 08 00 (da6:mps0:0:6:0): CAM status: SCSI Status Error (da6:mps0:0:6:0): SCSI status: Check Condition
Here's the output of
smartctl -x /dev/da0.Code:
smartctl 6.6 2017-11-05 r4594 [FreeBSD 11.2-STABLE amd64] (local build)
Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Vendor: SEAGATE
Product: ST32000445SS
Revision: MS02
User Capacity: 2,000,398,934,016 bytes [2.00 TB]
Logical block size: 512 bytes
Rotation Rate: 7200 rpm
Form Factor: 3.5 inches
Logical Unit id: 0x5000c50034da11b7
Serial number: 9WM7CCZ40000920486P4
Device type: disk
Transport protocol: SAS (SPL-3)
Local Time is: Tue Apr 9 19:36:01 2019 PDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
Temperature Warning: Enabled
Read Cache is: Enabled
Writeback Cache is: Disabled
=== START OF READ SMART DATA SECTION ===
SMART Health Status: OK
Current Drive Temperature: 35 C
Drive Trip Temperature: 68 C
Manufactured in week 33 of year 2011
Specified cycle count over device lifetime: 10000
Accumulated start-stop cycles: 44
Specified load-unload count over device lifetime: 300000
Accumulated load-unload cycles: 44
Elements in grown defect list: 0
Vendor (Seagate) cache information
Blocks sent to initiator = 4098019783
Blocks received from initiator = 786283223
Blocks read from cache and sent to initiator = 1061378457
Number of read and write commands whose size <= segment size = 132265888
Number of read and write commands whose size > segment size = 48163164
Vendor (Seagate/Hitachi) factory information
number of hours powered up = 17845.47
number of minutes until next internal SMART test = 22
Error counter log:
Errors Corrected by Total Correction Gigabytes Total
ECC rereads/ errors algorithm processed uncorrected
fast | delayed rewrites corrected invocations [10^9 bytes] errors
read: 3666938790 1 0 3666938791 3666938791 145034.698 0
write: 0 0 0 0 0 2628.822 0
verify: 134904 0 0 134904 134904 0.000 0
Non-medium error count: 26
No self-tests have been logged
Background scan results log
Status: scan is active
Accumulated power on time, hours:minutes 17845:28 [1070728 minutes]
Number of background scans performed: 115, scan progress: 72.55%
Number of background medium scans performed: 22288
# when lba(hex) [sk,asc,ascq] reassign_status
1 6432:27 00000000e8d09238 [1,17,1] Recovered via rewrite in-place
2 9211:57 000000009284b09e [1,17,1] Recovered via rewrite in-place
3 10126:32 00000000e8d097b2 [1,17,1] Recovered via rewrite in-place
snip
29 17673:56 00000000c8bacccd [1,17,1] Recovered via rewrite in-place
30 17836:43 000000001d54bcac [1,17,1] Recovered via rewrite in-place
Protocol Specific port log page for SAS SSP
relative target port id = 1
generation code = 0
number of phys = 1
phy identifier = 0
attached device type: SAS or SATA device
attached reason: power on
reason: power on
negotiated logical link rate: phy enabled; 6 Gbps
attached initiator port: ssp=1 stp=1 smp=1
attached target port: ssp=0 stp=0 smp=0
SAS address = 0x5000c50034da11b5
attached SAS address = 0x500605b0056c8fb0
attached phy identifier = 0
Invalid DWORD count = 0
Running disparity error count = 0
Loss of DWORD synchronization = 0
Phy reset problem = 0
Phy event descriptors:
Invalid word count: 0
Running disparity error count: 0
Loss of dword synchronization count: 0
Phy reset problem count: 0
relative target port id = 2
generation code = 0
number of phys = 1
phy identifier = 1
attached device type: no device attached
attached reason: unknown
reason: unknown
negotiated logical link rate: phy enabled; 1.5 Gbps
attached initiator port: ssp=0 stp=0 smp=0
attached target port: ssp=0 stp=0 smp=0
SAS address = 0x5000c50034da11b6
attached SAS address = 0x0
attached phy identifier = 0
Invalid DWORD count = 0
Running disparity error count = 0
Loss of DWORD synchronization = 0
Phy reset problem = 0
Phy event descriptors:
Invalid word count: 0
Running disparity error count: 0
Loss of dword synchronization count: 0
Phy reset problem count: 0
I'm also running
badblocks on all the drives and that may be finished tomorrow.