Hello, I've recently acquired a FreeNAS server and am busy putting it through some tests before trusting it in the home lab. My first clue something may be amiss was an apparent reboot the first night since the uptime reported starting at ~5AM the morning after. Searching online brought me to the forums and pointed me to running smart tests, more on those in a moment.
Questions:
1. Can the console output be found in any system logs that I can access for future diagnostics? Currently I only see this with a monitor plugged in.
2. I've run
The terminal monitor shows the following output.
Here's the output of
I'm also running
Questions:
1. Can the console output be found in any system logs that I can access for future diagnostics? Currently I only see this with a monitor plugged in.
2. I've run
smartctl -x
on all my drives and have different output than all the posts I've seen. I'm not seeing the output table titled "SMART Self-test log structure revision X" etc. I suspect it could be smartctl needs a device specified but the LSI 9210 isn't listed in the smartctl manual. This is a wild hypothesis formulated in ignorance, suggestions welcome. What I do see is below:The terminal monitor shows the following output.
Code:
(da6:mps0:0:6:0): SCSI sense: MEDIUM ERROR asc:11,0 (Unrecovered read error) (da6:mps0:0:6:0): Info: 0x7640afc8 (da6:mps0:0:6:0): Field Replaceable Unit: 129 (da6:mps0:0:6:0): Actual Retry Count: 157 (da6:mps0:0:6:0): Error 5, Unretryable error (da6:mps0:0:6:0): READ(10). CDB: 28 00 76 40 af f8 00 00 08 00 (da6:mps0:0:6:0): CAM status: SCSI Status Error (da6:mps0:0:6:0): SCSI status: Check Condition (da6:mps0:0:6:0): SCSI sense: MEDIUM ERROR asc:11,0 (Unrecovered read error) (da6:mps0:0:6:0): Info: 0x7640aff9 (da6:mps0:0:6:0): Field Replaceable Unit: 129 (da6:mps0:0:6:0): Actual Retry Count: 157 (da6:mps0:0:6:0): Error 5, Unretryable error (da6:mps0:0:6:0): READ(10). CDB: 28 00 76 40 b0 10 00 00 08 00 (da6:mps0:0:6:0): CAM status: SCSI Status Error (da6:mps0:0:6:0): SCSI status: Check Condition (da6:mps0:0:6:0): SCSI sense: MEDIUM ERROR asc:18,5 (Recovered data - recommend reassignment) (da6:mps0:0:6:0): Info: 0x7640b014 (da6:mps0:0:6:0): Field Replaceable Unit: 1 (da6:mps0:0:6:0): Actual Retry Count: 17 (da6:mps0:0:6:0): READ(10). CDB: 28 00 76 40 af f8 00 00 08 00 (da6:mps0:0:6:0): CAM status: SCSI Status Error (da6:mps0:0:6:0): SCSI status: Check Condition
Here's the output of
smartctl -x /dev/da0
.Code:
smartctl 6.6 2017-11-05 r4594 [FreeBSD 11.2-STABLE amd64] (local build) Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org === START OF INFORMATION SECTION === Vendor: SEAGATE Product: ST32000445SS Revision: MS02 User Capacity: 2,000,398,934,016 bytes [2.00 TB] Logical block size: 512 bytes Rotation Rate: 7200 rpm Form Factor: 3.5 inches Logical Unit id: 0x5000c50034da11b7 Serial number: 9WM7CCZ40000920486P4 Device type: disk Transport protocol: SAS (SPL-3) Local Time is: Tue Apr 9 19:36:01 2019 PDT SMART support is: Available - device has SMART capability. SMART support is: Enabled Temperature Warning: Enabled Read Cache is: Enabled Writeback Cache is: Disabled === START OF READ SMART DATA SECTION === SMART Health Status: OK Current Drive Temperature: 35 C Drive Trip Temperature: 68 C Manufactured in week 33 of year 2011 Specified cycle count over device lifetime: 10000 Accumulated start-stop cycles: 44 Specified load-unload count over device lifetime: 300000 Accumulated load-unload cycles: 44 Elements in grown defect list: 0 Vendor (Seagate) cache information Blocks sent to initiator = 4098019783 Blocks received from initiator = 786283223 Blocks read from cache and sent to initiator = 1061378457 Number of read and write commands whose size <= segment size = 132265888 Number of read and write commands whose size > segment size = 48163164 Vendor (Seagate/Hitachi) factory information number of hours powered up = 17845.47 number of minutes until next internal SMART test = 22 Error counter log: Errors Corrected by Total Correction Gigabytes Total ECC rereads/ errors algorithm processed uncorrected fast | delayed rewrites corrected invocations [10^9 bytes] errors read: 3666938790 1 0 3666938791 3666938791 145034.698 0 write: 0 0 0 0 0 2628.822 0 verify: 134904 0 0 134904 134904 0.000 0 Non-medium error count: 26 No self-tests have been logged Background scan results log Status: scan is active Accumulated power on time, hours:minutes 17845:28 [1070728 minutes] Number of background scans performed: 115, scan progress: 72.55% Number of background medium scans performed: 22288 # when lba(hex) [sk,asc,ascq] reassign_status 1 6432:27 00000000e8d09238 [1,17,1] Recovered via rewrite in-place 2 9211:57 000000009284b09e [1,17,1] Recovered via rewrite in-place 3 10126:32 00000000e8d097b2 [1,17,1] Recovered via rewrite in-place snip 29 17673:56 00000000c8bacccd [1,17,1] Recovered via rewrite in-place 30 17836:43 000000001d54bcac [1,17,1] Recovered via rewrite in-place Protocol Specific port log page for SAS SSP relative target port id = 1 generation code = 0 number of phys = 1 phy identifier = 0 attached device type: SAS or SATA device attached reason: power on reason: power on negotiated logical link rate: phy enabled; 6 Gbps attached initiator port: ssp=1 stp=1 smp=1 attached target port: ssp=0 stp=0 smp=0 SAS address = 0x5000c50034da11b5 attached SAS address = 0x500605b0056c8fb0 attached phy identifier = 0 Invalid DWORD count = 0 Running disparity error count = 0 Loss of DWORD synchronization = 0 Phy reset problem = 0 Phy event descriptors: Invalid word count: 0 Running disparity error count: 0 Loss of dword synchronization count: 0 Phy reset problem count: 0 relative target port id = 2 generation code = 0 number of phys = 1 phy identifier = 1 attached device type: no device attached attached reason: unknown reason: unknown negotiated logical link rate: phy enabled; 1.5 Gbps attached initiator port: ssp=0 stp=0 smp=0 attached target port: ssp=0 stp=0 smp=0 SAS address = 0x5000c50034da11b6 attached SAS address = 0x0 attached phy identifier = 0 Invalid DWORD count = 0 Running disparity error count = 0 Loss of DWORD synchronization = 0 Phy reset problem = 0 Phy event descriptors: Invalid word count: 0 Running disparity error count: 0 Loss of dword synchronization count: 0 Phy reset problem count: 0
I'm also running
badblocks
on all the drives and that may be finished tomorrow.