Alan W. Smtih
Explorer
- Joined
- Aug 30, 2014
- Messages
- 54
I haven't been able to find a concise "Here's how to burn-in a new drive and confirm it's safe as far as we can tell" checklist. (As in, there's already a FreeNAS box in production and you just want to spin up another temporary one to prep replacement disks.)
Between this forum (including this post which you should read for a lot more detail) and the Wikipedia S.M.A.R.T. entry, I put together what I think is a core checklist:
(Assuming: logged in as a user other than 'root' and one hypothetical drive at
1. Enabling kernel debugging with:
2. Run
3. Run
4. Run
5. Running
6. Reviewing the results of
Is there anything else that needs to be added?
Between this forum (including this post which you should read for a lot more detail) and the Wikipedia S.M.A.R.T. entry, I put together what I think is a core checklist:
(Assuming: logged in as a user other than 'root' and one hypothetical drive at
ada0
with a stripe size of 4096
)1. Enabling kernel debugging with:
sudo sysctl kern.geom.debugflags=0x10
(and make sure that if you reboot before completing the badblocks run, then you re-run this command)2. Run
sudo smartctl --scan
to get a list of drives (e.g. /dev/ada0
)3. Run
sudo diskinfo -v ada0
for the drive and get the number for stripesize
which is passed to -b
in the next step4. Run
sudo badblocks -b 4096 -ws /dev/ada0
on the drive and allowing it to complete (which can take a few days for disks that reach into TB)5. Running
smartctl -t long /dev/ada0
on the drive and allowing it to complete (which takes several hours -- you can track progress by running the same command that's used in the next step sudo smartctl -x /dev/ada0 | less
and looking for the line with % remaining)6. Reviewing the results of
sudo smartctl -x /dev/ada0 | less
and verify that for all of the following items that appear (and not all of them will) all have a RAW_VALUE
of 0
(zero).Code:
| ID | Name | |-----|---------------------------------------------| | 5 | Reallocated Sectors Count | | 10 | Spin Retry Count | | 184 | End-to-End error / IOEDC | | 187 | Reported Uncorrectable Errors | | 188 | Command Timeout | | 196 | Reallocation Event Count | | 197 | Current Pending Sector Count | | 198 | (Offline) Uncorrectable Sector Count | | 201 | Soft Read Error Rate or TA Counter Detected |
Is there anything else that needs to be added?
Last edited: