philiplu
Explorer
- Joined
- Aug 10, 2014
- Messages
- 58
I've got 8 4TB WD Reds that have been spinning around 3 months now, running 9.2.1.7. They're all hooked up to a SuperMicro X10SL7-F. 6 of the drives are in RaidZ2 attached to the on-board LSI HBA. The other two are located in a SuperMicro CSE-M35T-1B 5-in-3 hot swap bay, with the bays communicating through the SATA 3Gbps chipset ports.
The 2 hot-swap drives have had their Load Cycle Counts increasing by roughly 100 a day (which works out to about every 15 minutes). The 6 LSI-connected drives have LCCs that stayed basically constant. I don't understand why the hot-swap drives are seeing 100 unloads a day, but I didn't worry about it too much.
But I just noticed that about 12 days ago, when the drives had about 1750 hours on them, the 6 LSI-connected drives started unloading 100 times a day as well. The system's been powered up 43 days, so there wasn't a reboot to explain the change in behavior. I checked /var/log/messages for that time period, and nothing out of the ordinary appears there. I also looked back through the Reporting graphs for Disk & CPU, and don't see anything weird there either. I don't recall anything strange happening on the system 12 days ago (wish I'd noticed sooner, when anything out of the ordinary would be easier to remember).
So does anyone have any idea why WD Reds sometimes go days, even weeks without unloading, other times unload regularly, and switch from one state to the other with no discernible reason?
I run a daily script that saves the output of smartctl -x for all the drives. I've attached three such logs. The one from 2014/11/04 shows the last log before the weird LCC increases started showing up everywhere. The 2014/11/05 log is the first one where the LCC increases appear on the LSI-connected drives. The 2014/11/16 is last night's log, just to show the LCC has been increasing consistently on all the WD Reds since it started. In the logs, da0 to da5 are the 6 LSI-connected drives, da6 & da7 are two mirrored SSDs, and ada0 and ada1 are the hot-swap bay drives which have had LCC increasing all along at the ~100/day rate.
The 2 hot-swap drives have had their Load Cycle Counts increasing by roughly 100 a day (which works out to about every 15 minutes). The 6 LSI-connected drives have LCCs that stayed basically constant. I don't understand why the hot-swap drives are seeing 100 unloads a day, but I didn't worry about it too much.
But I just noticed that about 12 days ago, when the drives had about 1750 hours on them, the 6 LSI-connected drives started unloading 100 times a day as well. The system's been powered up 43 days, so there wasn't a reboot to explain the change in behavior. I checked /var/log/messages for that time period, and nothing out of the ordinary appears there. I also looked back through the Reporting graphs for Disk & CPU, and don't see anything weird there either. I don't recall anything strange happening on the system 12 days ago (wish I'd noticed sooner, when anything out of the ordinary would be easier to remember).
So does anyone have any idea why WD Reds sometimes go days, even weeks without unloading, other times unload regularly, and switch from one state to the other with no discernible reason?
I run a daily script that saves the output of smartctl -x for all the drives. I've attached three such logs. The one from 2014/11/04 shows the last log before the weird LCC increases started showing up everywhere. The 2014/11/05 log is the first one where the LCC increases appear on the LSI-connected drives. The 2014/11/16 is last night's log, just to show the LCC has been increasing consistently on all the WD Reds since it started. In the logs, da0 to da5 are the 6 LSI-connected drives, da6 & da7 are two mirrored SSDs, and ada0 and ada1 are the hot-swap bay drives which have had LCC increasing all along at the ~100/day rate.