@pav - I feel like I'm hijacking your thread now. Memtest86+, great memory diagnostic tool (
www.memtest.org - NOT .COM). I use it on VMware ESXi servers with upwards of 128GB RAM, no issues. I usually let it run at least 72 hours on new hardware. Boot from a CD/USB and leave it alone, it auto starts. If it doesn't find something, it doesn't exist.
@noobsauce80 - Server board is Intel S3200SH, currently running Nov 29, 2010 BIOS update S3200X38.86B.00.00.0052. SMART tests run daily, both short and long via web GUI, odd and even drive IDs run at separate times. Never had a bad report or errors on POST. Wish I had the old version, this server is remote and I didn't think to have the person helping me through the upgrade write down the previous one. Although SATA cable is possible, I'll say unlikely as it was brand new. Scheduled cron reboots was my next step. Seems like error count might have been to blame along with what @joeschmuck says.
@joeschmuck - you're 100% correct, same drive every time and it took about 1 week to get there. It was always the last ID on the SATA controller. Camcontrol wouldn't report the drive anymore unless with verbose option and then it would leave the adaX number off the end of the line. HDD Model is WDC WD20EARX-32PASB0 51.0AB51 - 2 TB HDD. I'm striping two mirrored vdevs in my config, all the same model drive.
Example of camcontrol on the reportedly failed drive (note passX,adaX are missing from ()'s):
<WDC WD20EARX-32PASB0 51.0AB51> at scbus3 target 0 lun 0 ()
Other drives never reported the issue, although the OS would become unresponsive.