Well, I have a dilemma.
Recently I had a big problem with my WD60EFRX disks (WD Red-6TB).. I had a 9 disk array and replaced about 7 of them due to failures.. I tested every replacement with badblocks before confirming as safe..
Last two disks arrived from WDC Support ( which I had to send because local warranty was expired ).
I put them in my server and run badblocks for both of them.. One of them didn't even start counting. I thought this is bad... The other start counting from 0,01 and so on.. After about 20 hours I saw that one of them is finished first phase, and started second. The other one was still about 3 percent of first phase. When I checked the logs I saw there are lots of unretriable errors on that disk... After a long while I stopped the test on the bad(?) disk and checked and saw that the good disk has completed all the tests..
Then I thought maybe the disk is not bad ( it just came from WDC so it's new or certified to be good right ) but the cable could be the problem. So I switched the good disk's slot with bad disk's and started the test again on both disks.. Again the same result. The bad disk failed the test again while good disk completed the test.
I opened a case with WDC reporting my findings, and they suggested WD Disk Diag Tool on Windows first before sending the disk back again for replacement.
I connected the disk to Windows and started the tool first with quick test ( which I didn't think for a minute that there would be any errors ) and it passed. Then I started long test which lasted about 9,5 hours... In this test I thought it should find something at least but it also completed the test without any errors. I think these are same tests as "smartctl -t short" and "smartctl -t long".. I did start "smartctl -t long" on this disk in the NAS system when the badblocks failed but the test is interrupted on 70% due to some failure on the disk..
So as you can see I have a disk which refuses to run when connected to my NAS but works perfectly ( as far as smart test goes ) on Windows. Tonight I'll install a Linux system and test with Badblocks again on the desktop computer...
If it will pass the test like extended smart test what should I do?
In my NAS I'm using LSI 2008 controller and Intel expander to connect the disks, and there are 12 disks already working in the array while 7 disks are tested and currently on standby.. So I'm sure enough that there's nothing wrong with controller. I also checked the disk slot for errors as I wrote above and if a good disk passed in both slots then there should be nothing wrong with it.
Why do you think this disk refuses to run with my NAS system?!? :)
Recently I had a big problem with my WD60EFRX disks (WD Red-6TB).. I had a 9 disk array and replaced about 7 of them due to failures.. I tested every replacement with badblocks before confirming as safe..
Last two disks arrived from WDC Support ( which I had to send because local warranty was expired ).
I put them in my server and run badblocks for both of them.. One of them didn't even start counting. I thought this is bad... The other start counting from 0,01 and so on.. After about 20 hours I saw that one of them is finished first phase, and started second. The other one was still about 3 percent of first phase. When I checked the logs I saw there are lots of unretriable errors on that disk... After a long while I stopped the test on the bad(?) disk and checked and saw that the good disk has completed all the tests..
Then I thought maybe the disk is not bad ( it just came from WDC so it's new or certified to be good right ) but the cable could be the problem. So I switched the good disk's slot with bad disk's and started the test again on both disks.. Again the same result. The bad disk failed the test again while good disk completed the test.
I opened a case with WDC reporting my findings, and they suggested WD Disk Diag Tool on Windows first before sending the disk back again for replacement.
I connected the disk to Windows and started the tool first with quick test ( which I didn't think for a minute that there would be any errors ) and it passed. Then I started long test which lasted about 9,5 hours... In this test I thought it should find something at least but it also completed the test without any errors. I think these are same tests as "smartctl -t short" and "smartctl -t long".. I did start "smartctl -t long" on this disk in the NAS system when the badblocks failed but the test is interrupted on 70% due to some failure on the disk..
So as you can see I have a disk which refuses to run when connected to my NAS but works perfectly ( as far as smart test goes ) on Windows. Tonight I'll install a Linux system and test with Badblocks again on the desktop computer...
If it will pass the test like extended smart test what should I do?
In my NAS I'm using LSI 2008 controller and Intel expander to connect the disks, and there are 12 disks already working in the array while 7 disks are tested and currently on standby.. So I'm sure enough that there's nothing wrong with controller. I also checked the disk slot for errors as I wrote above and if a good disk passed in both slots then there should be nothing wrong with it.
Why do you think this disk refuses to run with my NAS system?!? :)