scurrier
Patron
- Joined
- Jan 2, 2014
- Messages
- 297
I awoke this morning to a few SMART emails and an email saying my backup volume was degraded. I checked the logs and I am seeing a lot of this:
These messages repeated for at least 6 minutes. Then...
So it looks like something was wrong with ada0. But both ada0 and ada1 (the other drive in the mirror) are missing from camcontrol devlist:
I am suspecting a bad drive, but I've had issues that might also indicate cables. What is safe to do? Should I reboot? I've already backed up some critical data from my array that was previously backing up to this array.
Thanks.
Code:
Jun 1 05:57:43 thumper kernel: (ada0:ahcich0:0:0:0): CAM status: Uncorrectable parity/CRC error Jun 1 05:57:43 thumper kernel: (ada0:ahcich0:0:0:0): Retrying command Jun 1 05:57:43 thumper kernel: (ada0:ahcich0:0:0:0): READ_FPDMA_QUEUED. ACB: 60 00 50 ec b4 40 64 00 00 01 00 00 Jun 1 05:57:43 thumper kernel: (ada0:ahcich0:0:0:0): CAM status: Uncorrectable parity/CRC error
These messages repeated for at least 6 minutes. Then...
Code:
Jun 1 06:04:23 thumper kernel: ahcich0: AHCI reset: device not ready after 31000ms (tfd = 00000080) Jun 1 06:04:53 thumper kernel: ahcich0: Timeout on slot 18 port 0 Jun 1 06:04:53 thumper kernel: ahcich0: is 00000000 cs 00040000 ss 00000000 rs 00040000 tfd 80 serr 00080000 cmd 0004d217 Jun 1 06:04:53 thumper kernel: (aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00 Jun 1 06:04:53 thumper kernel: (aprobe0:ahcich0:0:0:0): CAM status: Command timeout Jun 1 06:04:53 thumper kernel: (aprobe0:ahcich0:0:0:0): Error 5, Retry was blocked Jun 1 06:05:24 thumper kernel: ahcich0: AHCI reset: device not ready after 31000ms (tfd = 00000080) Jun 1 06:05:54 thumper kernel: ahcich0: Timeout on slot 18 port 0 Jun 1 06:05:54 thumper kernel: ahcich0: is 00000000 cs 00040000 ss 00000000 rs 00040000 tfd 80 serr 00080000 cmd 0004d217 Jun 1 06:05:54 thumper kernel: (aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00 Jun 1 06:05:54 thumper kernel: (aprobe0:ahcich0:0:0:0): CAM status: Command timeout Jun 1 06:05:54 thumper kernel: (aprobe0:ahcich0:0:0:0): Error 5, Retry was blocked Jun 1 06:05:54 thumper kernel: ada0 at ahcich0 bus 0 scbus1 target 0 lun 0 Jun 1 06:05:54 thumper kernel: ada0: <ST4000VN000-1H4168 SC43> s/n Z3015ENK detached Jun 1 06:06:26 thumper kernel: ahcich0: AHCI reset: device not ready after 31000ms (tfd = 00000080) Jun 1 06:06:45 thumper kernel: ahcich0: Timeout on slot 18 port 0 Jun 1 06:06:45 thumper kernel: ahcich0: is 00000000 cs 00040000 ss 00000000 rs 00040000 tfd 80 serr 00080000 cmd 0004d217 Jun 1 06:06:45 thumper kernel: (ada0:ahcich0:0:0:0): SETFEATURES ENABLE RCACHE. ACB: ef aa 00 00 00 40 00 00 00 00 00 00 Jun 1 06:06:45 thumper kernel: (ada0:ahcich0:0:0:0): CAM status: Unconditionally Re-queue Request Jun 1 06:06:45 thumper kernel: (ada0:ahcich0:0:0:0): Error 5, Periph was invalidated Jun 1 06:06:47 thumper smartd[5540]: Device: /dev/ada0, failed to read SMART Attribute Data Jun 1 06:07:26 thumper kernel: ahcich0: AHCI reset: device not ready after 31000ms (tfd = 00000080) Jun 1 06:07:38 thumper kernel: ahcich0: Poll timeout on slot 18 port 0 Jun 1 06:07:38 thumper kernel: ahcich0: is 00000000 cs 00040000 ss 00000000 rs 00040000 tfd 80 serr 00080000 cmd 0004d217 Jun 1 06:07:38 thumper kernel: (aprobe0:ahcich0:0:0:0): SOFT_RESET. ACB: 00 00 00 00 00 00 00 00 00 00 00 00 Jun 1 06:07:38 thumper kernel: (aprobe0:ahcich0:0:0:0): CAM status: Command timeout Jun 1 06:07:38 thumper kernel: (aprobe0:ahcich0:0:0:0): Error 5, Retries exhausted Jun 1 06:08:08 thumper kernel: ahcich0: Timeout on slot 18 port 0 Jun 1 06:08:08 thumper kernel: ahcich0: is 00000000 cs 000c0000 ss 000c0000 rs 000c0000 tfd 80 serr 00080000 cmd 0004d217 Jun 1 06:08:08 thumper kernel: (ada0:ahcich0:0:0:0): READ_FPDMA_QUEUED. ACB: 60 00 08 8b 55 40 69 00 00 01 00 00 Jun 1 06:08:08 thumper kernel: (ada0:ahcich0:0:0:0): CAM status: Command timeout Jun 1 06:08:08 thumper kernel: (ada0:ahcich0:0:0:0): Error 5, Periph was invalidated Jun 1 06:08:08 thumper kernel: (ada0:ahcich0:0:0:0): READ_FPDMA_QUEUED. ACB: 60 00 08 8c 55 40 69 00 00 01 00 00 Jun 1 06:08:08 thumper kernel: (ada0:ahcich0:0:0:0): CAM status: Unconditionally Re-queue Request Jun 1 06:08:08 thumper kernel: (ada0:ahcich0:0:0:0): Error 5, Periph was invalidated Jun 1 06:09:00 thumper kernel: ahcich0: AHCI reset: device not ready after 31000ms (tfd = 00000080) Jun 1 06:09:00 thumper kernel: ahcich0: Poll timeout on slot 19 port 0 Jun 1 06:09:00 thumper kernel: ahcich0: is 00000000 cs 00080000 ss 00000000 rs 00080000 tfd 80 serr 00080000 cmd 0004d317 Jun 1 06:09:00 thumper kernel: (aprobe0:ahcich0:0:0:0): SOFT_RESET. ACB: 00 00 00 00 00 00 00 00 00 00 00 00 Jun 1 06:09:00 thumper kernel: (aprobe0:ahcich0:0:0:0): CAM status: Command timeout Jun 1 06:09:00 thumper kernel: (aprobe0:ahcich0:0:0:0): Error 5, Retries exhausted
So it looks like something was wrong with ada0. But both ada0 and ada1 (the other drive in the mirror) are missing from camcontrol devlist:
Code:
[root@thumper] ~# camcontrol devlist <ATA ST4000VN000-1H41 SC43> at scbus0 target 0 lun 0 (da0,pass0) <ATA ST4000VN000-1H41 SC43> at scbus0 target 1 lun 0 (da1,pass1) <ATA ST4000VN000-1H41 SC43> at scbus0 target 2 lun 0 (da2,pass2) <ATA ST4000VN000-1H41 SC43> at scbus0 target 3 lun 0 (da3,pass3) <ATA WDC WD2500AAJS-7 1B02> at scbus0 target 4 lun 0 (da4,pass4) <ATA Maxtor 6V300F0 1900> at scbus0 target 5 lun 0 (da5,pass5) <ATA ST2000DL003-9VT1 CC32> at scbus0 target 6 lun 0 (da6,pass6) <ATA ST4000VN000-1H41 SC43> at scbus0 target 7 lun 0 (da7,pass7) <ST31500541AS CC35> at scbus3 target 0 lun 0 (ada2,pass10) <ST31500541AS CC35> at scbus4 target 0 lun 0 (ada3,pass11) <ST31500541AS CC35> at scbus5 target 0 lun 0 (ada4,pass12) <ST31500541AS CC35> at scbus6 target 0 lun 0 (ada5,pass13) <SanDisk Cruzer 1.27> at scbus8 target 0 lun 0 (pass14,da8)
I am suspecting a bad drive, but I've had issues that might also indicate cables. What is safe to do? Should I reboot? I've already backed up some critical data from my array that was previously backing up to this array.
Thanks.