Disconnected drive?

leonardorame

Contributor
Joined
Jun 30, 2018
Messages
106
Hi, I logged in to a TrueNAS-12.0-U4.1 and noticed a warning saying one of 3 disk RaidZ1 was REMOVED. I rebooted the system and the disk went back online, resilver and now everything looks ok.

After that I took a look at /var/log/messages and found this:

Code:
Jul 29 17:38:18 gauss ahcich2: Timeout on slot 22 port 0
Jul 29 17:38:18 gauss ahcich2: is 00000000 cs 00400000 ss 00000000 rs 00400000 tfd c0 serr 00000000 cmd 0000d617
Jul 29 17:38:18 gauss (ada1:ahcich2:0:0:0): FLUSHCACHE48. ACB: ea 00 00 00 00 40 00 00 00 00 00 00
Jul 29 17:38:18 gauss (ada1:ahcich2:0:0:0): CAM status: Command timeout
Jul 29 17:38:18 gauss (ada1:ahcich2:0:0:0): Retrying command, 0 more tries remain
Jul 29 17:38:48 gauss ahcich2: Timeout on slot 7 port 0
Jul 29 17:38:48 gauss ahcich2: is 00000000 cs 00000080 ss 00000000 rs 00000080 tfd c0 serr 00000000 cmd 0000c717
Jul 29 17:38:48 gauss (ada1:ahcich2:0:0:0): FLUSHCACHE48. ACB: ea 00 00 00 00 40 00 00 00 00 00 00
Jul 29 17:38:48 gauss (ada1:ahcich2:0:0:0): CAM status: Command timeout
Jul 29 17:38:48 gauss (ada1:ahcich2:0:0:0): Retrying command, 0 more tries remain
Jul 29 17:39:19 gauss ahcich2: Timeout on slot 18 port 0
Jul 29 17:39:19 gauss ahcich2: is 00000000 cs 00040000 ss 00000000 rs 00040000 tfd c0 serr 00000000 cmd 0000d217
Jul 29 17:39:19 gauss (ada1:ahcich2:0:0:0): FLUSHCACHE48. ACB: ea 00 00 00 00 40 00 00 00 00 00 00
Jul 29 17:39:19 gauss (ada1:ahcich2:0:0:0): CAM status: Command timeout
Jul 29 17:39:19 gauss (ada1:ahcich2:0:0:0): Retrying command, 0 more tries remain
Jul 29 17:39:51 gauss ahcich2: AHCI reset: device not ready after 31000ms (tfd = 00000080)
Jul 29 17:40:14 gauss 1 2021-07-29T17:40:14.617732-03:00 gauss.truenas.local smartd 1552 - - Device: /dev/ada1, failed to read SMART Attribute Data
Jul 29 17:40:14 gauss ada1 at ahcich2 bus 0 scbus2 target 0 lun 0
Jul 29 17:40:14 gauss ada1: <WDC WD20EFRX-68EUZN0 82.00A82> s/n WD-WCC4M3JPNPJJ detached
Jul 29 17:40:15 gauss (ada1:ahcich2:0:0:0): Periph destroyed
Jul 29 17:40:19 gauss (aprobe0:ahcich2:0:0:0): NOP FLUSHQUEUE. ACB: 00 00 00 00 00 00 00 00 00 00 00 00
Jul 29 17:40:19 gauss (aprobe0:ahcich2:0:0:0): CAM status: ATA Status Error
Jul 29 17:40:19 gauss (aprobe0:ahcich2:0:0:0): ATA status: d1 (BSY DRDY SERV ERR), error: 04 (ABRT )
Jul 29 17:40:19 gauss (aprobe0:ahcich2:0:0:0): RES: d1 04 ff ff ff ff ff ff ff ff ff
Jul 29 17:40:19 gauss (aprobe0:ahcich2:0:0:0): Error 5, Retries exhausted
Jul 29 17:40:19 gauss (aprobe0:ahcich2:0:0:0): NOP FLUSHQUEUE. ACB: 00 00 00 00 00 00 00 00 00 00 00 00
Jul 29 17:40:19 gauss (aprobe0:ahcich2:0:0:0): CAM status: ATA Status Error
Jul 29 17:40:19 gauss (aprobe0:ahcich2:0:0:0): ATA status: d1 (BSY DRDY SERV ERR), error: 04 (ABRT )
Jul 29 17:40:19 gauss (aprobe0:ahcich2:0:0:0): RES: d1 04 ff ff ff ff ff ff ff ff ff
Jul 29 17:40:19 gauss (aprobe0:ahcich2:0:0:0): Error 5, Retries exhausted
Jul 30 00:00:00 gauss syslog-ng[1288]: Configuration reload request received, reloading configuration;
Jul 30 00:00:00 gauss syslog-ng[1288]: Configuration reload finished;
Jul 31 00:00:00 gauss syslog-ng[1288]: Configuration reload request received, reloading configuration;
Jul 31 00:00:00 gauss syslog-ng[1288]: Configuration reload finished;


To me it looks like a cabling or power supply problem, but do you see anything else?.
 
Joined
Jan 7, 2015
Messages
1,150
Id be very leery of this disk with a Z1 pool. Keep an eye on it for sure. If TN kicks it from the pool, something is hokey.
 

leonardorame

Contributor
Joined
Jun 30, 2018
Messages
106
Just started a long Smart test on all disks, just to see if something is wrong in any of the disks.
 

HarambeLives

Contributor
Joined
Jul 19, 2021
Messages
153
From the output, it looks like this is a 2TB SATA Disk. For the cost of a 2TB disk, I'd just toss this one honestly

But, that's just me
 
Top