I just noticed this alert message which is similar to one that I got several months ago:
New alerts:
* Pool TANK state is ONLINE: One or more devices has experienced an
unrecoverable error. An attempt was made to correct the error.
Applications are unaffected.
It doesn't appear to be a bad spot on the disk because:
Drive Info
Model Family: Western Digital Red
Device Model: WDC WD60EFRX-68MYMN1
Serial Number: WD-WX31DAxxxxxx
LU WWN Device Id: 5 0014ee 20bxxxxxx
Firmware Version: 82.00A82
User Capacity: 6,001,175,126,016 bytes [6.00 TB]
Sector Sizes: 512 bytes logical, 4096 bytes physical
Information from Latest Error (Mar 5, 2022) Version: TrueNAS-12.0-U8
Information from Previous Error (Sept 24, 2021) Version: TrueNAS-12.0-U8
New alerts:
* Pool TANK state is ONLINE: One or more devices has experienced an
unrecoverable error. An attempt was made to correct the error.
Applications are unaffected.
It doesn't appear to be a bad spot on the disk because:
- The Error message indicates ILLEGAL REQUEST asc:21,0 (Logical block address out of range)
- The smart log doesn't show any pending/relocation events, and after the last error I ran a long smart test and scrubbed the pool and both operations were normal and without errors.
- Problems / Crash on a connected workstation. Has several read-only and one read-write NFS share. I had some funky stuff happening on my Linux workstation that eventually caused a crash.
- A Controller problem
- Memory Error (The machine does not have ECC memory.
- TrueNAS software bug?
Drive Info
Model Family: Western Digital Red
Device Model: WDC WD60EFRX-68MYMN1
Serial Number: WD-WX31DAxxxxxx
LU WWN Device Id: 5 0014ee 20bxxxxxx
Firmware Version: 82.00A82
User Capacity: 6,001,175,126,016 bytes [6.00 TB]
Sector Sizes: 512 bytes logical, 4096 bytes physical
Information from Latest Error (Mar 5, 2022) Version: TrueNAS-12.0-U8
Code:
--- From /var/log/messages Mar 5 01:08:58 freenas (da3:mps0:0:5:0): WRITE(16). CDB: 8a 00 00 00 00 02 46 d7 15 70 00 00 00 08 00 00 Mar 5 01:08:58 freenas (da3:mps0:0:5:0): CAM status: SCSI Status Error Mar 5 01:08:58 freenas (da3:mps0:0:5:0): SCSI status: Check Condition Mar 5 01:08:58 freenas (da3:mps0:0:5:0): SCSI sense: ILLEGAL REQUEST asc:21,0 (Logical block address out of range) Mar 5 01:08:58 freenas (da3:mps0:0:5:0): Info: 0x246d71570 Mar 5 01:08:58 freenas (da3:mps0:0:5:0): Error 22, Unretryable error --- From Drive Smart Error Log Error 7 [6] occurred at disk power-on lifetime: 54558 hours (2273 days + 6 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER -- ST COUNT LBA_48 LH LM LL DV DC -- -- -- == -- == == == -- -- -- -- -- 10 -- 51 00 00 00 02 46 d7 15 70 40 00 Error: IDNF at LBA = 0x246d71570 = 9778435440 Commands leading to the command that caused the error were: CR FEATR COUNT LBA_48 LH LM LL DV DC Powered_Up_Time Command/Feature_Name -- == -- == -- == == == -- -- -- -- -- --------------- -------------------- 61 00 08 00 00 00 02 46 d7 15 70 40 00 13d+04:55:42.487 WRITE FPDMA QUEUED ea 00 00 00 00 00 00 00 00 00 00 40 00 13d+04:55:37.860 FLUSH CACHE EXT 61 00 30 00 00 00 02 46 d7 15 10 40 00 13d+04:55:37.860 WRITE FPDMA QUEUED ea 00 00 00 00 00 00 00 00 00 00 40 00 13d+04:55:37.847 FLUSH CACHE EXT 61 00 30 00 00 00 02 46 d7 14 e0 40 00 13d+04:55:37.847 WRITE FPDMA QUEUED
Information from Previous Error (Sept 24, 2021) Version: TrueNAS-12.0-U8
Code:
--- From /var/log/messages Sep 24 05:31:36 freenas (da3:mps0:0:5:0): WRITE(16). CDB: 8a 00 00 00 00 02 14 e5 9d c0 00 00 00 08 00 00 Sep 24 05:31:36 freenas (da3:mps0:0:5:0): CAM status: SCSI Status Error Sep 24 05:31:36 freenas (da3:mps0:0:5:0): SCSI status: Check Condition Sep 24 05:31:36 freenas (da3:mps0:0:5:0): SCSI sense: ILLEGAL REQUEST asc:21,0 (Logical block address out of range) Sep 24 05:31:36 freenas (da3:mps0:0:5:0): Info: 0x214e59dc0 Sep 24 05:31:36 freenas (da3:mps0:0:5:0): Error 22, Unretryable error --- From Drive Smart Error Log Error 5 [4] occurred at disk power-on lifetime: 50677 hours (2111 days + 13 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER -- ST COUNT LBA_48 LH LM LL DV DC -- -- -- == -- == == == -- -- -- -- -- 10 -- 51 00 00 00 02 14 e5 9d c0 40 00 Error: IDNF at LBA = 0x214e59dc0 = 8940527040 Commands leading to the command that caused the error were: CR FEATR COUNT LBA_48 LH LM LL DV DC Powered_Up_Time Command/Feature_Name -- == -- == -- == == == -- -- -- -- -- --------------- -------------------- 61 00 08 00 00 00 02 14 e5 9d c0 40 00 41d+02:52:12.844 WRITE FPDMA QUEUED ea 00 00 00 00 00 00 00 00 00 00 40 00 41d+02:51:52.870 FLUSH CACHE EXT 61 00 08 00 00 00 02 ba a0 f4 48 40 00 41d+02:51:52.869 WRITE FPDMA QUEUED 61 00 08 00 10 00 02 ba a0 f2 48 40 00 41d+02:51:52.869 WRITE FPDMA QUEUED 61 00 08 00 08 00 00 00 40 04 48 40 00 41d+02:51:52.869 WRITE FPDMA QUEUED