Need Help Determining Cause of Error

NASbox

Guru
Joined
May 8, 2012
Messages
650
I just noticed this alert message which is similar to one that I got several months ago:

New alerts:
* Pool TANK state is ONLINE: One or more devices has experienced an
unrecoverable error. An attempt was made to correct the error.
Applications are unaffected.


It doesn't appear to be a bad spot on the disk because:
  1. The Error message indicates ILLEGAL REQUEST asc:21,0 (Logical block address out of range)
  2. The smart log doesn't show any pending/relocation events, and after the last error I ran a long smart test and scrubbed the pool and both operations were normal and without errors.
I am wondering if this type of thing is most likely the result of:
  1. Problems / Crash on a connected workstation. Has several read-only and one read-write NFS share. I had some funky stuff happening on my Linux workstation that eventually caused a crash.
  2. A Controller problem
  3. Memory Error (The machine does not have ECC memory.
  4. TrueNAS software bug?
Any assistance would be much appreciated. The error information is below.

Drive Info
Model Family: Western Digital Red
Device Model: WDC WD60EFRX-68MYMN1
Serial Number: WD-WX31DAxxxxxx
LU WWN Device Id: 5 0014ee 20bxxxxxx
Firmware Version: 82.00A82
User Capacity: 6,001,175,126,016 bytes [6.00 TB]
Sector Sizes: 512 bytes logical, 4096 bytes physical


Information from Latest Error (Mar 5, 2022) Version: TrueNAS-12.0-U8
Code:
--- From /var/log/messages
Mar  5 01:08:58 freenas (da3:mps0:0:5:0): WRITE(16). CDB: 8a 00 00 00 00 02 46 d7 15 70 00 00 00 08 00 00
Mar  5 01:08:58 freenas (da3:mps0:0:5:0): CAM status: SCSI Status Error
Mar  5 01:08:58 freenas (da3:mps0:0:5:0): SCSI status: Check Condition
Mar  5 01:08:58 freenas (da3:mps0:0:5:0): SCSI sense: ILLEGAL REQUEST asc:21,0 (Logical block address out of range)
Mar  5 01:08:58 freenas (da3:mps0:0:5:0): Info: 0x246d71570
Mar  5 01:08:58 freenas (da3:mps0:0:5:0): Error 22, Unretryable error

--- From Drive Smart Error Log
Error 7 [6] occurred at disk power-on lifetime: 54558 hours (2273 days + 6 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  10 -- 51 00 00 00 02 46 d7 15 70 40 00  Error: IDNF at LBA = 0x246d71570 = 9778435440

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  61 00 08 00 00 00 02 46 d7 15 70 40 00 13d+04:55:42.487  WRITE FPDMA QUEUED
  ea 00 00 00 00 00 00 00 00 00 00 40 00 13d+04:55:37.860  FLUSH CACHE EXT
  61 00 30 00 00 00 02 46 d7 15 10 40 00 13d+04:55:37.860  WRITE FPDMA QUEUED
  ea 00 00 00 00 00 00 00 00 00 00 40 00 13d+04:55:37.847  FLUSH CACHE EXT
  61 00 30 00 00 00 02 46 d7 14 e0 40 00 13d+04:55:37.847  WRITE FPDMA QUEUED


Information from Previous Error (Sept 24, 2021) Version: TrueNAS-12.0-U8
Code:
--- From /var/log/messages
Sep 24 05:31:36 freenas (da3:mps0:0:5:0): WRITE(16). CDB: 8a 00 00 00 00 02 14 e5 9d c0 00 00 00 08 00 00
Sep 24 05:31:36 freenas (da3:mps0:0:5:0): CAM status: SCSI Status Error
Sep 24 05:31:36 freenas (da3:mps0:0:5:0): SCSI status: Check Condition
Sep 24 05:31:36 freenas (da3:mps0:0:5:0): SCSI sense: ILLEGAL REQUEST asc:21,0 (Logical block address out of range)
Sep 24 05:31:36 freenas (da3:mps0:0:5:0): Info: 0x214e59dc0
Sep 24 05:31:36 freenas (da3:mps0:0:5:0): Error 22, Unretryable error

--- From Drive Smart Error Log
Error 5 [4] occurred at disk power-on lifetime: 50677 hours (2111 days + 13 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  10 -- 51 00 00 00 02 14 e5 9d c0 40 00  Error: IDNF at LBA = 0x214e59dc0 = 8940527040

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  61 00 08 00 00 00 02 14 e5 9d c0 40 00 41d+02:52:12.844  WRITE FPDMA QUEUED
  ea 00 00 00 00 00 00 00 00 00 00 40 00 41d+02:51:52.870  FLUSH CACHE EXT
  61 00 08 00 00 00 02 ba a0 f4 48 40 00 41d+02:51:52.869  WRITE FPDMA QUEUED
  61 00 08 00 10 00 02 ba a0 f2 48 40 00 41d+02:51:52.869  WRITE FPDMA QUEUED
  61 00 08 00 08 00 00 00 40 04 48 40 00 41d+02:51:52.869  WRITE FPDMA QUEUED
 
Top