The volume state is unknown - HBA problem(?)

Toydoll

Dabbler
Joined
Sep 17, 2015
Messages
33
Hello

A couple of weeks ago I rebooted my server and was later met with a "The volume head state is unknown" ("head" being the name of the volume). I rebooted the server again and everything worked fine. I was a bit worried but quickly forgot about it. Now I rebooted it again and is met with the same message, but this time it won't go away. I would lie if I said I'm not a bit scared...

I'll post all the info I can think of below, please treat me like a five year old and let me know if I'm missing something super obvious.

The most important thing (I assume) to know is that the missing disks are connected to the HBA card. So I guess the main question is if the card is dead and if I can fix it and/or if my data is forever lost.

Hardware:
Moherboard - Supermicro MBD-X11SSM-F-O
CPU - Xeon E3-1220 V6
RAM - 2x8 Samsung ECC
HBA - Dell Perc H310
Disks - WD red for storage and two random ssd for boot

Freenas version:
FreeNAS-11.2-U3

Regarding the Dell Perc H310
This is a raid card but according to many forum posts it should work fine if it's flashed correctly. I followed a guide showing how to do so and confirm that it was done. Everything checked out.
When I enter "sas2flash -list" i get:
Code:
LSI Corporation SAS2 Flash Utility
Version 16.00.00.00 (2013.03.01)
Copyright (c) 2008-2013 LSI Corporation. All rights reserved

    Adapter Selected is a LSI SAS: SAS2008(B2)   

    Controller Number              : 0
    Controller                     : SAS2008(B2)   
    PCI Address                    : 00:02:00:00
    SAS Address                    : 590b11c-0-22e3-d200
    NVDATA Version (Default)       : 14.01.00.08
    NVDATA Version (Persistent)    : 14.01.00.08
    Firmware Product ID            : 0x2213 (IT)
    Firmware Version               : 20.00.07.00
    NVDATA Vendor                  : LSI
    NVDATA Product ID              : SAS9211-8i
    BIOS Version                   : N/A
    UEFI BSD Version               : N/A
    FCODE Version                  : N/A
    Board Name                     : 6Gbps SAS HBA
    Board Assembly                 : N/A
    Board Tracer Number            : N/A

    Finished Processing Commands Successfully.
    Exiting SAS2Flash.


Storage:
(Assuming that I have the terminology correct I have:)
One pool containing two vdevs with six disks each. Both vdevs are running raidz2

När jag skriver in "zpool import" får jag:
Code:
   pool: head
     id: 17323364340947240638
  state: UNAVAIL
 status: One or more devices are missing from the system.
 action: The pool cannot be imported. Attach the missing
    devices and try again.
   see: http://illumos.org/msg/ZFS-8000-3C
 config:

    head                                            UNAVAIL  insufficient replicas
      raidz2-0                                      UNAVAIL  insufficient replicas
        gptid/64cc2c30-5d2f-11e9-9b78-ac1f6b9743f0  ONLINE
        gptid/68574735-5d2f-11e9-9b78-ac1f6b9743f0  ONLINE
        9281326334773016186                         UNAVAIL  cannot open
        1905415357501583071                         UNAVAIL  cannot open
        16668410022386804535                        UNAVAIL  cannot open
        1451900473353901407                         UNAVAIL  cannot open
      raidz2-1                                      UNAVAIL  insufficient replicas
        gptid/3ed9b1a9-5d74-11e9-86e5-ac1f6b9743f0  ONLINE
        gptid/425b0efc-5d74-11e9-86e5-ac1f6b9743f0  ONLINE
        2755843918717659983                         UNAVAIL  cannot open
        5150504189328925119                         UNAVAIL  cannot open
        12362184591851198512                        UNAVAIL  cannot open
        13579243299936852929                        UNAVAIL  cannot open

Very important:
Every disk that is Unavailable is connected to the HBA controller.

It seems pretty clear to me that there's something wrong with the HBA card, but what? I'm hoping that it has just given up without destroying my data at the same time but then shouldn't it be completely invisible? It is clearly there when I type "sas2flash -list".

First question: Is there anything that indicates that my data is/might be forever lost? If I failed to flash it correctly for example.
Second question: Where do I start the troubleshooting/repairing?

I'm thankful for anything you have to say.
 
Joined
Jan 7, 2015
Messages
1,155
Id say the data is fine, but you have it likely pegged correctly. The HBA was dying, and now it might be dead. However, I would try a few things first before buying a new one. Re-Seat the HBA in a different pci port if possible. They can wiggle the slightest bit loose and cause goofy things. RAM sometimes need re-seated, this can be the same type of thing. Try that first. Another thing that could be making the HBA "show up" and the drives not, is power. Could these drives be on one rail, and that rail possibly has a power splitter along the way making intermittent connection? Is the rail burnt out? Its just a thought and easy enough to test. If that doesnt yield results, grab a different HBA and give that a try. They can be had on the secondary market for cheap all the time. After all these things, and it still doesnt work then you can worry.
 
Top