Hello
A couple of weeks ago I rebooted my server and was later met with a "The volume head state is unknown" ("head" being the name of the volume). I rebooted the server again and everything worked fine. I was a bit worried but quickly forgot about it. Now I rebooted it again and is met with the same message, but this time it won't go away. I would lie if I said I'm not a bit scared...
I'll post all the info I can think of below, please treat me like a five year old and let me know if I'm missing something super obvious.
The most important thing (I assume) to know is that the missing disks are connected to the HBA card. So I guess the main question is if the card is dead and if I can fix it and/or if my data is forever lost.
It seems pretty clear to me that there's something wrong with the HBA card, but what? I'm hoping that it has just given up without destroying my data at the same time but then shouldn't it be completely invisible? It is clearly there when I type "sas2flash -list".
First question: Is there anything that indicates that my data is/might be forever lost? If I failed to flash it correctly for example.
Second question: Where do I start the troubleshooting/repairing?
I'm thankful for anything you have to say.
A couple of weeks ago I rebooted my server and was later met with a "The volume head state is unknown" ("head" being the name of the volume). I rebooted the server again and everything worked fine. I was a bit worried but quickly forgot about it. Now I rebooted it again and is met with the same message, but this time it won't go away. I would lie if I said I'm not a bit scared...
I'll post all the info I can think of below, please treat me like a five year old and let me know if I'm missing something super obvious.
The most important thing (I assume) to know is that the missing disks are connected to the HBA card. So I guess the main question is if the card is dead and if I can fix it and/or if my data is forever lost.
Hardware:
Moherboard - Supermicro MBD-X11SSM-F-O
CPU - Xeon E3-1220 V6
RAM - 2x8 Samsung ECC
HBA - Dell Perc H310
Disks - WD red for storage and two random ssd for boot
Freenas version:
FreeNAS-11.2-U3
Regarding the Dell Perc H310
This is a raid card but according to many forum posts it should work fine if it's flashed correctly. I followed a guide showing how to do so and confirm that it was done. Everything checked out.
When I enter "sas2flash -list" i get:
Storage:
(Assuming that I have the terminology correct I have:)
One pool containing two vdevs with six disks each. Both vdevs are running raidz2
När jag skriver in "zpool import" får jag:
Very important:
Every disk that is Unavailable is connected to the HBA controller.
Moherboard - Supermicro MBD-X11SSM-F-O
CPU - Xeon E3-1220 V6
RAM - 2x8 Samsung ECC
HBA - Dell Perc H310
Disks - WD red for storage and two random ssd for boot
Freenas version:
FreeNAS-11.2-U3
Regarding the Dell Perc H310
This is a raid card but according to many forum posts it should work fine if it's flashed correctly. I followed a guide showing how to do so and confirm that it was done. Everything checked out.
When I enter "sas2flash -list" i get:
Code:
LSI Corporation SAS2 Flash Utility Version 16.00.00.00 (2013.03.01) Copyright (c) 2008-2013 LSI Corporation. All rights reserved Adapter Selected is a LSI SAS: SAS2008(B2) Controller Number : 0 Controller : SAS2008(B2) PCI Address : 00:02:00:00 SAS Address : 590b11c-0-22e3-d200 NVDATA Version (Default) : 14.01.00.08 NVDATA Version (Persistent) : 14.01.00.08 Firmware Product ID : 0x2213 (IT) Firmware Version : 20.00.07.00 NVDATA Vendor : LSI NVDATA Product ID : SAS9211-8i BIOS Version : N/A UEFI BSD Version : N/A FCODE Version : N/A Board Name : 6Gbps SAS HBA Board Assembly : N/A Board Tracer Number : N/A Finished Processing Commands Successfully. Exiting SAS2Flash.
Storage:
(Assuming that I have the terminology correct I have:)
One pool containing two vdevs with six disks each. Both vdevs are running raidz2
När jag skriver in "zpool import" får jag:
Code:
pool: head id: 17323364340947240638 state: UNAVAIL status: One or more devices are missing from the system. action: The pool cannot be imported. Attach the missing devices and try again. see: http://illumos.org/msg/ZFS-8000-3C config: head UNAVAIL insufficient replicas raidz2-0 UNAVAIL insufficient replicas gptid/64cc2c30-5d2f-11e9-9b78-ac1f6b9743f0 ONLINE gptid/68574735-5d2f-11e9-9b78-ac1f6b9743f0 ONLINE 9281326334773016186 UNAVAIL cannot open 1905415357501583071 UNAVAIL cannot open 16668410022386804535 UNAVAIL cannot open 1451900473353901407 UNAVAIL cannot open raidz2-1 UNAVAIL insufficient replicas gptid/3ed9b1a9-5d74-11e9-86e5-ac1f6b9743f0 ONLINE gptid/425b0efc-5d74-11e9-86e5-ac1f6b9743f0 ONLINE 2755843918717659983 UNAVAIL cannot open 5150504189328925119 UNAVAIL cannot open 12362184591851198512 UNAVAIL cannot open 13579243299936852929 UNAVAIL cannot open
Very important:
Every disk that is Unavailable is connected to the HBA controller.
It seems pretty clear to me that there's something wrong with the HBA card, but what? I'm hoping that it has just given up without destroying my data at the same time but then shouldn't it be completely invisible? It is clearly there when I type "sas2flash -list".
First question: Is there anything that indicates that my data is/might be forever lost? If I failed to flash it correctly for example.
Second question: Where do I start the troubleshooting/repairing?
I'm thankful for anything you have to say.