Hello all,
I have a x64 Dell PowerEdge 2950 III Quad Core (PE2950 III)
with dual Intel Xeon E5450 3.0/12M/1333 4C 80W,
2GB PC5300 DDR2 ECC Fully Buffered x 8 = 16GB,
Western Digital Red NAS Hard Drive WD20EFRX 2TB IntelliPower 64MB Cache SATA 6.0Gb/s 3.5" x6
IBM ServeRAID M1015/LSI SAS9220-8i PCI-E SAS+SATA 46C8933 Half Height Low Pro flashed in IT mode with 9211_8i_Package_P17_IR_IT_Firmware_BIOS_for_MSDOS.
I have two systems called NAS1 and NAS2
NAS1 is production and replicates snapshots over a wireless bridge at 6:00 AM every day to NAS2. Both systems went online in September of 2013 with version 9.1.1. The systems were upgraded to 9.2.0 on January 8th 2014. Until yesterday there have been no problems. NAS1 crashed sometime Tuesday night into Wednesday morning. I simple reboot resolved the issue. It also spontaneously rebooted three more times on Wednesday. I decided to scrub the zpool to see if that would make it stable. It crashed sometime overnight presumably while scrubbing and now it hangs while trying to mount the zpool with the message:
Mounting local file systems :
At this point the console is unresponsive and input like CTRL-t, F1, etc... does not work. I can boot if I disconnect from the HBA.
The file system in question is a ZFS raidz2-0 called lun0 made up of 6 2TB drives.
The question is how do I proceed to get the box operational and discover the root cause of the crashes in the first place. All my data is available on NAS2 and else ware, it will just take a large amount of time to recover it to NAS1 if required. FYI, NAS2 appears to be functioning normally but never has any load placed directly on it like NAS1.
Thanks for any help,
Tim
I have a x64 Dell PowerEdge 2950 III Quad Core (PE2950 III)
with dual Intel Xeon E5450 3.0/12M/1333 4C 80W,
2GB PC5300 DDR2 ECC Fully Buffered x 8 = 16GB,
Western Digital Red NAS Hard Drive WD20EFRX 2TB IntelliPower 64MB Cache SATA 6.0Gb/s 3.5" x6
IBM ServeRAID M1015/LSI SAS9220-8i PCI-E SAS+SATA 46C8933 Half Height Low Pro flashed in IT mode with 9211_8i_Package_P17_IR_IT_Firmware_BIOS_for_MSDOS.
I have two systems called NAS1 and NAS2
NAS1 is production and replicates snapshots over a wireless bridge at 6:00 AM every day to NAS2. Both systems went online in September of 2013 with version 9.1.1. The systems were upgraded to 9.2.0 on January 8th 2014. Until yesterday there have been no problems. NAS1 crashed sometime Tuesday night into Wednesday morning. I simple reboot resolved the issue. It also spontaneously rebooted three more times on Wednesday. I decided to scrub the zpool to see if that would make it stable. It crashed sometime overnight presumably while scrubbing and now it hangs while trying to mount the zpool with the message:
Mounting local file systems :
At this point the console is unresponsive and input like CTRL-t, F1, etc... does not work. I can boot if I disconnect from the HBA.
The file system in question is a ZFS raidz2-0 called lun0 made up of 6 2TB drives.
The question is how do I proceed to get the box operational and discover the root cause of the crashes in the first place. All my data is available on NAS2 and else ware, it will just take a large amount of time to recover it to NAS1 if required. FYI, NAS2 appears to be functioning normally but never has any load placed directly on it like NAS1.
Thanks for any help,
Tim