DiscreteBanana
Cadet
- Joined
- Mar 9, 2024
- Messages
- 1
Hello! I'm not sure where to post my question as I am not running TrueNAS. I was "referred" here from this Youtube video discussing the benefits and awesomeness of zfs.
If I'm in the wrong place, or if this shouldn't even be posted, I sincerely apologize. I figured you all are experts to zfs, at least compared to the folks over on Ubuntu/Linux Mint forums.
I'm experiencing issues with my new Linux Mint 21.3 on a new desktop computer, which is based on Ubuntu 22. I'm not new to Linux; I've been using Linux for almost 20 years, using Ubuntu as my primary desktop for the majority of that time. However, I've always used ext3/4. When setting up Linux Mint, I saw the option to use zfs and did some research into it and found the benefits worth the uncertainty/unfamiliarity.
The OS ran fine for a few days. I attached a second hard drive to act as a mirror to the first using the instructions here. I was having screen related issues and was recommended to install a BIOS patch. After doing so, I was unable to boot:
After a few days of unsuccessful troubleshooting, I decided to [c]zfs send[/c] the snapshots and reinstall LM. Wanting to recreate the issue, I repatched my BIOS, but my install survived. So I applied my snapshots and continued to rebuild my desktop. A few days later, after a reboot, I encountered the same error.
During my troubleshooting in GRUB, it seems like GRUB is unable to load the EFI partition. I do have secure boot enabled and I've found that my motherboard (make and model below) makes it awfully difficult to disable it. During the first troubleshooting, I tried disabling SecureBoot to no avail.
I've tried running Boot Repair to no avail.
I've reverted bpool to the previous day's snapshot, to no avail.
I am fully prepared to reinstall again if needed. However, I'm convinced that the issue is with zfs. I've never had these boot issues before on ext4. When I drop to the GRUB command line, I got this message when trying to access the filesystem:
I am able to boot to a LiveUSB and mount both bpool and rpool without any issues. However, I did notice something odd. When inspecting the partition in Disks, partition 3 of /dev/nvme0n1 is showing as "Contents: unknown" whereas the mirrored drive /dev/nvme1n1 is showing as "Contents: Unknown (zfs_member 5000)" with the GUI properly labeling the second drive as "bpool".
When inspecting both partitions in parted, partition 3 of /dev/nvme0n1 is showing as File System "fat32" with an label of "EFI" whereas /dev/nvme1n1 is showing as File System "zfs" with a label of "bpool".
I'm convinced the issue here is either with how I set up mirroring or snapshots. After searching for this issue, it seems that zfs is stable enough to run a daily OS so I'm sure it's something I'm doing wrong. However, I am hesitant to reinstall LM a third time only to have this happen again.
I'm hoping those of you who are more familiar with zfs will be able to save me from a reinstall and having this happen again. Is there a way for me to rebuild bpool/boot partition without having to reinstall the entire OS?
My desktop specs:
If I'm in the wrong place, or if this shouldn't even be posted, I sincerely apologize. I figured you all are experts to zfs, at least compared to the folks over on Ubuntu/Linux Mint forums.
I'm experiencing issues with my new Linux Mint 21.3 on a new desktop computer, which is based on Ubuntu 22. I'm not new to Linux; I've been using Linux for almost 20 years, using Ubuntu as my primary desktop for the majority of that time. However, I've always used ext3/4. When setting up Linux Mint, I saw the option to use zfs and did some research into it and found the benefits worth the uncertainty/unfamiliarity.
The OS ran fine for a few days. I attached a second hard drive to act as a mirror to the first using the instructions here. I was having screen related issues and was recommended to install a BIOS patch. After doing so, I was unable to boot:
Code:
error: file `/BOOT/ubuntu_udd128@/vmlinuz-6.5.0.21-generic' not found. error: you need to load the kernel first.
After a few days of unsuccessful troubleshooting, I decided to [c]zfs send[/c] the snapshots and reinstall LM. Wanting to recreate the issue, I repatched my BIOS, but my install survived. So I applied my snapshots and continued to rebuild my desktop. A few days later, after a reboot, I encountered the same error.
During my troubleshooting in GRUB, it seems like GRUB is unable to load the EFI partition. I do have secure boot enabled and I've found that my motherboard (make and model below) makes it awfully difficult to disable it. During the first troubleshooting, I tried disabling SecureBoot to no avail.
I've tried running Boot Repair to no avail.
I've reverted bpool to the previous day's snapshot, to no avail.
I am fully prepared to reinstall again if needed. However, I'm convinced that the issue is with zfs. I've never had these boot issues before on ext4. When I drop to the GRUB command line, I got this message when trying to access the filesystem:
Code:
grub: Secure Boot forbids loading module from (hd3,gpt1)/grub/x86_64-efi/efs2.mod ... (long list of files)
I am able to boot to a LiveUSB and mount both bpool and rpool without any issues. However, I did notice something odd. When inspecting the partition in Disks, partition 3 of /dev/nvme0n1 is showing as "Contents: unknown" whereas the mirrored drive /dev/nvme1n1 is showing as "Contents: Unknown (zfs_member 5000)" with the GUI properly labeling the second drive as "bpool".
When inspecting both partitions in parted, partition 3 of /dev/nvme0n1 is showing as File System "fat32" with an label of "EFI" whereas /dev/nvme1n1 is showing as File System "zfs" with a label of "bpool".
I'm convinced the issue here is either with how I set up mirroring or snapshots. After searching for this issue, it seems that zfs is stable enough to run a daily OS so I'm sure it's something I'm doing wrong. However, I am hesitant to reinstall LM a third time only to have this happen again.
I'm hoping those of you who are more familiar with zfs will be able to save me from a reinstall and having this happen again. Is there a way for me to rebuild bpool/boot partition without having to reinstall the entire OS?
My desktop specs:
- Motherboard make and model: MSI PRO B760-VC WIFI
- CPU make and model: Intel® Core™ i7-13700F
- RAM quantity: 32 GB
- Hard drives, quantity, model numbers, and RAID configuration, including boot drives:
-
Code:
NAME FSTYPE LABEL MOUNTPOINT SIZE MODELloop0 squashfs /rofs 2.4G sda 1.8T WDC WD20EZBX-00AYRA0 ├─sda1 linux_raid_member starbuck:viper 1.8T └─sda9 8M sdb 3.6T WDC WD40EZAX-00C8UB0 └─sdb1 ext4 BACKUP 3.6T sdc iso9660 Linux Mint 21.3 Cinnamon 64-bit 7.2G STORE N GO ├─sdc1 iso9660 Linux Mint 21.3 Cinnamon 64-bit /cdrom 2.9G ├─sdc2 vfat 4.9M └─sdc3 ext4 writable /var/log 4.4G nvme1n1 953.9G KXG50ZNV1T02 TOSHIBA ├─nvme1n1p1 vfat 512M ├─nvme1n1p2 swap 2G ├─nvme1n1p3 zfs_member bpool 2G └─nvme1n1p4 zfs_member rpool 927G nvme0n1 931.5G MSI M450 1TB ├─nvme0n1p1 vfat 512M ├─nvme0n1p2 swap 2G ├─nvme0n1p3 2G └─nvme0n1p4 zfs_member rpool 927G
-
Code:
[*]mint@mint:~$ zpool list NAME SIZE ALLOC FREE CKPOINT EXPANDSZ FRAG CAP DEDUP HEALTH ALTROOT bpool 1.88G 648M 1.24G - - 3% 33% 1.00x ONLINE /bpool rpool 920G 19.2G 901G - - 0% 2% 1.00x ONLINE /rpool mint@mint:~$ zpool status pool: bpool state: ONLINE scan: resilvered 15.7M in 00:00:00 with 0 errors on Fri Mar 8 02:21:23 2024 config: NAME STATE READ WRITE CKSUM bpool ONLINE 0 0 0 mirror-0 ONLINE 0 0 0 nvme0n1p3 ONLINE 0 0 0 nvme1n1p3 ONLINE 0 0 0 errors: No known data errors pool: rpool state: ONLINE scan: resilvered 10.5G in 00:00:13 with 0 errors on Sun Mar 3 00:59:56 2024 config: NAME STATE READ WRITE CKSUM rpool ONLINE 0 0 0 mirror-0 ONLINE 0 0 0 nvme0n1p4 ONLINE 0 0 0 nvme1n1p4 ONLINE 0 0 0 errors: No known data errors
-
Code:
mint@mint:~$ zfs list NAME USED AVAIL REFER MOUNTPOINT bpool 648M 1.12G 96K /bpool/boot bpool/BOOT 643M 1.12G 96K none bpool/BOOT/ubuntu_uddl28 643M 1.12G 139M /bpool/boot rpool 19.2G 872G 96K /rpool rpool/ROOT 14.9G 872G 96K none rpool/ROOT/ubuntu_uddl28 14.9G 872G 9.16G /rpool rpool/ROOT/ubuntu_uddl28/srv 96K 872G 96K /rpool/srv rpool/ROOT/ubuntu_uddl28/usr 1.09M 872G 96K /rpool/usr rpool/ROOT/ubuntu_uddl28/usr/local 1M 872G 504K /rpool/usr/local rpool/ROOT/ubuntu_uddl28/var 3.43G 872G 96K /rpool/var rpool/ROOT/ubuntu_uddl28/var/games 96K 872G 96K /rpool/var/games rpool/ROOT/ubuntu_uddl28/var/lib 3.40G 872G 3.25G /rpool/var/lib rpool/ROOT/ubuntu_uddl28/var/lib/AccountsService 100K 872G 100K /rpool/var/lib/AccountsService rpool/ROOT/ubuntu_uddl28/var/lib/NetworkManager 156K 872G 156K /rpool/var/lib/NetworkManager rpool/ROOT/ubuntu_uddl28/var/lib/apt 96.1M 872G 96.1M /rpool/var/lib/apt rpool/ROOT/ubuntu_uddl28/var/lib/dpkg 58.7M 872G 58.7M /rpool/var/lib/dpkg rpool/ROOT/ubuntu_uddl28/var/log 27.3M 872G 27.3M /rpool/var/log rpool/ROOT/ubuntu_uddl28/var/mail 96K 872G 96K /rpool/var/mail rpool/ROOT/ubuntu_uddl28/var/snap 96K 872G 96K /rpool/var/snap rpool/ROOT/ubuntu_uddl28/var/spool 132K 872G 132K /rpool/var/spool rpool/ROOT/ubuntu_uddl28/var/www 96K 872G 96K /rpool/var/www rpool/USERDATA 4.27G 872G 96K /rpool rpool/USERDATA/user_xxx111 4.27G 872G 1.91G /rpool/home/user rpool/USERDATA/root_tpy969 292K 872G 196K /rpool/root
-
- Hard disk controllers: Not sure, chipset is Intel B760
- Network cards: Realtek RTL8125B