I'm just looking for some clarification on a replication task I have. I've set up my jails to have a 'fixed' data set, so that if I nuke a jail, the data for it remains intact (ie. my plex metadata), My jails are on a mirrored pool using SSDs. I was then rsyncing the 'data' dataset to my main spinner pool (tank) for redundancy purposes.
So, my structure is as follows
The 'data' dataset is 3.8GB in the GUI, the 'data' dataset on jails-backup is 5.9GB. I could not get the two amounts to match, even using the sparse files and hardlinks switches with rsync, so I instead moved to replication (which seems quicker and easier anyway).
Nonetheless, there is still a difference in size. I did an md5 check on the two plex directories, both of which came back with the same hash. I've also checked blocksize (128k) and ashift (12) which is the same on all datasets (except boot, where ashift=9).
I cannot figure out why the replicated dataset is 1.1GB bigger than the source. Has anyone run into this before? It's not a massive space issue at the moment, but I'm planning on having a large amount of data on the jails dataset shortly and will then be more concerned about preserving space.
Any ideas?
So, my structure is as follows
/mnt/tank/jails-backup/data
/mnt/jails/data
[root@freenas] ~# zpool list
NAME SIZE ALLOC FREE EXPANDSZ FRAG CAP DEDUP HEALTH ALTROOT
freenas-boot 111G 1.70G 109G - - 1% 1.00x ONLINE -
jails 254G 7.26G 247G - 7% 2% 1.00x ONLINE /mnt
scratch 3.62T 3.02T 621G - 51% 83% 1.00x ONLINE /mnt
tank 43.5T 26.4T 17.1T - 11% 60% 1.00x ONLINE /mnt
The 'data' dataset is 3.8GB in the GUI, the 'data' dataset on jails-backup is 5.9GB. I could not get the two amounts to match, even using the sparse files and hardlinks switches with rsync, so I instead moved to replication (which seems quicker and easier anyway).
Nonetheless, there is still a difference in size. I did an md5 check on the two plex directories, both of which came back with the same hash. I've also checked blocksize (128k) and ashift (12) which is the same on all datasets (except boot, where ashift=9).
I cannot figure out why the replicated dataset is 1.1GB bigger than the source. Has anyone run into this before? It's not a massive space issue at the moment, but I'm planning on having a large amount of data on the jails dataset shortly and will then be more concerned about preserving space.
Any ideas?