I have replication running over a VPN from one strong FreeNAS box, site A, to another strong FreeNAS box, site B (1000 miles away). I have more than 50Mbps at each end. The VPN processing can and does handle 20 - 50 Mbps of traffic, when I send test traffic. Latency, inside the VPN is about 100ms. I don't normally need to run much traffic across it. The bottleneck becomes CPU utilization on the site B router. This a purely personal setup. Once synced up, there will be occasional new files. Something like new photo occasionally, I'm not a photographer. A few documents change every few days to weeks.
At one time, with the source FreeNAS box on a FreeNAS-mini, replication was using about 6Mbps, which was the artificial rate limit I put on it with QoS on the router. I was getting a lot of failed sync task messages, probably because it started from scratch and needs to move about 100GB. I didn't have the cycles to troubleshoot and just turned the sync process off.
After the FreeNAS-mini's main board died, I moved the disks to a duplicate of the the FreeNAS box at site B. 2009 model 2u Dell servers,
As far as I can tell, the sync and SSH processes are not CPU bound.
The replication process is set to replication the entire pool. That's probably a bad idea. I'm still learning how to deal with multiple FreeNAS boxes.
It's set for recursive replication. LZ4 compression, default encryption, ssh pubkey auth. Begin at 00:00:00, end 23:59:00.
What do I need to look at to further troubleshoot this?
This is an example of the failed sync messages:
At one time, with the source FreeNAS box on a FreeNAS-mini, replication was using about 6Mbps, which was the artificial rate limit I put on it with QoS on the router. I was getting a lot of failed sync task messages, probably because it started from scratch and needs to move about 100GB. I didn't have the cycles to troubleshoot and just turned the sync process off.
After the FreeNAS-mini's main board died, I moved the disks to a duplicate of the the FreeNAS box at site B. 2009 model 2u Dell servers,
RAIDZ with 4 2TB WD RED drives at site A and 6 2TB Dell SAS drives at site B, difference due to the drives coming from the mini. I started up the sync process and in two weeks, I've gotten about 6 GB of datasets synced to site B.OS Version:
FreeNAS-11.2-U8
(Build Date: Feb 14, 2020 15:55)
Processor:
Intel(R) Xeon(R) CPU X5660 @ 2.80GHz (24 cores)
Memory:
128 GiB
As far as I can tell, the sync and SSH processes are not CPU bound.
The replication process is set to replication the entire pool. That's probably a bad idea. I'm still learning how to deal with multiple FreeNAS boxes.
It's set for recursive replication. LZ4 compression, default encryption, ssh pubkey auth. Begin at 00:00:00, end 23:59:00.
What do I need to look at to further troubleshoot this?
This is an example of the failed sync messages:
Hello,
The replication failed for the local ZFS Pool1/iocage/releases/11.2-RELEASE/root while attempting to
apply incremental send of snapshot auto-20200419.0115-2m -> auto-20200420.0000-2w to 192.168.128.33