We're seeing frequent but intermittent failures to replicate between two arrays at distant sites. After the initial replication failure the retry always works. The errors include (but may not be limited to):
Are there any general debugging tips people are willing to offer to help resolve this?
- Failed: ssh: connect to host {ipaddr} port 22: Operation timed out.
- Connection to {ipaddr} closed by remote host. warning: cannot send 'pool/dataset@auto-2021-04-16_00-00': signal received..
- The replication failed for the local ZFS pool/dataset/dataset1 while attempting to apply incremental send of snapshot auto-20210415.0000-4w -> auto-20210416.0000-4w to {ipaddr}
Are there any general debugging tips people are willing to offer to help resolve this?