I would appreciate some advice on how to incorporate Truenas capabilities into my environment. I'm a newbie and waiting for a server to arrive to start experimenting with Truenas.
What I am hoping for is that Truenas will be able to give me:
- rollback on files
- resiliency of SSD/disk failures
- automate backup to offsite
without killing my file access performance
My hobby application (deep learning) has a few operating modes.
Mode 1: training: Ubuntu app reads about 1 TB of data across a million files as fast as possible continuously for a week, feeding them to multiple GPUs for training where GPU load is 100% for the week and room gets nice and hot. Files on nvme SSD helps performance. So does storing the most frequently access files to a large part of my 256G memory.
Mode 3: MySQL operation nvme SSD makes database operation nice and fast, not sure if it's necessary.
My thoughts are (but I have zero Truenas experience):
Question: How would a Truenas expert set this up?
What I am hoping for is that Truenas will be able to give me:
- rollback on files
- resiliency of SSD/disk failures
- automate backup to offsite
without killing my file access performance
My hobby application (deep learning) has a few operating modes.
Mode 1: training: Ubuntu app reads about 1 TB of data across a million files as fast as possible continuously for a week, feeding them to multiple GPUs for training where GPU load is 100% for the week and room gets nice and hot. Files on nvme SSD helps performance. So does storing the most frequently access files to a large part of my 256G memory.
- Deficiency #1: snapshots of this data is only 5, based on external devices and google drive backup. Would love more snapshots. Would love to have these snapshots available for perusing online. Right now its pull a disk from my safe, plug it in, look at the files, get next disk...
- Deficiency #2: dropbox and google drive is painfully slow and accomplished with sketchy apps. On the plus side, only modified files are uploaded.
Mode 3: MySQL operation nvme SSD makes database operation nice and fast, not sure if it's necessary.
- Deficiency is the failure of nvme would mean data loss up to the last dump.
- Would love a resilient volume. Not sure nvme necessary for SQL storage.
My thoughts are (but I have zero Truenas experience):
- mode 2, put 1st nvme on truenas and mirror to hard disk. Change mode 2 (file creation) to use the Truenas hosted data over the network.
- mode 1, have Truenas dump data to a 2nd nvme for training so it can directly use the 2nd nvme. Training only needs a read only view of this data.
- mode 3, move SQL disk to 1st nvme and access over the lan. SQL dumps onto a disk-based volume over the lan.
- proxmox truenas on a VM on same server as Mode 1,2,3 where truenas mirrors across 2 nvme's. I am assuming virtual lans between both VM will be infinite bw and not latency.
- An external truenas based on hard disks mirrors or backups the VM based Truenas.
- External truenas does dumping to offsite.
Question: How would a Truenas expert set this up?