Thanks for reading my post. To start with. 
My system
SuperMicro with 2x Xeon(R) CPU E5-2640 v3 and 96GB RAM running ESXi 7.0.
VM with 8 cores, 32GB RAM (locked), SAS2008 PCI-Express Fusion-MPT SAS-2 HBA card passed through
8 x Toshiba N300 4TB, 2x RAIDZ1 Vdevs with 4 X HDD per Vdev. 75% capacity used.
My issue
It has been running for over a year but now the write performance is very slow and appears to be the huge bottleneck. I have run the following Code and it would appear that the Pool performance is the bottleneck. The pool is set to Standard Sync, and LZ4 compression.
I have a basic-intermediate knowledge at best and am really keen to learn further and looking for some wisdom from the community. Is there any extra troubleshooting tips or next steps?
Many thanks,
Dan.
My system
SuperMicro with 2x Xeon(R) CPU E5-2640 v3 and 96GB RAM running ESXi 7.0.
VM with 8 cores, 32GB RAM (locked), SAS2008 PCI-Express Fusion-MPT SAS-2 HBA card passed through
8 x Toshiba N300 4TB, 2x RAIDZ1 Vdevs with 4 X HDD per Vdev. 75% capacity used.
My issue
It has been running for over a year but now the write performance is very slow and appears to be the huge bottleneck. I have run the following Code and it would appear that the Pool performance is the bottleneck. The pool is set to Standard Sync, and LZ4 compression.
Code:
fio --name=test --size=5g --rw=write --ioengine=posixaio --direct=1 --bs=1m fio --name=test --size=5g --rw=write --ioengine=posixaio --direct=1 --bs=1m test: (g=0): rw=write, bs=(R) 1024KiB-1024KiB, (W) 1024KiB-1024KiB, (T) 1024KiB- 1024KiB, ioengine=posixaio, iodepth=1 fio-3.27 Starting 1 process Jobs: 1 (f=1): [W(1)][98.2%][eta 00m:02s] test: (groupid=0, jobs=1): err= 0: pid=55519: Wed Sep 21 10:32:10 2022 write: IOPS=46, BW=46.8MiB/s (49.1MB/s)(5120MiB/109424msec); 0 zone resets slat (nsec): min=14626, max=80450, avg=32511.15, stdev=6826.22 clat (usec): min=271, max=4594.8k, avg=21338.39, stdev=289557.43 lat (usec): min=290, max=4594.9k, avg=21370.90, stdev=289557.10 clat percentiles (usec): | 1.00th=[ 281], 5.00th=[ 293], 10.00th=[ 306], | 20.00th=[ 330], 30.00th=[ 338], 40.00th=[ 347], | 50.00th=[ 359], 60.00th=[ 367], 70.00th=[ 375], | 80.00th=[ 396], 90.00th=[ 408], 95.00th=[ 424], | 99.00th=[ 449], 99.50th=[3170894], 99.90th=[4278191], | 99.95th=[4529849], 99.99th=[4596958] bw ( KiB/s): min=53067, max=405928, per=100.00%, avg=321392.74, stdev=92992.36, samples=31 iops : min= 51, max= 396, avg=313.39, stdev=90.92, samples=31 lat (usec) : 500=99.47% lat (msec) : >=2000=0.53% cpu : usr=0.11%, sys=0.07%, ctx=5122, majf=0, minf=1 IO depths : 1=100.0%, 2=0.0%, 4=0.0%, 8=0.0%, 16=0.0%, 32=0.0%, >=64=0.0% submit : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0% complete : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0% issued rwts: total=0,5120,0,0 short=0,0,0,0 dropped=0,0,0,0 latency : target=0, window=0, percentile=100.00%, depth=1 Run status group 0 (all jobs): WRITE: bw=46.8MiB/s (49.1MB/s), 46.8MiB/s-46.8MiB/s (49.1MB/s-49.1MB/s), io=5120MiB (5369MB), run=109424-109424msec root@truenas[~]#
I have a basic-intermediate knowledge at best and am really keen to learn further and looking for some wisdom from the community. Is there any extra troubleshooting tips or next steps?
Many thanks,
Dan.