I'm getting the following alert about weekly:
'Device /dev/gptid/<guid> is causing slow I/O on pool Datapool'
Just to be clear I am an absolute layman with this stuff, I just built a pc and put TrueNAS on it (sorry i know).
The alert will clear after a few hours then happen again after an hour then clear then happen again after an hour then clear for about a week. I've been having issues with my Plex streams where it will start buffering constantly (also when the alert is cleared) and I can't find anything else that could be causing that. It's only on disk ADA01. I'm running the following config:
CPU: Intel Core i5-11400
MOBO: Asrock B560M Steel Legend consumer board (I know)
RAM: 32Gb DDR4 3200MT/s
BOOT: 128Gb NVMe SSD
HDD: 5x Seagate 4TB Exos 7E8 ST4000NM000A (=CMR) in RAIDZ2
ARC: 500Gb Samsung 850 Evo
NIC: 10GbE Solarflare SFN6122F Dual-Port
My main purpose for this system is to use it as a Plex server and recently also as backup for other devices using SyncThing.
My main issue is that the error is too vague for a laymen. What am I supposed to do now? The drive works and is only three years old so I don't feel like replacing it yet as I'm not sure it's needed and don't feel like putting strain on the pool. Should I just leave it? Is the drive going to fail? Should I replace it? A tiny bit more info would be nice so I'm steered in the right direction.
My detailed issues with my plex streams are that I can be streaming 50mbit/s content that is not being transcoded (cpu=1%, no gpu) on a 1Gbit/s wired connection and have continuous buffering while this did not happen before, being able to handle even 150-200mbit/s content. Does not matter which high bitrate content is played or on which device, playback is horrible. Note: there are no other streams running in the background. Ever.
I've been looking on forums trying to find what I should do now but haven't found any answers. I did run some tests i saw on forums to try to see what was happening:
and
My gut says either of the following might also have something to do with it:
1. I have recently upgraded from 16Gb to 32Gb of RAM, which is plenty for me. Around this time the issues started but I can't remember if it was before or after.
2. I only recently started using the SyncThing plugin to backup data regularly to the system. I know this can use a lot of IO, but I can't begin to understand why this would also cause issues if the syncing devices are turned off.
Where do I go from here? What do I measure? Test? Attempt? I have no clue. Gratefull for any help and sorry if my issue is too vague, I simply don't know any of this stuff.
'Device /dev/gptid/<guid> is causing slow I/O on pool Datapool'
Just to be clear I am an absolute layman with this stuff, I just built a pc and put TrueNAS on it (sorry i know).
The alert will clear after a few hours then happen again after an hour then clear then happen again after an hour then clear for about a week. I've been having issues with my Plex streams where it will start buffering constantly (also when the alert is cleared) and I can't find anything else that could be causing that. It's only on disk ADA01. I'm running the following config:
CPU: Intel Core i5-11400
MOBO: Asrock B560M Steel Legend consumer board (I know)
RAM: 32Gb DDR4 3200MT/s
BOOT: 128Gb NVMe SSD
HDD: 5x Seagate 4TB Exos 7E8 ST4000NM000A (=CMR) in RAIDZ2
ARC: 500Gb Samsung 850 Evo
NIC: 10GbE Solarflare SFN6122F Dual-Port
My main purpose for this system is to use it as a Plex server and recently also as backup for other devices using SyncThing.
My main issue is that the error is too vague for a laymen. What am I supposed to do now? The drive works and is only three years old so I don't feel like replacing it yet as I'm not sure it's needed and don't feel like putting strain on the pool. Should I just leave it? Is the drive going to fail? Should I replace it? A tiny bit more info would be nice so I'm steered in the right direction.
My detailed issues with my plex streams are that I can be streaming 50mbit/s content that is not being transcoded (cpu=1%, no gpu) on a 1Gbit/s wired connection and have continuous buffering while this did not happen before, being able to handle even 150-200mbit/s content. Does not matter which high bitrate content is played or on which device, playback is horrible. Note: there are no other streams running in the background. Ever.
I've been looking on forums trying to find what I should do now but haven't found any answers. I did run some tests i saw on forums to try to see what was happening:
Code:
root@TrueNAS[~]# zpool status pool: Datapool state: ONLINE status: Some supported and requested features are not enabled on the pool. The pool can still be used, but some features are unavailable. action: Enable all features using 'zpool upgrade'. Once this is done, the pool may no longer be accessible by software that does not support the features. See zpool-features(7) for details. scan: scrub repaired 0B in 1 days 16:28:42 with 0 errors on Mon Jan 29 16:34:58 2024 config: NAME STATE READ WRITE CKSUM Datapool ONLINE 0 0 0 raidz2-0 ONLINE 0 0 0 gptid/372c160f-97de-11eb-acfd-a8a1596f8c29 ONLINE 0 0 0 gptid/37e90584-97de-11eb-acfd-a8a1596f8c29 ONLINE 0 0 0 gptid/37e2437d-97de-11eb-acfd-a8a1596f8c29 ONLINE 0 0 0 gptid/37f1d792-97de-11eb-acfd-a8a1596f8c29 ONLINE 0 0 0 gptid/3805bf00-97de-11eb-acfd-a8a1596f8c29 ONLINE 0 0 0 cache gptid/f7ece5ec-9e03-11eb-abc5-a8a1596f8c29 ONLINE 0 0 0 errors: No known data errors pool: boot-pool state: ONLINE status: Some supported and requested features are not enabled on the pool. The pool can still be used, but some features are unavailable. action: Enable all features using 'zpool upgrade'. Once this is done, the pool may no longer be accessible by software that does not support the features. See zpool-features(7) for details. scan: scrub repaired 0B in 00:00:10 with 0 errors on Sat Feb 3 03:45:10 2024 config: NAME STATE READ WRITE CKSUM boot-pool ONLINE 0 0 0 ada0p2 ONLINE 0 0 0
and
Code:
root@TrueNAS[~]# fio --ramp_time=5 --gtod_reduce=1 --numjobs=1 --bs=1M --size=100G --runtime=60s --readwrite=write --name=testfile testfile: (g=0): rw=write, bs=(R) 1024KiB-1024KiB, (W) 1024KiB-1024KiB, (T) 1024KiB-1024KiB, ioengine=psync, iodepth=1 fio-3.28 Starting 1 process testfile: Laying out IO file (1 file / 102400MiB) Jobs: 1 (f=1): [W(1)][8.1%][w=34.5MiB/s][w=34 IOPS][eta 12m:24s] testfile: (groupid=0, jobs=1): err= 0: pid=5084: Wed Feb 7 23:58:59 2024 write: IOPS=128, BW=128MiB/s (134MB/s)(7763MiB/60525msec); 0 zone resets bw ( KiB/s): min=33801, max=703875, per=100.00%, avg=131876.66, stdev=157535.72, samples=119 iops : min= 33, max= 687, avg=128.30, stdev=153.94, samples=119 cpu : usr=0.30%, sys=3.24%, ctx=38936, majf=0, minf=1 IO depths : 1=100.0%, 2=0.0%, 4=0.0%, 8=0.0%, 16=0.0%, 32=0.0%, >=64=0.0% submit : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0% complete : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0% issued rwts : total=0, 7763, 0, 0 short = 0, 0, 0, 0 dropped = 0, 0, 0, 0 latency : target = 0, window = 0, percentile = 100.00%, depth = 1 Run status group 0 (all jobs) : WRITE: bw = 128MiB/s (134MB/s), 128MiB/s-128MiB/s (134MB/s-134MB/s), io=7763MiB (8140MB), run=60525-60525msec
My gut says either of the following might also have something to do with it:
1. I have recently upgraded from 16Gb to 32Gb of RAM, which is plenty for me. Around this time the issues started but I can't remember if it was before or after.
2. I only recently started using the SyncThing plugin to backup data regularly to the system. I know this can use a lot of IO, but I can't begin to understand why this would also cause issues if the syncing devices are turned off.
Where do I go from here? What do I measure? Test? Attempt? I have no clue. Gratefull for any help and sorry if my issue is too vague, I simply don't know any of this stuff.