When accessing a share on my FreeNAS 8.0.0 box, the transfer will pause for 15-30 seconds, sometimes more than once. This happens via iSCSI and CIFS. While this is happening everything seems to be frozen including SSH login. However if I have an existing session, it works until I try to access the file system. ("touch some.file" works, but "cat some.file" will freeze the session until everything unpauses)
While watching "top" during an iSCSI transfer the cpu usage is around 25% when it's working, and 0% when paused.
Same for CIFS. All in all It seems like a file system issue to me.
I ran zpool status, and saw that I had a large number of write errors on one of my drives. I zpool cleared them, and then did a zpool scrub. A follow up zpool status after the scrub quoted me an estimated completion time of 400 hours and rising. (this is while the pausing is still happening intermittently). I put my desktop in sleep mode and went to bed. (effectively stopping all share traffic) This morning I ran zpool status and it said that it finished in just under one hour (around the same amount of time a full drive backup takes). There were no errors after the scrub, but it did have to resliver about 250MB.
I tried upgrading to 8.0.1 b4 (not realizing that RC1 was out) and this did not fix the problem either.
So long story short. iSCSI and CIFS transfers and directory listings can cause the pauses. The pauses block all disk IO (as best as I can tell), zpool scrub does not trigger the pauses.
Where do I proceed from here?
While watching "top" during an iSCSI transfer the cpu usage is around 25% when it's working, and 0% when paused.
Same for CIFS. All in all It seems like a file system issue to me.
I ran zpool status, and saw that I had a large number of write errors on one of my drives. I zpool cleared them, and then did a zpool scrub. A follow up zpool status after the scrub quoted me an estimated completion time of 400 hours and rising. (this is while the pausing is still happening intermittently). I put my desktop in sleep mode and went to bed. (effectively stopping all share traffic) This morning I ran zpool status and it said that it finished in just under one hour (around the same amount of time a full drive backup takes). There were no errors after the scrub, but it did have to resliver about 250MB.
I tried upgrading to 8.0.1 b4 (not realizing that RC1 was out) and this did not fix the problem either.
So long story short. iSCSI and CIFS transfers and directory listings can cause the pauses. The pauses block all disk IO (as best as I can tell), zpool scrub does not trigger the pauses.
Where do I proceed from here?