MalVeauX
Contributor
- Joined
- Aug 6, 2020
- Messages
- 110
Hrm,
I'm testing my ECC RAM at the moment. I've been running Memtest86 for a while now just to let it do it's thing. So far I'm on pass #5 after 18+ hours or so and it reports 0 (zero) errors. I had to run an older version (V4) because it's an old board (X8SIL-F) that doesn't support UEFI booting.
However, when I went to my event log on my motherboard via IPMI I found an event alert by chance just being nosy:
I tried looking this up, but basically, this is a total real error that was not corrected by ECC. So my understanding is, this would have resulted in corruption or lost some data if it were not already backed up, correct?
I'm confused how Memtest didn't report and error, but the IPMI log shows an error (not corrected by ECC)?
DIMM4B, does that mean it's the DIMM 4 slot on the board to help identity which stick it is?
How would ZFS handle this on a mirror pool of data? Would it have caught it having a different checksum and heal, or would this have been a total bust where it was corrupted in the RAM and was written corrupted? I'm not sure how serious this is. It looks like I need different RAM though and that worries me.
This server is going to be housing media (our family pictures, movies, etc) to serve to a few client machines in the house. The data will be on mirrors, no parity stuff, strictly 1:1 mirrors. I'm not sure if the above means I shouldn't put data on this server yet, and figure out this error above, replace it entirely or what.
Thoughts?
Very best,
I'm testing my ECC RAM at the moment. I've been running Memtest86 for a while now just to let it do it's thing. So far I'm on pass #5 after 18+ hours or so and it reports 0 (zero) errors. I had to run an older version (V4) because it's an old board (X8SIL-F) that doesn't support UEFI booting.
However, when I went to my event log on my motherboard via IPMI I found an event alert by chance just being nosy:
I tried looking this up, but basically, this is a total real error that was not corrected by ECC. So my understanding is, this would have resulted in corruption or lost some data if it were not already backed up, correct?
I'm confused how Memtest didn't report and error, but the IPMI log shows an error (not corrected by ECC)?
DIMM4B, does that mean it's the DIMM 4 slot on the board to help identity which stick it is?
How would ZFS handle this on a mirror pool of data? Would it have caught it having a different checksum and heal, or would this have been a total bust where it was corrupted in the RAM and was written corrupted? I'm not sure how serious this is. It looks like I need different RAM though and that worries me.
This server is going to be housing media (our family pictures, movies, etc) to serve to a few client machines in the house. The data will be on mirrors, no parity stuff, strictly 1:1 mirrors. I'm not sure if the above means I shouldn't put data on this server yet, and figure out this error above, replace it entirely or what.
Thoughts?
Very best,