Hi folks,
My FreeNAS server has been throwing MCA errors that seem to be related to ECC memory. I've ordered a replacement DIMM, but I'm wondering if its possible to identify exactly which module I need to replace. My server is an ASRock C2550D4I w/ 16 gigs ECC memory and 2x5 TB & 2x3 TB WD Red NAS hard drives.
The recent errors are here. An example is:
Running dmidecode gives this data. What I believe is the relevant part is here:
So a few questions:
1) Is it likely that replacing a single DIMM will solve this problem, or can it be determined from these logs that the problem lies with the motherboard?
2) If it is indeed a single DIMM, which one should I swap out? Or do I need to just guess and check?
Thank you for any help!
My FreeNAS server has been throwing MCA errors that seem to be related to ECC memory. I've ordered a replacement DIMM, but I'm wondering if its possible to identify exactly which module I need to replace. My server is an ASRock C2550D4I w/ 16 gigs ECC memory and 2x5 TB & 2x3 TB WD Red NAS hard drives.
The recent errors are here. An example is:
Code:
Dec 1 23:28:06 freenas MCA: Global Cap 0x0000000000000806, Status 0x0000000000000000 Dec 1 23:28:06 freenas MCA: Vendor "GenuineIntel", ID 0x406d8, APIC ID 0 Dec 1 23:28:06 freenas MCA: CPU 0 COR OVER RD channel 0 memory error Dec 1 23:28:06 freenas MCA: Address 0x5bb5c258
Running dmidecode gives this data. What I believe is the relevant part is here:
Code:
Handle 0x0020, DMI type 17, 34 bytes
Memory Device
Array Handle: 0x001E
Error Information Handle: Not Provided
Total Width: 64 bits
Data Width: 64 bits
Size: 4096 MB
Form Factor: DIMM
Set: None
Locator: DIMM0
Bank Locator: BANK 0
Type: DDR3
Type Detail: Synchronous Unbuffered (Unregistered)
Speed: 1600 MT/s
Manufacturer: Micron
Serial Number: 12231724
Asset Tag: KBANK 0 DIMM0 AssetTag
Part Number: 18KSF51272AZ-1G6K
Rank: 2
Configured Memory Speed: 1600 MT/s
Handle 0x0021, DMI type 20, 35 bytes
Memory Device Mapped Address
Starting Address: 0x00000000000
Ending Address: 0x000FFFFFFFF
Range Size: 4 GB
Physical Device Handle: 0x0020
Memory Array Mapped Address Handle: 0x001F
Partition Row Position: 1
Handle 0x0022, DMI type 17, 34 bytes
Memory Device
Array Handle: 0x001E
Error Information Handle: Not Provided
Total Width: 64 bits
Data Width: 64 bits
Size: 4096 MB
Form Factor: DIMM
Set: None
Locator: DIMM0
Bank Locator: BANK 1
Type: DDR3
Type Detail: Synchronous Unbuffered (Unregistered)
Speed: 1600 MT/s
Manufacturer: Micron
Serial Number: 13106656
Asset Tag: KBANK 1 DIMM0 AssetTag
Part Number: 18KSF51272AZ-1G6K
Rank: 2
Configured Memory Speed: 1600 MT/s
Handle 0x0023, DMI type 20, 35 bytes
Memory Device Mapped Address
Starting Address: 0x00100000000
Ending Address: 0x001FFFFFFFF
Range Size: 4 GB
Physical Device Handle: 0x0022
Memory Array Mapped Address Handle: 0x001F
Partition Row Position: 1
Handle 0x0024, DMI type 17, 34 bytes
Memory Device
Array Handle: 0x001E
Error Information Handle: Not Provided
Total Width: 64 bits
Data Width: 64 bits
Size: 4096 MB
Form Factor: DIMM
Set: None
Locator: DIMM1
Bank Locator: BANK 0
Type: DDR3
Type Detail: Synchronous Unbuffered (Unregistered)
Speed: 1600 MT/s
Manufacturer: Micron
Serial Number: 12231069
Asset Tag: KBANK 0 DIMM1 AssetTag
Part Number: 18KSF51272AZ-1G6K
Rank: 2
Configured Memory Speed: 1600 MT/s
Handle 0x0025, DMI type 20, 35 bytes
Memory Device Mapped Address
Starting Address: 0x00200000000
Ending Address: 0x002FFFFFFFF
Range Size: 4 GB
Physical Device Handle: 0x0024
Memory Array Mapped Address Handle: 0x001F
Partition Row Position: 1
Handle 0x0026, DMI type 17, 34 bytes
Memory Device
Array Handle: 0x001E
Error Information Handle: Not Provided
Total Width: 64 bits
Data Width: 64 bits
Size: 4096 MB
Form Factor: DIMM
Set: None
Locator: DIMM1
Bank Locator: BANK 1
Type: DDR3
Type Detail: Synchronous Unbuffered (Unregistered)
Speed: 1600 MT/s
Manufacturer: Micron
Serial Number: 13106655
Asset Tag: KBANK 1 DIMM1 AssetTag
Part Number: 18KSF51272AZ-1G6K
Rank: 2
Configured Memory Speed: 1600 MT/s
Handle 0x0027, DMI type 20, 35 bytes
Memory Device Mapped Address
Starting Address: 0x00300000000
Ending Address: 0x003FFFFFFFF
Range Size: 4 GB
Physical Device Handle: 0x0026
Memory Array Mapped Address Handle: 0x001F
Partition Row Position: 1So a few questions:
1) Is it likely that replacing a single DIMM will solve this problem, or can it be determined from these logs that the problem lies with the motherboard?
2) If it is indeed a single DIMM, which one should I swap out? Or do I need to just guess and check?
Thank you for any help!