- Joined
- Feb 18, 2014
- Messages
- 2,925
In the last couple of weeks my 11.0-U4, 32GB, FreeNAS Mini (mainboard replaced a few months ago under warranty by iXsystems after failing) has reported 4 instances of memory error:
The last two were yesterday and today - in between them I reseated the 4 memory modules, generally checked the inside of the box, and mentally prepared for the next phase if there was a repeat event. So, here we are.
Dmidecode results today:
The ASRock manual doesn't have a memory map and my searching so far has not turned one up, so I'm not encouraged that the information above is very helpful (if at all) in physically locating the problem - assuming it is with a memory module, in which of the four slots it resides? A general Google search didn't give me much hope either. I'd be delighted if anyone can tell me how I can use that information or anything else that we can get from a functional (if flakey) FreeNAS install to nail down the ailing component(s).
I'm gearing myself up to pull the four drives and install them in spare bays in my Dell backup box - more on that in my other post in Storage - then run Memtest86+ on the box to see if I can find a bad module. I'm uncertain right now about the testing protocol - if I do all four simultaneously will Memtest allow me to identify a culprit or must I test each one alone? If anyone has any pointers on that topic I'd be delighted to have them (I only ever tested "good" memory before - no errors found).
In fact, any comments will be welcome - this is new territory for me.
Code:
MCA: Bank 5, Status 0x9400004000910091 MCA: Global Cap 0x0000000000000806, Status 0x0000000000000000 MCA: Vendor "GenuineIntel", ID 0x406d8, APIC ID 0 MCA: CPU 0 COR RD channel 1 memory error MCA: Address 0x818e45508 MCA: Bank 5, Status 0x9400004000910091 MCA: Global Cap 0x0000000000000806, Status 0x0000000000000000 MCA: Vendor "GenuineIntel", ID 0x406d8, APIC ID 0 MCA: CPU 0 COR RD channel 1 memory error MCA: Address 0x51bd011c0 MCA: Bank 5, Status 0x9400004000910091 MCA: Global Cap 0x0000000000000806, Status 0x0000000000000000 MCA: Vendor "GenuineIntel", ID 0x406d8, APIC ID 0 MCA: CPU 0 COR RD channel 1 memory error MCA: Address 0x80ae56130 MCA: Bank 5, Status 0x9400004000910091 MCA: Global Cap 0x0000000000000806, Status 0x0000000000000000 MCA: Vendor "GenuineIntel", ID 0x406d8, APIC ID 0 MCA: CPU 0 COR RD channel 1 memory error MCA: Address 0x4ec722100
The last two were yesterday and today - in between them I reseated the 4 memory modules, generally checked the inside of the box, and mentally prepared for the next phase if there was a repeat event. So, here we are.
Dmidecode results today:
Code:
root@freenasmini:~ # dmidecode # dmidecode 3.0 Scanning /dev/mem for entry point. SMBIOS 2.8 present. 25 structures occupying 1495 bytes. Table at 0xCF527000. Handle 0x0000, DMI type 0, 24 bytes BIOS Information Vendor: American Megatrends Inc. Version: P2.90 Release Date: 01/26/2016 Address: 0xF0000 Runtime Size: 64 kB ROM Size: 8192 kB Characteristics: PCI is supported BIOS is upgradeable BIOS shadowing is allowed Boot from CD is supported Selectable boot is supported BIOS ROM is socketed EDD is supported 5.25"/1.2 MB floppy services are supported (int 13h) 3.5"/720 kB floppy services are supported (int 13h) 3.5"/2.88 MB floppy services are supported (int 13h) Print screen service is supported (int 5h) 8042 keyboard services are supported (int 9h) Serial services are supported (int 14h) Printer services are supported (int 17h) ACPI is supported USB legacy is supported BIOS boot specification is supported Targeted content distribution is supported UEFI is supported BIOS Revision: 5.6 Handle 0x0001, DMI type 1, 27 bytes System Information Manufacturer: iXsystems Product Name: FREENAS-MINI-2.0 Version: To Be Filled By O.E.M. Serial Number: A1-37201 UUID: 03000200-0400-0500-0006-000700080009 Wake-up Type: Power Switch SKU Number: To Be Filled By O.E.M. Family: To Be Filled By O.E.M. Handle 0x0002, DMI type 2, 15 bytes Base Board Information Manufacturer: ASRock Product Name: C2750D4I Version: 1.02 Serial Number: 73S0X39I0073 Asset Tag: Features: Board is a hosting board Board is replaceable Location In Chassis: Chassis Handle: 0x0003 Type: Motherboard Contained Object Handles: 0 Handle 0x0003, DMI type 3, 25 bytes Chassis Information Manufacturer: To Be Filled By O.E.M. Type: Desktop Lock: Not Present Version: To Be Filled By O.E.M. Serial Number: To Be Filled By O.E.M. Asset Tag: To Be Filled By O.E.M. Boot-up State: Safe Power Supply State: Safe Thermal State: Safe Security Status: None OEM Information: 0x00000000 Height: Unspecified Number Of Power Cords: 1 Contained Elements: 1 Power Supply (1) SKU Number: To Be Filled By O.E.M. Handle 0x0008, DMI type 9, 17 bytes System Slot Information Designation: PCIE1 Type: x8 PCI Express Current Usage: In Use Length: Long ID: 17 Characteristics: 3.3 V is provided Opening is shared PME signal is supported Handle 0x0009, DMI type 11, 5 bytes OEM Strings String 1: To Be Filled By O.E.M. Handle 0x0014, DMI type 32, 20 bytes System Boot Information Status: No errors detected Handle 0x0015, DMI type 41, 11 bytes Onboard Device Reference Designation: Onboard IGD Type: Video Status: Enabled Type Instance: 1 Bus Address: 0000:00:02.0 Handle 0x0016, DMI type 41, 11 bytes Onboard Device Reference Designation: Onboard LAN Type: Ethernet Status: Enabled Type Instance: 1 Bus Address: 0000:00:19.0 Handle 0x0017, DMI type 41, 11 bytes Onboard Device Reference Designation: Onboard 1394 Type: Other Status: Enabled Type Instance: 1 Bus Address: 0000:03:1c.2 Handle 0x0018, DMI type 7, 19 bytes Cache Information Socket Designation: L1-Cache Configuration: Enabled, Not Socketed, Level 1 Operational Mode: Write Back Location: Internal Installed Size: 448 kB Maximum Size: 448 kB Supported SRAM Types: Synchronous Installed SRAM Type: Synchronous Speed: Unknown Error Correction Type: Single-bit ECC System Type: Instruction Associativity: 8-way Set-associative Handle 0x0019, DMI type 7, 19 bytes Cache Information Socket Designation: L2-Cache Configuration: Enabled, Not Socketed, Level 2 Operational Mode: Write Back Location: Internal Installed Size: 4096 kB Maximum Size: 4096 kB Supported SRAM Types: Synchronous Installed SRAM Type: Synchronous Speed: Unknown Error Correction Type: Single-bit ECC System Type: Unified Associativity: 16-way Set-associative Handle 0x001A, DMI type 4, 42 bytes Processor Information Socket Designation: CPUSocket Type: Central Processor Family: Atom Manufacturer: Intel(R) Corporation ID: D8 06 04 00 FF FB EB BF Signature: Type 0, Family 6, Model 77, Stepping 8 Flags: FPU (Floating-point unit on-chip) VME (Virtual mode extension) DE (Debugging extension) PSE (Page size extension) TSC (Time stamp counter) MSR (Model specific registers) PAE (Physical address extension) MCE (Machine check exception) CX8 (CMPXCHG8 instruction supported) APIC (On-chip APIC hardware supported) SEP (Fast system call) MTRR (Memory type range registers) PGE (Page global enable) MCA (Machine check architecture) CMOV (Conditional move instruction supported) PAT (Page attribute table) PSE-36 (36-bit page size extension) CLFSH (CLFLUSH instruction supported) DS (Debug store) ACPI (ACPI supported) MMX (MMX technology supported) FXSR (FXSAVE and FXSTOR instructions supported) SSE (Streaming SIMD extensions) SSE2 (Streaming SIMD extensions 2) SS (Self-snoop) HTT (Multi-threading) TM (Thermal monitor supported) PBE (Pending break enabled) Version: Intel(R) Atom(TM) CPU C2750 @ 2.40GHz Voltage: 1.6 V External Clock: 100 MHz Max Speed: 2600 MHz Current Speed: 2400 MHz Status: Populated, Enabled Upgrade: Other L1 Cache Handle: 0x0018 L2 Cache Handle: 0x0019 L3 Cache Handle: Not Provided Serial Number: Not Specified Asset Tag: ProcessorInfo_ASSET_TAG Part Number: Not Specified Core Count: 8 Core Enabled: 8 Thread Count: 8 Characteristics: 64-bit capable Handle 0x001D, DMI type 15, 73 bytes System Event Log Area Length: 65535 bytes Header Start Offset: 0x0000 Header Length: 16 bytes Data Start Offset: 0x0010 Access Method: Memory-mapped physical 32-bit address Access Address: 0xFFA21000 Status: Valid, Not Full Change Token: 0x000000DC Header Format: Type 1 Supported Log Type Descriptors: 25 Descriptor 1: Single-bit ECC memory error Data Format 1: Multiple-event handle Descriptor 2: Multi-bit ECC memory error Data Format 2: Multiple-event handle Descriptor 3: Parity memory error Data Format 3: None Descriptor 4: Bus timeout Data Format 4: None Descriptor 5: I/O channel block Data Format 5: None Descriptor 6: Software NMI Data Format 6: None Descriptor 7: POST memory resize Data Format 7: None Descriptor 8: POST error Data Format 8: POST results bitmap Descriptor 9: PCI parity error Data Format 9: Multiple-event handle Descriptor 10: PCI system error Data Format 10: Multiple-event handle Descriptor 11: CPU failure Data Format 11: None Descriptor 12: EISA failsafe timer timeout Data Format 12: None Descriptor 13: Correctable memory log disabled Data Format 13: None Descriptor 14: Logging disabled Data Format 14: None Descriptor 15: System limit exceeded Data Format 15: None Descriptor 16: Asynchronous hardware timer expired Data Format 16: None Descriptor 17: System configuration information Data Format 17: None Descriptor 18: Hard disk information Data Format 18: None Descriptor 19: System reconfigured Data Format 19: None Descriptor 20: Uncorrectable CPU-complex error Data Format 20: None Descriptor 21: Log area reset/cleared Data Format 21: None Descriptor 22: System boot Data Format 22: None Descriptor 23: End of log Data Format 23: None Descriptor 24: OEM-specific Data Format 24: OEM-specific Descriptor 25: OEM-specific Data Format 25: OEM-specific Handle 0x001E, DMI type 16, 23 bytes Physical Memory Array Location: System Board Or Motherboard Use: System Memory Error Correction Type: Single-bit ECC Maximum Capacity: 64 GB Error Information Handle: Not Provided Number Of Devices: 4 Handle 0x001F, DMI type 19, 31 bytes Memory Array Mapped Address Starting Address: 0x00000000000 Ending Address: 0x007FFFFFFFF Range Size: 32 GB Physical Array Handle: 0x001E Partition Width: 1 Handle 0x0020, DMI type 17, 34 bytes Memory Device Array Handle: 0x001E Error Information Handle: Not Provided Total Width: 64 bits Data Width: 64 bits Size: 8192 MB Form Factor: DIMM Set: None Locator: DIMM0 Bank Locator: BANK 0 Type: DDR3 Type Detail: Synchronous Unbuffered (Unregistered) Speed: 1600 MHz Manufacturer: Samsung Serial Number: 20716568 Asset Tag: BANK 0 DIMM0 AssetTag Part Number: M391B1G73QH0-YK0 Rank: 2 Configured Clock Speed: 1600 MHz Handle 0x0021, DMI type 20, 35 bytes Memory Device Mapped Address Starting Address: 0x00000000000 Ending Address: 0x001FFFFFFFF Range Size: 8 GB Physical Device Handle: 0x0020 Memory Array Mapped Address Handle: 0x001F Partition Row Position: 1 Handle 0x0022, DMI type 17, 34 bytes Memory Device Array Handle: 0x001E Error Information Handle: Not Provided Total Width: 64 bits Data Width: 64 bits Size: 8192 MB Form Factor: DIMM Set: None Locator: DIMM0 Bank Locator: BANK 1 Type: DDR3 Type Detail: Synchronous Unbuffered (Unregistered) Speed: 1600 MHz Manufacturer: Samsung Serial Number: 20805821 Asset Tag: BANK 1 DIMM0 AssetTag Part Number: M391B1G73QH0-YK0 Rank: 2 Configured Clock Speed: 1600 MHz Handle 0x0023, DMI type 20, 35 bytes Memory Device Mapped Address Starting Address: 0x00200000000 Ending Address: 0x003FFFFFFFF Range Size: 8 GB Physical Device Handle: 0x0022 Memory Array Mapped Address Handle: 0x001F Partition Row Position: 1 Handle 0x0024, DMI type 17, 34 bytes Memory Device Array Handle: 0x001E Error Information Handle: Not Provided Total Width: 64 bits Data Width: 64 bits Size: 8192 MB Form Factor: DIMM Set: None Locator: DIMM1 Bank Locator: BANK 0 Type: DDR3 Type Detail: Synchronous Unbuffered (Unregistered) Speed: 1600 MHz Manufacturer: Samsung Serial Number: 20716567 Asset Tag: BANK 0 DIMM1 AssetTag Part Number: M391B1G73QH0-YK0 Rank: 2 Configured Clock Speed: 1600 MHz Handle 0x0025, DMI type 20, 35 bytes Memory Device Mapped Address Starting Address: 0x00400000000 Ending Address: 0x005FFFFFFFF Range Size: 8 GB Physical Device Handle: 0x0024 Memory Array Mapped Address Handle: 0x001F Partition Row Position: 1 Handle 0x0026, DMI type 17, 34 bytes Memory Device Array Handle: 0x001E Error Information Handle: Not Provided Total Width: 64 bits Data Width: 64 bits Size: 8192 MB Form Factor: DIMM Set: None Locator: DIMM1 Bank Locator: BANK 1 Type: DDR3 Type Detail: Synchronous Unbuffered (Unregistered) Speed: 1600 MHz Manufacturer: Samsung Serial Number: 20805822 Asset Tag: BANK 1 DIMM1 AssetTag Part Number: M391B1G73QH0-YK0 Rank: 2 Configured Clock Speed: 1600 MHz Handle 0x0027, DMI type 20, 35 bytes Memory Device Mapped Address Starting Address: 0x00600000000 Ending Address: 0x007FFFFFFFF Range Size: 8 GB Physical Device Handle: 0x0026 Memory Array Mapped Address Handle: 0x001F Partition Row Position: 1 Handle 0x0028, DMI type 127, 4 bytes End Of Table
The ASRock manual doesn't have a memory map and my searching so far has not turned one up, so I'm not encouraged that the information above is very helpful (if at all) in physically locating the problem - assuming it is with a memory module, in which of the four slots it resides? A general Google search didn't give me much hope either. I'd be delighted if anyone can tell me how I can use that information or anything else that we can get from a functional (if flakey) FreeNAS install to nail down the ailing component(s).
I'm gearing myself up to pull the four drives and install them in spare bays in my Dell backup box - more on that in my other post in Storage - then run Memtest86+ on the box to see if I can find a bad module. I'm uncertain right now about the testing protocol - if I do all four simultaneously will Memtest allow me to identify a culprit or must I test each one alone? If anyone has any pointers on that topic I'd be delighted to have them (I only ever tested "good" memory before - no errors found).
In fact, any comments will be welcome - this is new territory for me.