Hi guys,
Thanks in advance for any help you can give. I have been running my FreeNAS box quite successfully for the last 18 months or there abouts, with minor hiccups along the way, but nothing I haven't been able to figure out with a bit of googling. The crashes I have been experiencing on and off last few days has got me stumped though. As of this morning the system now won't stay up for more than 10 or 15 minutes without crashing. The system consists of a Supermicro X10SLH-F with a Xeon 1230v3 and 32GB of Supermicro certified Samsung ECC RAM. The storage was until last week 2 striped raidz2 vdevs of 6x3TB and 6x2TB drives. Over the last week I have one at a time replaced the 2TB drives with 3TB's to expand the storage which all seemed to go smoothly although maybe this was a catalyst for the crashes? The drives are attached to a M1015 (x8) card and the motherboard (x4) itself:
Anyway this is a sample of what comes up in the console footer just prior to a crash:
Trying to rule out the recent update I booted into the previous build from the bootloader but here's what I got after 13mins:
Once these messages pop up the system freezes for a few minutes and then the IPMI console starts to spit out tons of random text so fast it's unreadable. I managed a quick screen grab though here :
Also from time to time the reboot fails halfway, the system locks up and tries again but usually comes good after that, until the next time. Lol.
It appears it is all the drives attached to the M1015 that are crashing(da0 - da7) so could this be the cause? Is there anything I can do to confirm this before I try to get another one? If you need any more info please don't hesitate to ask.
Thanks again for your assistance.
Thanks in advance for any help you can give. I have been running my FreeNAS box quite successfully for the last 18 months or there abouts, with minor hiccups along the way, but nothing I haven't been able to figure out with a bit of googling. The crashes I have been experiencing on and off last few days has got me stumped though. As of this morning the system now won't stay up for more than 10 or 15 minutes without crashing. The system consists of a Supermicro X10SLH-F with a Xeon 1230v3 and 32GB of Supermicro certified Samsung ECC RAM. The storage was until last week 2 striped raidz2 vdevs of 6x3TB and 6x2TB drives. Over the last week I have one at a time replaced the 2TB drives with 3TB's to expand the storage which all seemed to go smoothly although maybe this was a catalyst for the crashes? The drives are attached to a M1015 (x8) card and the motherboard (x4) itself:
Anyway this is a sample of what comes up in the console footer just prior to a crash:
Code:
Dec 21 08:18:28 freenas (da4:mps0:0:6:0): READ(10). CDB: 28 00 69 27 3b e8 00 00 08 00 length 4096 SMID 263 command timeout cm 0xfffffe0000b090f8 ccb 0xfffff80038203800 Dec 21 08:18:28 freenas (noperiph:mps0:0:4294967295:0): SMID 1 Aborting command 0xfffffe0000b090f8 Dec 21 08:18:28 freenas mps0: Sending reset from mpssas_send_abort for target ID 6 Dec 21 08:18:28 freenas (da4:mps0:0:6:0): READ(10). CDB: 28 00 69 27 4d 90 00 00 08 00 length 4096 SMID 324 command timeout cm 0xfffffe0000b0df20 ccb 0xfffff8000bb28800 Dec 21 08:18:28 freenas (da4:mps0:0:6:0): READ(10). CDB: 28 00 69 27 5f 60 00 00 08 00 length 4096 SMID 484 command timeout cm 0xfffffe0000b1ac20 ccb 0xfffff800344aa000 Dec 21 08:18:28 freenas (da0:mps0:0:0:0): WRITE(10). CDB: 2a 00 69 26 6b 90 00 00 40 00 length 32768 SMID 229 command timeout cm 0xfffffe0000b06568 ccb 0xfffff80034537000 Dec 21 08:18:28 freenas (noperiph:mps0:0:4294967295:0): SMID 2 Aborting command 0xfffffe0000b06568 Dec 21 08:18:28 freenas mps0: Sending reset from mpssas_send_abort for target ID 0 Dec 21 08:18:28 freenas (da4:mps0:0:6:0): READ(10). CDB: 28 00 69 26 75 80 00 01 00 00 length 131072 SMID 473 command timeout cm 0xfffffe0000b19e08 ccb 0xfffff8021da6f000 Dec 21 08:18:28 freenas (da4:mps0:0:6:0): READ(10). CDB: 28 00 69 26 76 80 00 01 00 00 length 131072 SMID 950 command timeout cm 0xfffffe0000b40130 ccb 0xfffff8021da6c800 Dec 21 08:18:28 freenas (da3:mps0:0:3:0): READ(10). CDB: 28 00 69 26 78 c0 00 01 00 00 length 131072 SMID 483 command timeout cm 0xfffffe0000b1aad8 ccb 0xfffff80038999000 Dec 21 08:18:28 freenas (noperiph:mps0:0:4294967295:0): SMID 3 Aborting command 0xfffffe0000b1aad8 Dec 21 08:18:28 freenas mps0: Sending reset from mpssas_send_abort for target ID 3 Dec 21 08:18:28 freenas (da5:mps0:0:7:0): READ(10). CDB: 28 00 69 26 77 c0 00 01 00 00 length 131072 SMID 490 command timeout cm 0xfffffe0000b1b3d0 ccb 0xfffff80038221000 Dec 21 08:18:28 freenas (noperiph:mps0:0:4294967295:0): SMID 4 Aborting command 0xfffffe0000b1b3d0 Dec 21 08:18:28 freenas mps0: Sending reset from mpssas_send_abort for target ID 7 Dec 21 08:18:28 freenas (da3:mps0:0:3:0): READ(10). CDB: 28 00 69 26 79 c0 00 01 00 00 length 131072 SMID 455 command timeout cm 0xfffffe0000b186f8 ccb 0xfffff8000bbde000 Dec 21 08:18:28 freenas (da5:mps0:0:7:0): READ(10). CDB: 28 00 69 26 78 c0 00 01 00 00 length 131072 SMID 344 command timeout cm 0xfffffe0000b0f8c0 ccb 0xfffff8003899b800 Dec 21 08:18:29 freenas (da6:mps0:0:8:0): WRITE(10). CDB: 2a 00 ec 44 6a 48 00 00 18 00 length 12288 SMID 745 command timeout cm 0xfffffe0000b2fa88 ccb 0xfffff80034413000 Dec 21 08:18:29 freenas (noperiph:mps0:0:4294967295:0): SMID 5 Aborting command 0xfffffe0000b2fa88 Dec 21 08:18:29 freenas mps0: Sending reset from mpssas_send_abort for target ID 8 Dec 21 08:18:29 freenas (da7:mps0:0:9:0): WRITE(10). CDB: 2a 00 ec 44 6a 48 00 00 18 00 length 12288 SMID 446 command timeout cm 0xfffffe0000b17b70 ccb 0xfffff8003818c000 Dec 21 08:18:29 freenas (noperiph:mps0:0:4294967295:0): SMID 6 Aborting command 0xfffffe0000b17b70 Dec 21 08:18:29 freenas mps0: Sending reset from mpssas_send_abort for target ID 9
Trying to rule out the recent update I booted into the previous build from the bootloader but here's what I got after 13mins:
Code:
Dec 21 08:46:30 freenas (da4:mps0:0:6:0): READ(10). CDB: 28 00 e0 40 fc 88 00 00 08 00 length 4096 SMID 262 command timeout cm 0xfffffe0000b0cfb0 ccb 0xfffff8003d9f3800 Dec 21 08:46:30 freenas (noperiph:mps0:0:4294967295:0): SMID 1 Aborting command 0xfffffe0000b0cfb0 Dec 21 08:46:30 freenas mps0: Sending reset from mpssas_send_abort for target ID 6 Dec 21 08:46:30 freenas (da3:mps0:0:3:0): READ(10). CDB: 28 00 e0 40 fc 88 00 00 08 00 length 4096 SMID 213 command timeout cm 0xfffffe0000b090e8 ccb 0xfffff8003a587800 Dec 21 08:46:30 freenas (noperiph:mps0:0:4294967295:0): SMID 2 Aborting command 0xfffffe0000b090e8 Dec 21 08:46:30 freenas mps0: Sending reset from mpssas_send_abort for target ID 3 Dec 21 08:46:30 freenas (da5:mps0:0:7:0): READ(10). CDB: 28 00 03 f6 b0 78 00 00 08 00 length 4096 SMID 200 command timeout cm 0xfffffe0000b08040 ccb 0xfffff8003a61c800 Dec 21 08:46:30 freenas (noperiph:mps0:0:4294967295:0): SMID 3 Aborting command 0xfffffe0000b08040 Dec 21 08:46:30 freenas mps0: Sending reset from mpssas_send_abort for target ID 7 Dec 21 08:46:30 freenas (da7:mps0:0:9:0): READ(10). CDB: 28 00 03 ea d7 f8 00 00 08 00 length 4096 SMID 818 command timeout cm 0xfffffe0000b39810 ccb 0xfffff802f7654800 Dec 21 08:46:30 freenas (noperiph:mps0:0:4294967295:0): SMID 4 Aborting command 0xfffffe0000b39810 Dec 21 08:46:30 freenas mps0: Sending reset from mpssas_send_abort for target ID 9 Dec 21 08:46:30 freenas (da1:mps0:0:1:0): READ(10). CDB: 28 00 03 f7 47 40 00 00 08 00 length 4096 SMID 612 command timeout cm 0xfffffe0000b29020 ccb 0xfffff80322156000 Dec 21 08:46:30 freenas (noperiph:mps0:0:4294967295:0): SMID 5 Aborting command 0xfffffe0000b29020 Dec 21 08:46:30 freenas mps0: Sending reset from mpssas_send_abort for target ID 1 Dec 21 08:46:30 freenas (da2:mps0:0:2:0): READ(10). CDB: 28 00 03 f7 6f b8 00 00 08 00 length 4096 SMID 988 command timeout cm 0xfffffe0000b471e0 ccb 0xfffff803871d4000 Dec 21 08:46:30 freenas (noperiph:mps0:0:4294967295:0): SMID 6 Aborting command 0xfffffe0000b471e0 Dec 21 08:46:30 freenas mps0: Sending reset from mpssas_send_abort for target ID 2 Dec 21 08:46:30 freenas (da6:mps0:0:8:0): READ(10). CDB: 28 00 03 ec 59 d0 00 00 08 00 length 4096 SMID 477 command timeout cm 0xfffffe0000b1e328 ccb 0xfffff8000ba36800 Dec 21 08:46:30 freenas (noperiph:mps0:0:4294967295:0): SMID 7 Aborting command 0xfffffe0000b1e328 Dec 21 08:46:30 freenas mps0: Sending reset from mpssas_send_abort for target ID 8 Dec 21 08:46:30 freenas (da5:mps0:0:7:0): READ(10). CDB: 28 00 e0 40 fc 88 00 00 08 00 length 4096 SMID 89 command timeout cm 0xfffffe0000aff208 ccb 0xfffff8003d9ea800 Dec 21 08:46:30 freenas (da0:mps0:0:0:0): WRITE(10). CDB: 2a 00 03 f6 83 f0 00 01 00 00 length 131072 SMID 123 command timeout cm 0xfffffe0000b01d98 ccb 0xfffff8003a58e800 Dec 21 08:46:30 freenas (noperiph:mps0:0:4294967295:0): SMID 8 Aborting command 0xfffffe0000b01d98 Dec 21 08:46:30 freenas mps0: Sending reset from mpssas_send_abort for target ID 0 Dec 21 08:46:30 freenas (da1:mps0:0:1:0): READ(10). CDB: 28 00 03 f6 8f 60 00 00 c0 00 length 98304 SMID 763 command timeout cm 0xfffffe0000b35198 ccb 0xfffff802f7875800 Dec 21 08:46:30 freenas (da4:mps0:0:6:0): READ(10). CDB: 28 00 03 f7 17 60 00 00 08 00 length 4096 SMID 794 command timeout cm 0xfffffe0000b37950 ccb 0xfffff802f7875000 Dec 21 08:46:30 freenas (da6:mps0:0:8:0): READ(10). CDB: 28 00 03 ec 69 d8 00 00 08 00 length 4096 SMID 654 command timeout cm 0xfffffe0000b2c5f0 ccb 0xfffff80167c5a800 Dec 21 08:46:30 freenas (da6:mps0:0:8:0): READ(10). CDB: 28 00 03 ec 9c 88 00 00 08 00 length 4096 SMID 875 command timeout cm 0xfffffe0000b3e118 ccb 0xfffff8003a4f3000 Dec 21 08:46:30 freenas (da4:mps0:0:6:0): READ(10). CDB: 28 00 03 f7 c9 80 00 00 08 00 length 4096 SMID 226 command timeout cm 0xfffffe0000b0a190 ccb 0xfffff8003d1af000 Dec 21 08:46:30 freenas (da7:mps0:0:9:0): READ(10). CDB: 28 00 03 ec 69 d8 00 00 08 00 length 4096 SMID 963 command timeout cm 0xfffffe0000b451d8 ccb 0xfffff8003a61c000 Dec 21 08:46:30 freenas (da5:mps0:0:7:0): READ(10). CDB: 28 00 03 f6 8e a0 00 01 00 00 length 131072 SMID 907 command timeout cm 0xfffffe0000b40a18 ccb 0xfffff8003d9f0800 Dec 21 08:46:30 freenas (da5:mps0:0:7:0): READ(10). CDB: 28 00 03 f7 17 60 00 00 08 00 length 4096 SMID 523 command timeout cm 0xfffffe0000b21e18 ccb 0xfffff8058a295800 Dec 21 08:46:30 freenas (da7:mps0:0:9:0): READ(10). CDB: 28 00 03 ec 9c 88 00 00 08 00 length 4096 SMID 883 command timeout cm 0xfffffe0000b3eb58 ccb 0xfffff8058a296800 Dec 21 08:46:30 freenas (da4:mps0:0:6:0): READ(10). CDB: 28 00 03 f8 03 a0 00 00 08 00 length 4096 SMID 771 command timeout cm 0xfffffe0000b35bd8 ccb 0xfffff8003a593800 Dec 21 08:46:30 freenas (da5:mps0:0:7:0): READ(10). CDB: 28 00 03 f7 c9 80 00 00 08 00 length 4096 SMID 848 command timeout cm 0xfffffe0000b3be80 ccb 0xfffff8003a4fa000 Dec 21 08:46:30 freenas (da1:mps0:0:1:0): READ(10). CDB: 28 00 03 f6 90 70 00 00 40 00 length 32768 SMID 574 command timeout cm 0xfffffe0000b25f70 ccb 0xfffff803221a9000 Dec 21 08:46:30 freenas (da2:mps0:0:2:0): READ(10). CDB: 28 00 03 f6 91 f8 00 00 80 00 length 65536 SMID 460 command timeout cm 0xfffffe0000b1cd60 ccb 0xfffff8003a619800 Dec 21 08:46:30 freenas (da3:mps0:0:3:0): READ(10). CDB: 28 00 03 f6 90 30 00 01 00 00 length 131072 SMID 96 command timeout cm 0xfffffe0000affb00 ccb 0xfffff8003d1b0800 Dec 21 08:46:30 freenas (da4:mps0:0:6:0): READ(10). CDB: 28 00 03 f6 90 30 00 01 00 00 length 131072 SMID 350 command timeout cm 0xfffffe0000b14070 ccb 0xfffff8003a6b5800 Dec 21 08:46:30 freenas (da2:mps0:0:2:0): READ(10). CDB: 28 00 03 f6 92 f8 00 00 40 00 length 32768 SMID 634 command timeout cm 0xfffffe0000b2ac50 ccb 0xfffff8000bbdf000 Dec 21 08:46:32 freenas (da2:mps0:0:2:0): READ(10). CDB: 28 00 b2 f7 56 78 00 00 08 00 length 4096 SMID 498 command timeout cm 0xfffffe0000b1fe10 ccb 0xfffff802a2134800 Dec 21 08:46:32 freenas (da5:mps0:0:7:0): WRITE(10). CDB: 2a 00 b3 04 06 10 00 00 40 00 length 32768 SMID 981 command timeout cm 0xfffffe0000b468e8 ccb 0xfffff8003a2a4800 Dec 21 08:46:32 freenas (da0:mps0:0:0:0): WRITE(10). CDB: 2a 00 b3 04 06 18 00 00 40 00 length 32768 SMID 690 command timeout cm 0xfffffe0000b2f410 ccb 0xfffff80322155800 Dec 21 08:46:32 freenas (da6:mps0:0:8:0): WRITE(10). CDB: 2a 00 ea f2 65 f0 00 00 40 00 length 32768 SMID 368 command timeout cm 0xfffffe0000b15780 ccb 0xfffff8000bbd5000 Dec 21 08:46:32 freenas (da7:mps0:0:9:0): WRITE(10). CDB: 2a 00 ea f2 65 f0 00 00 40 00 length 32768 SMID 978 command timeout cm 0xfffffe0000b46510 ccb 0xfffff8003d0ef800 Dec 21 08:46:32 freenas (da2:mps0:0:2:0): WRITE(10). CDB: 2a 00 b3 04 06 18 00 00 40 00 length 32768 SMID 522 command timeout cm 0xfffffe0000b21cd0 ccb 0xfffff8000bbd0000 Dec 21 08:46:32 freenas (da1:mps0:0:1:0): WRITE(10). CDB: 2a 00 b3 04 06 18 00 00 40 00 length 32768 SMID 277 command timeout cm 0xfffffe0000b0e2e8 ccb 0xfffff8003a4ff000 Dec 21 08:46:32 freenas (da3:mps0:0:3:0): WRITE(10). CDB: 2a 00 b3 04 06 18 00 00 40 00 length 32768 SMID 184 command timeout cm 0xfffffe0000b06bc0 ccb 0xfffff802a21ef000 Dec 21 08:46:32 freenas (da4:mps0:0:6:0): WRITE(10). CDB: 2a 00 b3 04 06 18 00 00 40 00 length 32768 SMID 715 command timeout cm 0xfffffe0000b31418 ccb 0xfffff8003d9f0000 Dec 21 08:46:33 freenas mps0: mpssas_action_scsiio: Freezing devq for target ID 9 Dec 21 08:46:33 freenas (da7:mps0:0:9:0): WRITE(10). CDB: 2a 00 ea f2 81 08 00 00 18 00 Dec 21 08:46:33 freenas (da7:mps0:0:9:0): CAM status: CAM subsystem is busy Dec 21 08:46:33 freenas (da7:mps0:0:9:0): Retrying command Dec 21 08:46:33 freenas mps0: mpssas_action_scsiio: Freezing devq for target ID 8 Dec 21 08:46:33 freenas (da6:mps0:0:8:0): WRITE(10). CDB: 2a 00 ea f2 81 10 00 00 10 00 Dec 21 08:46:33 freenas (da6:mps0:0:8:0): CAM status: CAM subsystem is busy Dec 21 08:46:33 freenas (da6:mps0:0:8:0): Retrying command Dec 21 08:46:33 freenas mps0: mpssas_action_scsiio: Freezing devq for target ID 9 Dec 21 08:46:33 freenas (da7:mps0:0:9:0): WRITE(10). CDB: 2a 00 ea f2 81 08 00 00 18 00 Dec 21 08:46:33 freenas (da7:mps0:0:9:0): CAM status: CAM subsystem is busy Dec 21 08:46:33 freenas (da7:mps0:0:9:0): Retrying command Dec 21 08:46:33 freenas mps0: mpssas_action_scsiio: Freezing devq for target ID 8 Dec 21 08:46:33 freenas (da6:mps0:0:8:0): WRITE(10). CDB: 2a 00 ea f2 81 10 00 00 10 00 Dec 21 08:46:33 freenas (da6:mps0:0:8:0): CAM status: CAM subsystem is busy Dec 21 08:46:33 freenas (da6:mps0:0:8:0): Retrying command Dec 21 08:46:34 freenas mps0: mpssas_action_scsiio: Freezing devq for target ID 9 Dec 21 08:46:34 freenas (da7:mps0:0:9:0): WRITE(10). CDB: 2a 00 ea f2 81 08 00 00 18 00 Dec 21 08:46:34 freenas (da7:mps0:0:9:0): CAM status: CAM subsystem is busy Dec 21 08:46:34 freenas (da7:mps0:0:9:0): Retrying command Dec 21 08:46:34 freenas mps0: mpssas_action_scsiio: Freezing devq for target ID 8 Dec 21 08:46:34 freenas (da6:mps0:0:8:0): WRITE(10). CDB: 2a 00 ea f2 81 10 00 00 10 00 Dec 21 08:46:34 freenas (da6:mps0:0:8:0): CAM status: CAM subsystem is busy Dec 21 08:46:34 freenas (da6:mps0:0:8:0): Retrying command Dec 21 08:46:34 freenas mps0: mpssas_action_scsiio: Freezing devq for target ID 9 Dec 21 08:46:34 freenas (da7:mps0:0:9:0): WRITE(10). CDB: 2a 00 ea f2 81 08 00 00 18 00 Dec 21 08:46:34 freenas (da7:mps0:0:9:0): CAM status: CAM subsystem is busy Dec 21 08:46:34 freenas (da7:mps0:0:9:0): Retrying command Dec 21 08:46:34 freenas mps0: mpssas_action_scsiio: Freezing devq for target ID 8 Dec 21 08:46:34 freenas (da6:mps0:0:8:0): WRITE(10). CDB: 2a 00 ea f2 81 10 00 00 10 00 Dec 21 08:46:34 freenas (da6:mps0:0:8:0): CAM status: CAM subsystem is busy Dec 21 08:46:34 freenas (da6:mps0:0:8:0): Retrying command Dec 21 08:46:35 freenas mps0: mpssas_action_scsiio: Freezing devq for target ID 9 Dec 21 08:46:35 freenas (da7:mps0:0:9:0): WRITE(10). CDB: 2a 00 ea f2 81 08 00 00 18 00 Dec 21 08:46:35 freenas (da7:mps0:0:9:0): CAM status: CAM subsystem is busy Dec 21 08:46:35 freenas (da7:mps0:0:9:0): Error 5, Retries exhausted Dec 21 08:46:35 freenas mps0: mpssas_action_scsiio: Freezing devq for target ID 9 Dec 21 08:46:35 freenas mps0: mpssas_action_scsiio: Freezing devq for target ID 8 Dec 21 08:46:35 freenas (da7:mps0:0:9:0): READ(10). CDB: 28 00 00 40 02 90 00 00 10 00 Dec 21 08:46:35 freenas (da7:mps0:0:9:0): CAM status: CAM subsystem is busy Dec 21 08:46:35 freenas (da7:mps0:0:9:0): Retrying command Dec 21 08:46:35 freenas (da6:mps0:0:8:0): WRITE(10). CDB: 2a 00 ea f2 81 10 00 00 10 00 Dec 21 08:46:35 freenas (da6:mps0:0:8:0): CAM status: CAM subsystem is busy Dec 21 08:46:35 freenas (da6:mps0:0:8:0): Error 5, Retries exhausted Dec 21 08:46:35 freenas mps0: mpssas_action_scsiio: Freezing devq for target ID 8 Dec 21 08:46:35 freenas (da6:mps0:0:8:0): READ(10). CDB: 28 00 00 40 02 90 00 00 10 00 Dec 21 08:46:35 freenas (da6:mps0:0:8:0): CAM status: CAM subsystem is busy Dec 21 08:46:35 freenas (da6:mps0:0:8:0): Retrying command
Once these messages pop up the system freezes for a few minutes and then the IPMI console starts to spit out tons of random text so fast it's unreadable. I managed a quick screen grab though here :

Also from time to time the reboot fails halfway, the system locks up and tries again but usually comes good after that, until the next time. Lol.
It appears it is all the drives attached to the M1015 that are crashing(da0 - da7) so could this be the cause? Is there anything I can do to confirm this before I try to get another one? If you need any more info please don't hesitate to ask.
Thanks again for your assistance.