Spontaneous reboot?

Status
Not open for further replies.

GrahamBB

Explorer
Joined
Sep 6, 2014
Messages
77
Progress continued last week and I was getting to the stage of thinking that we might be close to using the system in anger.

However, looking at the logs from today, it looks as though the system spontaneously rebooted at the start of this log (there is nothing before that to indicate what triggered a reboot), and again at 6:19. Assuming I am interpreting the log correctly.

Q, where do I start to look?

http://pastebin.com/embed_js.php?i=u4it1avw
 

Jailer

Not strong, but bad
Joined
Sep 12, 2014
Messages
4,977
Your going to need to provide a bit more detail on hardware specs.

Is you controller flashed to IT mode?
 

GrahamBB

Explorer
Joined
Sep 6, 2014
Messages
77
Your going to need to provide a bit more detail on hardware specs.

Is you controller flashed to IT mode?

Yes, BIOS 7.29.00.00 or is more than that required?

Cheers
 
Joined
Oct 2, 2014
Messages
925
Yes, BIOS 7.29.00.00 or is more than that required?

Cheers
Full system spec's is preferred, such as CPU;make model, motherboard;make model, RAM, powersupply make/model
 

GrahamBB

Explorer
Joined
Sep 6, 2014
Messages
77
Full system spec's is preferred, such as CPU;make model, motherboard;make model, RAM, powersupply make/model
I'd welcome some coaching please. How does the equipment list I have fall short? How could I improve it?

Cheers
 

rogerh

Guru
Joined
Apr 18, 2014
Messages
1,111
Possibly just people failed to see it, especially if ageing eyes like mine! I'd still ask about the LSI HBA; as well as the BIOS you mention, what is the operating firmware it is using? I don't know much about these myself, but I know that people have problems if they are not in IT mode with the right firmware for the FreeNAS drivers.
 

Ericloewe

Server Wrangler
Moderator
Joined
Feb 15, 2014
Messages
20,194
I'd welcome some coaching please. How does the equipment list I have fall short? How could I improve it?

Cheers
He means that your specs aren't visible. Not everyone can see sigs (tapatalk, mobile website).

For reference, here's GrahamBB's sig:

Dell R710, Xeon Quad E5504 2Ghz, 48Gb ECC HP RAM 800Mhz 1.5V, 6 * ST8000AS002-1NA AR13, RAIDZ2. LSI MPTSAS2, BIOS 7.29.00.00, 9.3-STABLE

" LSI MPTSAS2, BIOS 7.29.00.00" does not help us at all. It sounds somewhat like hardware RAID. You might want to start by giving us the output of dmidecode (CODE tags, please).
 

GrahamBB

Explorer
Joined
Sep 6, 2014
Messages
77
He means that your specs aren't visible. Not everyone can see sigs (tapatalk, mobile website).

For reference, here's GrahamBB's sig:

Dell R710, Xeon Quad E5504 2Ghz, 48Gb ECC HP RAM 800Mhz 1.5V, 6 * ST8000AS002-1NA AR13, RAIDZ2. LSI MPTSAS2, BIOS 7.29.00.00, 9.3-STABLE

" LSI MPTSAS2, BIOS 7.29.00.00" does not help us at all. It sounds somewhat like hardware RAID. You might want to start by giving us the output of dmidecode (CODE tags, please).


Ahhh, I see. I followed the advice in the forum rules to put the info in my sig. Is there a best practice for how to list them other than that?

I'll find the details on the HBA, but just to note it IS flashed as per the forum recommendations.

Cheers
 

rogerh

Guru
Joined
Apr 18, 2014
Messages
1,111

Bidule0hm

Server Electronics Sorcerer
Joined
Aug 5, 2013
Messages
3,710
It's best practice to put the system specs in the post where you ask for help even if you already have them in your sig ;)
 

GrahamBB

Explorer
Joined
Sep 6, 2014
Messages
77
He means that your specs aren't visible. Not everyone can see sigs (tapatalk, mobile website).

For reference, here's GrahamBB's sig:

Dell R710, Xeon Quad E5504 2Ghz, 48Gb ECC HP RAM 800Mhz 1.5V, 6 * ST8000AS002-1NA AR13, RAIDZ2. LSI MPTSAS2, BIOS 7.29.00.00, 9.3-STABLE

" LSI MPTSAS2, BIOS 7.29.00.00" does not help us at all. It sounds somewhat like hardware RAID. You might want to start by giving us the output of dmidecode (CODE tags, please).
 

GrahamBB

Explorer
Joined
Sep 6, 2014
Messages
77
Let's see if I got this right! I don't see anything on the HBA in there - but perhaps I don't know what I'm looking for L;-) -, it is an M1015 and then flashed to match the 9.3 driver recommendations. I'm assuming that I should only post the info relevant to theHBA? The whole file is larger than the limit for a post. Again, my apologies for my new user dipstickness!
 

rogerh

Guru
Joined
Apr 18, 2014
Messages
1,111
I think people generally post things like dmidecode output in pastebin or somewhere like that and give us a link. If you are using an M1015, what is the role of the LSI MPTSAS2 in your sig? Forgive me if I am confused, I don't know much about these things.
 

GrahamBB

Explorer
Joined
Sep 6, 2014
Messages
77
I think people generally post things like dmidecode output in pastebin or somewhere like that and give us a link. If you are using an M1015, what is the role of the LSI MPTSAS2 in your sig? Forgive me if I am confused, I don't know much about these things.
Perhaps I take things too literally :smile:. I understood the request to be to be to post the date in a CODE fragment in the reply - but again I may be missing something!

As to the HBA, we flashed it bask to a LSI card and that is what it reports as after flashing.

Cheers
 

rogerh

Guru
Joined
Apr 18, 2014
Messages
1,111
Perhaps I take things too literally :). I understood the request to be to be to post the date in a CODE fragment in the reply - but again I may be missing something!

As to the HBA, we flashed it bask to a LSI card and that is what it reports as after flashing.

Cheers
You were indeed asked to do that, but if it doesn't work it is clearly not your fault! I was just saying that people often do use pastebin (as you did yourself above) so that it will be just as helpful.

About the LSI card, that's fascinating. If already has IT firmware 16 and the experts can't pick up any other problems in dmidecode it is beginning to look like a hardware problem, don't you think?
 

GrahamBB

Explorer
Joined
Sep 6, 2014
Messages
77
You were indeed asked to do that, but if it doesn't work it is clearly not your fault! I was just saying that people often do use pastebin (as you did yourself above) so that it will be just as helpful.

About the LSI card, that's fascinating. If already has IT firmware 16 and the experts can't pick up any other problems in dmidecode it is beginning to look like a hardware problem, don't you think?
Oh yes I agree! But where to start?

The smb config problems seem benign, but without confidence in the hardware, we can't do much :-(
 

rogerh

Guru
Joined
Apr 18, 2014
Messages
1,111
Nothing in the IPMI event log? Loose connection e.g. mains? PSU failing? Any UPS (otherwise could be power glitch). Then all the rest of the hardware I suppose. Just had an intermittent printer that proved to be the IEC mains plug creeping out to an intermittent contact.
 

Ericloewe

Server Wrangler
Moderator
Joined
Feb 15, 2014
Messages
20,194
The HBA is running P16 IT mode (or maybe IR, but that's also fine, as long as no configuration is done).

Like rogerh suggested, I'd check the IPMI log (if it exists). If that doesn't help, I guess you'll have to wait and see if it happens again.
 

rogerh

Guru
Joined
Apr 18, 2014
Messages
1,111
Another observation, assuming the log you posted originally is continuous. Before the machine rebooted at 06.19 there were no log messages after 04.46. So either it was locked up by some software process that prevented logging for an hour and a half or we must be dealing with two intermittent power or connection issues, one that stopped it and another that restarted it. Does that make sense?
 

Ericloewe

Server Wrangler
Moderator
Joined
Feb 15, 2014
Messages
20,194
Another observation, assuming the log you posted originally is continuous. Before the machine rebooted at 06.19 there were no log messages after 04.46. So either it was locked up by some software process that prevented logging for an hour and a half or we must be dealing with two intermittent power or connection issues, one that stopped it and another that restarted it. Does that make sense?
Or there was simply nothing to log. It is the middle of the night. Notice the 2-hour gap from 02:something to 04:something.
 
Status
Not open for further replies.
Top