TrueNAS Command constantly loses connection to TrueNAS core appliances

TheUsD

Contributor
Joined
May 17, 2013
Messages
116
Two TrueNAS core devices running on same version: TrueNAS-13.0-U3
One TrueCommand System Version: 2.2.2 Middleware Version: 2.2.2-20221018 running on Debian 11 (VM)

TrueCommand randomly, but constantly loses connection to TrueNAS Core devices. In order to resolve issue, I must delete the device from TrueCommand and then add it back. It does not matter if I re-use the same API keys or if I generate new keys. No changes are made to the environment in any fashion.

Error messages are consistent in TrueCommand:
1670731229914.png

The Certificates are valid, they are OV wildcard certs from a very well-known public CA. I have tried adding in via FQDN and by IP. Same results.
 
Last edited:

morganL

Captain Morgan
Administrator
Moderator
iXsystems
Joined
Mar 10, 2018
Messages
2,694
Two TrueNAS core devices running on same version: TrueNAS-13.0-U3
One TrueCommand System Version: 2.2.2 Middleware Version: 2.2.2-20221018 running on Debian 11 (VM)

TrueCommand randomly, but constantly loses connection to TrueNAS Core devices. In order to resolve issue, I must delete the device from TrueCommand and then add it back. It does not matter if I re-use the same API keys or if I generate new keys. No changes are made to the environment in any fashion.

Error messages are consistent in TrueCommand: View attachment 60789
The Certificates are valid, they are OV wildcard certs from a very well-known public CA. I have tried adding in via FQDN and by IP. Same results.


Thanks ... has this always happened or after an update?

Can you estimate frequency per node?

Is anyone else seeing a similar issue?
 

TheUsD

Contributor
Joined
May 17, 2013
Messages
116
Thanks ... has this always happened or after an update?
This was happening when the two nodes were on the previous version of Core 13 but I am unaware of which version of TrueCommand I started with the first time I attempted to use it. I ended up deleting the VM and spinning up a new VM on 11/30. Since then, I have lost the nodes 2 to 3 times. I do not always restore the nodes.

Can you estimate frequency per node?
To help give a better frequency, I attempted to delete and re-add both nodes. However, I am now being presented with a new error.
1670764569648.png


Again, these are office OV wildcard certs on said nodes. Same wildcard certs I use in my entire domain, for my company. The information I am blocking is the domain name (in the picture of OP and the IP in this post's photo.

I even went and switched back to the default cert on the NAS-01.
 

TheUsD

Contributor
Joined
May 17, 2013
Messages
116
@morganL This may need to be moved to the right forum. I just realized I put this in the Scale forums, not Command.
 

HoneyBadger

actually does care
Administrator
Moderator
iXsystems
Joined
Feb 6, 2014
Messages
5,112
This was happening when the two nodes were on the previous version of Core 13 but I am unaware of which version of TrueCommand I started with the first time I attempted to use it. I ended up deleting the VM and spinning up a new VM on 11/30. Since then, I have lost the nodes 2 to 3 times. I do not always restore the nodes.


To help give a better frequency, I attempted to delete and re-add both nodes. However, I am now being presented with a new error.
View attachment 60803

Again, these are office OV wildcard certs on said nodes. Same wildcard certs I use in my entire domain, for my company. The information I am blocking is the domain name (in the picture of OP and the IP in this post's photo.

I even went and switched back to the default cert on the NAS-01.
That screenshot is hinting at the answer and it's something I've had to fight with when it comes to wildcard certificates. It's looking for the IP address in the Subject Alternate Name field of the extended properties of your SSL cert.

This is an external CA - do they allow duplicate issues of the cert with additional custom values?

Your original issue seems to imply that your SSL cert isn't being trusted by the TrueCommand VM - is there perhaps a missing intermediate certificate? Might need to open a console session on the TC instance and use some command line OpenSSL to check for validity errors.
 

TheUsD

Contributor
Joined
May 17, 2013
Messages
116
I would chalk this up as nothing more than an issue with TrueCommand. The OV cert is issued by Sectigo. But moving on with it being a "my" cert issue, the same error message is generated when using the native FreeNAS cert that comes with TC / Core.

Uploaded the OV's .CRT and .KEY in all three appliances. The Cores and TC have no issues using the OV when being accessed via webgui. Also, to note, this cert is used in many appliances including FWs, Switches, that utilize SSL for VPNs and about 30 other applications, hardware and services.

Hope that info helps.
 

morganL

Captain Morgan
Administrator
Moderator
iXsystems
Joined
Mar 10, 2018
Messages
2,694
I would chalk this up as nothing more than an issue with TrueCommand. The OV cert is issued by Sectigo. But moving on with it being a "my" cert issue, the same error message is generated when using the native FreeNAS cert that comes with TC / Core.

Uploaded the OV's .CRT and .KEY in all three appliances. The Cores and TC have no issues using the OV when being accessed via webgui. Also, to note, this cert is used in many appliances including FWs, Switches, that utilize SSL for VPNs and about 30 other applications, hardware and services.

Hope that info helps.
Please report-a-bug with the cert information.

@wsoteros - can we move this thread to TrueCommand. forum.
 

Redcoat

MVP
Joined
Feb 18, 2014
Messages
2,925
Can you please remind me on how to report a bug, properly?
See "Report a Bug" on the masthead
1670852733051.png


You'll have to make a Jira account if you don't already have one.
 
Top