Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

give a status of connection to proxy/auth via CLI #43405

Open
anthonysomerset opened this issue Jun 24, 2024 · 2 comments
Open

give a status of connection to proxy/auth via CLI #43405

anthonysomerset opened this issue Jun 24, 2024 · 2 comments
Labels
feature-request Used for new features in Teleport, improvements to current should be #enhancements robustness Resistance to crashes and reliability

Comments

@anthonysomerset
Copy link
Contributor

What would you like Teleport to do?

So this is tangentially related to #3657 I needed to login to some nodes and was confused when i couldn't find them in teleport

i had to go into backup SSH to see some random error in when checking teleport service status about failing to resolve tunnel address (some transient DNS issue) where it got stuck however systemd thought eh service was active and running without issues

in my case restarting the service brought it back onto our cluster and able to be connected to, however this doesn't help me

I would like for the teleport binary to be able to report the status of its connectivity/health to its configured auth/proxy servers perhaps via some CLI command or perhaps to properly and gracefully error to the point that systemd can pick it up and restart

If i can poll the health of a given connection then i can monitor it and also take proactive action

What problem does this solve?

ability to proactively remediate issues with the teleport agent

If a workaround exists, please include it.

@anthonysomerset anthonysomerset added the feature-request Used for new features in Teleport, improvements to current should be #enhancements label Jun 24, 2024
@stevenGravy stevenGravy added the tsh tsh - Teleport's command line tool for logging into nodes running Teleport. label Jun 24, 2024
@zmb3 zmb3 added robustness Resistance to crashes and reliability and removed tsh tsh - Teleport's command line tool for logging into nodes running Teleport. labels Jun 24, 2024
@zmb3
Copy link
Collaborator

zmb3 commented Jun 24, 2024

It seems the root of the issue here is that a transient networking issue put Teleport in a state where the service was running but did not have a healthy connection to the control plane, which was resolved by a restart.

There are several ways to address this:

  • Have the service detect that it is unhealthy and exit (so that systemd can restart it)
  • Have the service recover on its own so that a restart is not necessary

The latter seems the better option, as it does not rely on any particular supervisor. In order for us to diagnose why Teleport wasn't able to function correctly without a restart, we'll need more details - this includes the specific versions you are running and reproduction steps.

@rosstimothy
Copy link
Contributor

It is also already possible to determine the health of a Teleport process, see https://goteleport.com/docs/management/diagnostics/monitoring/ for more details.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature-request Used for new features in Teleport, improvements to current should be #enhancements robustness Resistance to crashes and reliability
Projects
None yet
Development

No branches or pull requests

4 participants