-
Notifications
You must be signed in to change notification settings - Fork 289
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Worker cannot connect to Controller while upgrade to 0.8.0 #2063
Comments
FWIW; That may not be the root issue though; if you are using a DNS-based loadbalancer in front of your controllers and your health checks are failing that may show up as that since there are no healthy hosts. This could potentially be related to #2072 which could happen if your are using port 9201 as your health check endpoint and the load balancer is hitting it with an unexpected packet body - causing it to crash/stop listening. |
Thanks for raising this @justenwalker - I won't rule out #2072 being related here, but just to be sure since this does look DNS related, can you exec into the worker container and run a nslookup or telnet to the IP it's unable to connect to? |
@incubator4 raised the issue, so they'd have to try this. Just added comment to the other issue since I encountered this problem because of loadbalancer health checks; so it seems plausibly related. |
In fact, is an aws external ELB with public address (just like |
Hi there -- has this been addressed in later releases for you? |
Previously
My company has been using
Boundary
as a secure connection tool for a long time since version0.6.2
We use aws kms and postgresql for Dependency, and deploy controlller and a few workers in different regions as K8S Deployment in EKS Cluster.
It workers well for now. Because there is a new release version, I decide to upgrade our infrastructure to latest release version
0.8.0
.Describe the bug
I follow the Document Upgrade and Database Migration .
Backup database -> scale controller deployment to
replicas: 0
-> run the migration Job -> Upgrade the Controller pod image from0.6.2
to0.8.0
Actually, I use to allocated less resources to the controller, then the plugin load faild again and again.Then I found similar problem in #1813 , and I allocate more resource to controller and the problem was solvedMaybe the error message can be more friendly in feature.
It seems works well until now.But workers cannot connect to controller with following log.
Try to connect
Failure log
It keeps reporting errors.
Workers cannot connect to controller use both of version
0.6.2
and0.8.0
,then I rollback upgrade and restore database.To Reproduce
Steps to reproduce the behavior:
0.6.2
to0.8.0
The text was updated successfully, but these errors were encountered: