-
Notifications
You must be signed in to change notification settings - Fork 94
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Kuma Control Plane Stuck on creating default mesh #614
Comments
Hi @somejavadev, I will try to reproduce it first and then come back to you |
@somejavadev so I managed to reproduce the issue, and I find out the problem is related to control-plane not being able to communicate with k8s api-server, which is related to the "hardened" configuration of the cluster. I'm not an expert in GCP so I didn't manage to find a fix for you, as I don't know enough the flags used to create cluster, but will do my best to work on it next week. I wanted to let you know about my findings, as maybe you are more proficient with GCP itself and maybe you'll have better idea about this. |
@somejavadev could you please try re-create control-plane deployment with |
The issue is that in a private GKE cluster, by default, the kubernetes control plane is only allowed access to port 443 and 10250 of the pods and nodes. But it seems that there is some involvement of admission webhooks when creating a mesh. And that is why you will have to make custom firewall rules so that the k8s control plane can be allowed the right ports. From here, https://kuma.io/docs/1.0.1/documentation/networking/#kuma-cp-ports, it seems that it is port This might help you, https://cloud.google.com/kubernetes-engine/docs/how-to/private-clusters#add_firewall_rules UPDATE: |
Nice! Can we update the Kuma deployment docs with what you needed to change? |
Since @arjunsalyan seems to have disappeared, I just ran into this issue and adding 5443 to the list of allowed ports in the master<->nodes firewall rule fixed it. |
@itspngu Oops! I thought my comment above was elaborative enough for someone running into this issue. But it will make more sense to add this to documentation somewhere. |
This issue was inactive for 30 days it will be reviewed in the next triage meeting and might be closed. |
Off to kuma-website to add this to the docs |
This port is listed there: https://kuma.io/docs/1.4.0/networking/networking/#kuma-cp-ports Closing this ticket. |
Summary
I am trying to create the kuma control plane on a GKE cluster by following the standalone instructions as detailed here although after everything has completed it seems kuma is stuck at creating the default mesh. The control plane logs indicates
with
INFO defaults trying to create default Mesh
repeating.I have tried to creating a mesh manually by applying the follow with kubectl -f mesh.yml:
mesh.yml
but this resulted in the following error:
Error from server (InternalError): error when creating "test.yml": Internal error occurred: failed calling webhook "mesh.defaulter.kuma-admission.kuma.io": Post https://kuma-control-plane.kuma-system.svc:443/default-kuma-io-v1alpha1-mesh?timeout=30s: context deadline exceeded
Steps To Reproduce
The
limited-gke-service-account
has a Kubernetes Engine Admin IAM role as well as the Metric Writer role.Error from server (InternalError): error when creating "test.yml": Internal error occurred: failed calling webhook "mesh.defaulter.kuma-admission.kuma.io": Post https://kuma-control-plane.kuma-system.svc:443/default-kuma-io-v1alpha1-mesh?timeout=30s: context deadline exceeded
Additional Details & Logs
Tested on both kuma 1.1.1 and 1.1.0
There doesn't seem to be any other logs indicating error besides the repeating of the
trying to create default Mesh
INFO entriesUpdate: leaving the control plane in this state after a while results in this error message:
ERROR mesh-insight-resyncer component terminated with an error {"generationID": 1, "error": "stop channel was closed", "errorVerbose": "stop channel was closed\ngithub.heygears.com/kumahq/kuma/pkg/events.(*reader).Recv\n\t/go/src/github.com/kumahq/kuma/pkg/events/eventbus.go:57\ngithub.heygears.com/kumahq/kuma/pkg/insights.(*resyncer).Start\n\t/go/src/github.com/kumahq/kuma/pkg/insights/resyncer.go:101\ngithub.heygears.com/kumahq/kuma/pkg/core/runtime/component.(*resilientComponent).Start.func1\n\t/go/src/github.com/kumahq/kuma/pkg/core/runtime/component/resilient.go:43\nruntime.goexit\n\t/usr/local/go/src/runtime/asm_amd64.s:1374"}
Although it could be due to GKE restarting the pod.
Default quick start, no custom changes.
GKE
This occurs in the latest helm charts as well as with the kumactl client.
The text was updated successfully, but these errors were encountered: