-
Notifications
You must be signed in to change notification settings - Fork 118
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Service network alias is not inmediately available #1977
Comments
In the sense that this may slow down test execution, it is worth fixing. From a testing perspective, IMO this is a feature, every network request we ever make should have retries on it. Outside of the system tests, real servers are not always going to be available when they should be. |
The system tests pass for elastic/integrations#9926 do pass for me locally on Linux. Looking at this part of the logs of a failing CI build,
I notice Health check options for |
Yes, they pass in some machines. But for example it constantly fails for me locally, also on Linux, and they fail in CI too.
Good point, during some time the container is unresponsive because it is compiling the binary. So this is probably all, and no issue on elastic-package side, I will close this issue. Tested on my machine to add a healthcheck and it fixes the issue, @ShourieG could you add this in your PR to unblock it? This is the patch I am testing with: diff --git a/packages/websocket/_dev/deploy/docker/docker-compose.yml b/packages/websocket/_dev/deploy/docker/docker-compose.yml
index ba3545e54b..d7a9fdf7a9 100644
--- a/packages/websocket/_dev/deploy/docker/docker-compose.yml
+++ b/packages/websocket/_dev/deploy/docker/docker-compose.yml
@@ -7,5 +7,9 @@ services:
volumes:
- ./websocket-mock-service:/app
ports:
- - "80:80"
+ - "3000:3000"
command: ["go", "run", "main.go"]
+ healthcheck:
+ test: "wget --no-verbose --tries=1 --spider http://localhost:3000/health || exit 1"
+ interval: 10s
+ timeout: 5s
diff --git a/packages/websocket/_dev/deploy/docker/websocket-mock-service/main.go b/packages/websocket/_dev/deploy/docker/websocket-mock-service/main.go
index 86f5f829ce..44f7bfa53a 100644
--- a/packages/websocket/_dev/deploy/docker/websocket-mock-service/main.go
+++ b/packages/websocket/_dev/deploy/docker/websocket-mock-service/main.go
@@ -13,10 +13,16 @@ import (
func main() {
http.HandleFunc("/", handleWebSocket)
- log.Fatal(http.ListenAndServe(":80", nil))
+ log.Fatal(http.ListenAndServe(":3000", nil))
}
func handleWebSocket(w http.ResponseWriter, r *http.Request) {
+ log.Println(r.Method, r.URL.String())
+
+ if r.URL.Path == "/health" {
+ return
+ }
+
if r.URL.Path == "/testbasicauth" {
// Check if the 'Authorization' header is set for basic authentication
authHeader := r.Header.Get("Authorization") |
Yup, the tests are now passing, this is awesome. Thanks a lot @chrisberkhout & @jsoriano. Till now I was under the impression that the health checks were automatically added downstream somewhere, but this clarified that my understanding was wrong. This knowledge will help a lot in future with similar mock services. |
When a service is started, it is connected to the network where the agent is running, with an alias prefixed with
svc-
. It looks like adding this alias doesn't make the service inmediately available in the network of the agent, and the agent may fail to connect with the service if it tries soon enough.We saw this in elastic/integrations#9926, where the agent could not connect to the service, but other commands executed manually could do it. In this particular case the input didn't have any retry mechanism, so after the first failure the test could not succeed. DNS could be resolved, because we saw the IP in the logs:
This might be related to some flaky issues caused by not found documents in system tests.
Service deployer setup should not return if the service is still unreachable in the agent network.
The text was updated successfully, but these errors were encountered: