-
Notifications
You must be signed in to change notification settings - Fork 93
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Feature to expose BB metrics based user query key #61
Comments
Are you aware that all Buildbarn components expose a huge number of Prometheus metrics? Just point your browser to /metrics on the HTTP endpoint. Are there any statistics there that you feel are missing? |
Yes, I am aware. We could not find any relevant data by consuming Prometheus data in Grafana. |
Which metrics would you be missing? It would be better to extend the current metrics set, rather than introduce another way to gather metrics, considering Prometheus is pretty much the standard for this usecase. |
Let me start over, sorry for not making things clear in the first place. We are using BB also as a bazel remote cache. bazel communicates with the frontend and fetches data from a scalable storage backend, which we can scale as we need. The problem is, there is no way to ask BB how many hits in the cache it has for a particular job. Here's the problem: BB doesn't know which cache entry belongs to which client. Clients may be individual developers and CI jobs. By using the Essentially, I would like to be able to query BB for the hit rate for the |
What we could do is that we extend MetricsBlobAccess gains support for exposing metrics based on some user-defined key. That sounds like a fair request. A couple of things to look out for, though:
|
I like the idea of having white/blacklist for metrics, that's what for instance kube-state-metrics is doing, since it exposes a lot of metrics as well (see its |
This is a placeholder for a feature request for a gRPC "GetStats" channel for getting the statistics for remote execution, remote bazel cache.
For remote cache lookup using Build Barn, one can use the
--remote_cache_header
for a cache get/put/update. The "GetStats" could take the key used on the cache operation to get a statistic on how well the cache is performing (hit rate, hit rate %, etc.).The text was updated successfully, but these errors were encountered: