ref(statsd): Use statsdproxy to pre-aggregate metrics in-memory #2425

untitaker · 2023-08-24T20:42:32Z

use experimental statsdproxy hackweek project to aggregate counters and
gauges (i.e. the "easy stuff") in memory before sending it over the UDP
buffer.

We use the same code in rust consumers to pre-aggregate metrics. The
performance improvement is a wash (neither improves nor degrades perf),
but it should load on veneur, so it may still amount to cost savings.

Arpad has a kind of pre-aggregation that results in actual cost savings
within the application itself, in the future we may replace statsdproxy
with that.

use experimental statsdproxy hackweek project to aggregate counters and gauges (i.e. the "easy stuff") in memory before sending it over the UDP buffer. this might not work perfectly and most bizarrely, aggregating only some metric types probably will mess with timestamp accuracy (even though the flush interval is at a very low 1s). however, currently it's possible that we are dropping metrics because the udp send buffer is at its limits. so who knows really if this makes metrics more or less reliable...

getsantry · 2023-10-18T07:00:10Z

This issue has gone three weeks without activity. In another week, I will close it.

But! If you comment or otherwise update it, I will reset the clock, and if you remove the label Waiting for: Community, I will leave it alone ... forever!

"A weed is but an unloved flower." ― Ella Wheeler Wilcox 🥀

relay-config/src/config.rs

relay-statsd/src/lib.rs

untitaker · 2024-02-22T16:53:44Z

@Dav1dde some additional context that i forgot to share: we use statsdproxy in rust consumers to send data to DDM.

the way this works is that we pre-aggregate using statsdproxy, then multiplex to the rust SDK, in order to offset the performance overhead that the rust SDK has.

I think long-term statsdproxy is not the right abstraction for this, and in fact @Swatinem is already working on what I think could be a replacement for all of this. but in the short-term this would allow you to dogfood DDM in relay with minimal overhead (and no code locations). take a look at statsd.rs in snuba if you're interested.

Dav1dde · 2024-02-23T08:08:57Z

the way this works is that we pre-aggregate using statsdproxy, then multiplex to the rust SDK, in order to offset the performance overhead that the rust SDK has.

Thanks, that seems like a good approach and something we wanted to do anyways.

Dav1dde

Let's try it.

We need to test this properly on Canary and S4S first, please don't merge if you don't have enough time to do that.

If you want I can pick this up and do the rollout sometime beginning of next week.

jjbayer · 2024-02-23T09:46:57Z

relay/src/setup.rs

@@ -69,6 +69,7 @@ pub fn init_metrics(config: &Config) -> Result<()> {
        &addrs[..],
        default_tags,
        config.metrics_buffering(),
+        config.metrics_aggregation(),


nit: With this number of arguments, it might be nice to pass a StatsdConfig object instead. Not a blocker though.

Plan is to get rid of the options all together: #2425 (comment)

What do you think?

jjbayer · 2024-02-23T09:47:47Z

relay-statsd/src/lib.rs

+                    flush_interval: 1,
+                    flush_offset: 0,
+                    max_map_size: None,


Should these be configurable?

they should probably not have been options to begin with tbh

As discussed in #2425, removes the options, there is no reason not to buffer and not to aggregate/use statsdproxy. Also cleans up the configuration a bit.

untitaker requested a review from a team August 24, 2023 20:42

jan-auer changed the title ~~use statsdproxy to pre-aggregate metrics in-memory~~ hackweek: Use statsdproxy to pre-aggregate metrics in-memory Aug 25, 2023

untitaker marked this pull request as draft August 25, 2023 12:10

jan-auer added the question Further information is requested label Sep 4, 2023

getsantry bot added the Stale label Oct 18, 2023

Dav1dde assigned untitaker Feb 19, 2024

untitaker added 2 commits February 21, 2024 13:55

Merge remote-tracking branch 'origin/master' into statsdproxy-sink

225900f

upgrade statsdproxy

740f57f

untitaker changed the title ~~hackweek: Use statsdproxy to pre-aggregate metrics in-memory~~ ref(statsd): Use statsdproxy to pre-aggregate metrics in-memory Feb 21, 2024

untitaker marked this pull request as ready for review February 21, 2024 13:29

untitaker requested a review from a team as a code owner February 21, 2024 13:29

pacify clippy

346f060

Dav1dde reviewed Feb 21, 2024

View reviewed changes

relay-config/src/config.rs Outdated Show resolved Hide resolved

relay-statsd/src/lib.rs Show resolved Hide resolved

switch aggregation default to true

ea0acea

add changelog

eabb3c2

Dav1dde approved these changes Feb 23, 2024

View reviewed changes

Dav1dde removed question Further information is requested Stale labels Feb 23, 2024

Dav1dde self-assigned this Feb 23, 2024

jjbayer approved these changes Feb 23, 2024

View reviewed changes

untitaker merged commit 1dbd9bf into master Feb 23, 2024
20 checks passed

untitaker deleted the statsdproxy-sink branch February 23, 2024 13:42

Dav1dde mentioned this pull request Feb 29, 2024

ref(statsd): Remove buffering/aggregation config options #3184

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ref(statsd): Use statsdproxy to pre-aggregate metrics in-memory #2425

ref(statsd): Use statsdproxy to pre-aggregate metrics in-memory #2425

untitaker commented Aug 24, 2023 •

edited

Loading

getsantry bot commented Oct 18, 2023

untitaker commented Feb 22, 2024

Dav1dde commented Feb 23, 2024

Dav1dde left a comment •

edited

Loading

jjbayer Feb 23, 2024

Dav1dde Feb 23, 2024

jjbayer Feb 23, 2024

jjbayer Feb 23, 2024

untitaker Feb 23, 2024

ref(statsd): Use statsdproxy to pre-aggregate metrics in-memory #2425

ref(statsd): Use statsdproxy to pre-aggregate metrics in-memory #2425

Conversation

untitaker commented Aug 24, 2023 • edited Loading

getsantry bot commented Oct 18, 2023

untitaker commented Feb 22, 2024

Dav1dde commented Feb 23, 2024

Dav1dde left a comment • edited Loading

Choose a reason for hiding this comment

jjbayer Feb 23, 2024

Choose a reason for hiding this comment

Dav1dde Feb 23, 2024

Choose a reason for hiding this comment

jjbayer Feb 23, 2024

Choose a reason for hiding this comment

jjbayer Feb 23, 2024

Choose a reason for hiding this comment

untitaker Feb 23, 2024

Choose a reason for hiding this comment

untitaker commented Aug 24, 2023 •

edited

Loading

Dav1dde left a comment •

edited

Loading