feat(metrics): Extract transaction metrics in external relays [INGEST-1477] #1344

iker-barriocanal · 2022-07-19T23:32:45Z

Currently, transaction metrics are only extracted in processing relays (the function call to extract metrics is surrounded by an if_processing!). Even if we extract metrics in external relays, we don't want to extract metrics multiple times from a transaction, causing metric (and billing) duplication.

The implemented approach in this PR is to extract metrics in the very first relay in the chain, mark the transaction as extracted, and not extract metrics from that transaction in subsequent relays. The simplest way to accomplish that is using the existing metrics_extracted header in items that had metrics extracted. The approach and the header are reused from the metric extraction from sessions, there's nothing new here.

Implementation-wise, removing that if_processing triggers a lot of changes to make current processing-only features non-processing too. On the other hand, we store a new flag transaction_metrics_extracted on ProcessEnvelopeState to make it available to the whole state processing function and transfer it to the outgoing event. Ideally, process_state is refactored not to throw every piece of data we need to the envelope state object, but that's out of the scope of this PR.

Move all non-processing imports out of the processing cfg config in the envelopes actor.

The `metrics_extracted` header must only be used on items that had transactions extracted. This means that relay must not extract metrics from items with such header, to not duplicate metrics.

Use the function to get the config, instead of accessing the struct's property directly.

The transaction item was reused to keep the item's headers. However, no additional payload was being set and this was causing missing data such as missing breakdowns.

untitaker · 2022-07-20T08:58:38Z

relay-server/src/actors/envelopes.rs

+                .metric_conditional_tagging
+                .as_slice();
+
+            let (event, _) = self.event_from_json_payload(item, Some(EventType::Transaction))?;


I don't think this should be necessary. Don't you have access to state.event here?

untitaker · 2022-07-20T08:59:49Z

relay-server/src/actors/envelopes.rs

@@ -346,6 +346,8 @@ struct ProcessEnvelopeState {
    /// extracted.
    event: Annotated<Event>,

+    transaction_item: Option<Item>,


you can probably store item headers only here, not sure if it's feasible though

jjbayer · 2022-07-21T08:13:12Z

relay-server/src/actors/envelopes.rs

+                .metric_conditional_tagging
+                .as_slice();
+
+            let (event, _) = self.event_from_json_payload(item, Some(EventType::Transaction))?;


Why parse the event again here?

jjbayer · 2022-07-21T08:16:39Z

relay-server/src/actors/envelopes.rs

-        let event_type = state.event_type().unwrap_or_default();
-        let mut event_item = Item::new(ItemType::from_event_type(event_type));
+        let mut event_item = match state.event_type().unwrap_or_default() {
+            EventType::Transaction => state.transaction_item.take().unwrap(),


There's usually a better way than unwrap(). E.g. store event_item on the state instead of event_type and transaction_item, and then get the type from the item.

jjbayer · 2022-07-21T08:22:44Z

relay-server/src/metrics_extraction/transactions.rs

-#[cfg(feature = "processing")]
+impl TransactionMetricsConfig {
+    pub fn is_enabled(&self) -> bool {
+        self.version > 0 && self.version <= EXTRACT_MAX_VERSION


Quick thought: We could actually remove the self.version > 0 check and set EXTRACT_MAX_VERSION = 0. That way, we would not need any changes on the sentry side. Or am I missing something crucial?

Actually, let's keep it as-is to be consistent with session metrics extraction.

jjbayer · 2022-07-21T08:30:42Z

relay-server/src/actors/envelopes.rs

+        let mut event_item = match state.event_type().unwrap_or_default() {
+            EventType::Transaction => state.transaction_item.take().unwrap(),
+            ty => Item::new(ItemType::from_event_type(ty)),
+        };


I assume we reuse the item here because we do not want to lose the metrics_extracted information. But this could have side effects (I don't know what else is in that item header). So it might be safer to explicitly transfer the metrics_extracted information to the new item, as we do with sample_rates a few lines below.

Two minor tweaks to #1344, based on comments on that PR: - Do not parse the event payload twice. - Store a boolean on the envelope state, rather than the full event item.

Just like we did for session metrics extraction, add a version to the project config protocol such that older Relays will stop extracting metrics when the version supplied by Sentry is too high. This is a prerequisite for getsentry/relay#1344.

…rt-ext-relays

iker-barriocanal added 15 commits July 19, 2022 15:48

Extract non-processing imports in envelopes actor

86a47ee

Move all non-processing imports out of the processing cfg config in the envelopes actor.

Don't extract transactions when extracted header is present

f5a8abd

The `metrics_extracted` header must only be used on items that had transactions extracted. This means that relay must not extract metrics from items with such header, to not duplicate metrics.

Get config from state using the function

9e6dc3c

Use the function to get the config, instead of accessing the struct's property directly.

Check if tx metrics config matches required version

b8f8863

Ensure relay must extract metrics before extracting config

62718dc

Mark item as metrics_extracted right after extracting items

94a0206

Use existing tx item in envelope serialization

d8bd021

Add an integration test

50aa865

Add version in tx metrics config

8ebf596

Fix: set payload for all items, including txs\

c11736a

The transaction item was reused to keep the item's headers. However, no additional payload was being set and this was causing missing data such as missing breakdowns.

test: ensure no extra events were sent

71a2391

test: ensure metrics_extracted header exists

ccb37d7

Split big integration test into two smaller tests

9bb4e0b

Merge branch 'master' into iker/feat/mep-support-ext-relays

3d758ee

Add test: use too low and big versions

68fdad1

iker-barriocanal requested a review from a team July 19, 2022 23:32

iker-barriocanal self-assigned this Jul 19, 2022

Update changelog

9884b47

untitaker reviewed Jul 20, 2022

View reviewed changes

jjbayer reviewed Jul 21, 2022

View reviewed changes

This was referenced Jul 22, 2022

ref(metrics): Simplify metrics_extracted logic [INGEST-1477] #1348

Merged

feat(metrics): Transaction metrics extraction version [INGEST-1478] getsentry/sentry#36967

Merged

ref(metrics): Simplify metrics_extracted logic

75949a6

Two minor tweaks to #1344, based on comments on that PR: - Do not parse the event payload twice. - Store a boolean on the envelope state, rather than the full event item.

jjbayer added 2 commits July 25, 2022 10:42

ref: Minimize diff

33ccf63

Merge remote-tracking branch 'origin/master' into iker/feat/mep-suppo…

eacf4d8

…rt-ext-relays

jjbayer requested a review from a team July 25, 2022 09:08

jjbayer self-assigned this Jul 25, 2022

jjbayer requested a review from untitaker July 25, 2022 09:09

untitaker approved these changes Jul 25, 2022

View reviewed changes

jjbayer merged commit 5f9dc2b into master Jul 25, 2022

jjbayer deleted the iker/feat/mep-support-ext-relays branch July 25, 2022 11:45

jan-auer mentioned this pull request Jul 28, 2022

fix(metrics): Fix broken extraction of ops breakdown and conditional tagging [INGEST-1529] #1357

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(metrics): Extract transaction metrics in external relays [INGEST-1477] #1344

feat(metrics): Extract transaction metrics in external relays [INGEST-1477] #1344

iker-barriocanal commented Jul 19, 2022 •

edited by jjbayer

Loading

untitaker Jul 20, 2022

untitaker Jul 20, 2022

jjbayer Jul 21, 2022

jjbayer Jul 21, 2022

jjbayer Jul 21, 2022

jjbayer Jul 22, 2022

jjbayer Jul 21, 2022

feat(metrics): Extract transaction metrics in external relays [INGEST-1477] #1344

feat(metrics): Extract transaction metrics in external relays [INGEST-1477] #1344

Conversation

iker-barriocanal commented Jul 19, 2022 • edited by jjbayer Loading

untitaker Jul 20, 2022

Choose a reason for hiding this comment

untitaker Jul 20, 2022

Choose a reason for hiding this comment

jjbayer Jul 21, 2022

Choose a reason for hiding this comment

jjbayer Jul 21, 2022

Choose a reason for hiding this comment

jjbayer Jul 21, 2022

Choose a reason for hiding this comment

jjbayer Jul 22, 2022

Choose a reason for hiding this comment

jjbayer Jul 21, 2022

Choose a reason for hiding this comment

iker-barriocanal commented Jul 19, 2022 •

edited by jjbayer

Loading