fix(spool): Revert to page counts for size estimate #3379

jjbayer · 2024-04-05T13:05:25Z

INC-703 showed that estimate_spool_size via dbstat is too slow. This PR replaces the estimate with the number of non-free pages multiplied by the page size. Though inaccurate, benchmarks show that this is fast regardless of the db size.

Fixes https://github.com/getsentry/team-ingest/issues/307.

jjbayer · 2024-04-05T13:07:12Z

relay-server/src/services/spooler/mod.rs

@@ -1569,26 +1569,25 @@ mod tests {
            "buffer.writes:1|c",
            "buffer.envelopes_written:3|c",
            "buffer.envelopes_disk_count:3|g",
-            "buffer.disk_size:1031|h",
+            "buffer.disk_size:24576|h",


This is pretty far off. But the relative error should be lower for larger disk buffers.

Dav1dde · 2024-04-05T13:09:23Z

relay-server/src/services/spooler/sql.rs

@@ -74,7 +74,9 @@ pub fn delete<'a>(key: QueueKey) -> Query<'a, Sqlite, SqliteArguments<'a>> {
 ///
 /// This info used to calculate the current allocated database size.
 pub fn current_size<'a>() -> Query<'a, Sqlite, SqliteArguments<'a>> {


Can you update the docs for this function and the calling function?

👍 fun fact: The doc on the calling function was already correct because we never updated it.

Not quite, it didn't subtract the free pages.

relay-server/src/services/spooler/sql.rs

Dav1dde · 2024-04-05T13:10:22Z

relay-server/src/services/spooler/mod.rs

+        // Result:
+        // pgsize =      2'408'464'463 t.elapsed() = 7.007533833s
+        // pgsize_agg =  2'408'464'463 t.elapsed() = 5.010104791s
+        // brute_force = 1'750'000'000 t.elapsed() = 7.893590875s
+        // pragma =      3'036'307'456 t.elapsed() = 213.417µs


Nice! But it over estimates by quite a bit > 1 GiB?

Edit: To clarify I think that's okay, just surprising.

Yep the estimates are pretty far off, this is something we will have to keep in mind.

Dav1dde

I think this is changelog worthy, in my PR I had:

**Bug Fixes**:

- Fix performance regression in the spooler. ([#3378](https://github.com/getsentry/relay/pull/3378))

jjbayer · 2024-04-08T09:14:11Z

As discussed with @olksdr: Because the spool size estimation is also used to decide whether or not to transition back to memory mode, and the new computation over-estimates the spool size, this PR might require a follow-up if we notice that Relay gets stuck in disk mode.

With the performance regression fixed, getting stuck in disk mode should not be a big problem performance-wise. We could even consider operating always in disk mode to simplify the state machine.

jjbayer added 2 commits April 5, 2024 14:52

test

047573c

fix

06f90dd

jjbayer commented Apr 5, 2024

View reviewed changes

Dav1dde reviewed Apr 5, 2024

View reviewed changes

test

6e0c7e4

jjbayer marked this pull request as ready for review April 5, 2024 13:49

jjbayer requested a review from a team as a code owner April 5, 2024 13:49

Dav1dde approved these changes Apr 5, 2024

View reviewed changes

Dav1dde assigned jjbayer Apr 5, 2024

olksdr approved these changes Apr 8, 2024

View reviewed changes

doc

73308d0

jjbayer merged commit b7b1cab into master Apr 8, 2024
20 checks passed

jjbayer deleted the fix/spool-pragma branch April 8, 2024 09:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(spool): Revert to page counts for size estimate #3379

fix(spool): Revert to page counts for size estimate #3379

jjbayer commented Apr 5, 2024

jjbayer Apr 5, 2024

Dav1dde Apr 5, 2024

jjbayer Apr 5, 2024

Dav1dde Apr 5, 2024

Dav1dde Apr 5, 2024 •

edited

Loading

jjbayer Apr 8, 2024

Dav1dde left a comment

jjbayer commented Apr 8, 2024

fix(spool): Revert to page counts for size estimate #3379

fix(spool): Revert to page counts for size estimate #3379

Conversation

jjbayer commented Apr 5, 2024

jjbayer Apr 5, 2024

Choose a reason for hiding this comment

Dav1dde Apr 5, 2024

Choose a reason for hiding this comment

jjbayer Apr 5, 2024

Choose a reason for hiding this comment

Dav1dde Apr 5, 2024

Choose a reason for hiding this comment

Dav1dde Apr 5, 2024 • edited Loading

Choose a reason for hiding this comment

jjbayer Apr 8, 2024

Choose a reason for hiding this comment

Dav1dde left a comment

Choose a reason for hiding this comment

jjbayer commented Apr 8, 2024

Dav1dde Apr 5, 2024 •

edited

Loading