Add high throughput integration test #5655

rdettai · 2025-01-28T11:10:17Z

Description

This PR is a reuse of the tests and docs proposed in #5644, which itself is not necessary anymore after the the status code was fixed to be 429 when shards need scaling up (#5651).

It also adds a small indication of the number of retries that occurred to the CLI ingest command. This is handy for troubleshooting and shows concretely to users that retries are often necessary.

How was this PR tested?

Integration tests and running the CLI ingest command on the HDFS dataset.

rdettai · 2025-01-28T11:15:18Z

quickwit/quickwit-rest-client/src/models.rs

+    pub fn merge(self, other: RestIngestResponse) -> Self {
+        Self {
+            num_docs_for_processing: self.num_docs_for_processing + other.num_docs_for_processing,
+            num_ingested_docs: apply_op(self.num_ingested_docs, other.num_ingested_docs, |a, b| {
+                a + b
+            }),
+            num_rejected_docs: apply_op(self.num_rejected_docs, other.num_rejected_docs, |a, b| {
+                a + b
+            }),
+            parse_failures: apply_op(self.parse_failures, other.parse_failures, |a, b| {
+                a.into_iter().chain(b).collect()
+            }),
+            num_too_many_requests: self.num_too_many_requests,
+        }
+    }


I moved this back here as it makes more sense than in the API model because accumulating responses is quite specific to the rest client.

quickwit/quickwit-integration-tests/src/tests/ingest_v2_tests.rs

rdettai · 2025-02-06T20:12:16Z

quickwit/quickwit-integration-tests/src/tests/ingest_v2_tests.rs

+            // TODO: when using the default 10MiB batch size, we get persist
+            // timeouts with code 500 on some lower performance machines (e.g.
+            // Github runners). We should investigate why this happens exactly.
+            Some(5_000_000),


@guilload I didn't find a good explanation for why this timeout occur here in the persist

quickwit/quickwit/quickwit-ingest/src/ingest_v2/router.rs

Lines 432 to 443 in ce4501f

let persist_result = tokio::time::timeout(

PERSIST_REQUEST_TIMEOUT,

ingester.persist(persist_request),

)

.await

.unwrap_or_else(|_| {

let message = format!(

"persist request timed out after {} seconds",

PERSIST_REQUEST_TIMEOUT.as_secs()

);

Err(IngestV2Error::Timeout(message))

});

Persisting 10MB should not take 6 sec, even on a slow system and in debug mode, should it?

rdettai changed the base branch from retry-no-shard to main January 28, 2025 11:10

rdettai commented Jan 28, 2025

View reviewed changes

rdettai force-pushed the test-client-retries branch 4 times, most recently from de8f2a1 to aa98399 Compare January 30, 2025 11:05

esatterwhite reviewed Jan 30, 2025

View reviewed changes

quickwit/quickwit-integration-tests/src/tests/ingest_v2_tests.rs Show resolved Hide resolved

Add high throughput integration test

10bdda8

rdettai force-pushed the test-client-retries branch 2 times, most recently from aa77d7e to c2069e4 Compare February 6, 2025 20:03

Try smaller batches

ce4501f

rdettai force-pushed the test-client-retries branch from c2069e4 to ce4501f Compare February 6, 2025 20:08

rdettai commented Feb 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add high throughput integration test #5655

Add high throughput integration test #5655

rdettai commented Jan 28, 2025 •

edited

Loading

rdettai Jan 28, 2025

rdettai Feb 6, 2025 •

edited

Loading

	let persist_result = tokio::time::timeout(
	PERSIST_REQUEST_TIMEOUT,
	ingester.persist(persist_request),
	)
	.await
	.unwrap_or_else(\|_\| {
	let message = format!(
	"persist request timed out after {} seconds",
	PERSIST_REQUEST_TIMEOUT.as_secs()
	);
	Err(IngestV2Error::Timeout(message))
	});

Add high throughput integration test #5655

Are you sure you want to change the base?

Add high throughput integration test #5655

Conversation

rdettai commented Jan 28, 2025 • edited Loading

Description

How was this PR tested?

rdettai Jan 28, 2025

Choose a reason for hiding this comment

rdettai Feb 6, 2025 • edited Loading

Choose a reason for hiding this comment

rdettai commented Jan 28, 2025 •

edited

Loading

rdettai Feb 6, 2025 •

edited

Loading