[Feature] MongoDB Resume Tokens #196

stevensJourney · 2025-02-06T08:25:52Z

Context

Our current MongoDB Change Stream replicator uses the cluster time as the equivalent of an LSN (Log Sequence Number).

Typically, the LSN tracks replication progress and associates user write checkpoints with the active replication state. In MongoDB, the LSN (cluster time) is used as the startAfter value when resuming Change Stream consumption after a restart.

However, relying solely on cluster time can lead to edge cases where replication becomes inconsistent. For example, if replication connection details change after replication has started, the replicator may attempt to consume a Change Stream from a new database using the previously stored cluster time. Since MongoDB does not always return an error in this scenario, replication can silently fail.

MongoDB Change Streams provide Resume Tokens, which are unique to a specific database and can be used to manage cursor positioning more reliably. Using Resume Tokens as the Change Stream cursor should help prevent these inconsistencies.

Feature

This PR introduces support for storing Resume Tokens alongside cluster time in the MongoDB LSN. This approach mirrors how we store the binary log position alongside the GTID for MySQL.

If a Resume Token is available, it will be used when resuming Change Stream consumption. The LSN format remains lexicographically comparable, ensuring compatibility with existing LSNs. Unit tests have been added to verify that comparisons work correctly across both new and old LSN formats, which is critical for user write checkpoints.

Additionally, a unit test has been added to handle scenarios where replication connection details change after replication has started. MongoDB will now behave similarly to Postgres when a replication slot is missing—replication will restart from an initial snapshot with a new set of sync rule data.

changeset-bot · 2025-02-06T08:25:55Z

🦋 Changeset detected

Latest commit: 18ce978

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 14 packages

Name	Type
@powersync/service-errors	Minor
@powersync/service-module-mongodb	Minor
@powersync/service-image	Minor
@powersync/lib-services-framework	Patch
@powersync/service-rsocket-router	Patch
@powersync/service-core	Patch
@powersync/lib-service-mongodb	Patch
@powersync/lib-service-postgres	Patch
@powersync/service-module-mongodb-storage	Patch
@powersync/service-module-mysql	Patch
@powersync/service-module-postgres-storage	Patch
@powersync/service-module-postgres	Patch
@powersync/service-core-tests	Patch
test-client	Patch

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

stevensJourney · 2025-02-06T08:27:14Z

modules/module-mongodb/src/replication/ChangeStream.ts

-export class ChangeStreamInvalidatedError extends Error {
-  constructor(message: string) {
-    super(message);
+export class ChangeStreamInvalidatedError extends DatabaseConnectionError {


We could do something similar for Postgres and MySQL

stevensJourney · 2025-02-06T08:30:40Z

modules/module-mongodb/src/replication/ChangeStream.ts

@@ -589,6 +595,11 @@ export class ChangeStream {

          const originalChangeDocument = await stream.tryNext();

+          // The stream was closed, we will only ever receive `null` from it
+          if (!originalChangeDocument && stream.closed) {


I noticed this behaviour when testing dropping a MongoDB database while the PowerSync service was actively replicating. The tryNext call would emit null in an endless loop. We typically receive a null response about every second in normal operations, the frequency is much higher after a DB has been dropped. This typically causes the process to hang.

stevensJourney · 2025-02-06T08:33:46Z

modules/module-mongodb/test/src/change_stream_utils.ts

@@ -85,7 +85,7 @@ export class ChangeStreamTestContext {
  }

  startStreaming() {
-    this.streamPromise = this.walStream.streamChanges();
+    return (this.streamPromise = this.walStream.streamChanges());


This helps catch errors from the streaming process.

An alternative would be to await the dispose method since that method awaits the streamPromise. Unfortunately the dispose also aborts the replication, which can make it tricky to detect errors.

stevensJourney added 5 commits February 4, 2025 17:05

wip: mongodb resume tokens

44099a0

add comparable tests for MongoLSN

0da4c50

detect dropped databases

c90e5be

add unit test

f5e2ad6

Improve ChangeStreamInvalidatedError to extend service errors.

273b89e

stevensJourney commented Feb 6, 2025

View reviewed changes

stevensJourney added 2 commits February 6, 2025 10:53

Merge remote-tracking branch 'origin/main' into feat/resume-token-lsn

3904106

Add test for postgres storage

ab8da19

stevensJourney marked this pull request as ready for review February 6, 2025 09:46

stevensJourney requested a review from rkistner February 6, 2025 09:46

Merge branch 'main' into feat/resume-token-lsn

18ce978

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] MongoDB Resume Tokens #196

[Feature] MongoDB Resume Tokens #196

stevensJourney commented Feb 6, 2025

changeset-bot bot commented Feb 6, 2025 •

edited

Loading

stevensJourney Feb 6, 2025

stevensJourney Feb 6, 2025 •

edited

Loading

stevensJourney Feb 6, 2025

[Feature] MongoDB Resume Tokens #196

Are you sure you want to change the base?

[Feature] MongoDB Resume Tokens #196

Conversation

stevensJourney commented Feb 6, 2025

Context

Feature

changeset-bot bot commented Feb 6, 2025 • edited Loading

🦋 Changeset detected

stevensJourney Feb 6, 2025

Choose a reason for hiding this comment

stevensJourney Feb 6, 2025 • edited Loading

Choose a reason for hiding this comment

stevensJourney Feb 6, 2025

Choose a reason for hiding this comment

changeset-bot bot commented Feb 6, 2025 •

edited

Loading

stevensJourney Feb 6, 2025 •

edited

Loading