Skip to content

TheDataEngine/mongodb-arrow-connector

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MongoDB Apache Arrow Connector

A Rust library for reading and writing Apache Arrow batches from and to MongoDB.

Licensed under the Apache 2.0 license.

Motivation

We are curently writing this library due to a need to read MongoDB data into dataframes.

Features

  • Read from a collection to batches
  • Write from batches to a collection
  • Infer collection schema
  • Projection predicate push-down
  • Filter predicate push-down
  • Data types
    • Primitive types that MongoDB supports
    • List types
    • Nested structs (bson::Document)
    • Arbitrary binary data