๐ Learning and exploring Apache Arrow.
Apache Arrow is a cross-language development platform for in-memory analytics.
-- Apache Arrow
This project is a playground for me to learn and explore Apache Arrow with executable code and my own verbose in-line comments and notes.
This repository illustrates different concepts, patterns and examples via standalone sub-projects. Each sub-project is completely independent of the others and do not depend on the root project. This standalone sub-project constraint forces the sub-projects to be complete and maximizes the reader's chances of successfully running, understanding, and re-using the code.
The sub-projects include:
A runnable "hello world" program featuring Apache Arrow.
See the README in basic/.
Sorting and searching vectors with Apache Arrow.
See the README in sort-and-search/.
General clean-ups, TODOs and things I wish to implement for this project:
- DONE Model ZIP code data in a small hardcoded vector. I like using ZIP code data to explore data products, like I do in my other projects
- DONE Split into subprojects
- DONE Incorporate a full copy of ZIP code data.