Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.
-
Updated
Jun 9, 2024 - C++
Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.
TPC-H queries in Apache Spark SQL using native DataFrames API
Java Application, uses Apache Spark, handles batch as well as streaming processing
mainframe - a lightweight dataframe library for C++
Apache Spark project for Advanced Topics on Databases course
Semester assignment for ECE NTUA 3189 Advanced Topics in Database Systems
API converting NYC Department of Health: https://github.com/nychealth/coronavirus-data
Construct Source files as per the target files in Spark using Datframe api and spark
Soundhopper project - created for users to skip ahead to specified sections of track - built using Python, and Jupyter notebook.
Analysis of American Time Use Survey (ATUS): https://www.kaggle.com/bls/american-time-use-survey
make easier the use of columnar spark files
Add a description, image, and links to the dataframes-api topic page so that developers can more easily learn about it.
To associate your repository with the dataframes-api topic, visit your repo's landing page and select "manage topics."