🌊 Online machine learning in Python
-
Updated
Feb 24, 2025 - Python
🌊 Online machine learning in Python
Utils for streaming large files (S3, HDFS, gzip, bz2...)
Python Stream Processing
Python stream processing for Kafka
Real-time stream processing for python
A machine learning package for streaming data in Python. The other ancestor of River.
🌲 Implementation of the Robust Random Cut Forest algorithm for anomaly detection on streams
Optimal binning: monotonic binning with constraints. Support batch & stream optimal binning. Scorecard modelling and counterfactual explanations.
A distributed, structured concurrency runtime for Python (and friends)
Streaming Anomaly Detection Framework in Python (Outlier Detection for Streaming Data)
A stream processing runtime that allows connecting any streaming data source to any destination and act on it
Clustering for arbitrary data and dissimilarity function
Materialize is a streaming database for real-time analytics. This is a collection of Materialize demos and tutorials.
CinnaMon is a Python library which offers a number of tools to detect, explain, and correct data drift in a machine learning system
Benchmarks for data processing systems: Pathway, Spark, Flink, Kafka Streams
Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python
Stream Processing Made Easy
A Pythonic and ultra fast template engine DSL.
Streaming API for pandas applied to big datasets
Add a description, image, and links to the streaming-data topic page so that developers can more easily learn about it.
To associate your repository with the streaming-data topic, visit your repo's landing page and select "manage topics."