Skip to content
This repository has been archived by the owner on Feb 14, 2021. It is now read-only.
/ scalaz-analytics Public archive

A high-performance, purely-functional library for doing computational analysis and statistics over data in a type-safe way

License

Notifications You must be signed in to change notification settings

scalaz/scalaz-analytics

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

scalaz-analytics

Gitter

Goal

Scalaz Analytics provides a high-performance, purely-functional library for doing computational analysis and statistics over data in a type-safe way.

Introduction & Highlights

Scalaz Analytics is a principled functional programming library for data processing and analytics.

  • Simple and principled
  • First class support for analytics and data science
  • Pure type-safe, functional interface that integrates with other Scalaz projects
  • Supports batch and streaming
  • Efficient on both small and large data sets, single machine and distributed
  • Can be used from a REPL for interactive analysis or as a library for applications

Other libraries

Below is a selection of Analytics/Data processing Libraries that we are being used as inspiration. Some of these metrics are somewhat subjective but they give an idea for what we are looking at from each library. Note that these metrics assume native support, so libraries that achieve these things via another library are not considered.

Library Scales to Big Data Supports Batch Supports Streaming FP Easy to Debug Out of the box analytics
Spark ✔ (mini batch)
Flink
Pandas
R ?
Dask ?
Apex ?
Beam ?

Background

About

A high-performance, purely-functional library for doing computational analysis and statistics over data in a type-safe way

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages