Skip to content
View kanterov's full-sized avatar

Organizations

@apache

Block or report kanterov

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open, Multi-modal Catalog for Data & AI

Python 2,660 436 Updated Feb 13, 2025

English SDK for Apache Spark

Python 851 130 Updated Jun 12, 2024

The ultimate resource for becoming a freelancer in Sweden 🇸🇪 👨‍💻

492 53 Updated Apr 17, 2024

Kubernetes-like control planes for form-factors and use-cases beyond Kubernetes and container workloads.

Go 2,432 394 Updated Feb 17, 2025

Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.

Java 142 40 Updated Jun 3, 2024

Java/Scala library for easily authoring Flyte tasks and workflows

Java 43 28 Updated Feb 10, 2025

Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

Go 6,008 690 Updated Feb 17, 2025

A hackable data integration & analysis tool to enable non technical users to edit data processing jobs and visualise data on demand.

Java 578 43 Updated Feb 3, 2023

A collection of libraries for single-pass, distributed, sublinear-space approximate aggregation and sketching algorithms. Currently: HyperLogLog++; more to come.

Java 155 23 Updated Jun 24, 2022

ZetaSQL - Analyzer Framework for SQL

C++ 2,355 222 Updated Nov 13, 2024

Crucible is a library for symbolic simulation of imperative programs

Rust 696 42 Updated Feb 13, 2025

Build time tool for detecting link problems in java projects

Java 149 28 Updated Dec 17, 2024

A set of utilities designed for incremental building, merging and optimization of data transformations.

Java 1,205 141 Updated May 25, 2024

Build Systems à la Carte

TeX 250 18 Updated Jun 30, 2024

Apache Beam is a unified programming model for Batch and Streaming data processing.

Java 7,994 4,292 Updated Feb 17, 2025

120+ interactive Python coding interview challenges (algorithms and data structures). Includes Anki flashcards.

Python 29,927 4,498 Updated May 8, 2024

Learn how to design large-scale systems. Prep for the system de#terview. Includes Anki flashcards.

Python 289,388 48,157 Updated Dec 2, 2024

Out-of-memory sorting of large datasets map / reduce style processing

Rust 47 4 Updated Feb 10, 2025

A streaming query language.

Haskell 57 11 Updated Oct 20, 2020

Catch common Java mistakes as compile-time errors

Java 6,919 749 Updated Feb 17, 2025

Fast Apache Avro serialization/deserialization library

Java 43 14 Updated Oct 13, 2020

Python Helper library for Jupyter Notebooks

Jupyter Notebook 1,043 161 Updated Feb 16, 2021

Iceberg is a table format for large, slow-moving tabular data

Java 480 59 Updated Apr 10, 2023

An open-source, vendor-neutral data context service.

Java 159 51 Updated Mar 6, 2018

High-performance runtime for data analytics applications

Rust 2,996 256 Updated Jun 22, 2022

Parsing and analysis of Vertica, Hive, and Presto SQL.

Haskell 1,079 145 Updated Feb 16, 2022

a way to develop software with Nix

Nix 333 24 Updated May 23, 2021

Compilation and Verification of Data-Centric Languages

Coq 56 9 Updated Jul 17, 2024

Scala library for free applicative schemas capable of parsing/rendering sums-of-products data structures.

Scala 108 12 Updated Aug 30, 2018

Optics library for Scala

Scala 1,667 203 Updated Feb 17, 2025
Next
Showing results