Skip to content

Navigation Menu

MachineLearningSystem

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
#

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

#

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

MachineLearningSystem

Overview
Repositories
Projects
Packages
People

More

Overview
Repositories
Projects
Packages
People

Popular repositories Loading

25ASPLOS-Medusa Public

Forked from thustorage/Medusa

Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]

HTML 10 1
24MLSYS-prompt-cache Public

Forked from yale-sys/prompt-cache

Modular and structured prompt caching for low-latency LLM inference

Python 6
24PPOPP-Liger Public

C++ 5
Optimus-CC Public

[ASPLOS'23] Optimus-CC: Efficient Large NLP Model Training with 3D Parallelism Aware Communication Compression

Python 3 4
ATC23-Legion Public

Forked from JIESUN233/Legion

RC4ML GNN System Projects

C++ 3
Awesome-DL-Scheduling-Papers Public

Forked from S-Lab-System-Group/Awesome-DL-Scheduling-Papers

2

Repositories

Loading

Type

Select type

All Public Sources Forks Archived Mirrors Templates

Language

Select language

All C C++ Cuda Fortran Go HTML Java JavaScript Jsonnet Jupyter Notebook MATLAB Python Rust Shell

Sort

Select order

Last updated Name Stars

Showing 10 of 628 repositories

OSDI25-PipeANN Public Forked from thustorage/PipeANN
[OSDI'25] Achieving Low-Latency Graph-Based Vector Search via Aligning Best-First Search Algorithm with SSD

C++ 0 2 0 0 Updated Apr 24, 2025
25ASPLOS-Hetu-Galvatron Public Forked from PKU-DAIR/Hetu-Galvatron
Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs).

Python 0 Apache-2.0 11 0 0 Updated Apr 17, 2025
25SIGMOD-Apt-Serve Public Forked from eddiegaoo/Apt-Serve

Python 0 1 0 0 Updated Apr 12, 2025
specreason Public Forked from ruipeterpan/specreason
PoC for "SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning" [arXiv '25]

Python 0 5 0 0 Updated Apr 10, 2025
ember Public Forked from pyember/ember

Python 0 MIT 25 0 0 Updated Apr 8, 2025
AReaL Public Forked from inclusionAI/AReaL
Distributed RL System for LLM Reasoning

Python 0 Apache-2.0 52 0 0 Updated Apr 4, 2025
Triton-distributed Public Forked from ByteDance-Seed/Triton-distributed
Distributed Triton for Parallel Systems

C++ 0 MIT 34 0 0 Updated Apr 4, 2025
25NSDI-ByteCheckpoint Public Forked from ByteDance-Seed/ByteCheckpoint
ByteCheckpoint: An Unified Checkpointing Library for LFMs

Python 0 Apache-2.0 6 0 0 Updated Apr 2, 2025
25ASPLOS-Ayo Public Forked from NetX-lab/Ayo
[ASPLOS'25] Towards End-to-End Optimization of LLM-based Applications with Ayo

Python 0 MIT 2 0 0 Updated Mar 31, 2025
async_rlhf Public Forked from mnoukhov/async_rlhf
Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models

Python 0 Apache-2.0 5 0 0 Updated Mar 26, 2025

View all repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.