First Year ML PhD@Northwestern IEMS.
Undergrad@University of Science and Technology of China.
First Year ML PhD@Northwestern IEMS.
Undergrad@University of Science and Technology of China.
Course project for EE3001 Machine Learning
Jupyter Notebook 1
Forked from punica-ai/punica
Serving multiple LoRA finetuned LLM as one
Python 1
Redesigned course project for Compiler Principle 2023 Fall
C++
Forked from FMInference/FlexLLMGen
Running large language models like OPT-175B/GPT-3 on a single GPU. Focusing on high-throughput generation.
Python
Forked from Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Python