Pinned Loading
-
SmoothQuant_in_SpecInfer
SmoothQuant_in_SpecInfer PublicForked from mit-han-lab/smoothquant
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Python 1
-
AWQ_in_SpecInfer
AWQ_in_SpecInfer PublicForked from mit-han-lab/llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Jupyter Notebook 2
-
SPViT_tflite_ver
SPViT_tflite_ver PublicForked from PeiyanFlying/SPViT
This include tflite conversion codes in SPViT
Python
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.