thahn1230

Follow

sungsomi thahn1230

Follow

3 followers · 3 following

Organizations

Pinned Loading

SmoothQuant_in_SpecInfer SmoothQuant_in_SpecInfer Public

Forked from mit-han-lab/smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python 1
AWQ_in_SpecInfer AWQ_in_SpecInfer Public

Forked from mit-han-lab/llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Jupyter Notebook 2
SPViT_tflite_ver SPViT_tflite_ver Public

Forked from PeiyanFlying/SPViT

This include tflite conversion codes in SPViT

Python
llama-ssp-quant llama-ssp-quant Public

Cuda