Power of Two Quantization

Description and Outline 📐

Here we demonstrate the use of the standard power-of-two quantization formula

$P O 2 (x) = 2^{round (\log_{2} (x))}$

against our improved quantization formula

$P O 2_{+} (x) = 2^{round (\log_{2} (\sqrt{8 / 9} \cdot x))} .$

We use ResNet, MobileNet, and MobileVit models, all of which are available in the models directory. We also test on CIFAR and ImageNet data, which are available in the data directories. The main launch script is train_launch.sh, which we will describe how to use below.

Create a VM in GCP ☁︎

python3 create_vm.py --project_id="high-performance-ml" --vm_name="sleds" --disk_size=100 --gpu_type="nvidia-tesla-t4" --gpu_count=4 --machine_type="n1-standard-8"

Install Dependencies

pip install -r requirements.txt

Download Data

python download_data.py --dataset=cifar
huggingface-cli login # enter token first
python download_data.py --dataset=imagenet

Test a Single Training Run 🏃‍♂️

export LD_LIBRARY_PATH=
export OMP_NUM_THREADS=1
torchrun --standalone --nnodes=1 --nproc-per-node=4 train.py --model_type=resnet20 --dataset=cifar --quantizer_type=none --bits=4 --num_epochs=164 --batch_size=128 --lr=0.1 --seed=8

Run Train Scripts

For a given model and dataset, perform full precision training then all QAT configurations. We supply some hyperparameters, in our case (num_epochs=164, batch_size=128, lr=0.1, num_gpus=4). For a complete collection of training configurations, we run this across multiple seeds. Here we start with seed=1 and complete num_seeds=10.

./train_launch.sh resnet20 cifar 164 128 0.1 4 1 10
./train_launch.sh resnet32 cifar 164 128 0.1 4 1 10 
./train_launch.sh resnet44 cifar 164 128 0.1 4 1 10 
./train_launch.sh resnet56 cifar 164 128 0.1 4 1 10 
./train_launch.sh mobilenet cifar 164 128 0.1 4 1 10 
./train_launch.sh mobilevit cifar 164 128 0.1 4 1 10 

# only perform full precision training for imagenet
export LD_LIBRARY_PATH=
export OMP_NUM_THREADS=1
torchrun --standalone --nnodes=1 --nproc-per-node=4 train.py --model_type=resnet56 --dataset=imagenet --quantizer_type=none --bits=4 --num_epochs=164 --batch_size=128 --lr=0.1 --seed=8
torchrun --standalone --nnodes=1 --nproc-per-node=4 train.py --model_type=mobilenet --dataset=imagenet --quantizer_type=none --bits=4 --num_epochs=164 --batch_size=128 --lr=0.1 --seed=8
torchrun --standalone --nnodes=1 --nproc-per-node=4 train.py --model_type=mobilevit --dataset=imagenet --quantizer_type=none --bits=4 --num_epochs=164 --batch_size=128 --lr=0.1 --seed=8

Run Test Scripts 👨‍💻

# get test results for full precision, PTQ and QAT across all seeds
python test.py --model_type=resnet20 --dataset=cifar
python test.py --model_type=resnet32 --dataset=cifar
python test.py --model_type=resnet44 --dataset=cifar
python test.py --model_type=resnet56 --dataset=cifar
python test.py --model_type=mobilenet --dataset=cifar
python test.py --model_type=mobilevit --dataset=cifar

# get test results for full precision and PTQ for imagenet
python test.py --model_type=resnet56 --dataset=imagenet --skip_qat=True
python test.py --model_type=mobiletnet --dataset=imagenet --skip_qat=True
python test.py --model_type=mobilevit --dataset=imagenet --skip_qat=True

Results

For results, charts, and tables, see analysis.ipynb. Here we see mixed results. Sometimes the improved formulas give us better generalization error and quantization error, but not always. We intend to run each of the experiments with different seeds in order to get more robust results.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
models		models
results/cifar		results/cifar
train/cifar		train/cifar
utils		utils
.gitignore		.gitignore
README.md		README.md
analysis.ipynb		analysis.ipynb
create_vm.py		create_vm.py
distributed_test.py		distributed_test.py
download_data.py		download_data.py
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py
train_launch.sh		train_launch.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Power of Two Quantization

Description and Outline 📐

Create a VM in GCP ☁︎

Install Dependencies

Download Data

Test a Single Training Run 🏃‍♂️

Run Train Scripts

Run Test Scripts 👨‍💻

Results

About

Releases

Packages

Contributors 2

Languages

mschoenb97/po2_quantization

Folders and files

Latest commit

History

Repository files navigation

Power of Two Quantization

Description and Outline 📐

Create a VM in GCP ☁︎

Install Dependencies

Download Data

Test a Single Training Run 🏃‍♂️

Run Train Scripts

Run Test Scripts 👨‍💻

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages