netaug

Apr 24, 2022

4ec4d8b · Apr 24, 2022

Name	Name	Last commit message	Last commit date
parent directory ..
bash	bash	add netaug	Apr 24, 2022
configs	configs	add netaug	Apr 24, 2022
figures	figures	add netaug	Apr 24, 2022
models	models	add netaug	Apr 24, 2022
utils	utils	add netaug	Apr 24, 2022
.gitignore	.gitignore	add netaug	Apr 24, 2022
README.md	README.md	add netaug	Apr 24, 2022
eval.py	eval.py	add netaug	Apr 24, 2022
setup.py	setup.py	add netaug	Apr 24, 2022
train.py	train.py	add netaug	Apr 24, 2022

README.md

Network Augmentation for Tiny Deep Learning

@inproceedings{
    cai2022network,
    title={Network Augmentation for Tiny Deep Learning},
    author={Han Cai and Chuang Gan and Ji Lin and Song Han},
    booktitle={International Conference on Learning Representations},
    year={2022},
    url={https://openreview.net/forum?id=TYw3-OlrRm-}
}

Neural Networks Going Tiny for Deployment on Tiny Edge Devices

Training Tiny Neural Networks is Different from Training Large Neural Networks

Augment Tiny Neural Networks to Get More Supervision During Training

Experiment Results

Environment

Python 3.8.5
Pytorch 1.8.2
torchpack
torchprofile

Pretrained Models

Model	#Params	#MACs	ImageNet Top1 (%)	Pretrained weights
MobileNetV2-Tiny + NetAug	0.75M	23.5M	53.3%	pth
MCUNet + NetAug	0.74M	81.8M	62.7%	pth
ProxylessNAS-Mobile (w0.35, r160) + NetAug	1.8M	35.7M	60.8%	pth

More are available on Google Drive.

To evaluate pretrained models, please run eval.py.

Example:

torchpack dist-run -np 1 python eval.py \
	--dataset imagenet --data_path /dataset/imagenet/ \
	--image_size 160 \
	--model proxylessnas-0.35 \
	--init_from <path_of_pretrained_weight>

How to train models with NetAug

Scripts for training models with NetAug on ImageNet are available under the folder bash/imagenet.

Notes:

With netaug, the expand ratio of the augmented model will be very large. We find the fout initialization strategy does not work well for such kind of models. Thus, we use nn.init.kaiming_uniform initialization when netaug is used.
We sort the channels according to the channel's L1 value at the beginning of each epoch, which forces the target model to take the most important channels.
We stop augmenting the width multiplier (i.e., width multiplier augmentation ratio is always 1.0) in the second half of the training epochs, which slightly improves the results in our early experiments.
When using netaug, running mean and running var in BN layers are not accurate. Thus, if netaug is used, we always use a subset of training images to re-estimate running mean and running var in BN layers after getting the trained model.

How to run transfer learning experiments

To run transfer learning experiments, please first download our pretrained weights or train the models on the pretraining dataset by yourself. Scripts are available under the folder bash/transfer/.

Related Projects

TinyTL: Reduce Activations, Not Trainable Parameters for Efficient On-Device Learning (NeurIPS'20)

MCUNet: Tiny Deep Learning on IoT Devices (NeurIPS'20, spotlight)

Once for All: Train One Network and Specialize it for Efficient Deployment (ICLR'20)

ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware (ICLR'19)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

netaug

netaug

README.md

Network Augmentation for Tiny Deep Learning

Neural Networks Going Tiny for Deployment on Tiny Edge Devices

Training Tiny Neural Networks is Different from Training Large Neural Networks

Augment Tiny Neural Networks to Get More Supervision During Training

Experiment Results

Environment

Pretrained Models

How to train models with NetAug

How to run transfer learning experiments

Related Projects

Files

netaug

Directory actions

More options

Directory actions

More options

Latest commit

History

netaug

Folders and files

parent directory

README.md

Network Augmentation for Tiny Deep Learning

Neural Networks Going Tiny for Deployment on Tiny Edge Devices

Training Tiny Neural Networks is Different from Training Large Neural Networks

Augment Tiny Neural Networks to Get More Supervision During Training

Experiment Results

Environment

Pretrained Models

How to train models with NetAug

How to run transfer learning experiments

Related Projects