Weight Distillation (WD)

Requirements

pytorch >= 1.0, python >= 3.6.0, cuda >= 9.2

How to Reproduce

Take the NIST12 Chinese-English (Zh-En) for Example.

To reproduce the experiments, please run:

# train the baseline models
sh train_zh2en.sh
# translate to produce the pseudo data
sh translate_kd.sh
# train the wd phase1 models
sh train_wd_zh2en_kd_phase1.sh
# convert wd phase1 models to wd phase2 models
python3 fair_wd_weight_to_weight.py -model model_wd_phase1.pt
# train the wd phase2 models
sh train_wd_zh2en_kd_phase2.sh
# translate and score on the test and valid sets
sh translate_zh2en.sh

Implementations

The code files that implements WD are located in:

fairseq/models/wd_transformer.py

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
docs		docs
examples		examples
fairseq		fairseq
fairseq_cli		fairseq_cli
scripts		scripts
tests		tests
zh2en_ref		zh2en_ref
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
PATENTS		PATENTS
README.md		README.md
eval_lm.py		eval_lm.py
fair_wd_weight_to_weight.py		fair_wd_weight_to_weight.py
fairseq.gif		fairseq.gif
fairseq_logo.png		fairseq_logo.png
generate.py		generate.py
interactive.py		interactive.py
multi-bleu.perl		multi-bleu.perl
preprocess.py		preprocess.py
rerank.py		rerank.py
score.py		score.py
setup.py		setup.py
train.py		train.py
train_wd_zh2en_kd_phase1.sh		train_wd_zh2en_kd_phase1.sh
train_wd_zh2en_kd_phase2.sh		train_wd_zh2en_kd_phase2.sh
train_zh2en.sh		train_zh2en.sh
translate_decode.sh		translate_decode.sh
translate_kd.sh		translate_kd.sh
translate_zh2en.sh		translate_zh2en.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Weight Distillation (WD)

Requirements

How to Reproduce

Implementations

About

Releases

Packages

Languages

License

Lollipop321/weight-distillation

Folders and files

Latest commit

History

Repository files navigation

Weight Distillation (WD)

Requirements

How to Reproduce

Implementations

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages