Readme

A replication of EBGU's Diffusion LM which trained on small part of ROCStory dataset in one GPU, which is the simple implementation and easy to understand

run python trainer.py for training.
run python infer.py for generating sample in unconditional way.
may be you could modify the .yaml file for some config setting.
multi-GPU training setting and full datasets can be found in the origin codebase.

Main code idea is in the diffusion_bert.py

It shows that added nosie only in the word embedding before encoding by the bert-encoder (self-attention and full connection network).
Word embedding(with noise), position embedding, token_type embedding and time embedding are sum and feed to the bert encoder for getting a dim=768 vector.
The sample part is bert.cls.predictions to predict token in vocab which argmaxing the logits in each position.

some change

add loss curve in ckpt folder
add the sample result in test data

sample result

It seems look good in English but it's overfited.

[CLS] john was wanted a girlfriend so so he got dating dating. he went went on on date with her friend named joy came. she got five minutes. he got got angry for an angry friend. joy had a problem he had anger problem and problem, no hive hives. anna prevented him from finding she. [SEP]

loss curve in 2000 steps

Correct me if I'm wrong or you have new understanding. Thks!

Origin papar is diffusion-lm

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
Saved_Models/20230612bert_diffusion		Saved_Models/20230612bert_diffusion
20230612.yaml		20230612.yaml
LICENSE		LICENSE
README.md		README.md
ROCstory_test_1k.csv		ROCstory_test_1k.csv
ROCstory_train_1w.csv		ROCstory_train_1w.csv
diffusion_bert.py		diffusion_bert.py
infer.py		infer.py
samples_20230612_bestepoch_test.txt		samples_20230612_bestepoch_test.txt
trainer.py		trainer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Readme

sample result

loss curve in 2000 steps

About

Releases

Packages

Languages

License

bansky-cl/diffusion-lm-roc

Folders and files

Latest commit

History

Repository files navigation

Readme

sample result

loss curve in 2000 steps

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages