ANTOR

This is the implementation of COLING 2022 paper:

Adaptive Natural Language Generation for Task-oriented Dialogue via Reinforcement Learning. [arXiv]

Setup

Python == 3.7

Clone repository

$ git clone --recursive git@github.com:nu-dialogue/antor.git
$ cd antor

Install ConvLab-2
```
cd ConvLab-2
pip install -e .
cd ../
```
Install antor
```
pip install -e .
python -m spacy download en_core_web_sm
```
It is ok to ignore pip's dependency error with ConvLab-2.

Experiments

You can reproduce the three experiments performed in the paper by following these steps.

1. Prepare Models

Before running the experiment, the NLG and NLU models must be prepared respectively.

System NLG

GPT-2 (Base model for reinforcement learning) and SC-GPT
- Go to sys_nlg/gpt2 and sys_nlg/scgpt and prepare each model following the README.md in the directory, respectively.
SC-LSTM
- (Since we use pre-trained models available in the ConvLab-2, there are no steps required.)

User NLU

MILU and BERT NLU
- Go to user_nlu/milu and user_nlu/bert and prepare the each model following the README.md in the directory, respectively.

2. Conduct experiments in each condition

2-a In Clean Environment

Simply fine-tune the NLG (GPT-2) with NLU (BERT NLU or MILU)
- Go to experiments/ppo and follow the README.md in the directory.
Evaluate the fine-tuned model
- Go to experiments/evaluate_model and follow the README.md in the directory.

2-b Conditions for Speech Recognition Error

Build the confusion matrix for ASR error simulation
- Build the confusion matrix by performing the steps described in each README.md in experiments/text2speech_data, experiments/noisy_speech_data, experiments/speech2text_data, and experiments/speech_error_simulation, in that order.
You can skip this step by directly using the final confusion matrices we used in our paper. The usage can be found in experiments/speech_error_simulation. In addition, we also publish the noisy transcribed text data used to build the confusion matrices. See experiments/speech2text_data.
Fine-tune NLG in ASR error simulation
- Go to experiments/ppo and follow the README.md in the directory.
Evaluate the fine-tuned model
- Go to experiments/evaluate_model and follow the README.md in the directory.

2-c Conditions for Different Vocabulary Levels

Prepare vocabulary level
- Go to experiments/vocabulary_level and follow the README.md in the directory.
Train NLU models with only certain vocabulary levels of data
- Go to user_nlu/milu or user_nlu/bert and train each model following the README.md in the directory.
Fine-tune NLG
- Go to experiments/ppo and follow the README.md in the directory.
Evaluate the fine-tuned model
- Go to experiments/evaluate_model and follow the README.md in the directory.

Citation

@article{ohashi2022adaptive,
  title={Adaptive Natural Language Generation for Task-oriented Dialogue via Reinforcement Learning},
  author={Ohashi, Atsumoto and Higashinaka, Ryuichiro},
  journal={arXiv preprint arXiv:2209.07873},
  year={2022}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
ConvLab-2 @ ad32b76		ConvLab-2 @ ad32b76
common_utils		common_utils
experiments		experiments
ppo_utils		ppo_utils
sys_nlg		sys_nlg
user_nlu		user_nlu
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ANTOR

Setup

Experiments

1. Prepare Models

System NLG

User NLU

2. Conduct experiments in each condition

2-a In Clean Environment

2-b Conditions for Speech Recognition Error

2-c Conditions for Different Vocabulary Levels

Citation

About

Releases

Packages

Languages

nu-dialogue/antor

Folders and files

Latest commit

History

Repository files navigation

ANTOR

Setup

Experiments

1. Prepare Models

System NLG

User NLU

2. Conduct experiments in each condition

2-a In Clean Environment

2-b Conditions for Speech Recognition Error

2-c Conditions for Different Vocabulary Levels

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages