Active Learning to Guide Labeling Efforts for Question Difficulty Estimation

This repository contains the code to reproduce the results of the paper Active Learning to Guide Labeling Efforts for Question Difficulty Estimation.

The paper is available here as an arXiv preprint.

Workflow

Download the RACE (here) and RACE-c (here) dataset. Together, these datasets form the RACE++ dataset. Save the datasets in the data/raw/ folder.

Prepare the datasets by running:

python data_preparation.py
python create_small_dev.py race_pp --size 1000

Create the environment from the environment.yml file and activate it:

conda env create -f environment.yml
conda activate qdet_active

The configuration files are located in the folder src/config/race_pp/. Run the experiments with:

python main.py race_pp

Finally, inspect the results by running the analysis.ipynb notebook.

Cite as

If you use this code in your workflow or scientific publication, please cite the corresponding paper:

@article{thuy2024active,
  title={Active Learning to Guide Labeling Efforts for Question Difficulty Estimation},
  author={Thuy, Arthur and Loginova, Ekaterina and Benoit, Dries F},
  journal={arXiv preprint arXiv:2409.09258},
  year={2024},
  doi={10.48550/arXiv.2409.09258}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Active Learning to Guide Labeling Efforts for Question Difficulty Estimation

Workflow

Cite as

Files

README.md

Latest commit

History

README.md

File metadata and controls

Active Learning to Guide Labeling Efforts for Question Difficulty Estimation

Workflow

Cite as