Introduction

This is the official implementation for fine-tuning models through representation editing. We have provided trained models for quick evaluation and corresponding training code to reproduce our work.

There are several directories in this repo:

bash contains scripts both training and evaluation
model contains checkpoints and generation results we have trained by RED for quick evaluation
RED contains the source code for training and evaluation of RED
RED++ contains the source code for training and evaluation of the variant of RED
lm-evaluation-harness contains the source code for automated evaluation on Llama

Quickstart

Install requirements

conda create -n RED python=3.8
conda activate RED
pip install -r requirements
pip install torch==2.1.1 torchvision==0.16.1 torchaudio==2.1.1 --index-url https://download.pytorch.org/whl/cu118

You also need to install Java 1.8

Download datasets and evaluate packages

# Download datasets
python download_dataset.py

# Download evaluate packages
git clone https://github.com/huggingface/evaluate.git
git clone https://github.com/tuetschek/e2e-metrics.git

Evaluate RED models Here we provide a simple script to evaluate the model we trained using RED on the E2E_NLG dataset, and you can go to the bash folder to learn more detailed test scripts.

cd/bash/GPT-2/Test

# Evaluate based on the GPT2 generation results we have provided
bash gpt2_medium_evaluation.sh

# Use the model we have provided and specify the decoding strategy to generate samples and save them in the pred.txt file
# And then use these generation results for evaluation
bash gpt2_medium_generation.sh

Training

Here we provide a simple script for training on the RoBERTa-base model using the RED method on the STS-B dataset, and more detailed training details can be found in the bash folder

Execute scripts for training

cd bash/RED/RoBERTa/RoBERTa_base
bash stsb.sh

Training RoBERTa-base on STS-B

CUDA_VISIBLE_DEVICES=0 python ../../../../RED/RoBERTa/roberta_base.py \
    --seed $seed \
    --weight_decay 0.0 \
    --dataset_name "stsb" \
    --batch_size 32 \
    --lr 0.003 \
    --do_train \
    --do_eval \
    --do_test \
    --warmup_rate 0.06 \
    --operation_key "ffn_all_layer" \
    --model_type "roberta_base" \
    --epochs 40

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
RED++		RED++
RED		RED
bash		bash
lm-evaluation-harness		lm-evaluation-harness
model		model
README.md		README.md
data_utils.py		data_utils.py
download_dataset.py		download_dataset.py
model.py		model.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Quickstart

Training

About

Releases

Packages

Languages

mlwu22/RED

Folders and files

Latest commit

History

Repository files navigation

Introduction

Quickstart

Training

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages