Incentive Q-Flow (IQ-Flow)

This is the code for experiments in the paper IQ-Flow: Mechanism Design for Inducing Cooperative Behavior to Self-Interested Agents in Sequential Social Dilemmas. This implementation benefits from LIO and JAXRL.

Setup

Python 3.8
Tensorflow 2.2.1
Flax 0.3.4
Jax 0.2.17
Jaxlib 0.3.0
Gym 0.26.2
Follow the setup instructions for the official repository of the paper Learning to Incentivize Other Learning Agents at LIO.
Run $ pip install -e . from the root.
In order to do hyperparameter tuning, install Optuna.

Navigation

alg/ - Implementation of IQ-Flow and baselines.
config/ - Configuration files for experiments.
env/ - Implementation of the Escape Room game and wrappers around the SSD environment from LIO and Iterated Matrix Games.
eval/ - Evaluation scripts
networks/ - Neural network implementations
trainer/ - Training scripts
utils/ - Utilities

Examples

Train IQ-Flow on Iterated Matrix Games

Set config values in config/config_img_qflow.py
cd into the trainer folder
Execute training script $ python train_multiprocess.py --alg qflow --exp ipd --n_seeds 5 --config config_img_qflow to run Iterated Prisoner's Dilemma with 5 seeds. You can see the argument options for further configuration details.
Execute training script $ python train_multiprocess.py --alg qflow --exp chicken --n_seeds 5 --config config_img_qflow to run Chicken Game with 5 seeds.
Execute training script $ python train_multiprocess.py --alg qflow --exp stag_hunt --n_seeds 5 --config config_img_qflow to run Stag Hunt with 5 seeds.

Train IQ-Flow on Escape Room

Set config values in config/config_er_qflow.py
cd into the trainer folder
Execute training script $ python train_multiprocess.py --alg qflow --exp er --n_seeds 5 --config config_er_qflow to run Escape with 5 seeds. You can see the argument options for further configuration details.

Train IQ-Flow on Cleanup

Set config values in config/config_ssd_qflow.py
cd into the trainer folder
Execute training script $ python train_multiprocess.py --alg qflow --exp ssd --n_seeds 5 --config config_ssd_qflow to run Cleanup with 5 seeds. You can see the argument options for further configuration details.

Train Incentive Designer(ID) on Escape Room

Set config values in config/config_er_id.py
cd into the trainer folder
Execute training script $ python train_multiprocess.py --alg id --exp er --n_seeds 5 --config config_er_id to run Escape with 5 seeds. You can see the argument options for further configuration details.

Train Incentive Designer(ID) on Cleanup

Set config values in config/config_ssd_id.py
cd into the trainer folder
Execute training script $ python train_multiprocess.py --alg id --exp ssd --n_seeds 5 --config config_ssd_id to run Cleanup with 5 seeds. You can see the argument options for further configuration details.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
iqflow		iqflow
.gitignore		.gitignore
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Incentive Q-Flow (IQ-Flow)

Setup

Navigation

Examples

Train IQ-Flow on Iterated Matrix Games

Train IQ-Flow on Escape Room

Train IQ-Flow on Cleanup

Train Incentive Designer(ID) on Escape Room

Train Incentive Designer(ID) on Cleanup

About

Releases

Packages

Languages

data-and-decision-lab/IQ-Flow

Folders and files

Latest commit

History

Repository files navigation

Incentive Q-Flow (IQ-Flow)

Setup

Navigation

Examples

Train IQ-Flow on Iterated Matrix Games

Train IQ-Flow on Escape Room

Train IQ-Flow on Cleanup

Train Incentive Designer(ID) on Escape Room

Train Incentive Designer(ID) on Cleanup

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages