ORSO: Accelerating Reward Design via Online Reward Selection and Policy Optimization

This repository contains the code for the paper "ORSO: Accelerating Reward Design via Online Reward Selection and Policy Optimization".

arXiv | Website

Installation

ORSO requires Python ≥ 3.8

Create a new conda environment with

conda create -n orso python=3.8
conda activate orso

Install IsaacGym. Follow the instruction to download the package.

tar -xvf IsaacGym_Preview_4_Package.tar.gz
cd isaacgym/python
pip install -e .
(test installation) python examples/joint_monkey.py

Install ORSO

pip install -e .
cd isaacgymenvs
pip install -e .

Set an environemnt variable for the OpenAI API key
```
export OPENAI_API_KEY= "YOUR_API_KEY"
```

Running Experiments

Navigate to the src directory and run:

python train_orso_budget.py env={env}

The full set of hyperparameters can be found in src/config/config.yaml and src/config/envs/{env}.yaml for environment specific parameters.

Work in Progress

An implementation of ORSO with a fixed set of reward functions and without language model will be available soon. We will also provide a minimal implementation framework with CleanRL for practitioners to easily integrate ORSO into their projects.

Citation

If you find this code useful, please consider citing our paper:

@inproceedings{zhang2025orso,
  title={{ORSO}: Accelerating Reward Design via Online Reward Selection and Policy Optimization},
  author={Chen Bo Calvin Zhang and Zhang-Wei Hong and Aldo Pacchiano and Pulkit Agrawal},
  booktitle={The Thirteenth International Conference on Learning Representations},
  year={2025},
  url={https://openreview.net/forum?id=0uRc3CfJIQ}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
isaacgymenvs		isaacgymenvs
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ORSO: Accelerating Reward Design via Online Reward Selection and Policy Optimization

Installation

Running Experiments

Work in Progress

Citation

About

Releases

Packages

Languages

License

Improbable-AI/orso

Folders and files

Latest commit

History

Repository files navigation

ORSO: Accelerating Reward Design via Online Reward Selection and Policy Optimization

Installation

Running Experiments

Work in Progress

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages