TSP or Terrain Based Switching
There are 3 types of envs:
- Cobblestone
- Gravel
- Sandpaper
├── README.md
├── images
├── TSP
│ ├── xml
│ │ ├── cobblestone_hopper.xml
│ │ ├── cobblestone_hopper_loose_v1.xml
│ │ ├── cobblestone_hopper_loose_v2.xml
│ │ ├── sandpaper_hopper.xml
│ │ └── gravel_hopper.xml
│ ├── notebooks
| | ├── train
│ │ | ├── baseline_online_RL.ipynb
│ │ | ├── cobblestone_hopper_loose_v1.ipynb
│ │ | ├── standard_hopper.ipynb
│ │ | ├── sandpaper_hopper.ipynb
│ │ | ├── TSP_oracle.ipynb
│ │ | └── TSP.ipynb
│ │ └── test
│ │ ├── Evaluation.ipynb
│ │ └── Visualization.ipynb
│ ├── policies
│ │ ├── cobblestone_hopper_loose_v1
│ │ ├── standard_hopper
│ │ ├── sandpaper_hopper
| | ├── online_RL_policy
| | ├── TSP_oracle
│ │ └── TSP
│ └── data
│ └── data_generation
├── archive (temporary)
usr/local/lib/python3.7/dist-packages/gym/envs/mujoco/assets
Here we include a short video of the TSP agent in action. The agent is trained on the cobblestone environment and is tested on the sandpaper environment. The agent is able to adapt to the new environment and perform well.