-
Notifications
You must be signed in to change notification settings - Fork 8.6k
Pendulum v1
Hadi edited this page Jan 15, 2022
·
1 revision
- Name: Pendulum-v1
- Category: Classic Control
- Leaderboard Page
- Old links:
Try to keep a frictionless pendulum standing up.
Unknown
Type: Box(3)
Num | Observation | Min | Max |
---|---|---|---|
0 | cos(theta) | -1.0 | 1.0 |
1 | sin(theta) | -1.0 | 1.0 |
2 | theta dot | -8.0 | 8.0 |
Type: Box(1)
Num | Action | Min | Max |
---|---|---|---|
0 | Joint effort | -2.0 | 2.0 |
The precise equation for reward:
-(theta^2 + 0.1*theta_dt^2 + 0.001*action^2)
Theta is normalized between -pi and pi. Therefore, the lowest reward is -(pi^2 + 0.1*8^2 + 0.001*2^2) = -16.2736044
, and the highest reward is 0
. In essence, the goal is to remain at zero angle (vertical), with the least rotational velocity, and the least effort.
Random angle from -pi to pi, and random velocity between -1 and 1
There is no specified termination. Adding a maximum number of steps might be a good idea.
NOTE: Your environment object could be wrapped by the TimeLimit wrapper, if created using the "gym.make" method. In that case it will terminate after 200 steps.
None yet specified
- Gym Repository
- Wiki Home
- Leaderboard
- Environments
- FAQ
- Resources
- Feature Requests