[Question] Is the length of trajectory (episode) controlled by the done in step() function? #814
Closed
2 tasks done
Labels
question
Further information is requested
Question
In RL, a trajectory (or episode) is a sequence of states and actions in the world, see the following link for more explanations:
https://spinningup.openai.com/en/latest/spinningup/rl_intro.html
In SB3, is the length of the trajectory (episode) controlled by the done in step() function?
If I have an infinite horizon, then the done is always False during all total_timesteps in the model.learn() function, right? If I have a finite horizon, I just need to set the done = True when one trajectory (episode) ends and the model.learn() function will identify it automatically, right? Thanks in advance.
Checklist
The text was updated successfully, but these errors were encountered: