-
-
Notifications
You must be signed in to change notification settings - Fork 917
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Add a tutorial for vectorized envs using A2C. #234
Conversation
Content: - how to create vectorized envs? - using vectorized envs for domain randomization to train more robust agents. - building an A2C agent and training it on LunarLander-v2 using vector envs
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The tutorial looks good.
You note that the purpose of the tutorial is on speeding up training.
Not sure how painful this would be but could you
- Create a new graph with the time (I believe that
RecordEpisodes
wrapper includes this info) to replace for X time steps using different numbers of environments, 1, 3, 10, 32. Log the performance as well. - Compare the performance for different vector implementations,
AsyncVectorEnv
,SyncVectorEnv
, +EnvPool
orSampleFactory
if you have time.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Everything incorporated now.
- fix type hints
- add performance analysis section
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
not sure why isort is failing here, locally it's passing.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
New performance section looks good and interesting.
For pre-commit, if you run it twice in a row does it fix the issue?
Content: