-
Notifications
You must be signed in to change notification settings - Fork 705
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Remove the unnecessary regular advantage code in PPO #287
Conversation
To resolve issue vwxyzjn#207 in cleanrl, extra advantage code not needed
To resolve issue vwxyzjn#207 in cleanrl, extra advantage calc code unnecessary
Updated to resolve issue vwxyzjn#207, unncessary additional advantage calc code
Updated to resolve issue vwxyzjn#207
Updated to resolve issue vwxyzjn#207, unnecessary additional advantage calc code for ppo implementations
Updated to resolve issue vwxyzjn#207
Updated to resolve issue vwxyzjn#207
The latest updates on your projects. Learn more about Vercel for Git ↗︎
|
Thanks @bragajj, it looks good. I would also remove the Lines 58 to 59 in 49168b8
The |
GAE flags removed from all ppo files, isaac gym and ppo_rnd_envpool.py are also now updated to reflect GAE revisions |
Fixed styling of lines 432-436
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM! Thanks @bragajj!
Description
Resolving issue #207
Unnnecessary ppo code removed, numerical accuracy was ensured by team members through debugger. Additional runs showing performance without the extra code can be found at the following wandb link: https://wandb.ai/bragajj/ppo_advcalc
Types of changes
Checklist:
pre-commit run --all-files
passes (required).