Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Remove the unnecessary regular advantage code in PPO #287

Merged
merged 18 commits into from
Oct 4, 2022

Conversation

bragajj
Copy link
Contributor

@bragajj bragajj commented Oct 3, 2022

Description

Resolving issue #207
Unnnecessary ppo code removed, numerical accuracy was ensured by team members through debugger. Additional runs showing performance without the extra code can be found at the following wandb link: https://wandb.ai/bragajj/ppo_advcalc

Types of changes

  • Bug fix
  • New feature
  • New algorithm
  • Documentation

Checklist:

  • I've read the CONTRIBUTION guide (required).
  • I have ensured pre-commit run --all-files passes (required).

To resolve issue vwxyzjn#207 in cleanrl, extra advantage code not needed
To resolve issue vwxyzjn#207 in cleanrl, extra advantage calc code unnecessary
Updated to resolve issue vwxyzjn#207, unncessary additional advantage calc code
Updated to resolve issue vwxyzjn#207, unnecessary additional advantage calc code for ppo implementations
Updated to resolve issue vwxyzjn#207
@vercel
Copy link

vercel bot commented Oct 3, 2022

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name Status Preview Updated
cleanrl ✅ Ready (Inspect) Visit Preview Oct 4, 2022 at 0:52AM (UTC)

@vwxyzjn vwxyzjn changed the title Issue #207 Resolution PPO Remove the unnecessary regular advantage code in PPO Oct 3, 2022
@vwxyzjn
Copy link
Owner

vwxyzjn commented Oct 3, 2022

Thanks @bragajj, it looks good. I would also remove the gae flag.

parser.add_argument("--gae", type=lambda x: bool(strtobool(x)), default=True, nargs="?", const=True,
help="Use GAE for advantage computation")

The --gae flag exists in other scripts as well. If you could do the same for them, that would be great!

image

@bragajj
Copy link
Contributor Author

bragajj commented Oct 3, 2022

GAE flags removed from all ppo files, isaac gym and ppo_rnd_envpool.py are also now updated to reflect GAE revisions

Fixed styling of lines 432-436
Copy link
Owner

@vwxyzjn vwxyzjn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! Thanks @bragajj!

@vwxyzjn vwxyzjn merged commit f0bbf49 into vwxyzjn:master Oct 4, 2022
@vwxyzjn vwxyzjn mentioned this pull request Oct 19, 2022
vwxyzjn added a commit to jseppanen/cleanrl that referenced this pull request Nov 20, 2022
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants