Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Training on BipedalWalkerHardcore seems to result in a negative reward #7

Open
kirk86 opened this issue Oct 9, 2018 · 1 comment
Open

Comments

@kirk86
Copy link

kirk86 commented Oct 9, 2018

Hi and thanks for sharing the code.
I've tried to run the training process on a different environment such as the BipedalWalkerHardcore-v2 but it seems that is not able to learn anything. I even tried with different shift values as noted in the code comments but still in the end I get a negative reward. Should we train for longer or there any hyperparams that we are missing?

@ar8372
Copy link

ar8372 commented Jun 25, 2022

Hey @kirk86 , I am having similar issue did you solve it?
Do look at this thread for my exact issue.

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants