You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
0.4.6.post1 1.10.0 1.21.2 3.9.10 | packaged by conda-forge | (main, Feb 1 2022, 21:24:11)
[GCC 9.4.0] linux
Hi,
I think RNN as it is implemented right now is not working. I've been testing it with some algorithms, and these are the results I got:
DQN on the environments:
Cartpole it learns in less than one epoch, but if it is run for more time it "forgets"
Lunarlander doesn't learn
MountainCar also doesn't learn
SAC on the environments:
Ant-v3 doesn't learn
HalfCheetah-v3 doesn't learn
Could you please refer me to the implementation you are trying to replicate?
The text was updated successfully, but these errors were encountered:
BFAnas
changed the title
What paper or reference, is th RNN implementation trying to replicate?
What paper or reference is the RNN implementation trying to replicate?
Mar 11, 2022
There're known issues pointing out at #486. Unfortunately, I have no time to fix it until two months later (after graduation)...Feel free to submit PRs and I'm really sorry about that.
0.4.6.post1 1.10.0 1.21.2 3.9.10 | packaged by conda-forge | (main, Feb 1 2022, 21:24:11)
[GCC 9.4.0] linux
Hi,
I think RNN as it is implemented right now is not working. I've been testing it with some algorithms, and these are the results I got:
Could you please refer me to the implementation you are trying to replicate?
The text was updated successfully, but these errors were encountered: