Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Agent/m rl #79

Merged
merged 8 commits into from
Dec 14, 2021
Merged

Agent/m rl #79

merged 8 commits into from
Dec 14, 2021

Conversation

leonard-q
Copy link
Collaborator

🌟 Hello! Thanks for contributing JORLDY!

Checklist

Please check if you consider the following items.

  • [v] My code follows the style guidelines of this project contributing
  • [v] My code follows the naming convention of documentation
  • [v] I have commented my code, particularly in hard-to-understand areas
  • [v] My changes generate no new warnings or errors

Types of changes

Please describe the types of changes! (ex. Bugfix, New feature, Documentation, ...)
New Feature

Test Configuration

  • OS: Linux Ububtu
  • Python version: 3.8
  • Additional libraries: None

Description

Please describe the details of your contribution
Add M-RL (M-DQN, M-IQN) algorithms.
They are implemented with simple modification to DQN and IQN.
Therefore, the performance is compared with Original DQN and IQN.
The figure shows the performance between M-RL and Original algorithms (same parameters).

  • light blue, green: DQN
  • Gray, Pink: M-DQN
  • Orange: IQN
  • Blue: M-IQN
    image

@leonard-q leonard-q requested a review from ramanuzan as a code owner December 14, 2021 00:24
root and others added 2 commits December 14, 2021 11:37
Co-authored-by: leonard-q <leonard-q@users.noreply.github.com>
@ramanuzan ramanuzan merged commit 369acff into master Dec 14, 2021
@ramanuzan ramanuzan deleted the agent/m-rl branch December 14, 2021 04:38
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants