Build software better, together

nikhilbarhate99 / PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

reinforcement-learning deep-learning deep-reinforcement-learning pytorch policy-gradient reinforcement-learning-algorithms pytorch-tutorial proximal-policy-optimization ppo pytorch-implmention ppo-pytorch

Updated Jul 9, 2024
Python

Lizhi-sjtu / DRL-code-pytorch

Star

Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.

reinforcement-learning pytorch rainbow-dqn dqn-pytorch ddpg-pytorch ppo-pytorch sac-pytorch ppo-gru ppo-lstm td3-pytorch

Updated Mar 29, 2023
Python

taherfattahi / ppo-rocket-landing

Star

Proximal Policy Optimization (PPO) algorithm using PyTorch to train an agent for a rocket landing task in a custom environment

machine-learning reinforcement-learning ai pytorch ppo ppo-pytorch

Updated Nov 2, 2024
Python

CherryPieSexy / imitation_learning

Star

PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.

reinforcement-learning deep-learning deep-reinforcement-learning pytorch policy-gradient imitation-learning proximal-policy-optimization ppo advantage-actor-critic a2c gail ppo-pytorch ppo-algo recurrent-ppo gail-ppo

Updated Nov 15, 2021
Python

reiniscimurs / DRL-robot-navigation-IR-SIM

Star

Deep Reinforcement Learning for mobile robot navigation in IR-SIM simulation. Using DRL (SAC, TD3, PPO, DDPG) neural networks, a robot learns to navigate to a random goal point in a simulated environment while avoiding obstacles.

ddpg obstacle-avoidance sac drl ppo robot-navigation obstacle-avoidance-robot td3 ddpg-pytorch ppo-pytorch sac-pytorch drl-pytorch td3-pytorch ir-sim

Updated Mar 17, 2025
Python

philtabor / ProtoRL

Star

A Torch Based RL Framework for Rapid Prototyping of Research Papers

dqn ddpg sac actor-critic dueling-network-architecture dueling-dqn proximal-policy-optimization ppo prioritized-experience-replay td3 soft-actor-critic dqn-pytorch dueling-ddqn ddpg-pytorch dueling-dqn-pytorch ppo-pytorch sac-pytorch td3-pytorch twin-delayed-policy-gradient

Updated Jan 6, 2025
Python

dvalenciar / ReinforceUI-Studio

Sponsor

Star

ReinforceUI-Studio. A Python-based application with a graphical user interface designed to simplify the configuration and monitoring of RL training processes. Supporting MuJoCo, OpenAI Gymnasium, and DeepMind Control Suite. Algorithms included: CTD4, DDPG, DQN, PPO, SAC, TD3, TQC

machine-learning reinforcement-learning deep-learning pytorch reinforcement-learning-algorithms gymnasium mujoco reinforcement-learning-agent dm-control soft-actor-critic ppo-pytorch

Updated Mar 20, 2025
Python

akjayant / PPO_Lagrangian_PyTorch

Star

Implementation of PPO Lagrangian in PyTorch

reinforcement-learning lagrangian ppo safe-reinforcement-learning pytorch-implementation ppo-pytorch ppo-lagrangian

Updated Aug 29, 2022
Python

faildeny / Multi_Agent_PPO

Star

Multi agent PPO implementation in Pytorch for Unity ML Agents environments.

reinforcement-learning multi-agent-reinforcement-learning unity-ml-agents reacher-environment ppo-pytorch

Updated Jul 25, 2024
Python

jatinarora2702 / gail-pytorch

Star

PyTorch implementation of GAIL and PPO reinforcement learning algorithms

reinforcement-learning openai-gym pytorch policy-gradient imitation-learning gail cartpole-v0 ppo-pytorch

Updated May 7, 2021
Python

rvdweerd / simmodel

Star

Solving pursuit-evasion problems on graphs using Reinfocement Learning and GNNs

reinforcement-learning deep-reinforcement-learning reinforcement-learning-algorithms lstm-neural-networks pomdp pursuit-evasion graph-neural-networks graph-representation-learning pytorch-geometric dqn-pytorch ppo-pytorch partial-observability gnn-algorithm

Updated Oct 12, 2022
Python

paulchen2713 / RIS-MISO-HWI-DRL

Star

Implementation of the IEEE WCNC 2025 'Worst-Case MSE Minimization for RIS-Assisted mmWave MU-MISO Systems With Hardware Impairments and Imperfect CSI' paper

reinforcement-learning gymnasium wireless-communication ppo-pytorch stable-baselines3 digital-beamforming reconfigurable-intelligent-surfaces

Updated Mar 6, 2025
Python

Solrikk / CriptoWhisper

Star

TradeWhisperer is a sophisticated cryptocurrency trading bot that leverages advanced Reinforcement Learning techniques, specifically the Proximal Policy Optimization (PPO) algorithm, to navigate the complex world of crypto markets. Built with a focus on adaptability and risk management, this bot combines technical analysis with machine learning.

python finance trading pytorch trading-algorithms trade-bot bybit bybit-api ppo-pytorch stable-baselines3 bybit-bot criptotrading aitrade tradingapi

Updated Dec 8, 2024
Python

houssameehsain / CutnFill_DeepRL

Star

Positioning a building mass on topography while minimizing the required cut and fill excavation volume using actor critic methods.

python reinforcement-learning deep-reinforcement-learning urban-planning pytorch grasshopper topography urban-design a2c ppo-pytorch

Updated Feb 28, 2022
Python

wegfawefgawefg / wegs-drl-baselines

Star

Minimum viable reinforcement learning algorithms for your educational convenience.

machine-learning reinforcement-learning deep-reinforcement-learning openai-gym pytorch dqn neural-networks reinforcement-learning-algorithms actor-critic dueling-dqn ppo reinforcement-learning-agent td3 world-models rainbow-dqn dqn-pytorch world-models-rl ppo-pytorch noisy-dqn

Updated Jun 25, 2023
Python

francofgp / Tic-Tac-Toe-Gym

Star

This is the Tic-Tac-Toe game made with Python using the PyGame library and the Gym library to implement the AI with Reinforcement Learning

python data-science machine-learning reinforcement-learning ai gym stable-baselines ppo-pytorch

Updated Nov 1, 2021
Python

CherryPieSexy / rl_mario

Star

Reinforcement learning (PPO) plays Mario.

reinforcement-learning super-mario-bros ppo ppo-pytorch

Updated Aug 4, 2021
Python

Git-123-Hub / reinforcement-learning-algorithm

Star

implementation of reinforcement learning algorithm that is easy to read and understand

reinforcement-learning deep-reinforcement-learning pytorch dqn reinforce ddpg sac ddqn dueling-dqn ppo prioritized-experience-replay td3 ppo-pytorch ddqn-per reinforce-baseline

Updated Feb 28, 2022
Python

nkoorty / rl_parking

Star

Repository with all source files relating to the 6CCE3EEP Final Year Project titled "Self Parking with Reinforcement Learning." The project was implemented using Python, and used PyGame, OpenAI Gym, and the Stable Baselines-3 libraries in order to implement a Proximal Policy Optimisation (PPO) algorithm.

reinforcement-learning pygame ppo ppo-pytorch stablebaselines3

Updated Jul 20, 2023
Python

mominalix / Humanoid-Robot-Reinforcement-Learning-PPO

Star

This repository contains a project that leverages reinforcement learning to make a humanoid robot walk in a PyBullet simulation. It uses a custom Gym environment, a Proximal Policy Optimization (PPO) agent, and a provided URDF file for the robot model. The training process prints rewards per generation and visualizes the robot's behavior.

python reinforcement-learning tensorflow pytorch humanoid-robot pybullet gym-environment ppo ppo-pytorch

Updated Jun 24, 2024
Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ppo-pytorch

Here are 63 public repositories matching this topic...

nikhilbarhate99 / PPO-PyTorch

Lizhi-sjtu / DRL-code-pytorch

taherfattahi / ppo-rocket-landing

CherryPieSexy / imitation_learning

reiniscimurs / DRL-robot-navigation-IR-SIM

philtabor / ProtoRL

dvalenciar / ReinforceUI-Studio

akjayant / PPO_Lagrangian_PyTorch

faildeny / Multi_Agent_PPO

jatinarora2702 / gail-pytorch

rvdweerd / simmodel

paulchen2713 / RIS-MISO-HWI-DRL

Solrikk / CriptoWhisper

houssameehsain / CutnFill_DeepRL

wegfawefgawefg / wegs-drl-baselines

francofgp / Tic-Tac-Toe-Gym

CherryPieSexy / rl_mario

Git-123-Hub / reinforcement-learning-algorithm

nkoorty / rl_parking

mominalix / Humanoid-Robot-Reinforcement-Learning-PPO

Improve this page

Add this topic to your repo