Build software better, together

Karakarawowow / machin

python data-science algorithm scikit-learn regression logistic pytorch dqn smo knn datamining sac azure-machine-learning ppo reinforcementlearning td3 adaboost-algorithm a3c-pytorch

Updated Feb 12, 2026
Jupyter Notebook

Skw3mdy / Reinforcement-Learning-Projects

Star

🤖 Explore reinforcement learning techniques with projects including a taxi agent using Q-Learning and a DQN-based Space Invaders agent.

machine-learning robotics unity simulation deep-reinforcement-learning dcgan gym neural-networks cartpole sac augmentation ppo erfnet td3 semantic-segmentation-models pytorch-template intrinsic-reward huggingface

Updated Feb 12, 2026
Jupyter Notebook

ramzibjd19 / Deep-Reinforcement-Learning-With-Pytorch

Star

🤖 Implement classic and state-of-the-art deep reinforcement learning algorithms using clear PyTorch code for easy understanding and application.

reinforcement-learning deep-learning unity tensorflow deep-reinforcement-learning recurrent-neural-networks gym reinforce alphago actor-critic pytorch-a3c acer emergent-behavior double-dqn trpo a2c td3 multi-agent-reinforcement-learning

Updated Feb 12, 2026
Python

Jaehyun-Jeong / 100LinesRL

Star

Clean RL algorithm implementations in under 100 lines each.

python machine-learning reinforcement-learning deep-learning pytorch dqn reinforcement-learning-algorithms rl educational sac gymnasium ppo td3 rl-algorithms minimal-implementation 100-line-code

Updated Feb 12, 2026
Python

SarodYatawatta / smart-calibration

Star

Deep reinforcement learning for smart calibration of radio telescopes. Automatic hyper-parameter tuning.

reinforcement-learning distributed-computing openai-gym pytorch hyperparameter-optimization radio-astronomy ddpg sac sagecal fuzzy-controller radio-telescopes td3 elastic-net-regression influence-maps

Updated Feb 11, 2026
Python

ItsTSV / RoboDRL

Star

Implementation and usage of advanced deep reinforcement learning algorithms in robotic control and complex simulated environments.

reinforcement-learning textual pytorch reinforcement-learning-algorithms sac mujoco ppo td3

Updated Feb 10, 2026
Python

AlirezaShamsoshoara / RL-from-zero

Star

Comprehensive collection of reinforcement learning algorithms implemented from scratch in PyTorch. Features tabular methods, value-based , policy gradient, actor-critic, offline RL, and multi-agent methods Includes YAML-based configuration, W&B integration, and unified CLI.

python machine-learning reinforcement-learning deep-learning q-learning pytorch policy-gradient a3c deep-q-network ddpg sac gymnasium actor-critic multiagent-reinforcement-learning ppo td3 multi-agent-reinforcement-learning marl pettingzoo

Updated Feb 12, 2026
Python

RS2002 / Triple-BERT

Star

[ICLR 2026 oral] Official Repository for The Paper, Triple-BERT: Do We Really Need MARL for Order Dispatch on Ride-Sharing Platforms?

reinforcement-learning bert td3 ride-pool order-dispatchment

Updated Feb 9, 2026
Python

PrayYoung / r2pa

Star

R²PA — Regime-aware reinforcement learning for portfolio allocation (RL, regime signals, LLM oracle)

reinforcement-learning quantitative-finance sac portfolio-allocation ppo a2c td3 quantative-trading llms

Updated Feb 5, 2026
Python

RS2002 / MA2SA

Star

Official Repository for The Paper, Beyond Multi‑Agent Reinforcement Learning: Scalable Centralized Control for Large-Scale Dynamic Trip-Vehicle Assignment

reinforcement-learning bert td3

Updated Jan 31, 2026
Python

ayushraj09 / TradingAgent

Star

PPO-based trading agent with automated fine-tuning every 2 hours and SHAP/LIME explainability. Achieved 22.56% return with 2.318 Sharpe ratio.

reinforcement-learning deep-reinforcement-learning fintech sac drl lime ppo trading-automation explainable-ai xai td3 trading-signals shap drl-trading-agents stablebaselines3

Updated Jan 29, 2026
Jupyter Notebook

reiniscimurs / DRL-robot-navigation-IR-SIM

Star

Deep Reinforcement Learning for mobile robot navigation in IR-SIM simulation. Using DRL (SAC, TD3, PPO, DDPG) neural networks, a robot learns to navigate to a random goal point in a simulated environment while avoiding obstacles.

ddpg obstacle-avoidance sac drl ppo robot-navigation obstacle-avoidance-robot td3 ddpg-pytorch ppo-pytorch sac-pytorch drl-pytorch td3-pytorch ir-sim

Updated Jan 28, 2026
Python

Tahernezhad / Continuous-Control-Workbench

Star

A clean PyTorch implementation of PPO, SAC, and TD3 made from scratch. It is built for testing and comparing continuous control RL algorithms on complex environments such as BipedalWalker-v3.

reinforcement-learning deep-learning pytorch policy-gradient from-scratch sac gymnasium actor-critic ppo td3 bipedalwalker continuous-control-tasks

Updated Jan 28, 2026
Python

liyc-ai / RL-pytorch

Star

A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.

pytorch dqn reinforcement-learning-algorithms ddpg sac ddqn trpo ppo td3 dueldqn

Updated Jan 27, 2026
Python

Degas01 / quadruped-gait-rl

Star

Quadruped gait control using reinforcement learning (TD3, DDPG) with MATLAB/Simulink and Simscape simulation.

machine-learning reinforcement-learning robotics matlab deep-reinforcement-learning simulink control-systems ddpg legged-robots mechatronics quadruped autonomous-robots td3 simscape simscape-multibody robot-locomotion

Updated Jan 23, 2026
MATLAB

jviquerat / dragonfly

Star

Paper repository: "Dragonfly: a modular deep reinforcement learning library"

deep-reinforcement-learning gym ddpg sac drl mujoco ppo td3

Updated Jan 12, 2026
Roff

arminlotfyFP / Safe-Proximal-Policy-Optimization-with-Predictive-and-Memory_Aware-Battery-Management-for-BEVs

Star

For this paper RLlib has two version 2.10 and 2.8. in Version 2.10 we do not have TD3 so we use version 2.8. In higher version we cannot use manually the policy of trained agents. Reward function has deliberately removed!

rl sac gymnasium ppo cnn-lstm td3 tensorflow2

Updated Jan 12, 2026
Python

smtmRadu / DeepUnity

Star

An open source deep learning library for Unity.

reinforcement-learning deep-learning unity ddpg sac ppo td3 llm-inference

Updated Jan 8, 2026
Jupyter Notebook

datawhalechina / easy-rl

Star

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

reinforcement-learning deep-reinforcement-learning q-learning dqn policy-gradient sarsa a3c ddpg imitation-learning double-dqn dueling-dqn ppo td3 easy-rl

Updated Dec 30, 2025
Jupyter Notebook

Omkarkkale / Autonomous-Robotic-Arm-Control-using-Reinforcement-Learning

Star

Autonomous Robotic Arm Control (Franka Panda) using Twin Delayed DDPG (TD3) in Robosuite/MuJoCo. An implementation of Deep Reinforcement Learning for continuous control tasks like Door Opening.

reinforcement-learning deep-learning continuous-control autonomous-systems mujoco td3 robosuite robotics-pytorch franka-emika-panda

Updated Dec 29, 2025
Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

td3

Here are 197 public repositories matching this topic...

Karakarawowow / machin

Skw3mdy / Reinforcement-Learning-Projects

ramzibjd19 / Deep-Reinforcement-Learning-With-Pytorch

Jaehyun-Jeong / 100LinesRL

SarodYatawatta / smart-calibration

ItsTSV / RoboDRL

AlirezaShamsoshoara / RL-from-zero

RS2002 / Triple-BERT

PrayYoung / r2pa

RS2002 / MA2SA

ayushraj09 / TradingAgent

reiniscimurs / DRL-robot-navigation-IR-SIM

Tahernezhad / Continuous-Control-Workbench

liyc-ai / RL-pytorch

Degas01 / quadruped-gait-rl

jviquerat / dragonfly

arminlotfyFP / Safe-Proximal-Policy-Optimization-with-Predictive-and-Memory_Aware-Battery-Management-for-BEVs

smtmRadu / DeepUnity

datawhalechina / easy-rl

Omkarkkale / Autonomous-Robotic-Arm-Control-using-Reinforcement-Learning

Improve this page

Add this topic to your repo