-
Updated
Feb 12, 2026 - Jupyter Notebook
td3
Here are 197 public repositories matching this topic...
🤖 Explore reinforcement learning techniques with projects including a taxi agent using Q-Learning and a DQN-based Space Invaders agent.
-
Updated
Feb 12, 2026 - Jupyter Notebook
🤖 Implement classic and state-of-the-art deep reinforcement learning algorithms using clear PyTorch code for easy understanding and application.
-
Updated
Feb 12, 2026 - Python
Clean RL algorithm implementations in under 100 lines each.
-
Updated
Feb 12, 2026 - Python
Deep reinforcement learning for smart calibration of radio telescopes. Automatic hyper-parameter tuning.
-
Updated
Feb 11, 2026 - Python
Implementation and usage of advanced deep reinforcement learning algorithms in robotic control and complex simulated environments.
-
Updated
Feb 10, 2026 - Python
Comprehensive collection of reinforcement learning algorithms implemented from scratch in PyTorch. Features tabular methods, value-based , policy gradient, actor-critic, offline RL, and multi-agent methods Includes YAML-based configuration, W&B integration, and unified CLI.
-
Updated
Feb 12, 2026 - Python
[ICLR 2026 oral] Official Repository for The Paper, Triple-BERT: Do We Really Need MARL for Order Dispatch on Ride-Sharing Platforms?
-
Updated
Feb 9, 2026 - Python
R²PA — Regime-aware reinforcement learning for portfolio allocation (RL, regime signals, LLM oracle)
-
Updated
Feb 5, 2026 - Python
Official Repository for The Paper, Beyond Multi‑Agent Reinforcement Learning: Scalable Centralized Control for Large-Scale Dynamic Trip-Vehicle Assignment
-
Updated
Jan 31, 2026 - Python
PPO-based trading agent with automated fine-tuning every 2 hours and SHAP/LIME explainability. Achieved 22.56% return with 2.318 Sharpe ratio.
-
Updated
Jan 29, 2026 - Jupyter Notebook
Deep Reinforcement Learning for mobile robot navigation in IR-SIM simulation. Using DRL (SAC, TD3, PPO, DDPG) neural networks, a robot learns to navigate to a random goal point in a simulated environment while avoiding obstacles.
-
Updated
Jan 28, 2026 - Python
A clean PyTorch implementation of PPO, SAC, and TD3 made from scratch. It is built for testing and comparing continuous control RL algorithms on complex environments such as BipedalWalker-v3.
-
Updated
Jan 28, 2026 - Python
Quadruped gait control using reinforcement learning (TD3, DDPG) with MATLAB/Simulink and Simscape simulation.
-
Updated
Jan 23, 2026 - MATLAB
An open source deep learning library for Unity.
-
Updated
Jan 8, 2026 - Jupyter Notebook
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
-
Updated
Dec 30, 2025 - Jupyter Notebook
Autonomous Robotic Arm Control (Franka Panda) using Twin Delayed DDPG (TD3) in Robosuite/MuJoCo. An implementation of Deep Reinforcement Learning for continuous control tasks like Door Opening.
-
Updated
Dec 29, 2025 - Python
Improve this page
Add a description, image, and links to the td3 topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the td3 topic, visit your repo's landing page and select "manage topics."