ppo

Star

Here are 8 public repositories matching this topic...

phonism / LLMNotes

Star

LLM 学习笔记：Transformer 架构、强化学习 (RLHF/DPO/PPO)、分布式训练、推理优化。含完整数学推导与Slides。

reinforcement-learning notes transformer moe learning-notes ppo dpo llm rlhf rlvr

Updated Feb 28, 2026
TeX

LetteraUnica / BriscolaBot

Star

Reinforcement Learning agent that plays Briscola, a famous Italian card game

game python machine-learning reinforcement-learning ai deep-learning ml deep-reinforcement-learning pytorch game-theory actor-critic proximal-policy-optimization ppo multi-agent-reinforcement-learning briscola

Updated Jan 24, 2024
TeX

jviquerat / dragonfly

Star

Paper repository: "Dragonfly: a modular deep reinforcement learning library"

deep-reinforcement-learning gym ddpg sac drl mujoco ppo td3

Updated Mar 24, 2026
TeX

phschoepf / cs-bachelor-thesis

Star

My bachelor thesis in Computer Science, "Hypernetwork-PPO for Continual Reinforcement Learning".

hypernetworks ppo continual-learning

Updated Feb 8, 2023
TeX

zamweis / sumo-marl-traffic-control

Star

A framework for training and evaluating multi-agent reinforcement learning models for adaptive traffic light control in SUMO.

sumo traffic-simulation ppo marl sumo-rl

Updated Nov 24, 2025
TeX

Sproc01 / highway_RL_agent

Star

RL agents for the highway environment

python reinforcement-learning python3 pytorch dqn autonomous-driving dueling-dqn ppo

Updated Mar 31, 2025
TeX

papetronics / case-studies-final-project

Star

Reinforcement Learning for Yahtzee: A2C, PPO, REINFORCE

reinforcement-learning reinforce yahtzee ppo a2c msai msai-ut-austin

Updated Dec 16, 2025
TeX

hadamove / deep_rl_presentations

Star

Deep RL topics presented at FI MUNI

drl ppo dpo deepseek-math deepseek-r1 grpo

Updated Jun 9, 2025
TeX

Improve this page

Add a description, image, and links to the ppo topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ppo topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ppo

Here are 8 public repositories matching this topic...

phonism / LLMNotes

LetteraUnica / BriscolaBot

jviquerat / dragonfly

phschoepf / cs-bachelor-thesis

zamweis / sumo-marl-traffic-control

Sproc01 / highway_RL_agent

papetronics / case-studies-final-project

hadamove / deep_rl_presentations

Improve this page

Add this topic to your repo