Data Center Environment and Reinforcement Learning (RL) Control
-
Updated
Oct 29, 2023 - HTML
Data Center Environment and Reinforcement Learning (RL) Control
Hybrid Reinforcement Learning and minimax agent for Tablut game. Combines PPO trained value networks with alpha beta search for competitive play.
RL for insulin dosing in Type-1 Diabetes patients using Simglucose simulator
🛒 Sistema de pedidos em TypeScript usando POO e arquitetura orientada a eventos (projeto da Unidade 4).
Fine-tunes FLAN-T5 using Reinforcement Learning (PPO) and PEFT to generate less toxic summaries, leveraging Meta AI's hate speech reward model for detoxification.
Comparing the performance of MPC based racing and RL based racing
Add a description, image, and links to the ppo topic page so that developers can more easily learn about it.
To associate your repository with the ppo topic, visit your repo's landing page and select "manage topics."