I create MIT bugs.
- Italy
-
14:08
(UTC +01:00) - https://mateonunez.co/
- @mmateonunez
- in/mateo-nunez
Highlights
Stars
๐ RL
2 repositories
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Fine-tuning & Reinforcement Learning for LLMs. ๐ฆฅ Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.