Stars
4
stars
written in Python
Clear filter
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
A live stream development of RL tunning for LLM agents
AI Manus is a general-purpose AI Agent system that supports running various tools and operations in a sandbox environment.