Stars
Simplifying reinforcement learning for complex game environments
Type annotations and runtime checking for shape and dtype of JAX/NumPy/PyTorch/etc. arrays. https://docs.kidger.site/jaxtyping/
Solve puzzles. Improve your pytorch.
A scalable asynchronous reinforcement learning implementation with in-flight weight updates.
Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments
Scalable RL solution for advanced reasoning of language models
Tongyi Deep Research, the Leading Open-source Deep Research Agent
My learning notes/codes for ML SYS.
SkyRL: A Modular Full-stack RL Library for LLMs
A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning
Optimize prompts, code, and more with AI-powered Reflective Text Evolution
slime is an LLM post-training framework for RL Scaling.
Real-time terminal monitor for InfiniBand networks - htop for high-speed interconnects
Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.
Training-Ready RL Environments + Evals
Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition
A PyTorch native platform for training generative AI models
Environments for LLM Reinforcement Learning
verl: Volcano Engine Reinforcement Learning for LLMs
Qwen Code is a coding agent that lives in the digital world.
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
Use Claude Code as the foundation for coding infrastructure, allowing you to decide how to interact with the model while enjoying updates from Anthropic.
Supabase CLI. Manage postgres migrations, run Supabase locally, deploy edge functions. Postgres backups. Generating types from your database schema.