Stars
Lightweight and portable LLM sandbox runtime (code interpreter) Python library.
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
NexRL is an ultra-loosely-coupled LLM post-training framework.
[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL
🤗 A PyTorch-native and Flexible Inference Engine with Hybrid Cache Acceleration and Parallelism for DiTs.
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
SGLang model provider for Strands Agents for on-policy agentic RL training.
A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.
A NCCL extension library, designed to efficiently offload GPU memory allocated by the NCCL communication library.
A construction kit for reinforcement learning environment management.
Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.
Checkpoint-engine is a simple middleware to update model weights in LLM inference engines
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
A lightweight, powerful framework for multi-agent workflows
Fully Open Framework for Democratized Multimodal Training
InternLM / GroupedGEMM
Forked from fanshiqing/grouped_gemmPyTorch bindings for CUTLASS and CUBLAS Grouped GEMM, Permute and Unpermute.
InternLM / AdaptiveGEMM
Forked from deepseek-ai/DeepGEMMAdaptiveGEMM: FP8 GEMM with Adaptation to Various Lengths of Group M
A debugging and profiling tool that can trace and visualize python code execution
how to optimize some algorithm in cuda.
Reference PyTorch implementation and models for DINOv3
Implementation for FP8/INT8 Rollout for RL training without performence drop.
VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo