[Experimental] Miles-diffusion is an post-training framework for large-scale diffusion model training and production workloads, forked from and co-evolving with miles.

Python 17 5 Updated Jun 12, 2026

inclusionAI / ASearcher

An Open-Source Large-Scale Reinforcement Learning Project for Search Agents

Python 593 38 Updated Nov 26, 2025

SandAI-org / MagiAttention

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 850 58 Updated Jun 15, 2026

SafeAILab / EAGLE

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

Python 2,396 285 Updated Feb 20, 2026

vllm-project / speculators

A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM

Python 516 104 Updated Jun 13, 2026

verl-project / uni-agent

A unified framework for building, running, and training general agents at scale.

Python 340 44 Updated Jun 15, 2026

xhyumiracle / Awesome-AgenticLLM-RL-Papers

1,813 79 Updated Jan 20, 2026

verl-project / verl-mint

Open MinT training runtime on veRL

Python 235 14 Updated May 18, 2026

lightseekorg / tokenspeed

TokenSpeed is a speed-of-light LLM inference engine.

Python 1,434 156 Updated Jun 15, 2026

walkinglabs / hands-on-modern-rl

🚀 An open-source, hands-on curriculum bridging the gap from basic RL concepts to LLM alignment, RLVR, and advanced Agentic systems.

Python 2,898 178 Updated Jun 12, 2026

verl-project / vexact

verl Zero-Mismatch Dense/MoE HuggingFace Rollout

Python 53 5 Updated Jun 11, 2026

verl-project / verl-omni

Multimodal RL training framework for diffusion & omni models

Python 359 55 Updated Jun 15, 2026

sail-sg / odc

On demand communication

Python 34 2 Updated Apr 16, 2026

deepseek-ai / TileKernels

A kernel library written in tilelang

Python 1,587 138 Updated Apr 23, 2026

skyzh / chicv

A minimal and fully-customizable CV template for Typst.

Typst 715 51 Updated Apr 6, 2025

wusyong / resume.typ

Resume template for Typst. Mirror to https://typst.app/project/rVVa3y9vXemUKyvNKnabKV

Typst 152 13 Updated Jul 15, 2025

ice-kylin / typst-cv-miku

A simple, elegant, academic style CV template for typst. Support for English and Chinese (and more).

96 7 Updated Apr 9, 2023

redai-infra / Relax

An Asynchronous Reinforcement Learning Engine for Omni-Modal Post-Training at Scale

Python 423 47 Updated Jun 13, 2026

LMIS-ORG / slime-agentic

A project implementing various agentic RL based on the Slime post-training framework

Python 465 32 Updated Apr 11, 2026

icerain-alt / FSDPToys

Learning and Debugging for FSDP/FSDP2 Training

Python 17 Updated Feb 7, 2026

mingyin0312 / RLFromScratch

Python 633 66 Updated Aug 28, 2025

mit-han-lab / fastrl

[ASPLOS'26] Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter

Python 173 18 Updated Feb 27, 2026