Stars
Data recipes and robust infrastructure for training AI agents
This is a pytorch implementation of k-means clustering algorithm
JohannesBuchner / imagehash
Forked from bunchesofdonald/photohashA Python Perceptual Image Hashing Module
verl: Volcano Engine Reinforcement Learning for LLMs
Scalable toolkit for efficient model reinforcement
Official Implementation of LaViDa: :A Large Diffusion Language Model for Multimodal Understanding
A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning
The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]
Official PyTorch implementation for "Large Language Diffusion Models"
get things from one computer to another, safely
An open-source implementation for training LLaVA-NeXT.
[COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoning"
Understanding R1-Zero-Like Training: A Critical Perspective
OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement
A light-weight tool for evaluating LLMs in rule-based ways.
Wan: Open and Advanced Large-Scale Video Generative Models
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
Efficient Triton Kernels for LLM Training
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & TIS & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
A unified inference and post-training framework for accelerated video generation.
Fully open data curation for reasoning models
A library for advanced large language model reasoning
Synthetic data curation for post-training and structured data extraction