-
20:28
(UTC -12:00)
Lists (1)
Sort Name ascending (A-Z)
Stars
Efficient Triton Kernels for LLM Training
An open-source AI agent that brings the power of Gemini directly into your terminal.
Simple, scalable AI model deployment on GPU clusters
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
Production-ready platform for agentic workflow development.
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation
DeepEP: an efficient expert-parallel communication library
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
[NeurIPS 2025 D&B🔥] ImgEdit: A Unified Image Editing Dataset and Benchmark
Official PyTorch implementation for "Large Language Diffusion Models"
[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL
[ICCV 2025] LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
[🔥 Yuwei Niu's Academic Personal Homepage]
Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback
High quality training free inpaint for every stable diffusion model. Supports ComfyUI
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation
SEED-Voken: A Series of Powerful Visual Tokenizers
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling