-
21:29
(UTC -12:00)
Lists (1)
Sort Name ascending (A-Z)
Stars
Open-Sora: Democratizing Efficient Video Production for All
Fully open reproduction of DeepSeek-R1
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Wan: Open and Advanced Large-Scale Video Generative Models
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Wan: Open and Advanced Large-Scale Video Generative Models
Trae Agent is an LLM-based agent for general purpose software engineering tasks.
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
Efficient Triton Kernels for LLM Training
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Simple, scalable AI model deployment on GPU clusters
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
Pytorch implementation of FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention
Official PyTorch implementation for "Large Language Diffusion Models"
Unlimited-length talking video generation that supports image-to-video and video-to-video generation
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
[ICCV 2025] LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
VideoSys: An easy and efficient system for video generation
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback