Stars
A Flexible and Powerful Parameter Server for large-scale machine learning
MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE
Official Code of "Distribution Matching Distillation Meets Reinforcement Learning"
Official inference repo for FLUX.2 models
Diffusion-SDPO: Safeguarded Direct Preference Optimization for Diffusion Models
The world's simplest facial recognition api for Python and the command line
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition
Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglement
[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing
Identity-GRPO: Optimizing Multi-Human Identity-preserving Video Generation via Reinforcement Learning
Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment
Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"
[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction
NewtonGen: Physics-Consistent and Controllable Text-to-Video Generation via Neural Newtonian Dynamics
HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
Wan: Open and Advanced Large-Scale Video Generative Models
Wan: Open and Advanced Large-Scale Video Generative Models
Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).
PromptEnhancer is a prompt-rewriting tool, refining prompts into clearer, structured versions for better image generation.