Lists (1)
Sort Name ascending (A-Z)
Stars
Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models
Community trainer for Lightricks' LTX Video model 🎬 ⚡️
[CVPR 2026] Official implementation of BiCo: Composing Concepts from Images and Videos via Concept-prompt Binding
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
Wan: Open and Advanced Large-Scale Video Generative Models
[AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation
[ICCV 2025] Light-A-Video: Training-free Video Relighting via Progressive Light Fusion
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
[IEEE TPAMI 2026] Simulating the Real World: Survey & Resources, which contains our survey "Simulating the Real World: A Unified Survey of Multimodal Generative Models" (IEEE TPAMI, 2026) and Aweso…
Generative Models by Stability AI
Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Latent Consistency Model for AUTOMATIC1111 Stable Diffusion WebUI
Improved AnimateDiff for ComfyUI and Advanced Sampling Support
WebUI extension for ControlNet
Refine high-quality datasets and visual AI models
Official code for PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point Tracking (ICCV 2023)
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
animatediff prompt travel
[CVPR'24 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
[SIGGRAPH Asia 2023] An interactive story visualization tool that support multiple characters