Stars
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
A generative world for general-purpose robotics & embodied AI learning.
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Generative Models by Stability AI
Google Research
[ArXiv 2025] A survey about controllable video generation: This repo is the official awesome of "Controllable video generation: A survey"
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
A curated list of recent diffusion models for video generation, editing, and various other applications.
[NeurIPS 2025] MMaDA - Open-Sourced Multimodal Large Diffusion Language Models
links to conference publications in graph-based deep learning
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
A collection of AWESOME things about domain adaptation
A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.
A little word cloud generator in Python
Official inference repo for FLUX.1 models
[ICCV25] TACA: Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers
(CVPR 2025) DoF-Gaussian: Controllable Depth-of-Field for 3D Gaussian Splatting
Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"
[ICCV 2023] GETAvatar: Generative Textured Meshes for Animatable Human Avatars
[ICCV2025] DCM: Dual-Expert Consistency Model for Efficient and High-Quality Video Generation
Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437