Starred repositories
[CVPR 2026 Highlight] VideoCoF: Unified Video Editing with Temporal Reasoner
The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation
Implementation of the AAAI 2025 paper "SVGBuilder: Component-Based Colored SVG Generation with Text-Guided Autoregressive Transformers".
JoyAI-Echo: Pushing the Frontier of Long Audio-Visual Generation
lihaoyun6 / FlashVSR_plus
Forked from OpenImagingLab/FlashVSRTowards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient one-step diffusion framework for streaming VSR with locality-constrained sparse attention and a tiny conditional de…
The official repository of our ICLR 2026 paper "Vivid-VR: Distilling Concepts from Text-to-Video Diffusion Transformer for Photorealistic Video Restoration".
Real-time 3D full-body reconstruction from a single camera, Multiperson BVH output, Pure C++ runtime, ONNX + ggml, 70-joint skeleton with hands.
[ICLR 2026] Official Implementation of Muddit [Meissonic II]: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model.
[AAAI 2024] "LDMVFI: Video Frame Interpolation with Latent Diffusion Models", Duolikun Danier, Fan Zhang, David Bull
[CVPR2026] VOSR: A Vision-Only Generative Model for Image Super-Resolution
[ICCV 2025] This is the official PyTorch codes for the paper: "DiT4SR: Taming Diffusion Transformer for Real-World Image Super-Resolution"
[CVPR2026] ODTSR: This repo is the official implementation of "One-Step Diffusion Transformer for Controllable Real-World Image Super-Resolution"
[ICML26 Spotlight] UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture
[NeurIPS 2025] PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers
delldu / TRELLIS
Forked from microsoft/TRELLISOfficial repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
delldu / Hunyuan3D-2
Forked from Tencent-Hunyuan/Hunyuan3D-2High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
[ICCV 2025] Official code of DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
[ICLR 2025 spotlight] 3DIS: Depth-Driven Decoupled Instance Synthesis for Text-to-Image Generation
ComfyUI-OmniGen - A ComfyUI custom node implementation of OmniGen, a powerful text-to-image generation and editing model.
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
Lightweight Python framework that provides a high-level API for creating and rendering scenes with Blender.
delldu / chitu
Forked from thu-pacman/chituHigh-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.
High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.
delldu / Wan2.1
Forked from Wan-Video/Wan2.1Wan: Open and Advanced Large-Scale Video Generative Models
Wan: Open and Advanced Large-Scale Video Generative Models
delldu / cpp-httplib
Forked from yhirose/cpp-httplibA C++ header-only HTTP/HTTPS server and client library