Starred repositories
[ICLR' 25] SplatFormer: Point Transformer for Robust 3D Gaussian Splatting
[CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
MVGS: Multi-View Regulated Gaussian Splatting for Novel View Synthesis
[AAAI 2025🔥] Official implementation of Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle
[3DV 2025 Best Paper] We present Object Images (Omages): An homage to the classic Geometry Images.
SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM (CVPR 2024)
Official repository for "CFG++: manifold-constrained classifier free guidance for diffusion models" (ICLR2025)
Transparent Image Layer Diffusion using Latent Transparency
Original reference implementation of "GES : Generalized Exponential Splatting for Efficient Radiance Field Rendering" [CVPR 2024]
Compressed 3D Gaussian Splatting for Accelerated Novel View Synthesis
[ECCV 2024]"FSGS: Real-Time Few-Shot View Synthesis using Gaussian Splatting", Zehao Zhu*, Zhiwen Fan*, Yifan Jiang, Zhangyang Wang
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
[SIGGRAPH 2023] We provide a unified formula for neural fields (Factor Fields) and a novel dictionary factorization (Dictionary Fields)
[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".
[CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor
PyTorch implementation for "Parallel Sampling of Diffusion Models", NeurIPS 2023 Spotlight
Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功能实现,在线Demo,本地部署试用,代码、模型已全部开源,支持Windows, macOS, Linux)
Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts
The official PyTorch implementation for arXiv'23 paper 'LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer'
Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.
[ICCV 2023] Consistent Image Synthesis and Editing
A curated list of recent diffusion models for video generation, editing, and various other applications.
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators