- Singapore
- https://xingangpan.github.io/
Highlights
- Pro
Stars
[ICCV 2025] FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing
[3DV 2026] FastMesh: Efficient Artistic Mesh Generation via Component Decoupling
Dynamic 3D Foundation Model using Causal Transformer
[ICCV 2025] TokensGen: Harnessing Condensed Tokens for Long Video Generation
Official Code for Epona: Autoregressive Diffusion World Model for Autonomous Driving (ICCV 2025)
[NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory
[CVPR2025] Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion
[CVPR 2025 Oral] Alias-free Latent Diffusion Models (official implementation)
[SIGGRAPH Asia 2025] Official code for "Bokeh Diffusion: Defocus Blur Control in Text-to-Image Diffusion Models."
[SIGGRAPH Asia 2024] I2VEdit: First-Frame-Guided Video Editing via Image-to-Video Diffusion Models
[ICLR 2025] Trajectory Attention For Fine-grained Video Motion Control
Official repository for "SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE"
[Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller
High-quality and editable surfel 3D Gaussian generation through native 3D diffusion (ICLR 2025)
A collection of resources and papers on Diffusion Models
A community-maintained Python framework for creating mathematical animations.
Understanding Deep Learning - Simon J.D. Prince
Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.
This is the code for siggrapha paper "An Implicit Neural Representation for the Image Stack: Depth, All in Focus, and High Dynamic Range"
✨✨Latest Advances on Multimodal Large Language Models
[ECCV-2024] LN3Diff creates high-quality 3D object mesh from text within 8 V100-SECONDS.
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
Official Code for DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing (CVPR 2024)
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
DragGAN meets GET3D for interactive mesh generation and editing.