-
Nanyang Technological University
- Singapore
-
09:28
(UTC +08:00) - shangchenzhou.com
- @ShangchenZhou
Highlights
- Pro
Lists (13)
Sort Name ascending (A-Z)
Stars
Wan: Open and Advanced Large-Scale Video Generative Models
Dynamic 3D Foundation Model using Causal Transformer
Reference PyTorch implementation and models for DINOv3
RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation
Official SeedVR2 Video Upscaler for ComfyUI
Repo for SeedVR2 & SeedVR (CVPR2025 Highlight)
SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training
ObjectClear: Complete Object Removal via Object-Effect Attention
[SIGGRAPH Asia 2025] Official code for "Bokeh Diffusion: Defocus Blur Control in Text-to-Image Diffusion Models."
[CVPR2025 Highlight] SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration
A node-based image processing GUI aimed at making chaining image processing tasks easy and customizable. Born as an AI upscaling application, chaiNNer has grown into an extremely flexible and power…
[CVPR 2025] 3DEnhancer: Consistent Multi-View Diffusion for 3D Enhancement
[ICLR 2025][arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization
Author's Implementation for E-LatentLPIPS
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
Concept Sliders for Precise Control of Diffusion Models
[CVPR 2025] MatAnyone: Stable Video Matting with Consistent Memory Propagation
Official inference repo for FLUX.1 models
BokehMe: When Neural Rendering Meets Classical Rendering (CVPR 2022 Oral)
🔥 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Arbitrary-steps Image Super-resolution via Diffusion Inversion (CVPR 2025)
Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"
Official repository for "SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE"
High-quality and editable surfel 3D Gaussian generation through native 3D diffusion (ICLR 2025)
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)
[ICLR 2025] Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation