-
CUHK
- Hong Kong, China
-
03:35
(UTC +08:00) - https://wbhu.github.io/
- @wbhu_cuhk
- in/huwenbo
Lists (3)
Sort Name ascending (A-Z)
Stars
A latent text-to-image diffusion model
A simple screen parsing tool towards pure vision based GUI agent
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
High-Resolution Image Synthesis with Latent Diffusion Models
《Python Cookbook》 3rd Edition Translation
Code release for NeRF (Neural Radiance Fields)
Reference PyTorch implementation and models for DINOv3
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…
Image restoration with neural networks but without learning.
Inpaint anything using Segment Anything and inpainting models.
Using Low-rank adaptation to quickly fine-tune diffusion models.
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…
A unified framework for 3D content generation.
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering
Official Implementation for "Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation" (CVPR 2021) presenting the pixel2style2pixel (pSp) framework
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…
NeRF (Neural Radiance Fields) and NeRF in the Wild using pytorch-lightning
This is the repo for our new project Highly Accurate Dichotomous Image Segmentation
Scenarios, tutorials and demos for Autonomous Driving
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
An unsupervised learning framework for depth and ego-motion estimation from monocular videos
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.
Discovering Interpretable GAN Controls [NeurIPS 2020]
A Modular Framework for 3D Gaussian Splatting and Beyond
A suite of image and video neural tokenizers