Skip to content
View wbhu's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@TencentARC

Block or report wbhu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
108 stars written in Jupyter Notebook
Clear filter

A latent text-to-image diffusion model

Jupyter Notebook 72,344 10,580 Updated Jun 18, 2024

Google Research

Jupyter Notebook 37,225 8,323 Updated Feb 5, 2026

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 24,341 2,112 Updated Sep 12, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 18,462 2,339 Updated Dec 25, 2024

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 13,829 1,712 Updated Feb 29, 2024

《Python Cookbook》 3rd Edition Translation

Jupyter Notebook 12,015 2,970 Updated Jul 24, 2024

Code release for NeRF (Neural Radiance Fields)

Jupyter Notebook 10,794 1,445 Updated Apr 12, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 9,511 725 Updated Nov 20, 2025

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,605 556 Updated Nov 10, 2025

Image restoration with neural networks but without learning.

Jupyter Notebook 8,066 1,448 Updated Apr 27, 2023

Inpaint anything using Segment Anything and inpainting models.

Jupyter Notebook 7,592 657 Updated Feb 29, 2024

Using Low-rank adaptation to quickly fine-tune diffusion models.

Jupyter Notebook 7,522 500 Updated Mar 22, 2024

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…

Jupyter Notebook 7,371 1,101 Updated Aug 6, 2024

A unified framework for 3D content generation.

Jupyter Notebook 6,974 547 Updated Dec 16, 2024

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Jupyter Notebook 5,303 1,427 Updated Jun 12, 2024

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Jupyter Notebook 4,294 360 Updated Nov 27, 2025

[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

Jupyter Notebook 3,366 321 Updated Oct 27, 2024

Official Implementation for "Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation" (CVPR 2021) presenting the pixel2style2pixel (pSp) framework

Jupyter Notebook 3,272 577 Updated Oct 1, 2022

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…

Jupyter Notebook 3,102 356 Updated Feb 5, 2026

NeRF (Neural Radiance Fields) and NeRF in the Wild using pytorch-lightning

Jupyter Notebook 2,808 468 Updated Aug 3, 2023
Jupyter Notebook 2,595 454 Updated Dec 16, 2023

This is the repo for our new project Highly Accurate Dichotomous Image Segmentation

Jupyter Notebook 2,516 291 Updated Sep 23, 2024

Scenarios, tutorials and demos for Autonomous Driving

Jupyter Notebook 2,413 570 Updated Aug 25, 2025

Puzzles for learning Triton

Jupyter Notebook 2,284 195 Updated Nov 18, 2024

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)

Jupyter Notebook 2,120 398 Updated Jun 7, 2022

An unsupervised learning framework for depth and ego-motion estimation from monocular videos

Jupyter Notebook 2,014 556 Updated Oct 26, 2021

[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.

Jupyter Notebook 1,975 257 Updated Jan 24, 2024

Discovering Interpretable GAN Controls [NeurIPS 2020]

Jupyter Notebook 1,798 266 Updated Jan 20, 2023

A Modular Framework for 3D Gaussian Splatting and Beyond

Jupyter Notebook 1,709 96 Updated Nov 5, 2025

A suite of image and video neural tokenizers

Jupyter Notebook 1,703 85 Updated Feb 11, 2025
Next