Skip to content
View ZhendongWang6's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report ZhendongWang6

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
62 stars written in Jupyter Notebook
Clear filter

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 53,379 6,227 Updated Sep 18, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 32,511 3,897 Updated Jul 23, 2024

本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。

Jupyter Notebook 19,304 5,427 Updated Oct 14, 2021

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 18,474 2,337 Updated Dec 25, 2024

stable diffusion webui colab

Jupyter Notebook 15,972 2,655 Updated Dec 16, 2025

This repository contains the source code for the paper First Order Motion Model for Image Animation

Jupyter Notebook 15,000 3,283 Updated Nov 14, 2024

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 13,833 1,712 Updated Feb 29, 2024
Jupyter Notebook 12,276 1,436 Updated Jan 30, 2026

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,158 1,097 Updated Nov 18, 2024

Code release for NeRF (Neural Radiance Fields)

Jupyter Notebook 10,795 1,446 Updated Apr 12, 2025

Public facing notes page

Jupyter Notebook 10,783 4,155 Updated Sep 7, 2025

🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022

Jupyter Notebook 9,692 1,028 Updated Feb 5, 2025

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,609 557 Updated Nov 10, 2025

数据科学的笔记以及资料搜集

Jupyter Notebook 8,528 3,133 Updated Aug 16, 2021

Image restoration with neural networks but without learning.

Jupyter Notebook 8,065 1,448 Updated Apr 27, 2023

Using Low-rank adaptation to quickly fine-tune diffusion models.

Jupyter Notebook 7,523 500 Updated Mar 22, 2024

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 6,454 414 Updated Jun 28, 2024

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 6,425 1,229 Updated Jul 30, 2024

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Jupyter Notebook 5,792 545 Updated Aug 29, 2025

[ICCV 2019] Monocular depth estimation from a single image

Jupyter Notebook 4,454 989 Updated Aug 10, 2024

SimCLRv2 - Big Self-Supervised Models are Strong Semi-Supervised Learners

Jupyter Notebook 4,452 659 Updated May 22, 2023

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,914 317 Updated Jun 12, 2025

Open-source and strong foundation image recognition models.

Jupyter Notebook 3,584 317 Updated Feb 18, 2025

exercise for nndl

Jupyter Notebook 3,321 1,457 Updated Jul 19, 2024

Official Implementation for "Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation" (CVPR 2021) presenting the pixel2style2pixel (pSp) framework

Jupyter Notebook 3,272 577 Updated Oct 1, 2022
Jupyter Notebook 3,046 286 Updated Feb 27, 2023

The Python Code Tutorials

Jupyter Notebook 2,965 1,994 Updated Feb 5, 2026
Jupyter Notebook 2,596 454 Updated Dec 16, 2023

[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.

Jupyter Notebook 1,975 257 Updated Jan 24, 2024

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)

Jupyter Notebook 1,843 102 Updated Feb 1, 2025
Next