ZhendongWang6

🎯

Focusing

Zhendong Wang ZhendongWang6

🎯

Focusing

Ph.D. student, focus on computer vision and deep learning.

130 followers · 55 following

University of Science and Technology of China
Beijing, China
00:29 (UTC +08:00)
https://zhendongwang6.github.io/
https://scholar.google.com.hk/citations?user=Ya5VDjQAAAAJ&hl=zh-CN

Achievements

Highlights

Lists (27)

Sort

Stars

62 stars written in Jupyter Notebook

Clear filter

facebookresearch / segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 53,379 6,227 Updated Sep 18, 2024

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 32,511 3,897 Updated Jul 23, 2024

ShusenTang / Dive-into-DL-PyTorch

本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。

Jupyter Notebook 19,304 5,427 Updated Oct 14, 2021

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 18,474 2,337 Updated Dec 25, 2024

camenduru / stable-diffusion-webui-colab

stable diffusion webui colab

Jupyter Notebook 15,972 2,655 Updated Dec 16, 2025

AliaksandrSiarohin / first-order-model

This repository contains the source code for the paper First Order Motion Model for Image Animation

Jupyter Notebook 15,000 3,283 Updated Nov 14, 2024

CompVis / latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 13,833 1,712 Updated Feb 29, 2024

google-research / vision_transformer

Jupyter Notebook 12,276 1,436 Updated Jan 30, 2026

salesforce / LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,158 1,097 Updated Nov 18, 2024

bmild / nerf

Code release for NeRF (Neural Radiance Fields)

Jupyter Notebook 10,795 1,446 Updated Apr 12, 2025

cs231n / cs231n.github.io

Public facing notes page

Jupyter Notebook 10,783 4,155 Updated Sep 7, 2025

advimman / lama

🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022

Jupyter Notebook 9,692 1,028 Updated Feb 5, 2025

FoundationVision / VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,609 557 Updated Nov 10, 2025

fengdu78 / Data-Science-Notes

数据科学的笔记以及资料搜集

Jupyter Notebook 8,528 3,133 Updated Aug 16, 2021

DmitryUlyanov / deep-image-prior

Image restoration with neural networks but without learning.

Jupyter Notebook 8,065 1,448 Updated Apr 27, 2023

cloneofsimo / lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Jupyter Notebook 7,523 500 Updated Mar 22, 2024

tencent-ailab / IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 6,454 414 Updated Jun 28, 2024

CompVis / taming-transformers

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 6,425 1,229 Updated Jul 30, 2024

OFA-Sys / Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Jupyter Notebook 5,792 545 Updated Aug 29, 2025

nianticlabs / monodepth2

[ICCV 2019] Monocular depth estimation from a single image

Jupyter Notebook 4,454 989 Updated Aug 10, 2024

google-research / simclr

SimCLRv2 - Big Self-Supervised Models are Strong Semi-Supervised Learners

Jupyter Notebook 4,452 659 Updated May 22, 2023

QwenLM / Qwen2.5-Omni

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,914 317 Updated Jun 12, 2025

xinyu1205 / recognize-anything

Open-source and strong foundation image recognition models.

Jupyter Notebook 3,584 317 Updated Feb 18, 2025

nndl / exercise

exercise for nndl

Jupyter Notebook 3,321 1,457 Updated Jul 19, 2024

eladrich / pixel2style2pixel

Official Implementation for "Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation" (CVPR 2021) presenting the pixel2style2pixel (pSp) framework

Jupyter Notebook 3,272 577 Updated Oct 1, 2022

rinongal / textual_inversion

Jupyter Notebook 3,046 286 Updated Feb 27, 2023

x4nth055 / pythoncode-tutorials

The Python Code Tutorials

Jupyter Notebook 2,965 1,994 Updated Feb 5, 2026

yangxy / GPEN

Jupyter Notebook 2,596 454 Updated Dec 16, 2023

hila-chefer / Transformer-Explainability

[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.

Jupyter Notebook 1,975 257 Updated Jan 24, 2024

YangLing0818 / RPG-DiffusionMaster

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)

Jupyter Notebook 1,843 102 Updated Feb 1, 2025

Zhendong Wang ZhendongWang6

Highlights

Lists (27)

chatgpt

clip

controlnet

dataset

diffusion model

face-anti-spoofing

face-forgery-detection

flow

gan

img2img

interview

knowledge distillation

large language models

large vision model

ocr

pretrain

r1

sam系列

score metrics

segmentation

subject driven generation

survey

tools

vae

video generation

vision_language

visual text generation

Stars