Skip to content
View wjf5203's full-sized avatar

Block or report wjf5203

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
24 stars written in Jupyter Notebook
Clear filter

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 53,381 6,227 Updated Sep 18, 2024

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 17,387 1,572 Updated Sep 5, 2024

✔(已完结)超级全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】【大飞 大模型Agent】

Jupyter Notebook 16,859 1,955 Updated Jan 12, 2026

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 13,833 1,712 Updated Feb 29, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,158 1,097 Updated Nov 18, 2024

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,611 557 Updated Nov 10, 2025

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 6,425 1,230 Updated Jul 30, 2024

CoTracker is a model for tracking any point (pixel) on a video.

Jupyter Notebook 4,813 345 Updated Jan 21, 2025

Open-source and strong foundation image recognition models.

Jupyter Notebook 3,584 317 Updated Feb 18, 2025

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 3,347 208 Updated May 19, 2025

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…

Jupyter Notebook 3,103 357 Updated Feb 5, 2026

pytorch1.0 updated. Support cpu test and demo. (Use detectron2, it's a masterpiece)

Jupyter Notebook 1,818 468 Updated Nov 12, 2020

Official PyTorch repo for GAN's N' Roses. Diverse im2im and vid2vid selfie to anime translation.

Jupyter Notebook 1,152 150 Updated May 27, 2022

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 1,109 63 Updated Mar 20, 2025

This repository is intended to host tools and demos for ActivityNet

Jupyter Notebook 966 328 Updated Mar 21, 2024

This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.

Jupyter Notebook 752 64 Updated Oct 17, 2023

LaVIT: Empower the Large Language Model to Understand and Generate Visual Content

Jupyter Notebook 602 31 Updated Oct 6, 2024

A Simple pytorch implementation of GradCAM and GradCAM++

Jupyter Notebook 396 98 Updated Apr 23, 2019

Evaluating text-to-image/video/3D models with VQAScore

Jupyter Notebook 374 34 Updated Sep 22, 2025

FlexTok: Resampling Images into 1D Token Sequences of Flexible Length

Jupyter Notebook 290 14 Updated Jun 2, 2025
Jupyter Notebook 116 3 Updated Nov 8, 2025

Official PyTorch implementation of FlowMo.

Jupyter Notebook 110 7 Updated Apr 7, 2025

COCO API Customized for OVIS evaluation

Jupyter Notebook 16 1 Updated Nov 8, 2021
Jupyter Notebook 3 Updated May 8, 2021