Skip to content
View cshizhe's full-sized avatar

Highlights

  • Pro

Organizations

@AIM3-RUC

Block or report cshizhe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
20 stars written in Jupyter Notebook
Clear filter

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 52,380 6,133 Updated Sep 18, 2024

Google Research

Jupyter Notebook 36,667 8,231 Updated Oct 30, 2025

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 31,435 3,819 Updated Jul 23, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 17,547 2,182 Updated Dec 25, 2024

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 17,097 1,548 Updated Sep 5, 2024

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 16,003 1,266 Updated Oct 27, 2025

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 11,332 1,684 Updated Jul 2, 2025

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 5,567 725 Updated Aug 5, 2024

VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.

Jupyter Notebook 3,292 333 Updated Mar 3, 2024

A data generation pipeline for creating semi-realistic synthetic multi-object videos with rich annotations such as instance segmentation masks, depth maps, and optical flow.

Jupyter Notebook 2,594 258 Updated May 6, 2025

Solutions of Reinforcement Learning, An Introduction

Jupyter Notebook 2,330 506 Updated Jul 10, 2025

Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"

Jupyter Notebook 1,708 129 Updated Jan 29, 2024

An open-source library for GPU-accelerated robot learning and sim-to-real transfer.

Jupyter Notebook 1,534 229 Updated Oct 3, 2025

A new codebase for popular Scene Graph Generation methods (2020). Visualization & Scene Graph Extraction on custom images/datasets are provided. It's also a PyTorch implementation of paper “Unbiase…

Jupyter Notebook 1,171 236 Updated Oct 25, 2024

Neural question generation using transformers

Jupyter Notebook 1,138 352 Updated Apr 5, 2024

A PyTorch reimplementation of bottom-up-attention models

Jupyter Notebook 304 76 Updated Apr 7, 2022

[NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale

Jupyter Notebook 198 23 Updated Nov 13, 2023

[ICCV 2021 Oral + TPAMI] Just Ask: Learning to Answer Questions from Millions of Narrated Videos

Jupyter Notebook 124 15 Updated Sep 29, 2023

Official implementation of "Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy."

Jupyter Notebook 113 14 Updated Oct 23, 2025

Learning Descriptive Image Captioning via Semipermeable Maximum Likelihood Estimation (NeurIPS 2023)

Jupyter Notebook 22 1 Updated Oct 1, 2023