Skip to content
View jesanli's full-sized avatar

Block or report jesanli

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICCV 2025] Implementation for Describe Anything: Detailed Localized Image and Video Captioning

Python 1,474 90 Updated Jun 26, 2025

[NeurIPS 2025] YOLOv12: Attention-Centric Real-Time Object Detectors

Python 2,825 417 Updated Feb 18, 2026

[ICCV 2023] MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking

Python 221 20 Updated Oct 15, 2025

The official code of "Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark"

Python 170 16 Updated Jul 23, 2025

A modern GUI client based on Tauri, designed to run in Windows, macOS and Linux for tailored proxy experience

TypeScript 105,819 7,685 Updated Mar 27, 2026
Python 49 9 Updated Jun 19, 2024
Jupyter Notebook 41 5 Updated Jun 3, 2022

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 32,968 3,970 Updated Mar 25, 2026

The official pytorch implementation of our AAAI 2024 paper "Unifying Visual and Vision-Language Tracking via Contrastive Learning"

Python 47 5 Updated Nov 4, 2024

[ECCV 2024] Elysium: Exploring Object-level Perception in Videos via MLLM

Python 86 5 Updated Oct 25, 2024

[NeurIPS 2024] VastTrack: Vast Category Visual Object Tracking

Python 73 5 Updated Sep 30, 2025

A vision-language tracking paper list, articles related to visual language tracking have been documented.

42 2 Updated Dec 15, 2024

[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding

Python 153 11 Updated Jul 13, 2024

[TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.

Jupyter Notebook 132 11 Updated Nov 10, 2025
Python 196 28 Updated Feb 27, 2024

[NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion

Python 101 4 Updated Oct 29, 2025

Bag of Tricks and A Strong Baseline for Deep Person Re-identification

Python 2,335 581 Updated Apr 23, 2020

SOTA Re-identification Methods and Toolbox

Python 3,877 874 Updated Jul 30, 2024

code for CVPR2024 paper: DiffMOT: A Real-time Diffusion-based Multiple Object Tracker with Non-linear Prediction

Python 445 53 Updated Jun 13, 2024

The official implement of TrackSSM

Python 36 1 Updated Oct 13, 2024
Python 67 15 Updated Aug 22, 2024

https://arxiv.org/abs/2302.11813

Python 261 24 Updated May 6, 2023

A Confidence-Aware Matching Strategy For Generalized Multi-Object Tracking

Python 11 Updated Feb 9, 2025

MOT using deepsort and yolov3 with pytorch

Python 3,015 734 Updated Jul 16, 2024

[CVPR 2023] Referring Multi-Object Tracking

Python 153 18 Updated Jul 2, 2024

[CVPR 2025] Multiple Object Tracking as ID Prediction

Python 495 42 Updated Aug 20, 2025

Code for RA-L paper "PKF: Probabilistic Data Association Kalman Filter for Multi-Object Tracking"

Python 19 5 Updated Sep 5, 2025

[CVPR 2024] iKUN: Speak to Trackers without Retraining

Python 147 3 Updated Jun 19, 2024

Attentive Generative Adversarial Network for Raindrop Removal from A Single Image (CVPR 2018)

Python 547 114 Updated Jun 29, 2018

[CVPR2022] DanceTrack: Multiple Object Tracking in Uniform Appearance and Diverse Motion

Python 449 38 Updated Mar 19, 2026
Next