Skip to content
View fanq15's full-sized avatar

Block or report fanq15

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Includes the VideoCount dataset and CountVid code for the paper Open-World Object Counting in Videos.

Python 89 12 Updated Dec 15, 2025

Awesome Incremental Learning

4,396 624 Updated Jan 29, 2026

[NeurIPS 2025] Efficient Reasoning Vision Language Models

Python 449 29 Updated Sep 18, 2025

This repository collects and organises state‑of‑the‑art papers on spatial reasoning for Multimodal Vision–Language Models (MVLMs).

279 15 Updated Feb 10, 2026

Awesome Spatial Intelligence (Personal Use)

48 2 Updated Jan 7, 2026

[ICLR 2026] Official implementation of "Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs"

Python 249 13 Updated Oct 31, 2025

[NeurIPS 2025 Spotlight] The official repository of "ReCon: Region-Controllable Data Augmentation with Rectification and Alignment for Object Detection".

Python 17 1 Updated Oct 20, 2025

Detect Anything via Next Point Prediction (Based on Qwen2.5-VL-3B)

Jupyter Notebook 1,132 82 Updated Jan 25, 2026

本人的科研经验

10,184 530 Updated Feb 10, 2026

[TMLR 2025🔥] A survey for the autoregressive models in vision.

786 22 Updated Nov 8, 2025

[CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).

Jupyter Notebook 531 48 Updated Oct 27, 2025

🧑‍🚀 全世界最好的LLM资料总结(多模态生成、Agent、辅助编程、AI审稿、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.

7,537 730 Updated Feb 15, 2026

The Hugging Face course on Transformers

MDX 3,709 1,250 Updated Feb 5, 2026

detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.

Python 2,276 241 Updated Sep 11, 2025

[NeurIPS 2025] YOLOv12: Attention-Centric Real-Time Object Detectors

Python 2,795 414 Updated Oct 3, 2025

Ultralytics YOLO 🚀

Python 53,358 10,215 Updated Feb 17, 2026

YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]

Python 11,224 1,176 Updated Mar 14, 2025

A distilled Segment Anything (SAM) model capable of running real-time with NVIDIA TensorRT

Python 854 70 Updated Nov 20, 2023

[CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"

Jupyter Notebook 109 5 Updated Nov 24, 2025

The official implementation of [CVPR 2025] "5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks".

Python 391 20 Updated Jun 23, 2025

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Jupyter Notebook 5,622 565 Updated Dec 19, 2025

[CVPR 2024] Official implement of <Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation>

Python 397 22 Updated Aug 5, 2025

YOLO-UniOW: Efficient Universal Open-World Object Detection

Python 176 17 Updated Jan 17, 2025

Data distillation benchmark

HTML 72 5 Updated Jun 13, 2025

Official github repo for SafeDialBench, a comprehensive multi-turn dialogue benchmark to evaluate LLMs' safety.

Python 42 2 Updated May 12, 2025

An awesome paper list of Semi-Supervised Learning under realistic settings.

Shell 128 14 Updated Nov 21, 2024

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

1,355 57 Updated Mar 14, 2024

Label Studio is a multi-type data labeling and annotation tool with standardized output format

TypeScript 26,447 3,393 Updated Feb 17, 2026

cliptrase

Jupyter Notebook 47 6 Updated Sep 1, 2024

Writing AI Conference Papers: A Handbook for Beginners

3,405 119 Updated Jul 16, 2025
Next