fanq15

Qi Fan fanq15

119 followers · 17 following

Tsinghua University

Achievements

Stars

niki-amini-naieni / CountVid

Includes the VideoCount dataset and CountVid code for the paper Open-World Object Counting in Videos.

Python 89 12 Updated Dec 15, 2025

xialeiliu / Awesome-Incremental-Learning

Awesome Incremental Learning

4,396 624 Updated Jan 29, 2026

JIA-Lab-research / VisionThink

[NeurIPS 2025] Efficient Reasoning Vision Language Models

Python 449 29 Updated Sep 18, 2025

zhengxuJosh / Awesome-Multimodal-Spatial-Reasoning

This repository collects and organises state‑of‑the‑art papers on spatial reasoning for Multimodal Vision–Language Models (MVLMs).

279 15 Updated Feb 10, 2026

lif314 / Awesome-Spatial-Intelligence

Awesome Spatial Intelligence (Personal Use)

48 2 Updated Jan 7, 2026

Gorilla-Lab-SCUT / PaDT

[ICLR 2026] Official implementation of "Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs"

Python 249 13 Updated Oct 31, 2025

haoweiz23 / ReCon

[NeurIPS 2025 Spotlight] The official repository of "ReCon: Region-Controllable Data Augmentation with Rectification and Alignment for Object Detection".

Python 17 1 Updated Oct 20, 2025

IDEA-Research / Rex-Omni

Detect Anything via Next Point Prediction (Based on Qwen2.5-VL-3B)

Jupyter Notebook 1,132 82 Updated Jan 25, 2026

pengsida / learning_research

本人的科研经验

10,184 530 Updated Feb 10, 2026

ChaofanTao / Autoregressive-Models-in-Vision-Survey

[TMLR 2025🔥] A survey for the autoregressive models in vision.

786 22 Updated Nov 8, 2025

tue-mps / eomt

[CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).

Jupyter Notebook 531 48 Updated Oct 27, 2025

WangRongsheng / awesome-LLM-resources

🧑‍🚀 全世界最好的LLM资料总结（多模态生成、Agent、辅助编程、AI审稿、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型） | Summary of the world's best LLM resources.

7,537 730 Updated Feb 15, 2026

huggingface / course

The Hugging Face course on Transformers

MDX 3,709 1,250 Updated Feb 5, 2026

IDEA-Research / detrex

detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.

Python 2,276 241 Updated Sep 11, 2025

sunsmarterjie / yolov12

[NeurIPS 2025] YOLOv12: Attention-Centric Real-Time Object Detectors

Python 2,795 414 Updated Oct 3, 2025

ultralytics / ultralytics

Ultralytics YOLO 🚀

Python 53,358 10,215 Updated Feb 17, 2026

THU-MIG / yolov10

YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]

Python 11,224 1,176 Updated Mar 14, 2025

NVIDIA-AI-IOT / nanosam

A distilled Segment Anything (SAM) model capable of running real-time with NVIDIA TensorRT

Python 854 70 Updated Nov 20, 2023

Koorye / DePT

[CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"

Jupyter Notebook 109 5 Updated Nov 24, 2025

Leiyi-HU / mona

The official implementation of [CVPR 2025] "5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks".

Python 391 20 Updated Jun 23, 2025

ChaoningZhang / MobileSAM

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Jupyter Notebook 5,622 565 Updated Dec 19, 2025

w1oves / Rein

[CVPR 2024] Official implement of <Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation>

Python 397 22 Updated Aug 5, 2025

THU-MIG / YOLO-UniOW

YOLO-UniOW: Efficient Universal Open-World Object Detection

Python 176 17 Updated Jan 17, 2025

NUS-HPC-AI-Lab / DD-Ranking

Data distillation benchmark

HTML 72 5 Updated Jun 13, 2025

drivetosouth / SafeDialBench-Dataset

Official github repo for SafeDialBench, a comprehensive multi-turn dialogue benchmark to evaluate LLMs' safety.

Python 42 2 Updated May 12, 2025

RabbitBoss / Awesome-Realistic-Semi-Supervised-Learning

An awesome paper list of Semi-Supervised Learning under realistic settings.

Shell 128 14 Updated Nov 21, 2024

Computer-Vision-in-the-Wild / CVinW_Readings

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

1,355 57 Updated Mar 14, 2024

HumanSignal / label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

TypeScript 26,447 3,393 Updated Feb 17, 2026

leaves162 / CLIPtrase

cliptrase

Jupyter Notebook 47 6 Updated Sep 1, 2024

hzwer / WritingAIPaper

Writing AI Conference Papers: A Handbook for Beginners

3,405 119 Updated Jul 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qi Fan fanq15

Achievements

Achievements

Block or report fanq15

Stars

niki-amini-naieni / CountVid

xialeiliu / Awesome-Incremental-Learning

JIA-Lab-research / VisionThink

zhengxuJosh / Awesome-Multimodal-Spatial-Reasoning

lif314 / Awesome-Spatial-Intelligence

Gorilla-Lab-SCUT / PaDT

haoweiz23 / ReCon

IDEA-Research / Rex-Omni

pengsida / learning_research

ChaofanTao / Autoregressive-Models-in-Vision-Survey

tue-mps / eomt

WangRongsheng / awesome-LLM-resources

huggingface / course

IDEA-Research / detrex

sunsmarterjie / yolov12

ultralytics / ultralytics

THU-MIG / yolov10

NVIDIA-AI-IOT / nanosam

Koorye / DePT

Leiyi-HU / mona

ChaoningZhang / MobileSAM

w1oves / Rein

THU-MIG / YOLO-UniOW

NUS-HPC-AI-Lab / DD-Ranking

drivetosouth / SafeDialBench-Dataset

RabbitBoss / Awesome-Realistic-Semi-Supervised-Learning

Computer-Vision-in-the-Wild / CVinW_Readings

HumanSignal / label-studio

leaves162 / CLIPtrase

hzwer / WritingAIPaper