Skip to content
View raojingson's full-sized avatar

Block or report raojingson

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICLR 2026] An official implementation of "CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning"

Python 203 7 Updated Apr 17, 2026

[ICCV 2025] MobileViCLIP: An Efficient Video-Text Model for Mobile Devices

Python 22 1 Updated Dec 11, 2025

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 2,245 145 Updated Mar 25, 2026

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

Python 586 45 Updated Jun 7, 2024

(CVPR 2025 highlight✨) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models"

Python 582 32 Updated Feb 4, 2026

Visual tracking library based on PyTorch.

Python 3,495 613 Updated Aug 8, 2024

Yet Another Document Translator

Python 8,138 644 Updated Apr 13, 2026

Implementation of "YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception".

Python 1,634 171 Updated Nov 18, 2025

[COLM 2025] Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

Python 312 15 Updated Aug 25, 2025

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 7,310 550 Updated May 5, 2025

🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

MDX 73,470 7,923 Updated Mar 11, 2026

Official project website for the CVPR 2021 paper "Exploring intermediate representation for monocular vehicle pose estimation"

Python 185 21 Updated Apr 25, 2022

Student Classroom Behavior dataset

447 47 Updated Sep 18, 2025

Everything about the SmolLM and SmolVLM family of models

Python 3,720 287 Updated Apr 2, 2026

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook 2,776 212 Updated Nov 15, 2025

This repository contains the official implementation of the research papers, "MobileCLIP" CVPR 2024 and "MobileCLIP2" TMLR August 2025

Python 1,492 117 Updated Apr 15, 2026

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

22,534 2,126 Updated May 19, 2025

The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."

Python 2,168 162 Updated Mar 13, 2025

[NeurIPS 2023] MixFormerV2: Efficient Fully Transformer Tracking

Python 212 32 Updated Apr 20, 2024

Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)

Python 630 43 Updated Dec 30, 2024

记录近期的 1) 图像/视频的超分增强等low level vision任务; 2) 图像生成 等任务相关论文, 主要为18年以后的DL based方法.

549 52 Updated Mar 6, 2025

A PyTorch library and evaluation platform for end-to-end compression research

Python 1,562 272 Updated Mar 31, 2026

X-Super-Resolution is dedicated to presenting the research efforts of XPixel in the realm of image super-resolution.

49 4 Updated Aug 24, 2023

Collect super-resolution related papers, data, repositories

3,046 367 Updated Apr 17, 2026

Collection of public available person re-identification datasets

1,077 170 Updated Oct 23, 2025

Awesome Person Re-identification

1,345 201 Updated Jun 18, 2024

The official implementation of [CVPR2022] Decoupled Knowledge Distillation https://arxiv.org/abs/2203.08679 and [ICCV2023] DOT: A Distillation-Oriented Trainer https://openaccess.thecvf.com/content…

Python 900 132 Updated Nov 5, 2023
Jupyter Notebook 106 15 Updated Nov 11, 2019

The dataset for drone based detection and tracking is released, including both image/video, and annotations.

2,234 232 Updated Sep 24, 2023
Next