Skip to content
View raojingson's full-sized avatar

Block or report raojingson

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICLR 2026] An official implementation of "CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning"

Python 197 7 Updated Feb 8, 2026

[ICCV 2025] MobileViCLIP: An Efficient Video-Text Model for Mobile Devices

Python 18 1 Updated Dec 11, 2025

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 2,230 144 Updated Mar 25, 2026

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

Python 586 45 Updated Jun 7, 2024

(CVPR 2025 highlight✨) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models"

Python 575 32 Updated Feb 4, 2026

Visual tracking library based on PyTorch.

Python 3,497 614 Updated Aug 8, 2024

Yet Another Document Translator

Python 8,056 634 Updated Mar 30, 2026

Implementation of "YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception".

Python 1,621 168 Updated Nov 18, 2025

[COLM 2025] Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

Python 312 15 Updated Aug 25, 2025

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 7,295 549 Updated May 5, 2025

🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

MDX 72,835 7,820 Updated Mar 11, 2026

Official project website for the CVPR 2021 paper "Exploring intermediate representation for monocular vehicle pose estimation"

Python 184 21 Updated Apr 25, 2022

Student Classroom Behavior dataset

436 46 Updated Sep 18, 2025

Everything about the SmolLM and SmolVLM family of models

Python 3,697 284 Updated Apr 2, 2026

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook 2,762 210 Updated Nov 15, 2025

This repository contains the official implementation of the research papers, "MobileCLIP" CVPR 2024 and "MobileCLIP2" TMLR August 2025

Python 1,478 116 Updated Oct 9, 2025

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

22,499 2,121 Updated May 19, 2025

The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."

Python 2,150 162 Updated Mar 13, 2025

[NeurIPS 2023] MixFormerV2: Efficient Fully Transformer Tracking

Python 208 31 Updated Apr 20, 2024

Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)

Python 630 43 Updated Dec 30, 2024

记录近期的 1) 图像/视频的超分增强等low level vision任务; 2) 图像生成 等任务相关论文, 主要为18年以后的DL based方法.

549 52 Updated Mar 6, 2025

A PyTorch library and evaluation platform for end-to-end compression research

Python 1,550 271 Updated Mar 31, 2026

X-Super-Resolution is dedicated to presenting the research efforts of XPixel in the realm of image super-resolution.

49 4 Updated Aug 24, 2023

Collect super-resolution related papers, data, repositories

3,037 368 Updated Feb 5, 2026

Collection of public available person re-identification datasets

1,071 170 Updated Oct 23, 2025

Awesome Person Re-identification

1,340 201 Updated Jun 18, 2024

The official implementation of [CVPR2022] Decoupled Knowledge Distillation https://arxiv.org/abs/2203.08679 and [ICCV2023] DOT: A Distillation-Oriented Trainer https://openaccess.thecvf.com/content…

Python 897 132 Updated Nov 5, 2023
Jupyter Notebook 106 15 Updated Nov 11, 2019

The dataset for drone based detection and tracking is released, including both image/video, and annotations.

2,198 228 Updated Sep 24, 2023
Next