raojingson

raojingson

Stars

InternLM / CapRL

[ICLR 2026] An official implementation of "CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning"

Python 203 7 Updated Apr 17, 2026

MCG-NJU / MobileViCLIP

[ICCV 2025] MobileViCLIP: An Efficient Video-Text Model for Mobile Devices

Python 22 1 Updated Dec 11, 2025

OpenGVLab / InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 2,245 145 Updated Mar 25, 2026

FoundationVision / Groma

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

Python 586 45 Updated Jun 7, 2024

iSEE-Laboratory / LLMDet

(CVPR 2025 highlight✨) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models"

Python 582 32 Updated Feb 4, 2026

visionml / pytracking

Visual tracking library based on PyTorch.

Python 3,495 613 Updated Aug 8, 2024

funstory-ai / BabelDOC

Yet Another Document Translator

Python 8,138 644 Updated Apr 13, 2026

iMoonLab / yolov13

Implementation of "YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception".

Python 1,634 171 Updated Nov 18, 2025

Victorwz / Open-Qwen2VL

[COLM 2025] Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

Python 312 15 Updated Aug 25, 2025

apple / ml-fastvlm

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 7,310 550 Updated May 5, 2025

dair-ai / Prompt-Engineering-Guide

🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

MDX 73,470 7,923 Updated Mar 11, 2026

Nicholasli1995 / EgoNet

Official project website for the CVPR 2021 paper "Exploring intermediate representation for monocular vehicle pose estimation"

Python 185 21 Updated Apr 25, 2022

Whiffe / SCB-dataset

Student Classroom Behavior dataset

447 47 Updated Sep 18, 2025

deepseek-ai / DeepSeek-V3

Python 102,671 16,660 Updated Aug 28, 2025

huggingface / smollm

Everything about the SmolLM and SmolVLM family of models

Python 3,720 287 Updated Apr 2, 2026

EleutherAI / pythia

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook 2,776 212 Updated Nov 15, 2025

apple / ml-mobileclip

This repository contains the official implementation of the research papers, "MobileCLIP" CVPR 2024 and "MobileCLIP2" TMLR August 2025

Python 1,492 117 Updated Apr 15, 2026

HqWu-HITCS / Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

22,534 2,126 Updated May 19, 2025

YvanYin / Metric3D

The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."

Python 2,168 162 Updated Mar 13, 2025

MCG-NJU / MixFormerV2

[NeurIPS 2023] MixFormerV2: Efficient Fully Transformer Tracking

Python 212 32 Updated Apr 20, 2024

Ucas-HaoranWei / Vary-toy

Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)

Python 630 43 Updated Dec 30, 2024

lcybuzz / Low-Level-Vision-Paper-Record

记录近期的 1) 图像/视频的超分增强等low level vision任务; 2) 图像生成等任务相关论文, 主要为18年以后的DL based方法.

549 52 Updated Mar 6, 2025

InterDigitalInc / CompressAI

A PyTorch library and evaluation platform for end-to-end compression research

Python 1,562 272 Updated Mar 31, 2026

XPixelGroup / X-Super-Resolution

X-Super-Resolution is dedicated to presenting the research efforts of XPixel in the realm of image super-resolution.

49 4 Updated Aug 24, 2023

ChaofWang / Awesome-Super-Resolution

Collect super-resolution related papers, data, repositories

3,046 367 Updated Apr 17, 2026

NEU-Gou / awesome-reid-dataset

Collection of public available person re-identification datasets

1,077 170 Updated Oct 23, 2025

bismex / Awesome-person-re-identification

Awesome Person Re-identification

1,345 201 Updated Jun 18, 2024

megvii-research / mdistiller

The official implementation of [CVPR2022] Decoupled Knowledge Distillation https://arxiv.org/abs/2203.08679 and [ICCV2023] DOT: A Distillation-Oriented Trainer https://openaccess.thecvf.com/content…

Python 900 132 Updated Nov 5, 2023

hunjung-lim / awesome-vehicle-datasets

Jupyter Notebook 106 15 Updated Nov 11, 2019

VisDrone / VisDrone-Dataset

The dataset for drone based detection and tracking is released, including both image/video, and annotations.

2,234 232 Updated Sep 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly