Skip to content
View yangjie-cv's full-sized avatar

Block or report yangjie-cv

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

WeThink: Toward General-purpose Vision-Language Reasoning via Reinforcement Learning

Python 36 1 Updated Jun 10, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,780 364 Updated Mar 26, 2026

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

1,388 60 Updated Feb 26, 2026

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,966 661 Updated Mar 27, 2026

An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.

Python 157 8 Updated Jan 5, 2026
1 Updated Oct 29, 2024

Kolors Team

Python 4,610 357 Updated Nov 13, 2024
Python 1,842 61 Updated Jun 28, 2024

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 2,611 202 Updated Feb 16, 2025

Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Python 21,153 3,065 Updated Oct 17, 2025

A universal summary of current robotics simulators

TypeScript 543 28 Updated Jul 22, 2025

[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Python 2,643 171 Updated Oct 15, 2025
Python 10 Updated Oct 30, 2023

[ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"

Python 788 39 Updated Aug 16, 2024

detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.

Python 2,273 244 Updated Sep 11, 2025

[ICCV 2023] Official implementation of the paper "DFA3D: 3D Deformable Attention For 2D-to-3D Feature Lifting"

Python 182 5 Updated Apr 12, 2025

[ICCV 2023] Official implementation of the paper "Neural Interactive Keypoint Detection"

Python 85 4 Updated Oct 12, 2023

Official implementation of the paper "Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model"

Python 66 1 Updated Aug 9, 2023

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 9,921 1,014 Updated Aug 12, 2024

[CVPR 2023] The official implementation of CVPR 2023 paper "Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes"

275 4 Updated Oct 3, 2023

[ICCV-2023] Official code for work "HumanMAC: Masked Motion Completion for Human Motion Prediction".

Python 323 19 Updated May 5, 2024

[ICLR 2023] Official implementation of the paper "Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation "

Python 185 12 Updated Sep 20, 2023