Skip to content
View ronghanghu's full-sized avatar

Organizations

@BVLC @DarrellGroup

Block or report ronghanghu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
113 results for source starred repositories
Clear filter

🚀 Efficient implementations of state-of-the-art linear attention models

Python 4,355 378 Updated Feb 3, 2026

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,628 512 Updated Feb 4, 2026

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 7,578 994 Updated Feb 3, 2026

Detect Anything via Next Point Prediction (Based on Qwen2.5-VL-3B)

Jupyter Notebook 1,122 80 Updated Jan 25, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 19,038 3,203 Updated Feb 6, 2026

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 2,147 141 Updated Jan 22, 2026

Microsoft PowerToys is a collection of utilities that supercharge productivity and customization on Windows

C# 129,083 7,677 Updated Feb 6, 2026

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 18,470 2,337 Updated Dec 25, 2024

The benchmark for "Video Object Segmentation in Panoptic Wild Scenes".

Python 12 1 Updated Oct 17, 2023

[ECCV2022] MOTR: End-to-End Multiple-Object Tracking with TRansformer

Python 779 108 Updated Jan 15, 2024

[CVPR2023] MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors

Python 466 61 Updated Feb 28, 2023

A PyTorch implementation of Connected Components Labeling

Jupyter Notebook 122 28 Updated Jun 8, 2023

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Python 1,169 76 Updated Oct 21, 2024

Modern WebSocket support for Flask.

Python 314 25 Updated Jan 6, 2025

Monocular Depth Estimation Toolbox based on MMSegmentation.

Python 968 111 Updated Jul 21, 2025

Model parallel transformers in JAX and Haiku

Python 6,364 886 Updated Jan 21, 2023

JAX-based neural network library

Python 3,183 283 Updated Jan 30, 2026

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 3,346 208 Updated May 19, 2025

Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry lead…

Python 547 70 Updated Jan 13, 2026

2nd solution of ICDAR 2021 Competition on Scientific Literature Parsing, Task B.

Python 469 108 Updated Jul 4, 2022

Making large AI models cheaper, faster and more accessible

Python 41,337 4,538 Updated Jan 19, 2026

JAX - A curated list of resources https://github.com/google/jax

2,036 157 Updated Jan 20, 2026

Abseil Common Libraries (Python)

Python 2,430 270 Updated Feb 6, 2026

Abseil Common Libraries (C++)

C++ 17,008 2,965 Updated Feb 6, 2026

ConvMAE: Masked Convolution Meets Masked Autoencoders

Python 523 44 Updated Mar 14, 2023

A paper list of some recent Transformer-based CV works.

1,428 147 Updated Nov 19, 2025

torch-optimizer -- collection of optimizers for Pytorch

Python 3,160 310 Updated Mar 22, 2024

Enabling PyTorch on XLA Devices (e.g. Google TPU)

C++ 2,747 564 Updated Dec 18, 2025
Next