Skip to content
View ronghanghu's full-sized avatar

Organizations

@BVLC @DarrellGroup

Block or report ronghanghu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🚀 Efficient implementations of state-of-the-art linear attention models

Python 4,097 335 Updated Dec 20, 2025

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,401 461 Updated Dec 18, 2025

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 6,274 727 Updated Dec 21, 2025

Detect Anything via Next Point Prediction (Based on Qwen2.5-VL-3B)

Jupyter Notebook 1,012 66 Updated Dec 15, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,662 2,861 Updated Dec 21, 2025

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,942 127 Updated Dec 18, 2025

Microsoft PowerToys is a collection of utilities that help you customize Windows and streamline everyday tasks

C# 126,651 7,546 Updated Dec 21, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 18,077 2,287 Updated Dec 25, 2024

The benchmark for "Video Object Segmentation in Panoptic Wild Scenes".

Python 12 1 Updated Oct 17, 2023

[ECCV2022] MOTR: End-to-End Multiple-Object Tracking with TRansformer

Python 760 107 Updated Jan 15, 2024

[CVPR2023] MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors

Python 456 61 Updated Feb 28, 2023

A PyTorch implementation of Connected Components Labeling

Jupyter Notebook 122 28 Updated Jun 8, 2023

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Python 1,167 74 Updated Oct 21, 2024

Modern WebSocket support for Flask.

Python 312 25 Updated Jan 6, 2025

Monocular Depth Estimation Toolbox based on MMSegmentation.

Python 965 111 Updated Jul 21, 2025

Pipeline Parallelism for PyTorch

Python 783 88 Updated Aug 21, 2024

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Jupyter Notebook 1,584 97 Updated Feb 16, 2024

Model parallel transformers in JAX and Haiku

Python 6,356 889 Updated Jan 21, 2023

JAX-based neural network library

Python 3,147 271 Updated Dec 18, 2025

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 3,285 203 Updated May 19, 2025

Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry lead…

Python 542 71 Updated Dec 20, 2025

2nd solution of ICDAR 2021 Competition on Scientific Literature Parsing, Task B.

Python 467 108 Updated Jul 4, 2022

Making large AI models cheaper, faster and more accessible

Python 41,298 4,546 Updated Dec 8, 2025

JAX - A curated list of resources https://github.com/google/jax

1,993 154 Updated Sep 2, 2025

Abseil Common Libraries (Python)

Python 2,418 267 Updated Dec 19, 2025

Abseil Common Libraries (C++)

C++ 16,744 2,923 Updated Dec 19, 2025

ConvMAE: Masked Convolution Meets Masked Autoencoders

Python 520 42 Updated Mar 14, 2023

A paper list of some recent Transformer-based CV works.

1,393 145 Updated Nov 19, 2025
Next