Skip to content
View ShangGaoG's full-sized avatar

Block or report ShangGaoG

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Tempo: Small Vision-Language Models are Smart Compressors for Long Video Understanding

Python 70 3 Updated Apr 29, 2026

[CVPR 2026 Hightlight] OmniVGGT: Omni-Modality Driven Visual Geometry Grounded Transformer

Python 340 28 Updated May 21, 2026

Text-driven human motion generation surveys, datasets and models.

TypeScript 95 3 Updated Aug 17, 2025

This repository collects papers on Human-Interaction-Motion-Generation applications. We will update new papers irregularly.

296 16 Updated Oct 21, 2025

A paper list of some recent works about Token Compress for Vit and VLM

921 43 Updated Jun 2, 2026

A curated list of awesome LLM/VLM/VLA/World Model for Autonomous Driving(LLM4AD) resources (continually updated)

1,850 108 Updated May 28, 2026

🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐

Python 12,176 1,944 Updated Mar 16, 2026

a collection of visualization function

Python 446 42 Updated Jan 15, 2022

Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources

2,222 142 Updated Apr 16, 2026

A collection of papers on diffusion models for 3D generation.

1,255 61 Updated Jan 16, 2026

Segment Any RGBD

Python 868 52 Updated May 24, 2023

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,963 513 Updated Dec 13, 2025

Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/sp…

Python 1,774 104 Updated Aug 29, 2023