donglala165

donglala165

1 follower · 0 following

Stars

deeptibhegde / CLIP-goes-3D

Official code release of "CLIP goes 3D: Leveraging Prompt Tuning for Language Grounded 3D Recognition"

Python 242 14 Updated May 1, 2023

hanxunyu / Inst3D-LMM

[CVPR 2025 Highlight] Official code repository for "Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuning"

Python 130 7 Updated Jan 30, 2026

jiaqihuang01 / DETRIS

[AAAI-2025] The official code of Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation

Python 73 4 Updated May 21, 2025

kkakkkka / ETRIS

[ICCV-2023] The official code of Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation

Python 138 6 Updated Jun 26, 2025

RL4VLM / RL4VLM

Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Jupyter Notebook 413 40 Updated Dec 15, 2024

ZhenyangLiu / ReasonGrounder

Forked from nerfies/nerfies.github.io

Python 15 1 Updated Jul 11, 2025

heshuting555 / ReferSplat

[ICML2025 Oral] ReferSplat: Referring Segmentation in 3D Gaussian Splatting

Jupyter Notebook 147 8 Updated May 26, 2026

xuxiaoxxxx / 3DSS-VLG

[ECCV 2024] The offical implementation of paper 3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance

Python 14 2 Updated Mar 23, 2025

Kunhao-Liu / 3D-OVS

[NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation

Python 128 6 Updated May 5, 2026

MarkMoHR / Awesome-Referring-Image-Segmentation

📚 A collection of papers about Referring Image Segmentation.

826 64 Updated Jan 28, 2026

ZzZZCHS / WS-3DVG

[ICCV 2023] Distilling Coarse-to-fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding

Python 14 2 Updated Oct 2, 2024

DFQ-Dojo / dfq-toolkit

[ICCV 2025] Task-Specific Zero-shot Quantization-Aware Training for Object Detection

Python 27 8 Updated Sep 26, 2025

Qi-Zhangyang / GPT4Scene-and-VLN-R1

GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models

Python 528 24 Updated Mar 2, 2026

ZhaochongAn / GFS-VL

[CVPR 2025] Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model

Python 63 3 Updated May 7, 2025

nickgkan / butd_detr

Code for the ECCV22 paper "Bottom Up Top Down Detection Transformers for Language Grounding in Images and Point Clouds"

Python 95 10 Updated Jun 9, 2023

facebookresearch / vggt

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 13,650 1,508 Updated May 19, 2026

zyn213 / DEGround

11 Updated Jun 6, 2025

yanmin-wu / EDA

[CVPR 2023] EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding

Python 134 3 Updated Oct 11, 2023

daveredrum / D3Net

[ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding

Python 44 5 Updated Aug 27, 2022

xibi777 / 3DLFVG

5 1 Updated Mar 27, 2024

Leon1207 / 3DGCTR

This is a PyTorch implementation of 3DGCTR proposed by our paper “Rethinking 3D Dense Caption and Visual Grounding in A Unified Framework through Prompt-based Localization”

Python 6 Updated Dec 30, 2024

ZCMax / ScanReason

[ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities

Python 84 3 Updated Oct 10, 2024

GWxuan / TSP3D

[CVPR 2025, All Strong Accept] TSP3D: Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding

Python 251 15 Updated Jun 11, 2025

TheShadow29 / awesome-grounding

awesome grounding: A curated list of research papers in visual grounding

1,125 103 Updated Sep 21, 2025

eslambakr / CoT3D_VG

Chain_of_Thoughts_3D_Visual_Grounding

Python 21 2 Updated Apr 20, 2024

iris0329 / SeeGround

[CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding

Python 222 10 Updated Apr 21, 2025

liudaizong / Awesome-3D-Visual-Grounding

😎 up-to-date & curated list of awesome 3D Visual Grounding papers, methods & resources.

281 6 Updated Jan 14, 2026

jimtsai23 / PseudoEmbed

[ECCV 2024] Pseudo-Embedding for Generalized Few-Shot 3D Segmentation

Python 6 Updated May 22, 2025

ZhaochongAn / Multimodality-3D-Few-Shot

[ICLR 2025 Spotlight] Multimodality Helps Few-shot 3D Point Cloud Semantic Segmentation

Python 73 5 Updated May 7, 2025

ZrrSkywalker / I2P-MAE

[CVPR 2023] Learning 3D Representations from 2D Pre-trained Models via Image-to-Point Masked Autoencoders

Python 230 18 Updated Aug 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly