wxkwywr

wxkwywr

Stars

[NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.

Python 31 1 Updated Nov 13, 2025

[TPAMI 2025] Towards Visual Grounding: A Survey

Shell 299 26 Updated Nov 18, 2025

YOLOv3 in PyTorch > ONNX > CoreML > TFLite

Python 10,562 3,442 Updated Mar 18, 2026

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 9,943 1,015 Updated Aug 12, 2024

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Python 57,135 17,449 Updated Mar 18, 2026

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,844 1,709 Updated Jan 30, 2026