-
University of North Texas
- Denton
- https://hengfan2010.github.io/
Stars
This repository will contain the official implementation of paper 'PRVQL: Progressive Knowledge-guided Refinement for Robust Egocentric Visual Query Localization'
[ICLR 2025] Knowing Your Target: Target-Aware Transformer Makes Better Spatio-Temporal Video Grounding
Code for "Self-supervised Denoising and Bulk Motion Artifact Removal of 3D Optical Coherence Tomography Angiography of Awake Brain" @ MICCAI 2024
This is an official implementation of the MICCAI2024 paper "Self-supervised 3D Skeleton Completion for Vascular Structures".
[IROS 2024] Efficient Multimodal Semantic Segmentation via Dual-Prompt Learning
[ECCV 2024] Tracking Meets LoRA: Faster Training, Larger Model, Stronger Performance
[ECCV 2024] Beyond MOT: Semantic Multi-Object Tracking
[ICRA 2025] LaMOT: Language-Guided Multi-Object Tracking
[T-NNLS 2024] AttMOT: Improving Multiple-Object Tracking by Introducing Auxiliary Pedestrian Attributes
This repository can be used to visualize objects of KITTI in camera image, point cloud and bird's eye view. It can be adapted to visualize objects in other point cloud datasets.
[NeurIPS 2024] VastTrack: Vast Category Visual Object Tracking
[CVPR 2024] Context-Guided Spatio-Temporal Video Grounding
COST: Collaborative Three-Stream Transformers for Video Captioning
[ICCV 2023] Accurate and Fast Compressed Video Captioning
[ICCV 2023] PlanarTrack: A Large-scale Challenging Benchmark for Planar Object Tracking.
[ECCV22] High-Fidelity Image Inpainting with GAN Inversion
[CVPR 2019 & IJCV 2021] LaSOT: A High-quality Benchmark for Large-scale Single Object Tracking