Lists (7)
Sort Name ascending (A-Z)
Stars
Fast-BEV: A Fast and Strong Bird’s-Eye View Perception Baseline
[ICRA 2026] Official implementation of the paper: "StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling"
This repo contains the code for paper "nuCarla: A nuScenes-Style Bird’s-Eye View Perception Dataset for CARLA Simulation"
Dimensional is the agentic operating system for physical space. Vibecode humanoids, quadrupeds, drones, and other hardware platforms in natural language and build multi-agent systems that work seam…
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
Fisheye image correction and distortion table conversion.
Configs and boilerplates for Label Studio's Machine Learning backend
Universal Trajectory Optimization Framework for Differential Drive Robot Class
An object tracking project with YOLOv8 and ByteTrack, speed up by C++ and TensorRT.
C++ implementation of ByteTrack that does not include an object detection algorithm.
[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box
C++ implementation of BoT-SORT MOT algorithm with Re-ID and Camera Motion Compensation
support deepsort and bytetrack MOT(Multi-object tracking) using yolov5 with C++
BoT-SORT: Robust Associations Multi-Pedestrian Tracking
Simple, online, and realtime tracking of multiple objects in a video sequence.
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
Tools for directing, throttling, selecting, and otherwise manipulating ROS 2 topics at a meta-level.
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
[CVPR 2022] "MonoScene: Monocular 3D Semantic Scene Completion": 3D Semantic Occupancy Prediction from a single image
Metric depth estimation from a single image
A GPU-accelerated TSDF and ESDF library for robots equipped with RGB-D cameras.
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
PyTorch code for training EfficientPS for Panoptic Segmentation
Python sample codes and textbook for robotics algorithms.
[ICCV'25] 3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection
Awesome Monocular 3D detection