Skip to content
View jayson-yxj's full-sized avatar

Block or report jayson-yxj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2026] Official implementation of "ACoT-VLA: Action Chain-of-Thought for Vision-Language-Action Models"

Python 151 31 Updated Apr 3, 2026

CoWTracker: Tracking by Warping instead of Correlation

Python 144 6 Updated Feb 5, 2026

Visual scheme for the Lilliput project

Python 2 Updated Mar 3, 2026

π RuView: WiFi DensePose turns commodity WiFi signals into real-time human pose estimation, vital sign monitoring, and presence detection — all without a single pixel of video.

Rust 50,828 6,719 Updated Apr 30, 2026

Official implementation of paper [DeepTag: A General Framework for Fiducial Marker Design and Detection]

Python 137 24 Updated Mar 3, 2023

This is a lightweight GAN developed for real-time deblurring. The model has a super tiny size and a rapid inference time. The motivation is to boost marker detection in robotic applications, howeve…

Python 54 11 Updated Sep 12, 2023

Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"

Python 1,280 73 Updated Jan 5, 2026

[RSS'25] This repository is the implementation of "NaVILA: Legged Robot Vision-Language-Action Model for Navigation"

Python 596 56 Updated Aug 20, 2025

A flexible, high-performance 3D simulator for Embodied AI research.

C++ 3,645 529 Updated Feb 24, 2026

PyTorch Implementation of MVSNet

Python 680 92 Updated May 11, 2022

A GPU-accelerated TSDF and ESDF library for robots equipped with RGB-D cameras.

C++ 1,166 144 Updated Mar 6, 2026

MonSter++: A Unified Geometric Foundation Model for Stereo and Multi-View Depth Estimation via the Unleashing of Monodepth Priors

Python 264 24 Updated Dec 23, 2025

The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."

Python 2,178 161 Updated Mar 13, 2025

RTAB-Map's ROS package.

C++ 1,414 646 Updated Apr 20, 2026

国内首个占据栅格网络全栈课程《从BEV到Occupancy Network,算法原理与工程实践》,包含端侧部署。Surrounding Semantic Occupancy Perception Course for Autonomous Driving (docs, ppt and source code) 在线课程主页:http://111.229.117.200:8100/ (作者独立搭建)

Python 766 89 Updated Oct 30, 2024

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.

Python 5,477 414 Updated Apr 21, 2025

Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"

Python 5,376 721 Updated Aug 23, 2024

[CVPR 2023] An academic alternative to Tesla's occupancy network for autonomous driving.

Python 1,344 126 Updated Sep 7, 2024

This repository contains the code for the paper "Occupancy Networks - Learning 3D Reconstruction in Function Space"

Python 1,656 305 Updated Jun 27, 2023

GeoCalib: Learning Single-image Calibration with Geometric Optimization (ECCV 2024)

Python 830 59 Updated Mar 13, 2026

Visual SLAM/odometry package based on NVIDIA-accelerated cuVSLAM

C++ 1,341 189 Updated Mar 24, 2026

[CVPR 2025] JamMa is a lightweight image matcher that enables fast internal and mutual interaction of images with joint Mamba.

Python 232 16 Updated May 29, 2025

[CVPR 2025 Best Paper Nomination] FoundationStereo: Zero-Shot Stereo Matching

Python 2,665 242 Updated Dec 19, 2025

在 ROS 上run的 llama3.1-8b 聊天机器人,无需联网,直接 ollama 本地部署调用!

CMake 1 Updated Nov 14, 2025

Joint deep network for feature line detection and description

Jupyter Notebook 586 77 Updated Dec 26, 2023

This code contains an algorithm to compute stereo visual SLAM by using both point and line segment features.

C++ 787 248 Updated Nov 24, 2019

Depth Anything 3

Python 5,155 547 Updated Mar 21, 2026

[CVPR 2025 Highlight] Real-time dense scene reconstruction with SLAM3R

Python 1,153 76 Updated Oct 18, 2025

Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding

Python 1,995 149 Updated Jan 9, 2026
Next