Skip to content
View jayson-yxj's full-sized avatar

Block or report jayson-yxj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2026] Official implementation of "ACoT-VLA: Action Chain-of-Thought for Vision-Language-Action Models"

Python 127 20 Updated Apr 3, 2026

CoWTracker: Tracking by Warping instead of Correlation

Python 139 6 Updated Feb 5, 2026

Visual scheme for the Lilliput project

Python 2 Updated Mar 3, 2026

π RuView: WiFi DensePose turns commodity WiFi signals into real-time human pose estimation, vital sign monitoring, and presence detection — all without a single pixel of video.

Rust 46,209 6,241 Updated Apr 10, 2026

Official implementation of paper [DeepTag: A General Framework for Fiducial Marker Design and Detection]

Python 136 24 Updated Mar 3, 2023

This is a lightweight GAN developed for real-time deblurring. The model has a super tiny size and a rapid inference time. The motivation is to boost marker detection in robotic applications, howeve…

Python 53 11 Updated Sep 12, 2023

Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"

Python 1,269 71 Updated Jan 5, 2026

[RSS'25] This repository is the implementation of "NaVILA: Legged Robot Vision-Language-Action Model for Navigation"

Python 575 48 Updated Aug 20, 2025

A flexible, high-performance 3D simulator for Embodied AI research.

C++ 3,619 529 Updated Feb 24, 2026

PyTorch Implementation of MVSNet

Python 677 92 Updated May 11, 2022

A GPU-accelerated TSDF and ESDF library for robots equipped with RGB-D cameras.

C++ 1,149 140 Updated Mar 6, 2026

MonSter++: A Unified Geometric Foundation Model for Stereo and Multi-View Depth Estimation via the Unleashing of Monodepth Priors

Python 315 32 Updated Dec 23, 2025

The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."

Python 2,156 162 Updated Mar 13, 2025

RTAB-Map's ROS package.

C++ 1,404 644 Updated Mar 21, 2026

国内首个占据栅格网络全栈课程《从BEV到Occupancy Network,算法原理与工程实践》,包含端侧部署。Surrounding Semantic Occupancy Perception Course for Autonomous Driving (docs, ppt and source code) 在线课程主页:http://111.229.117.200:8100/ (作者独立搭建)

Python 759 87 Updated Oct 30, 2024

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.

Python 5,427 407 Updated Apr 21, 2025

Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"

Python 5,361 721 Updated Aug 23, 2024

[CVPR 2023] An academic alternative to Tesla's occupancy network for autonomous driving.

Python 1,338 126 Updated Sep 7, 2024

This repository contains the code for the paper "Occupancy Networks - Learning 3D Reconstruction in Function Space"

Python 1,654 303 Updated Jun 27, 2023

GeoCalib: Learning Single-image Calibration with Geometric Optimization (ECCV 2024)

Python 814 58 Updated Mar 13, 2026

Visual SLAM/odometry package based on NVIDIA-accelerated cuVSLAM

C++ 1,320 189 Updated Mar 24, 2026

[CVPR 2025] JamMa is a lightweight image matcher that enables fast internal and mutual interaction of images with joint Mamba.

Python 228 14 Updated May 29, 2025

[CVPR 2025 Best Paper Nomination] FoundationStereo: Zero-Shot Stereo Matching

Python 2,632 237 Updated Dec 19, 2025

在 ROS 上run的 llama3.1-8b 聊天机器人,无需联网,直接 ollama 本地部署调用!

CMake 1 Updated Nov 14, 2025

Joint deep network for feature line detection and description

Jupyter Notebook 585 77 Updated Dec 26, 2023

This code contains an algorithm to compute stereo visual SLAM by using both point and line segment features.

C++ 789 248 Updated Nov 24, 2019

Depth Anything 3

Python 4,946 514 Updated Mar 21, 2026

[CVPR 2025 Highlight] Real-time dense scene reconstruction with SLAM3R

Python 1,134 75 Updated Oct 18, 2025

Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding

Python 1,988 150 Updated Jan 9, 2026
Next