Lists (1)
Sort Name ascending (A-Z)
Stars
Spec-driven development (SDD) for your team's workflow. Kiro style commands that enforce structured requirements→design→tasks workflow and steering, transforming how you build with AI. Support Clau…
GitHub Copilot CLI brings the power of Copilot coding agent directly to your terminal.
End-to-end Graformer-based Realistic Two Hands and Object Reconstruction with self-supervision
HTFormer: Human Topology Aware Transformer for 3D Human Pose Estimation
[PR 2024] GraphMLP: A Graph MLP-Like Architecture for 3D Human Pose Estimation
[CVPR 2025] JamMa is a lightweight image matcher that enables fast internal and mutual interaction of images with joint Mamba.
MambaGlue: Fast and Robust Local Feature Matching With Mamba @ ICRA'25
Official repository for the NeurIPS 2025 paper "Towards Implicit Aggregation: Robust Image Representation for Place Recognition in the Transformer Era".
Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.
A highly robust and accurate LiDAR-only, LiDAR-inertial odometry
[NeurIPS 2025 Spotlight] Towards Safety Alignment of Vision-Language-Action Model via Constrained Learning.
Official repository for OmniVLA training and inference code
[CVPR2025] CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos
Official implementation of "Synthetic vs. Real Training Data for Visual Navigation".
VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place Recognition (ECCV 2024)
Official code and checkpoint release for mobile robot foundation models: GNM, ViNT, and NoMaD.
Official code and checkpoint release for "GNM: A General Navigation Model to Drive Any Robot".
Official implementation of the ICRA 2024 paper "PlaceNav: Topological Navigation through Place Recognition"
Official Pytorch Implementation of: "Asymmetric Loss For Multi-Label Classification"(ICCV, 2021) paper
Schedule-Free Optimization in PyTorch
[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"
This is an official implementation of facial landmark detection for our TPAMI paper "Deep High-Resolution Representation Learning for Visual Recognition". https://arxiv.org/abs/1908.07919
[RA-L 2022] Ctrl-VIO: Continuous-Time Visual-Inertial Odometry for Rolling Shutter Cameras