Starred repositories
STL files and lerobot integration code for modified so101 arms
XLeRobot: Practical Dual-Arm Mobile Home Robot for $660
Volume Transformer: Revisiting Vanilla Transformers for 3D Scene Understanding (ECCV 2026)
A collection of agent skills for CAD, robotics and hardware design
Transform 3D modeling from complex menus to simple conversations. Create and modify CAD models using plain English descriptions.
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
A tutorial and a set of tools to compute depth-from-stereo with Project Aria Gen2 devices. This includes stereo image rectification as well as disparity estimation
[IEEE TPAMI 2026] Simulating the Real World: Survey & Resources, which contains our survey "Simulating the Real World: A Unified Survey of Multimodal Generative Models" (IEEE TPAMI, 2026) and Aweso…
Seamlessly extend any image in any direction with AI. Open-source web app powered by Gemini via OpenRouter, with Poisson-blended seams and best-of-3 variant picker.
Official implementation of "WorldKV: Efficient World Memory with World Retrieval and Compression"
Stemdeck is an modern stem extraction platform for musicians,producers and hobbyists, designed to isolate vocals, drums, bass, piano and guitar for practice, transcription, remixing, and creative a…
Simulation platform for general-purpose robotics & embodied AI learning.
VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning
Research reading list on memory for robotics
Cambrian-P: Pose-Grounded Video Understanding
Prevents your Mac from going to sleep.
Algorithm powering the For You feed on X
Native and Compact Structured Latents for 3D Generation
Multilingual Document Layout Parsing in a Single Vision-Language Model
Fast Rust library for PDF inspection, classification, and text extraction. Intelligently detects scanned vs text-based PDFs to enable smart routing decisions.
Inference repo for Falcon-Perception and Falcon-OCR model, early-fusion, natively multimodal, dense Autoregressive Transformer models.
The agent that grows with you
📚 Two books on harness engineering — the design philosophies behind Claude Code & Codex: constraints, query loops, context governance, multi-agent verification. harness-books.agentway.dev
Build smaller, faster, and more secure desktop and mobile applications with a web frontend.