Highlights
- Pro
Stars
分享AI Infra知识&代码练习:PyTorch/vLLM/SGLang框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等
StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
Dexbotic: Open-Source Vision-Language-Action Toolbox
XLeRobot: Practical Dual-Arm Mobile Home Robot for $660
An Open-Source Library for Robust Object Manipulation via Uncertainty-aware Task-specific Intuitive Physics
[RSS 2026] Causal video-action world model for generalist robot control
A community collection of OpenClaw use cases for making life easier.
RoboBrain 2.5: Advanced version of RoboBrain. Depth in Sight, Time in Mind. 🎉🎉🎉
The memory harness for proactive AI agents — structured storage, intent capture, 10x token reduction.
A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone
Lightweight, open-source AI agent for your tools, chats, and workflows.
PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation
Dimensional is the agentic operating system for physical space. Vibecode humanoids, quadrupeds, drones, and other hardware platforms in natural language and build multi-agent systems that work seam…
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
A lightweight point-based visualization tool used for inspecting Gaussian data, designing camera motion, and exporting setups for external Gaussian renderers.
A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.
[NeurIPS 2025] SpatialLM: Training Large Language Models for Structured Indoor Modeling
[ICLR 2026] π^3: Permutation-Equivariant Visual Geometry Learning
Official repo for GraspGen: A Diffusion-based Framework for 6-DOF Grasping
[CVPR 2026] Official implementation of "MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second".
[ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy
COLMAP - Structure-from-Motion and Multi-View Stereo
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
💪 [TPAMI 2025] Pytorch implementation of 'HAC++: Towards 100X Compression of 3D Gaussian Splatting'