Stars
RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI
open Multi-View Stereo reconstruction library
New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos
Benchmarking Knowledge Transfer in Lifelong Robot Learning
Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals
[Lumina具身智能社区] 具身智能技术指南 Embodied-AI-Guide
Official Algorithm Codebase for the Paper "BEHAVIOR Robot Suite: Streamlining Real-World Whole-Body Manipulation for Everyday Household Activities"
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
🔥 LeetCode for PyTorch — practice implementing softmax, attention, GPT-2 and more from scratch with instant auto-grading. Jupyter-based, self-hosted or try online.
仅需Python基础,从0构建自己的具身智能机器人;从0逐步构建VLA/OpenVLA/SmolVLA/Pi0, 深入理解具身智能
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
SGLang is a high-performance serving framework for large language models and multimodal models.
Beam prediction based on large language models, IEEE Wireless Communications Letters
The repository contains code, report and presentation for the solution of Team TII for ITU AI/ML in 5G Grand Challenge 2022: ML5G-PS-011: Multi Modal Beam Prediction: Towards Generalization
Sionna: An Open-Source Library for Research on Communication Systems
[ICCV 2023] OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
Event-driven network library for multi-threaded Linux server in C++11
[IEEE T-PAMI 2023] Awesome BEV perception research and cookbook for all level audience in autonomous diriving
图解计算机网络、操作系统、计算机组成、数据库,共 1000 张图 + 50 万字,破除晦涩难懂的计算机基础知识,让天下没有难懂的八股文!🚀 在线阅读:https://xiaolincoding.com
Awesome papers about Multi-Camera 3D Object Detection and Segmentation in Bird's-Eye-View, such as DETR3D, BEVDet, BEVFormer, BEVDepth, UniAD
Code for paper: FUTR3D: a unified sensor fusion framework for 3d detection