- Shanghai, China
Lists (1)
Sort Name ascending (A-Z)
Stars
XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
🔥 LeetCode for PyTorch — practice implementing softmax, attention, GPT-2 and more from scratch with instant auto-grading. Jupyter-based, self-hosted or try online.
Elevate your AI research writing, no more tedious polishing ✨
A curated list of awesome Claude Skills, resources, and tools for customizing Claude AI workflows
Robo3R: Enhancing Robotic Manipulation with Accurate Feed-Forward 3D Reconstruction
动手学CS146S中文版课程,包含assignments,vibe coding工具等,本项目将长期持续维护,致力于打造中文最好的vibe coding教程。
Code for "EgoX: Egocentric Video Generation from a Single Exocentric Video"
[ICLR 2026] π^3: Permutation-Equivariant Visual Geometry Learning
[SIGGRAPH Asia 2025 (ACM TOG)] AnySplat: Feed-forward 3D Gaussian Splatting from Unconstrained Views
A unified framework for feed-forward neural networks
[CVPR 2025] Official implementation of "GenManip: LLM-driven Simulation for Generalizable Instruction-Following Manipulation"
[ICLR 2026]ARTDECO: Towards Efficient and High-Fidelity On-the-Fly 3D Reconstruction with Structured Scene Representation
InternRobotics' open-source toolbox for vision-based embodied spatial intelligence.
A versatile, all-in-one toolbox for whole-body humanoid robot control.
A simulation platform for versatile Embodied AI research and developments.
An All-in-one robot manipulation learning suite for policy models training and evaluation on various datasets and benchmarks.
InternRobotics' open platform for building generalized navigation foundation models.
[ICLR 2026] InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation
[NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.
From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
[ICRA 2026] Official implementation of the paper: "StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling"
[AAAI26 oral] CronusVLA: Towards Efficient and Robust Manipulation via Multi-Frame Vision-Language-Action Modeling
[Lumina具身智能社区] 具身智能技术指南 Embodied-AI-Guide
Collect and summarize point cloud sota methods.