Stars
Elevate your AI research writing, no more tedious polishing ✨
Lightweight, open-source AI agent for your tools, chats, and workflows.
Official code of "Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding"
[ICML 2026] 🏂 World Guidance: World Modeling in Condition Space for Action Generation
[IEEE Star 2023 | Nreal AR JAM Challenge 2022] A pocket-size metatrainer: fitness is only a gesture away
Animation of an SMPLX character in an augmented reality application
[TPAMI 2025] Official Code for "SMPLest-X: Ultimate Scaling for Expressive Human Pose and Shape Estimation"
CUDA accelerated rasterization of gaussian splatting
[CVPR 2025] SAT-HMR: Real-Time Multi-Person 3D Mesh Estimation via Scale-Adaptive Tokens
[ICLR 2026] Streaming 4D Visual Geometry Transformer
Official code of the paper MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical Environments accepted at CVPR 2025. This repo includes both the dat…
[3DV 2026 Oral] Official Repo of "SAIL-Recon: Large SfM by Augmenting Scene Regression with Localization"
Tool for robust segmentation of >100 important anatomical structures in CT and MR images
[ISBI 2024] An implementation of SAM3D which adapts Segment Anything Model for Volumetric Medical Image Segmentation
[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.
Programmer's guide about how to cook at home.
XaiR: An XR Platform that Integrates Large Language Models with the Physical World
Official inference repo for FLUX.1 models
A Sample Project for Passthrough Camera API in Unity.
SynCity: Training-Free Generation of 3D Worlds
Fully open reproduction of DeepSeek-R1
XR-Objects is an open-source prototype that anchors contextual interactions onto analog objects to not only convey information but also to initiate digital actions, such as querying LLMs for detail…
Simulation platform for general-purpose robotics & embodied AI learning.
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
Meta-Transformer for Unified Multimodal Learning