dragonlong

Follow

Xiaolong dragonlong

Follow

3D AI Researcher, Nvidia

113 followers · 43 following

Santa Clara
12:19 (UTC -12:00)
https://dragonlong.github.io/
@lxiaol9

Achievements

Achievements

Lists (1)

Sort

🔮 Future ideas

Stars

mli0603 / openpi-comet

Team Comet's 2025 BEHAVIOR Challenge Codebase

Python 157 7 Updated Dec 17, 2025

nvidia-cosmos / cosmos-reason2

Cosmos-Reason2 models understand the physical common sense and generate appropriate embodied decisions in natural language through long chain-of-thought reasoning processes.

Python 7 1 Updated Dec 20, 2025

NVIDIA / multi-storage-client

Unified high-performance Python client for object and file stores.

Python 53 8 Updated Dec 19, 2025

InternRobotics / InternVLA-A1

InternVLA-A1: Unifying Understanding, Generation, and Action for Robotic Manipulation

Python 55 1 Updated Sep 18, 2025

apple / ml-egodex

EgoDex: Learning Dexterous Manipulation from Large-Scale Egocentric Video

Python 90 2 Updated Aug 20, 2025

InternRobotics / InternVLA-M1

InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy

Python 315 16 Updated Dec 17, 2025

microsoft / VITRA

VITRA: Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos

Python 200 7 Updated Dec 12, 2025

NVIDIA / TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 3,019 584 Updated Dec 20, 2025

xiaomi-research / dggt

DGGT: Feedforward 4D Reconstruction of Dynamic Driving Scenes using Unposed Images

Python 333 28 Updated Dec 11, 2025

InternRobotics / InternNav

InternRobotics' open platform for building generalized navigation foundation models.

Jupyter Notebook 502 57 Updated Dec 19, 2025

open-gigaai / giga-world-0

GigaWorld-0: World Models as Data Engine to Empower Embodied AI

Python 750 60 Updated Dec 3, 2025

xiaoxiaoxh / TactAR_APP

[RSS 2025] TactAR teleopeartion APP in "Reactive Diffusion Policy: Slow-Fast Visual-Tactile Policy Learning for Contact-Rich Manipulation"

C# 58 7 Updated Jul 11, 2025

xiaoxiaoxh / reactive_diffusion_policy

[RSS 2025] Reactive Diffusion Policy: Slow-Fast Visual-Tactile Policy Learning for Contact-Rich Manipulation

Python 270 12 Updated Dec 12, 2025

google-deepmind / dm_control

Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

Python 4,379 733 Updated Dec 4, 2025

ARISE-Initiative / robomimic

robomimic: A Modular Framework for Robot Learning from Demonstration

Python 1,211 325 Updated Nov 10, 2025

ARISE-Initiative / robosuite

robosuite: A Modular Simulation Framework and Benchmark for Robot Learning

Python 2,103 632 Updated Dec 2, 2025

real-stanford / universal_manipulation_interface

Universal Manipulation Interface: In-The-Wild Robot Teaching Without In-The-Wild Robots

Python 1,164 213 Updated Jul 21, 2025

ToyotaResearchInstitute / lbm_eval

Simulation benchmark from Toyota Research Institute containing 49 tasks that measure the performance of Large Behavior Model policies

Python 54 3 Updated Nov 6, 2025

nvidia-cosmos / cosmos-cookbook

Post-training scripts and samples for NVIDIA Cosmos ecosystem

Python 145 31 Updated Dec 20, 2025

NVIDIA / dgx-spark-playbooks

Collection of step-by-step playbooks for setting up AI/ML workloads on NVIDIA DGX Spark devices with Blackwell architecture.

TypeScript 261 85 Updated Dec 19, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 38,953 4,927 Updated Dec 9, 2025

thu-ml / RDT2

Official code of RDT 2

Python 605 28 Updated Dec 3, 2025

OpenDriveLab / AgiBot-World

[IROS 2025 Award Finalist] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Python 2,661 186 Updated Dec 16, 2025

AgibotTech / Genie-Envisioner

Python 341 17 Updated Dec 19, 2025

AIDC-AI / Ovis

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

Python 1,424 85 Updated Sep 22, 2025

The-AI-Alliance / GEO-Bench-VLM

GEOBench-VLM: Benchmarking Vision-Language Models for Geospatial Tasks

Python 87 6 Updated Jul 1, 2025

DangMinh21 / Multimodal-and-Multi-task-Fusion-for-Spatial-Reasoning

Python 2 Updated Sep 19, 2025

mingyin0312 / RLFromScratch

Python 465 37 Updated Aug 28, 2025

modelscope / DiffSynth-Studio

Enjoy the magic of Diffusion models!

Python 11,177 1,054 Updated Dec 20, 2025

yeliudev / R2-Tuning

🌀 R2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding (ECCV 2024)

Python 90 5 Updated Jul 2, 2024