-
Tsinghua University
- China
-
23:10
(UTC +08:00) - https://xuxw98.github.io
- in/xiuwei-xu-69ab551ab
- @xxw21_thu
Lists (1)
Sort Name ascending (A-Z)
Stars
high-performance inference and serving library for interactive autoregressive video and world models
Controlling diverse robots by inferring jacobian fields with deep networks! Let's make robots understand their bodies!
Official implementation of "Turning Video Models into Generalist Robot Policies"
[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
Official implementation of paper "VLM³: Vision Language Models Are Native 3D Learners".
iMac: Translating Actions into Motion and Contact Images for Embodied World Models
Kimi Code CLI — The Starting Point for Next-Gen Agents
repository for training action-conditioned latent diffusion world models for robot video generation
Graphs that teach > graphs that impress. Turn any code into an interactive knowledge graph you can explore, search, and ask questions about. Works with Claude Code, Codex, Cursor, Copilot, Gemini C…
[CVPR 2026] AwareVLN: Reasoning with Self-awareness for Vision-Language Navigation
GesVLA: Gesture-Aware Vision-Language-Action Model with Embedded Representations
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
Enjoy the magic of Diffusion models!
[CVPR 2026] Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout
Heuristic Learning Blog Post
A optimized PyTorch framework for behavior cloning with flow related generative models.
(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
Simulation platform for general-purpose robotics & embodied AI learning.
The official Lark/Feishu CLI tool, maintained by the larksuite team — built for humans and AI Agents. Covers core business domains including Messenger, Docs, Base, Sheets, Calendar, Mail, Tasks, Me…
An all-in-one VLA engineering platform for embodied AI — from data to real-robot deployment.
[RSS 2026] R2RGen: Real-to-Real 3D Data Generation for Spatially Generalized Manipulation
[RSS 2026] Code for RISE: Self-Improving Robot Policy with Compositional World Model
NVIDIA Isaac GR00T N1.7 - A Foundation Model for Generalist Robots.