-
Tsinghua University
- China
-
07:21
(UTC +08:00) - https://xuxw98.github.io
- in/xiuwei-xu-69ab551ab
- @xxw21_thu
Lists (1)
Sort Name ascending (A-Z)
Stars
Official code of Motus: A Unified Latent Action World Model
Our inference and training framework to run on the Cosmos Models
Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.
Source code for 👏🏻"CLAP: Contrastive Latent Action Pretraining for Learning Vision-Language-Action Models from Human Videos"
high-performance inference and serving library for interactive autoregressive video and world models
Controlling diverse robots by inferring jacobian fields with deep networks! Let's make robots understand their bodies!
Official implementation of "Turning Video Models into Generalist Robot Policies"
[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
Official implementation of paper "VLM³: Vision Language Models Are Native 3D Learners".
iMac: Translating Actions into Motion and Contact Images for Embodied World Models
Kimi Code CLI — The Starting Point for Next-Gen Agents
repository for training action-conditioned latent diffusion world models for robot video generation
Graphs that teach > graphs that impress. Turn any code into an interactive knowledge graph you can explore, search, and ask questions about. Works with Claude Code, Codex, Cursor, Copilot, Gemini C…
[CVPR 2026] AwareVLN: Reasoning with Self-awareness for Vision-Language Navigation
GesVLA: Gesture-Aware Vision-Language-Action Model with Embedded Representations
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
Enjoy the magic of Diffusion models!
[CVPR 2026] Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout
Heuristic Learning Blog Post
A optimized PyTorch framework for behavior cloning with flow related generative models.
(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
Simulation platform for general-purpose robotics & embodied AI learning.
The official Lark/Feishu CLI tool, maintained by the larksuite team — built for humans and AI Agents. Covers core business domains including Messenger, Docs, Base, Sheets, Calendar, Mail, Tasks, Me…