Skip to content
View xuxw98's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report xuxw98

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official code of Motus: A Unified Latent Action World Model

Python 1,152 65 Updated Jan 5, 2026

Our inference and training framework to run on the Cosmos Models

Python 267 35 Updated Jun 18, 2026

Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.

Python 7,479 1,207 Updated Jun 17, 2026

Official repository for LTX-Video

Python 10,523 1,042 Updated Jan 5, 2026

Source code for 👏🏻"CLAP: Contrastive Latent Action Pretraining for Learning Vision-Language-Action Models from Human Videos"

Python 35 Updated Jun 14, 2026

high-performance inference and serving library for interactive autoregressive video and world models

Python 336 21 Updated Jun 17, 2026

Controlling diverse robots by inferring jacobian fields with deep networks! Let's make robots understand their bodies!

Jupyter Notebook 224 32 Updated Dec 9, 2025

Official implementation of "Turning Video Models into Generalist Robot Policies"

14 Updated Jun 14, 2026
Python 4 Updated Jun 12, 2026

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 1,652 97 Updated Mar 16, 2025

Official implementation of paper "VLM³: Vision Language Models Are Native 3D Learners".

Jupyter Notebook 318 9 Updated Jun 1, 2026

iMac: Translating Actions into Motion and Contact Images for Embodied World Models

Python 14 Updated Jun 9, 2026

Kimi Code CLI — The Starting Point for Next-Gen Agents

TypeScript 2,546 300 Updated Jun 18, 2026
Python 225 12 Updated Jun 1, 2026

repository for training action-conditioned latent diffusion world models for robot video generation

Python 66 2 Updated May 29, 2026

Graphs that teach > graphs that impress. Turn any code into an interactive knowledge graph you can explore, search, and ask questions about. Works with Claude Code, Codex, Cursor, Copilot, Gemini C…

TypeScript 63,505 5,246 Updated Jun 18, 2026

[CVPR 2026] AwareVLN: Reasoning with Self-awareness for Vision-Language Navigation

Python 48 Updated May 24, 2026

[CVPR 2026 Oral] VGGT Omega

Python 3,072 135 Updated May 18, 2026

GesVLA: Gesture-Aware Vision-Language-Action Model with Embedded Representations

Python 24 Updated May 22, 2026

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 8,331 654 Updated Jun 18, 2026

Enjoy the magic of Diffusion models!

Python 12,595 1,232 Updated Jun 18, 2026

LaTeX Thesis Template for Tsinghua University

TeX 5,391 1,163 Updated Jun 17, 2026

[CVPR 2026] Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout

Python 82 4 Updated Mar 21, 2026

Heuristic Learning Blog Post

Python 567 58 Updated May 25, 2026
Python 127 2 Updated Mar 24, 2026

A optimized PyTorch framework for behavior cloning with flow related generative models.

Python 277 13 Updated May 5, 2026

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 1,370 82 Updated Aug 7, 2025

Simulation platform for general-purpose robotics & embodied AI learning.

Python 29,376 2,786 Updated Jun 17, 2026

FASTER: Rethinking Real-Time Flow VLAs

Python 130 8 Updated May 14, 2026

The official Lark/Feishu CLI tool, maintained by the larksuite team — built for humans and AI Agents. Covers core business domains including Messenger, Docs, Base, Sheets, Calendar, Mail, Tasks, Me…

Go 14,375 990 Updated Jun 18, 2026
Next