Skip to content
View xuxw98's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report xuxw98

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

high-performance inference and serving library for interactive autoregressive video and world models

Python 321 19 Updated Jun 13, 2026

Controlling diverse robots by inferring jacobian fields with deep networks! Let's make robots understand their bodies!

Jupyter Notebook 223 32 Updated Dec 9, 2025

Official implementation of "Turning Video Models into Generalist Robot Policies"

12 Updated Jun 11, 2026
Python 2 Updated Jun 12, 2026

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 1,647 94 Updated Mar 16, 2025

Official implementation of paper "VLM³: Vision Language Models Are Native 3D Learners".

Jupyter Notebook 293 9 Updated Jun 1, 2026

iMac: Translating Actions into Motion and Contact Images for Embodied World Models

Python 13 Updated Jun 9, 2026

Kimi Code CLI — The Starting Point for Next-Gen Agents

TypeScript 2,366 271 Updated Jun 14, 2026
Python 212 12 Updated Jun 1, 2026

repository for training action-conditioned latent diffusion world models for robot video generation

Python 66 2 Updated May 29, 2026

Graphs that teach > graphs that impress. Turn any code into an interactive knowledge graph you can explore, search, and ask questions about. Works with Claude Code, Codex, Cursor, Copilot, Gemini C…

TypeScript 59,033 4,903 Updated Jun 11, 2026

[CVPR 2026] AwareVLN: Reasoning with Self-awareness for Vision-Language Navigation

Python 48 Updated May 24, 2026

[CVPR 2026 Oral] VGGT Omega

Python 2,965 122 Updated May 18, 2026

GesVLA: Gesture-Aware Vision-Language-Action Model with Embedded Representations

Python 24 Updated May 22, 2026

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 8,241 639 Updated Jun 10, 2026

Enjoy the magic of Diffusion models!

Python 12,576 1,228 Updated Jun 12, 2026

LaTeX Thesis Template for Tsinghua University

TeX 5,385 1,162 Updated Jun 14, 2026

[CVPR 2026] Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout

Python 82 4 Updated Mar 21, 2026

Heuristic Learning Blog Post

Python 562 58 Updated May 25, 2026
Python 125 2 Updated Mar 24, 2026

A optimized PyTorch framework for behavior cloning with flow related generative models.

Python 276 12 Updated May 5, 2026

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 1,365 82 Updated Aug 7, 2025

Simulation platform for general-purpose robotics & embodied AI learning.

Python 29,338 2,781 Updated Jun 13, 2026

FASTER: Rethinking Real-Time Flow VLAs

Python 129 8 Updated May 14, 2026

The official Lark/Feishu CLI tool, maintained by the larksuite team — built for humans and AI Agents. Covers core business domains including Messenger, Docs, Base, Sheets, Calendar, Mail, Tasks, Me…

Go 14,074 963 Updated Jun 14, 2026

An all-in-one VLA engineering platform for embodied AI — from data to real-robot deployment.

Python 463 50 Updated Jun 10, 2026

[RSS 2026] R2RGen: Real-to-Real 3D Data Generation for Spatially Generalized Manipulation

Jupyter Notebook 10 Updated Apr 29, 2026

[RSS 2026] Code for RISE: Self-Improving Robot Policy with Compositional World Model

Python 281 17 Updated Jun 4, 2026

Cosmos Policy

Python 803 79 Updated Jan 23, 2026

NVIDIA Isaac GR00T N1.7 - A Foundation Model for Generalist Robots.

Python 7,335 1,257 Updated Jun 12, 2026
Next