Skip to content
View wgsxm's full-sized avatar

Highlights

  • Pro

Block or report wgsxm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

NVIDIA Cosmos is an open platform of world models, datasets, and tools that enables developers to build Physical AI for robots, autonomous vehicles, smart infrastructure, and more.

Jupyter Notebook 10,298 678 Updated Jun 16, 2026

PyMuPDF4LLM

Python 1,850 228 Updated Jun 15, 2026

StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing

Python 2,855 353 Updated Jun 17, 2026

[RSS 2026] LDA-1B: Scaling Latent Dynamics Action Model via Universal Embodied Data Ingestion

Python 264 18 Updated May 26, 2026

[browser-agent] Never send a human to do a machine's job.

Python 9 1 Updated May 9, 2026

Official codebase for Fast-WAM: Do World Action Models Need Test-time Future Imagination?

Python 985 102 Updated Apr 3, 2026

[ICLR 2026] The offical Implementation of "Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model"

C++ 673 64 Updated Jun 10, 2026

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…

Python 1,797 60 Updated Jun 13, 2026

Flow Policy Gradients for Robot Control

Python 257 12 Updated May 29, 2026

日麻从入门到入土,相关新手入门教程和指南书籍

142 24 Updated Feb 26, 2025

Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals

Python 2,278 193 Updated Apr 19, 2026

[ICML 2026] Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation" & Causal Forcing++

Python 787 45 Updated Jun 17, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 379,103 79,342 Updated Jun 17, 2026

Advancing Open-source World Models

Python 3,928 351 Updated May 22, 2026

Cosmos Policy

Python 807 79 Updated Jan 23, 2026

Benchmarking Knowledge Transfer in Lifelong Robot Learning

Jupyter Notebook 1,956 422 Updated Mar 15, 2025

A Large-scale Video Action Dataset

Python 473 13 Updated Jan 16, 2026

[Lumina具身智能社区] 具身智能技术指南 Embodied-AI-Guide

14,306 912 Updated Mar 12, 2026

Open-source implementation of AlphaEvolve

Python 6,564 1,049 Updated Mar 18, 2026

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

Python 942 123 Updated Sep 8, 2025

A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.

3,267 150 Updated Jun 16, 2026

A Foundation Model for Generalist Gaming Agents

Python 2,084 234 Updated Jan 25, 2026
Python 9 Updated Jan 9, 2026

Native and Compact Structured Latents for 3D Generation

Python 8,373 1,023 Updated Jun 5, 2026

[ICLR 2026] Official Repo For "BindWeave: Subject-Consistent Video Generation via Cross-Modal Integration"

Python 335 19 Updated Jan 28, 2026

[CVPR' 2026] JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization

Python 404 8 Updated Feb 22, 2026

[CVPR 2026] Official Implementation of Particulate: Feed-Forward 3D Object Articulation

Python 145 8 Updated Apr 1, 2026

[CVPR 2026] Diff4Splat: Controllable 4D Scene Generation with Latent Dynamic Reconstruction Models.

Python 110 4 Updated Apr 9, 2026
Next