Skip to content
View fx-hit's full-sized avatar

Highlights

  • Pro

Block or report fx-hit

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 3 Updated May 7, 2026

Official codebase for Fast-WAM: Do World Action Models Need Test-time Future Imagination?

Python 999 105 Updated Apr 3, 2026

[CVPR 2026 Oral] VGGT Omega

Python 3,079 138 Updated May 18, 2026

LaST-R1

Python 101 5 Updated May 6, 2026

Bash is all you need - A nano claude code–like γ€Œagent harness」, built from 0 to 1

Python 67,463 10,961 Updated Jun 15, 2026

Official code for "LagerNVS Latent Geometry for Fully Neural Real-time Novel View Synthesis" (CVPR 2026)

Python 388 21 Updated Jun 11, 2026

[CVPR2026] TextPecker: Rewarding Structural Anomaly Quantification for Enhancing Visual Text Rendering

Python 50 1 Updated Feb 26, 2026

Cool Papers - Immersive Paper Discovery

JavaScript 775 18 Updated Mar 27, 2026
Python 63 3 Updated Feb 12, 2026

A minimal CLI tool to organize, load, and switch between your project's environment contexts.

Python 2 Updated Jun 18, 2026

VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model

Python 387 26 Updated May 2, 2026

[RSS 2026] Causal video-action world model for generalist robot control

Python 1,351 117 Updated Apr 29, 2026

StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing

Python 2,876 361 Updated Jun 18, 2026

[AAAI 2026 Oral] SpatialActor: Exploring Disentangled Spatial Representations for Robust Robotic Manipulation

Python 62 1 Updated Jun 13, 2026

The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'

Jupyter Notebook 240 8 Updated Nov 28, 2025

A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites

814 43 Updated Dec 17, 2025
Python 65 3 Updated Feb 20, 2025

Official code of Motus: A Unified Latent Action World Model

Python 1,152 65 Updated Jan 5, 2026

Ego-Vision World Model for Humanoid Contact Planning

Python 182 7 Updated Dec 24, 2025

Humanoid dataset for learning

Python 142 1 Updated May 10, 2026

[ICLR 2026] Towards Unified Latent VLA for Whole-body Loco-manipulation Control

486 12 Updated May 25, 2026

πŸ”₯ The first open-sourced diffusion vision-langauge-action model. [ICLR 2026]

Python 181 9 Updated Mar 12, 2026

The official code of paper WristWorld.

Python 26 Updated Nov 8, 2025

[CVPR 2026] Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight

Python 92 1 Updated Jun 5, 2026
Python 115 12 Updated Oct 27, 2025

RoboBrain 2.5: Advanced version of RoboBrain. Depth in Sight, Time in Mind. πŸŽ‰πŸŽ‰πŸŽ‰

Python 1,103 109 Updated Feb 28, 2026

[CVPR 2025] RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete. Official Repository.

Python 554 44 Updated Oct 13, 2025

[ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 1,732 113 Updated Jan 6, 2026
Next