Skip to content
View fx-hit's full-sized avatar

Highlights

  • Pro

Block or report fx-hit

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 3 Updated May 7, 2026

Official codebase for Fast-WAM: Do World Action Models Need Test-time Future Imagination?

Python 1,011 108 Updated Apr 3, 2026

[CVPR 2026 Oral] VGGT Omega

Python 3,135 145 Updated May 18, 2026

LaST-R1

Python 101 5 Updated May 6, 2026

Bash is all you need - A nano claude code–like γ€Œagent harness」, built from 0 to 1

Python 67,744 11,020 Updated Jun 21, 2026

Official code for "LagerNVS Latent Geometry for Fully Neural Real-time Novel View Synthesis" (CVPR 2026)

Python 389 21 Updated Jun 11, 2026

[CVPR2026] TextPecker: Rewarding Structural Anomaly Quantification for Enhancing Visual Text Rendering

Python 50 1 Updated Feb 26, 2026

Cool Papers - Immersive Paper Discovery

JavaScript 776 18 Updated Mar 27, 2026
Python 63 3 Updated Feb 12, 2026

A minimal CLI tool to organize, load, and switch between your project's environment contexts.

Python 2 Updated Jun 18, 2026

VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model

Python 396 28 Updated May 2, 2026

[RSS 2026] Causal video-action world model for generalist robot control

Python 1,360 118 Updated Apr 29, 2026

StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing

Python 2,884 362 Updated Jun 22, 2026

[AAAI 2026 Oral] SpatialActor: Exploring Disentangled Spatial Representations for Robust Robotic Manipulation

Python 62 1 Updated Jun 13, 2026

The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'

Jupyter Notebook 241 8 Updated Nov 28, 2025

A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites

814 43 Updated Dec 17, 2025
Python 65 3 Updated Feb 20, 2025

Official code of Motus: A Unified Latent Action World Model

Python 1,155 65 Updated Jan 5, 2026

Ego-Vision World Model for Humanoid Contact Planning

Python 183 7 Updated Dec 24, 2025

Humanoid dataset for learning

Python 143 1 Updated Jun 21, 2026

[ICLR 2026] Towards Unified Latent VLA for Whole-body Loco-manipulation Control

492 13 Updated May 25, 2026

πŸ”₯ The first open-sourced diffusion vision-langauge-action model. [ICLR 2026]

Python 182 9 Updated Mar 12, 2026

The official code of paper WristWorld.

Python 26 Updated Nov 8, 2025

[CVPR 2026] Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight

Python 92 1 Updated Jun 5, 2026
Python 115 12 Updated Oct 27, 2025

RoboBrain 2.5: Advanced version of RoboBrain. Depth in Sight, Time in Mind. πŸŽ‰πŸŽ‰πŸŽ‰

Python 1,104 109 Updated Feb 28, 2026

[CVPR 2025] RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete. Official Repository.

Python 554 44 Updated Oct 13, 2025

[ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 1,735 113 Updated Jan 6, 2026
Next