Skip to content
View slz929's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Beijing

Block or report slz929

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Replication of EgoScale kind data collection tool based on Unitree-g1 robot.

Python 1 Updated Jun 3, 2026

Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch

Python 540 16 Updated Jan 18, 2026

A curated awesome list for dexterous robot manipulation, tactile sensing, dexterous hands, robot learning, datasets, benchmarks, and simulators.

8 Updated May 11, 2026

HOT3D: Hand and Object Tracking in 3D from Egocentric Multi-View Videos, CVPR 2025

Python 255 33 Updated Jun 16, 2026

[ICLR 2026] Sat3DGen: Comprehensive Street-Level 3D Scene Generation from Single Satellite Image

Python 97 7 Updated May 18, 2026

Deep Learning for Visual-Inertial Odometry

Python 158 16 Updated Nov 9, 2024

A Curated List of Vision-Language-Action (VLA) and World Action Models (WAM) Research and Beyond

765 26 Updated Jun 16, 2026

[CVPR 2026] UniDex: A Robot Foundation Suite for Universal Dexterous Hand Control from Egocentric Human Videos

Python 146 7 Updated Mar 31, 2026

[CVPR 2026] ZipMap: Linear-Time Stateful 3D Reconstruction via Test-Time Training

Python 452 11 Updated Jun 11, 2026
Python 90 6 Updated Dec 18, 2024

This repository holds the code that wraps habitat-sim. The main purpose of this code is data collection. Datasets like [mvl-dataset](https://huggingface.co/datasets/EnriqueSolarte/mvl_datasets) wer…

Python 5 Updated Sep 23, 2025

ViPE: Video Pose Engine for Geometric 3D Perception

Python 1,990 161 Updated Jun 9, 2026

An agentic skills framework & software development methodology that works.

Shell 232,541 20,657 Updated Jun 18, 2026
Python 182 24 Updated May 29, 2026

Project Lyra: Open Generative 3D World Models

Python 2,106 223 Updated Jun 11, 2026

A Curated List of Awesome Video World Models with AR Diffusion: Covering Algorithms, Applications, and Infrastructure, Aimed at Serving as a Comprehensive Resource for Researchers, Practitioners, a…

TeX 605 17 Updated Jun 4, 2026

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…

Python 1,804 60 Updated Jun 18, 2026

A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.

3,033 125 Updated Jun 12, 2026

Paper list for robot learning from human videos (LfHV)

129 4 Updated Jun 11, 2026

Official Implementation of SAGE-GRPO:Manifold-Aware Exploration for Reinforcement Learning in Video Generation

Python 124 2 Updated Apr 2, 2026

[CVPR 2026] Official codes of "Monet: Reasoning in Latent Visual Space Beyond Image and Language"

Python 199 3 Updated Mar 19, 2026

A curated list of awesome Claude Skills, resources, and tools for customizing Claude AI workflows

Python 65,132 7,231 Updated May 22, 2026

[NeurIPS 2024 Datasets and Benchmarks Track] Closed-Loop E2E-AD Benchmark Enhanced by World Model RL Expert

Python 1,888 136 Updated May 22, 2026

Simulate and correct images for dichromatic color blindness

Python 92 20 Updated Sep 4, 2024

Code implementation of Pi-Long

Python 188 12 Updated Apr 16, 2026

RLLaVA is a user-friendly framework for multi-modal RL research and optimized for resource-constrained teams.

Python 58 6 Updated Mar 18, 2026

CoReVLA: A Dual-Stage End-to-End Autonomous Driving Framework for Long-Tail Scenarios via Collect-and-Refine

Python 40 5 Updated Feb 2, 2026

Official repo for "GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization"

Python 272 29 Updated Jan 20, 2026

Python pdb for multiple processes

Python 82 9 Updated May 24, 2025

A Survey on Reinforcement Learning of Vision-Language-Action Models for Robotic Manipulation

747 22 Updated May 18, 2026
Next