Skip to content
View xhghhh's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report xhghhh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 20,424 3,348 Updated Dec 24, 2025

The official implementation of Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight

Python 67 1 Updated Dec 2, 2025

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 1,723 100 Updated Dec 25, 2025

Sharp Monocular View Synthesis in Less Than a Second

Python 5,051 322 Updated Dec 19, 2025

RLP: Reinforcement as a Pretraining Objective

218 13 Updated Oct 5, 2025
Python 119 7 Updated Aug 29, 2024

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

Python 789 98 Updated Sep 8, 2025

[NeurIPS 2025] Official implementation for "Flow Matching-Based Autonomous Driving Planning with Advanced Interactive Behavior Modeling"

Python 117 18 Updated Nov 27, 2025

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 6,447 750 Updated Dec 21, 2025

使用原生 PyTorch 实现 LLM + RL 的一系列算法变体

Python 3 Updated Dec 17, 2025

A Foundation Model for Generalist Gaming Agents

Python 918 107 Updated Dec 23, 2025

Visual Geometry Transformer for Autonomous Driving

Python 97 3 Updated Dec 19, 2025

[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.

Python 4,223 676 Updated Aug 15, 2024

Joint 3D Geometry Reconstruction and Motion Generation for 4D Synthesis from a Single Image

Python 56 1 Updated Dec 23, 2025

DGGT: Feedforward 4D Reconstruction of Dynamic Driving Scenes using Unposed Images

Python 344 30 Updated Dec 11, 2025

A Video Tokenizer Evaluation Dataset

Python 145 10 Updated Jan 13, 2025

Official implementation of "From Forecasting to Planning: Policy World Model for Collaborative State-Action Prediction"

Python 49 1 Updated Nov 25, 2025
Python 7,838 461 Updated Dec 25, 2025

[ECCV 2024] Officially implement of the paper "DrivingDiffusion: Layout-Guided Multi-View Driving Scenarios Video Generation with Latent Diffusion Model".

Python 555 17 Updated Dec 15, 2023

[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving

304 12 Updated Mar 14, 2024

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 36,095 4,262 Updated Dec 24, 2025

Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch

Python 1,019 1,346 Updated Aug 29, 2025

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 26,537 3,775 Updated Dec 24, 2025

Unfied World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets

Python 171 10 Updated Oct 8, 2025

Bird-eye's view for CARLA simulator

Python 219 27 Updated Aug 30, 2024

一体化网页笔记批注、协作与专注辅助工具。 All-in-one Chrome extension for annotated learning, real-time collaboration, and reading focus tools.

JavaScript 18 1 Updated Dec 20, 2025
Python 484 30 Updated Nov 26, 2025

PyTorch code for the paper "Model-Based Imitation Learning for Urban Driving".

Python 416 39 Updated Apr 21, 2023

MiMo-Embodied

Python 326 11 Updated Nov 21, 2025
Next