Skip to content
View Dou-Yiming's full-sized avatar

Block or report Dou-Yiming

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 159,442 24,823 Updated Feb 4, 2026

A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.

2,507 109 Updated Feb 3, 2026

High-quality Text-to-Audio Generation with Efficient Diffusion Transformer

Python 328 25 Updated Dec 17, 2025

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,574 2,171 Updated Feb 3, 2026

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,526 4,702 Updated Feb 3, 2026

Realtime & high-frequency control interfaces for various robot arms including bi-manual I2RT YAM, Franka Panda, with manual tele-operation control or autonomous policy control

Python 24 4 Updated Jan 26, 2026

Dexbotic: Open-Source Vision-Language-Action Toolbox

Python 688 57 Updated Jan 20, 2026

Modern, minimal, and modular LaTeX CV template ✨ 📄

TeX 23 1 Updated Dec 4, 2025

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 21,416 3,680 Updated Feb 3, 2026

The official Meta Llama 3 GitHub site

Python 29,218 3,513 Updated Jan 26, 2025

Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success

Python 1,014 121 Updated Sep 9, 2025

Official implementation of Diffusion Policy Policy Optimization, arxiv 2024

Python 749 92 Updated Feb 4, 2025

DiWA: Diffusion Policy Adaptation with World Models

Python 70 2 Updated Aug 26, 2025

PyTorch code and models for VJEPA2 self-supervised learning from video.

Python 2,922 316 Updated Aug 28, 2025

Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence

Python 1,374 95 Updated Jan 31, 2025
Python 71 27 Updated Jan 23, 2026

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 5,186 624 Updated Mar 23, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 9,493 723 Updated Nov 20, 2025

Distributed Robot Interaction Dataset.

Jupyter Notebook 317 55 Updated Sep 15, 2025

An efficient video loader for deep learning with smart shuffling that's super easy to digest

C++ 2,411 217 Updated Jul 17, 2024
Python 365 47 Updated Mar 24, 2025

Official code for the CVPR 2025 paper "Navigation World Models".

Python 525 47 Updated Nov 24, 2025

robomimic: A Modular Framework for Robot Learning from Demonstration

Python 1,268 344 Updated Nov 10, 2025

DROID Policy Learning and Evaluation

Python 264 26 Updated Apr 22, 2025

Text-to-Audio/Music Generation

Python 2,575 205 Updated Sep 29, 2024

DexUMI: Using Human Hand as the Universal Manipulation Interface for Dexterous Manipulation

C 171 17 Updated Oct 2, 2025

Official Implementation of Paper Transfer between Modalities with MetaQueries

Python 297 11 Updated Oct 12, 2025

Official PyTorch implementation of One-Minute Video Generation with Test-Time Training

Python 2,365 194 Updated Jun 5, 2025
Next