Dou-Yiming

Follow

Yiming Dou Dou-Yiming

Follow

CS PhD student @ Cornell Tech

179 followers · 183 following

Cornell Tech
Shanghai ↔️ New York
www.yimingdou.com
@_YimingDou

Achievements

Achievements

Lists (17)

Sort

Audio

11 repositories

Causality

Graphics

lists

Multimodal

24 repositories

nerf

NLP

11 repositories

prompt

reading

reasoning

RL

Robotics

11 repositories

simulator

Tactile

11 repositories

Templates

tools

12 repositories

Vision

39 repositories

Stars

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 159,442 24,823 Updated Feb 4, 2026

jonyzhang2023 / awesome-embodied-vla-va-vln

A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.

2,507 109 Updated Feb 3, 2026

haidog-yaqub / EzAudio

High-quality Text-to-Audio Generation with Efficient Diffusion Transformer

Python 328 25 Updated Dec 17, 2025

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,574 2,171 Updated Feb 3, 2026

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,526 4,702 Updated Feb 3, 2026

uynitsuj / robots_realtime

Realtime & high-frequency control interfaces for various robot arms including bi-manual I2RT YAM, Franka Panda, with manual tele-operation control or autonomous policy control

Python 24 4 Updated Jan 26, 2026

dexmal / dexbotic

Dexbotic: Open-Source Vision-Language-Action Toolbox

Python 688 57 Updated Jan 20, 2026

giladturok / CleanCV

Modern, minimal, and modular LaTeX CV template ✨ 📄

TeX 23 1 Updated Dec 4, 2025

CharlesQ9 / Self-Evolving-Agents

848 80 Updated Oct 15, 2025

huggingface / lerobot

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 21,416 3,680 Updated Feb 3, 2026

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 29,218 3,513 Updated Jan 26, 2025

moojink / openvla-oft

Forked from openvla/openvla

Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success

Python 1,014 121 Updated Sep 9, 2025

irom-princeton / dppo

Official implementation of Diffusion Policy Policy Optimization, arxiv 2024

Python 749 92 Updated Feb 4, 2025

acl21 / diwa

DiWA: Diffusion Policy Adaptation with World Models

Python 70 2 Updated Aug 26, 2025

facebookresearch / vjepa2

PyTorch code and models for VJEPA2 self-supervised learning from video.

Python 2,922 316 Updated Aug 28, 2025

allenzren / open-pi-zero

Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence

Python 1,374 95 Updated Jan 31, 2025

i2rt-robotics / i2rt

Python 71 27 Updated Jan 23, 2026

Physical-Intelligence / openpi

Python 10,119 1,429 Updated Dec 27, 2025

openvla / openvla

Forked from TRI-ML/prismatic-vlms

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 5,186 624 Updated Mar 23, 2025

facebookresearch / dinov3

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 9,493 723 Updated Nov 20, 2025

droid-dataset / droid

Distributed Robot Interaction Dataset.

Jupyter Notebook 317 55 Updated Sep 15, 2025

dmlc / decord

An efficient video loader for deep learning with smart shuffling that's super easy to digest

C++ 2,411 217 Updated Jul 17, 2024

gaoyuezhou / dino_wm

Python 365 47 Updated Mar 24, 2025

facebookresearch / nwm

Official code for the CVPR 2025 paper "Navigation World Models".

Python 525 47 Updated Nov 24, 2025

ARISE-Initiative / robomimic

robomimic: A Modular Framework for Robot Learning from Demonstration

Python 1,268 344 Updated Nov 10, 2025

droid-dataset / droid_policy_learning

DROID Policy Learning and Evaluation

Python 264 26 Updated Apr 22, 2025

haoheliu / AudioLDM2

Text-to-Audio/Music Generation

Python 2,575 205 Updated Sep 29, 2024

real-stanford / DexUMI

DexUMI: Using Human Hand as the Universal Manipulation Interface for Dexterous Manipulation

C 171 17 Updated Oct 2, 2025

facebookresearch / metaquery

Official Implementation of Paper Transfer between Modalities with MetaQueries

Python 297 11 Updated Oct 12, 2025

test-time-training / ttt-video-dit

Official PyTorch implementation of One-Minute Video Generation with Test-Time Training

Python 2,365 194 Updated Jun 5, 2025