Skip to content
View lixin4ever's full-sized avatar
🍉
I may be slow to respond before the due date of ACL.
🍉
I may be slow to respond before the due date of ACL.

Organizations

@dmlc @textmine

Block or report lixin4ever

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 40 3 Updated Mar 31, 2026

HaWoR: World-Space Hand Motion Reconstruction from Egocentric Videos

Python 204 20 Updated Apr 5, 2025

[CVPR 2026] UniDex: A Robot Foundation Suite for Universal Dexterous Hand Control from Egocentric Human Videos

Python 82 3 Updated Mar 31, 2026

Code for NeurIPS 2022 Datasets and Benchmarks paper - EgoTaskQA: Understanding Human Tasks in Egocentric Videos.

Python 43 Updated Apr 17, 2023

Interactive World Simulator for Robot Policy Training and Evaluation

Python 205 11 Updated Mar 20, 2026

GigaWorld-0: World Models as Data Engine to Empower Embodied AI

Python 1,537 128 Updated Dec 3, 2025

Cosmos Policy

Python 698 58 Updated Jan 23, 2026

Moonshot's most powerful model

1,703 186 Updated Jan 31, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 352,367 71,007 Updated Apr 9, 2026

RynnBrain: Open Embodied Foundation Models

Jupyter Notebook 718 67 Updated Mar 10, 2026

RynnScale: Scalable VLM and VLA Development Kits

Python 17 2 Updated Feb 28, 2026

Fast, Sharp & Reliable Agentic Intelligence

C++ 1,994 79 Updated Apr 3, 2026

A Large-scale Video Action Dataset

Python 444 12 Updated Jan 16, 2026

[ICLR 2026] The offical Implementation of "Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model"

C++ 585 50 Updated Mar 25, 2026

Step3-VL-10B: A compact yet frontier multimodal model achieving SOTA performance at the 10B scale, matching open-source models 10-20x its size.

402 28 Updated Jan 21, 2026

AllenAI's post-training codebase

Python 3,681 531 Updated Apr 9, 2026

Galaxea's open-source VLA repository

Python 562 40 Updated Feb 14, 2026

MAI-UI: Real-World Centric Foundation GUI Agents ranging from 2B to 235B

Jupyter Notebook 1,770 176 Updated Mar 20, 2026

Official code of Motus: A Unified Latent Action World Model

Python 942 43 Updated Jan 5, 2026

DeepEP: an efficient expert-parallel communication library

Cuda 9,105 1,146 Updated Mar 31, 2026

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 3,450 248 Updated Apr 8, 2026
33 Updated Dec 17, 2025

MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence

Python 57 Updated Mar 11, 2026

Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.

Python 1,612 267 Updated Jul 31, 2024

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 1,061 147 Updated Apr 9, 2026

A curated collection of fun and creative examples generated with Nano Banana & Nano Banana Pro🍌, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the commu…

21,991 2,248 Updated Dec 12, 2025

[ICLR 2026] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence

Python 83 2 Updated Mar 13, 2026

Code for [AAAI 2026] AffordDex: Towards Affordance-Aware Robotic Dexterous Grasping with Human-like Priors

Python 29 Updated Dec 26, 2025

A framework aiming to bridge fast robot prototyping, predefined motion primitives, heterogeneous teleoperation, data collection, and flexible deployment across diverse robot platforms.

Python 26 1 Updated Apr 4, 2026
Next