Skip to content
View lixin4ever's full-sized avatar
🍉
I may be slow to respond before the due date of ACL.
🍉
I may be slow to respond before the due date of ACL.

Organizations

@dmlc @textmine

Block or report lixin4ever

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SenseNova-U series: Native Unified Paradigm with NEO-Unify from the First Principles

Python 116 3 Updated Apr 28, 2026

[CVPR 2026] Scaling Spatial Intelligence with Multimodal Foundation Models

Python 212 11 Updated Apr 19, 2026
Python 3 Updated Apr 27, 2026

Allen Institute for AI: WildDet3D: Scaling Promptable 3D Detection in the Wild

Python 521 38 Updated Apr 27, 2026

slime is an LLM post-training framework for RL Scaling.

Python 5,507 755 Updated Apr 28, 2026
Python 76 5 Updated Mar 31, 2026

HaWoR: World-Space Hand Motion Reconstruction from Egocentric Videos

Python 219 24 Updated Apr 16, 2026

[CVPR 2026] UniDex: A Robot Foundation Suite for Universal Dexterous Hand Control from Egocentric Human Videos

Python 107 4 Updated Mar 31, 2026

Code for NeurIPS 2022 Datasets and Benchmarks paper - EgoTaskQA: Understanding Human Tasks in Egocentric Videos.

Python 43 Updated Apr 17, 2023

[RSS 2026] Interactive World Simulator for Robot Policy Training and Evaluation

Python 224 12 Updated Mar 20, 2026

GigaWorld-0: World Models as Data Engine to Empower Embodied AI

Python 1,565 131 Updated Dec 3, 2025

Cosmos Policy

Python 731 68 Updated Jan 23, 2026

Moonshot's most powerful model

1,875 224 Updated Jan 31, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 365,542 74,904 Updated Apr 28, 2026

RynnBrain: Open Embodied Foundation Models

Jupyter Notebook 742 70 Updated Apr 15, 2026

RynnScale: Scalable VLM and VLA Development Kits

Python 17 2 Updated Feb 28, 2026

Fast, Sharp & Reliable Agentic Intelligence

C++ 2,017 82 Updated Apr 3, 2026

A Large-scale Video Action Dataset

Python 453 14 Updated Jan 16, 2026

[ICLR 2026] The offical Implementation of "Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model"

C++ 632 52 Updated Apr 13, 2026

Step3-VL-10B: A compact yet frontier multimodal model achieving SOTA performance at the 10B scale, matching open-source models 10-20x its size.

403 28 Updated Jan 21, 2026

AllenAI's post-training codebase

Python 3,703 536 Updated Apr 28, 2026

Galaxea's open-source VLA repository

Python 573 42 Updated Feb 14, 2026

MAI-UI: Real-World Centric Foundation GUI Agents ranging from 2B to 235B

Jupyter Notebook 1,792 178 Updated Apr 20, 2026

Official code of Motus: A Unified Latent Action World Model

Python 1,001 49 Updated Jan 5, 2026

DeepEP: an efficient expert-parallel communication library

Cuda 9,571 1,207 Updated Apr 28, 2026

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 3,477 250 Updated Apr 15, 2026

MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence

Python 57 Updated Mar 11, 2026

Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.

Python 1,635 271 Updated Jul 31, 2024
Next