Skip to content
View lixin4ever's full-sized avatar
🍉
I may be slow to respond before the due date of ACL.
🍉
I may be slow to respond before the due date of ACL.

Organizations

@dmlc @textmine

Block or report lixin4ever

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official implementation of Tuna-2: Pixel Embeddings Beat Vision Encoders for Unified Understanding and Generation

Python 391 11 Updated Apr 30, 2026

A coding agent optimized to smaller LLMs

TypeScript 837 56 Updated Apr 28, 2026

SenseNova-U series: Native Unified Paradigm with NEO-Unify from the First Principles

Python 700 23 Updated Apr 30, 2026

[CVPR 2026] Scaling Spatial Intelligence with Multimodal Foundation Models

Python 230 12 Updated Apr 29, 2026
Python 3 Updated Apr 27, 2026

Allen Institute for AI: WildDet3D: Scaling Promptable 3D Detection in the Wild

Python 529 38 Updated Apr 27, 2026

slime is an LLM post-training framework for RL Scaling.

Python 5,539 764 Updated Apr 30, 2026
Python 78 5 Updated Mar 31, 2026

HaWoR: World-Space Hand Motion Reconstruction from Egocentric Videos

Python 221 24 Updated Apr 16, 2026

[CVPR 2026] UniDex: A Robot Foundation Suite for Universal Dexterous Hand Control from Egocentric Human Videos

Python 110 5 Updated Mar 31, 2026

Code for NeurIPS 2022 Datasets and Benchmarks paper - EgoTaskQA: Understanding Human Tasks in Egocentric Videos.

Python 43 Updated Apr 17, 2023

[RSS 2026] Interactive World Simulator for Robot Policy Training and Evaluation

Python 235 12 Updated Mar 20, 2026

GigaWorld-0: World Models as Data Engine to Empower Embodied AI

Python 1,566 131 Updated Dec 3, 2025

Cosmos Policy

Python 736 70 Updated Jan 23, 2026

Moonshot's most powerful model

1,892 233 Updated Jan 31, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 366,756 75,324 Updated Apr 30, 2026

RynnBrain: Open Embodied Foundation Models

Jupyter Notebook 746 70 Updated Apr 15, 2026

RynnScale: Scalable VLM and VLA Development Kits

Python 18 2 Updated Feb 28, 2026

Fast, Sharp & Reliable Agentic Intelligence

C++ 2,016 83 Updated Apr 3, 2026

A Large-scale Video Action Dataset

Python 456 14 Updated Jan 16, 2026

[ICLR 2026] The offical Implementation of "Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model"

C++ 636 53 Updated Apr 13, 2026

Step3-VL-10B: A compact yet frontier multimodal model achieving SOTA performance at the 10B scale, matching open-source models 10-20x its size.

403 28 Updated Jan 21, 2026

AllenAI's post-training codebase

Python 3,704 537 Updated Apr 30, 2026

Galaxea's open-source VLA repository

Python 578 43 Updated Feb 14, 2026

MAI-UI: Real-World Centric Foundation GUI Agents ranging from 2B to 235B

Jupyter Notebook 1,794 178 Updated Apr 20, 2026

Official code of Motus: A Unified Latent Action World Model

Python 1,019 50 Updated Jan 5, 2026

DeepEP: an efficient expert-parallel communication library

Cuda 9,591 1,217 Updated Apr 29, 2026

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 3,480 250 Updated Apr 15, 2026
Next