- Shatin, N.T., HKSAR
- https://lixin4ever.github.io/
- @lixin4ever
Stars
SenseNova-U series: Native Unified Paradigm with NEO-Unify from the First Principles
[CVPR 2026] Scaling Spatial Intelligence with Multimodal Foundation Models
Allen Institute for AI: WildDet3D: Scaling Promptable 3D Detection in the Wild
slime is an LLM post-training framework for RL Scaling.
HaWoR: World-Space Hand Motion Reconstruction from Egocentric Videos
[CVPR 2026] UniDex: A Robot Foundation Suite for Universal Dexterous Hand Control from Egocentric Human Videos
Code for NeurIPS 2022 Datasets and Benchmarks paper - EgoTaskQA: Understanding Human Tasks in Egocentric Videos.
[RSS 2026] Interactive World Simulator for Robot Policy Training and Evaluation
GigaWorld-0: World Models as Data Engine to Empower Embodied AI
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
RynnBrain: Open Embodied Foundation Models
RynnScale: Scalable VLM and VLA Development Kits
Fast, Sharp & Reliable Agentic Intelligence
[ICLR 2026] The offical Implementation of "Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model"
Step3-VL-10B: A compact yet frontier multimodal model achieving SOTA performance at the 10B scale, matching open-source models 10-20x its size.
MAI-UI: Real-World Centric Foundation GUI Agents ranging from 2B to 235B
Official code of Motus: A Unified Latent Action World Model
DeepEP: an efficient expert-parallel communication library
TurboDiffusion: 100–200× Acceleration for Video Diffusion Models
MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence
Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.