lixin4ever

🍉

I may be slow to respond before the due date of ACL.

LI XIN lixin4ever

🍉

I may be slow to respond before the due date of ACL.

PhD@CUHK, Research Engineer@Alibaba

435 followers · 46 following

Achievements

Organizations

Stars

thu-ml / TurboDiffusion

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 1,943 121 Updated Dec 25, 2025

GuanhuaJi / oxe-auge

17 Updated Dec 17, 2025

InternRobotics / MMSI-Video-Bench

MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence

Python 40 Updated Dec 23, 2025

octo-models / octo

Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.

Python 1,485 242 Updated Jul 31, 2024

radixark / miles

Python 627 60 Updated Dec 25, 2025

PicoTrex / Awesome-Nano-Banana-images

A curated collection of fun and creative examples generated with Nano Banana & Nano Banana Pro🍌, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the commu…

19,148 1,998 Updated Dec 12, 2025

InternRobotics / MMSI-Bench

[arXiv 2025] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence

Python 67 Updated Dec 23, 2025

Maxwell-Zhao / AffordDex

Code for [AAAI 2026] AffordDex: Towards Affordance-Aware Robotic Dexterous Grasping with Human-like Priors

Python 12 Updated Nov 20, 2025

alibaba-damo-academy / RynnMotion

A framework aiming to bridge fast robot prototyping, predefined motion primitives, heterogeneous teleoperation, data collection, and flexible deployment across diverse robot platforms.

C++ 15 Updated Dec 21, 2025

facebookresearch / sam-3d-objects

SAM 3D Objects

Python 5,101 479 Updated Dec 16, 2025

facebookresearch / sam3

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 6,467 754 Updated Dec 21, 2025

meituan-longcat / LongCat-Flash-Omni

This is the official repo for the paper "LongCat-Flash-Omni Technical Report"

Python 447 25 Updated Dec 15, 2025

baaivision / Emu3.5

Native Multimodal Models are World Learners

Python 1,372 52 Updated Nov 28, 2025

FlagOpen / RoboBrain-X0

Python 100 11 Updated Oct 27, 2025

MiniMax-AI / MiniMax-M2

MiniMax-M2, a model built for Max coding & agentic workflows.

2,125 161 Updated Nov 13, 2025

EvolvingLMMs-Lab / lmms-engine

A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.

Python 681 27 Updated Dec 23, 2025

TianxingChen / Embodied-AI-Guide

[Lumina Embodied AI] 具身智能技术指南 Embodied-AI-Guide

10,215 695 Updated Dec 3, 2025

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 21,573 1,929 Updated Oct 25, 2025

Maxwell-Zhao / RoboSimGS

Code for "High-Fidelity Simulated Data Generation for Real-World Zero-Shot Robotic Manipulation Learning with Gaussian Splatting"

Python 45 Updated Oct 27, 2025

Espere-1119-Song / VideoNSA

VideoNSA: Native Sparse Attention Scales Video Understanding

Python 75 1 Updated Nov 16, 2025

thu-ml / RDT2

Official code of RDT 2

Python 606 30 Updated Dec 3, 2025

LeCAR-Lab / HDMI

Python 486 28 Updated Nov 29, 2025

QwenLM / Qwen3-Omni

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,164 193 Updated Oct 9, 2025

huggingface / fineVideo

Python 95 5 Updated Sep 19, 2024

EvolvingLMMs-Lab / LLaVA-OneVision-1.5

Fully Open Framework for Democratized Multimodal Training

Python 663 53 Updated Dec 15, 2025

MiroMindAI / MiroThinker

MiroThinker is a series of open-source agentic models trained for deep research and complex tool use scenarios.

Python 1,369 95 Updated Dec 23, 2025

MiroMindAI / MiroFlow

MiroMind Research Agent: Fully Open-Source Deep Research Agent with Reproducible State-of-the-Art Performance on FutureX, GAIA, HLE, BrowserComp and xBench.

Python 1,623 175 Updated Nov 30, 2025

facebookresearch / map-anything

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

Python 2,577 161 Updated Dec 18, 2025

unitreerobotics / Qmini

499 69 Updated Sep 17, 2025

PRIME-RL / SimpleVLA-RL

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 1,147 63 Updated Oct 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LI XIN lixin4ever

Achievements

Achievements

Organizations

Block or report lixin4ever

Stars

thu-ml / TurboDiffusion

GuanhuaJi / oxe-auge

InternRobotics / MMSI-Video-Bench

octo-models / octo

radixark / miles

PicoTrex / Awesome-Nano-Banana-images

InternRobotics / MMSI-Bench

Maxwell-Zhao / AffordDex

alibaba-damo-academy / RynnMotion

facebookresearch / sam-3d-objects

facebookresearch / sam3

meituan-longcat / LongCat-Flash-Omni

baaivision / Emu3.5

FlagOpen / RoboBrain-X0

MiniMax-AI / MiniMax-M2

EvolvingLMMs-Lab / lmms-engine

TianxingChen / Embodied-AI-Guide

deepseek-ai / DeepSeek-OCR

Maxwell-Zhao / RoboSimGS

Espere-1119-Song / VideoNSA

thu-ml / RDT2

LeCAR-Lab / HDMI

QwenLM / Qwen3-Omni

huggingface / fineVideo

EvolvingLMMs-Lab / LLaVA-OneVision-1.5

MiroMindAI / MiroThinker

MiroMindAI / MiroFlow

facebookresearch / map-anything

unitreerobotics / Qmini

PRIME-RL / SimpleVLA-RL