LiveStarry

LiveStarry

Stars

A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone

Python 24,356 1,902 Updated Apr 1, 2026

Open-source unified multimodal model

Python 5,802 513 Updated Oct 27, 2025

C++ 74 1 Updated Mar 29, 2025

official code for "3D Question Answering via only 2D Vision-Language Models"

Python 24 1 Updated Mar 4, 2026

Python 4,632 456 Updated Sep 14, 2025

Supercharge Your LLM with the Fastest KV Cache Layer

Python 7,970 1,088 Updated Apr 13, 2026

[ICCV 2025 Oral] SceneSplat - Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining

Python 328 17 Updated Feb 11, 2026

Code for 3D-LLM: Injecting the 3D World into Large Language Models

Python 1,192 74 Updated Jun 6, 2024