Skip to content
View LiveStarry's full-sized avatar

Block or report LiveStarry

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone

Python 24,356 1,902 Updated Apr 1, 2026

Open-source unified multimodal model

Python 5,802 513 Updated Oct 27, 2025
C++ 74 1 Updated Mar 29, 2025

official code for "3D Question Answering via only 2D Vision-Language Models"

Python 24 1 Updated Mar 4, 2026
Python 4,632 456 Updated Sep 14, 2025

Supercharge Your LLM with the Fastest KV Cache Layer

Python 7,970 1,088 Updated Apr 13, 2026

[ICCV 2025 Oral] SceneSplat - Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining

Python 328 17 Updated Feb 11, 2026

Code for 3D-LLM: Injecting the 3D World into Large Language Models

Python 1,192 74 Updated Jun 6, 2024