Skip to content
View LaurentMazare's full-sized avatar

Organizations

@kyutai-labs

Block or report LaurentMazare

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
12 stars written in Python
Clear filter

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 9,076 827 Updated Nov 3, 2025

Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

Python 2,561 259 Updated Sep 22, 2025

Convolutional neural network model for video classification trained on the Kinetics dataset.

Python 1,805 468 Updated Sep 12, 2019

A voice chat app

Python 1,170 147 Updated May 21, 2025

Make text LLMs listen and speak

Python 956 168 Updated Nov 3, 2025

Googles NotebookLM but local

Python 597 81 Updated Sep 18, 2025

Kyutai with an "eye"

Python 223 28 Updated Mar 26, 2025

Simple high-throughput inference library

Python 149 10 Updated May 14, 2025

Fine-tuning Moshi/J-Moshi on your own spoken dialogue data

Python 73 8 Updated Jul 31, 2025
Python 29 4 Updated Apr 28, 2025

A text embedding extension for the Polars Dataframe library.

Python 26 Updated Nov 21, 2024