Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…

Python 12,261 1,588 Updated Mar 17, 2026

MiniMax-AI / MiniMax-M2.1

MiniMax M2.1, a SOTA model for real-world dev & agents.

542 46 Updated Jan 28, 2026

XiaomiMiMo / MiMo-Audio

MiMo-Audio: Audio Language Models are Few-Shot Learners

Python 1,056 104 Updated Jun 17, 2026

zlab-princeton / SoFlow

[ICLR 2026] SoFlow: Solution Flow Models for One-Step Generative Modeling

Python 160 7 Updated Apr 8, 2026

facebookresearch / sam-audio

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 3,554 321 Updated May 26, 2026

disco-speech / DisCo-Speech

90 7 Updated Dec 31, 2025

ddlBoJack / Omni-Captioner

[ICLR 2026] Data Pipeline, Models, and Benchmark for Omni-Captioner.

Python 138 Updated Apr 7, 2026

ASLP-lab / MeanVC

A Lightweight and Streaming Zero-Shot Voice Conversion via Mean Flows

Python 291 21 Updated Jan 8, 2026

ASLP-lab / DiffRhythm2

Forked from xiaomi-research/diffrhythm2

Di♪♪Rhythm 2: Efficient And High Fidelity Song Generation Via Block Flow Matching

Python 166 12 Updated Nov 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Liumeng Xue lmxue

Achievements

Achievements

Organizations

Block or report lmxue

Stars

dieKarotte / ASAudio

ASLP-lab / ArxivWatcher

qualialabsAI / SmoothConv-DuplexConv

halspeech / julius-speech-foundation-model

cwx-worst-one / WavTTS

ddlBoJack / MMAE

Soul-AILab / SoulX-Transcriber

ASLP-lab / Speaker-Reasoner

ASLP-lab / MINT-Bench-Demo

ASLP-lab / MINT-Bench

ZeyueT / Audio-Omni

jeremychee4 / AffectSpeech

k2-fsa / OmniVoice

HKUDS / CLI-Anything

X-LANCE / Xmart

QwenLM / Qwen3-TTS

MiniMax-AI / MiniMax-M2.1

XiaomiMiMo / MiMo-Audio

zlab-princeton / SoFlow

facebookresearch / sam-audio

disco-speech / DisCo-Speech

ddlBoJack / Omni-Captioner

ASLP-lab / MeanVC

ASLP-lab / DiffRhythm2

Soul-AILab / SoulX-Podcast

OpenBMB / VoxCPM

wuzhiyue111 / Codec-Evaluation

diodiogod / TTS-Audio-Suite

opendilab / HH-Codec

xiquan-li / MeanAudio