ASLP-lab

ASLP-lab

336 followers · 2 following

Achievements

Stars

qualialabsAI / SmoothConv-DuplexConv

HTML 66 Updated Jun 12, 2026

Soul-AILab / SoulX-Podcast

SoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text.

Python 3,444 440 Updated Dec 11, 2025

ASLP-lab / FlashTTS

Fast Streaming TTS with MTP Acceleration and X-pred Mean Flow Distillation

Python 37 1 Updated Jun 9, 2026

ASLP-lab / ArxivWatcher

Python 16 1 Updated Jun 12, 2026

Stability-AI / stable-audio-3

Python 485 58 Updated Jun 9, 2026

ASLP-lab / FMSU

Towards Fine-Grained Multi-Dimensional Speech Understanding: Data Pipeline, Benchmark, and Model

Python 25 1 Updated May 21, 2026

ASLP-lab / MINT-Bench

Python 47 2 Updated May 2, 2026

ASLP-lab / Speaker-Reasoner

Speaker-Reasoner: Scaling Interaction Turns and Reasoning Patterns for Timestamped Speaker-Attributed ASR

Python 88 2 Updated May 13, 2026

openvpi / SOME

SOME: Singing-Oriented MIDI Extractor.

Python 695 54 Updated Mar 7, 2026

schrodingercatss / tuning_playbook_zh_cn

一本系统地教你将深度学习模型的性能最大化的战术手册。

3,205 287 Updated May 27, 2023

LAION-AI / scaled-echo-tts

Scaled diffusion transformer for text-to-speech synthesis (DiT + T5Gemma2 conditioning, TorchTitan & Megatron backends, tested up to 1024 GPUs)

Python 24 Updated Mar 29, 2026

ASLP-lab / YingMusic-Singer-Plus

YingMusic-Singer-Plus: Controllable Singing Voice Synthesis with Flexible Lyric Manipulation and Annotation-free Melody Guidance

Python 66 4 Updated Apr 12, 2026

ASLP-lab / Stream-TN

HTML 13 Updated Mar 25, 2026

ASLP-lab / OmniCodec

OmniCodec: Low Frame Rate Universal Audio Codec with Semantic–Acoustic Disentanglement

Python 40 1 Updated Apr 17, 2026

ASLP-lab / M7-TTS

M7-TTS: A Mini-Scale Multilingual and Multi-Dialect Text-to-Speech Language Model with Mimi codec and Multi Token Prediction

20 1 Updated Mar 19, 2026

ASLP-lab / SmartGlasses

This challenge focuses on evaluating speech recognition and semantic understanding capabilities of AI glasses in complex real-world environments.

18 Updated Jun 14, 2026

ASLP-lab / OSUM-Pangu

An Open-Source Multidimension Speech Understanding Foundation Model Built upon OpenPangu on Ascend NPUs

Python 32 Updated Mar 15, 2026

ASLP-lab / WenetSpeech-Wu-Repo

A Large-scale Wu Dialect Speech Corpus with Multi-dimensional Annotations

Python 152 4 Updated Feb 6, 2026

ASLP-lab / VoiceSculptor

An instruct text-to-speech solution based on LLaSA and CosyVoice2 developed by the ASLP lab and collaborators.

Python 250 12 Updated Feb 26, 2026

ASLP-lab / DiffRhythm2

Forked from xiaomi-research/diffrhythm2

Di♪♪Rhythm 2: Efficient And High Fidelity Song Generation Via Block Flow Matching

Python 165 12 Updated Nov 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ASLP-lab

Achievements

Achievements

Block or report ASLP-lab

Stars

qualialabsAI / SmoothConv-DuplexConv

Soul-AILab / SoulX-Podcast

ASLP-lab / FlashTTS

ASLP-lab / ArxivWatcher

Stability-AI / stable-audio-3

ASLP-lab / FMSU

ASLP-lab / MINT-Bench

ASLP-lab / Speaker-Reasoner

openvpi / SOME

schrodingercatss / tuning_playbook_zh_cn

LAION-AI / scaled-echo-tts

ASLP-lab / YingMusic-Singer-Plus

ASLP-lab / Stream-TN

ASLP-lab / OmniCodec

ASLP-lab / M7-TTS

ASLP-lab / SmartGlasses

ASLP-lab / OSUM-Pangu

ASLP-lab / WenetSpeech-Wu-Repo

ASLP-lab / VoiceSculptor

ASLP-lab / DiffRhythm2

ASLP-lab / MeanVC

ASLP-lab / SongFormer

ASLP-lab / Easy-Turn

ASLP-lab / WenetSpeech-Chuan

ASLP-lab / Automatic-Song-Aesthetics-Evaluation-Challenge

ASLP-lab / WenetSpeech-Yue

ASLP-lab / MSU-Bench

MrSupW / ContextASR-Bench

ASLP-lab / SongEval

ASLP-lab / LLaSA_Plus