Skip to content
View Natalia-T's full-sized avatar
  • Inria
  • France

Block or report Natalia-T

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Codebase for 'Scaling Rich Style-Prompted Text-to-Speech Datasets'

Python 161 11 Updated Mar 26, 2026
Python 96 12 Updated Jan 28, 2026

CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)

Python 401 46 Updated Sep 14, 2021

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 10,395 969 Updated May 16, 2026

Codabench is a flexible, easy-to-use and reproducible benchmarking platform. Check our paper: https://hubs.li/Q01fwRWB0

Python 157 65 Updated Jun 12, 2026

Implementation of the VQ-VAE model described in "Privacy-oriented manipulation of speaker representations".

Python 3 Updated Sep 11, 2024

Recipe for training and testing RIR-Classifier

Python 6 Updated Sep 18, 2024

Lightweight highly configurable Python launcher based on microkernel architecture

Python 3 1 Updated Jun 9, 2026

Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor

Python 19 2 Updated Jun 5, 2023

A python package to analyze and compare voices with deep learning

Python 3,267 484 Updated Oct 12, 2023

A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.

Python 2,285 220 Updated Apr 13, 2026
Python 410 28 Updated May 27, 2024

FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3

Python 247 23 Updated Apr 20, 2024

The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"

Jupyter Notebook 34 2 Updated Nov 23, 2023

Awesome speech/audio LLMs, representation learning, and codec models

1,232 75 Updated Jun 1, 2026

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,793 1,173 Updated Apr 8, 2026

Model implementation and trained network for "A Two-Step Disentanglement Method" by Naama Hadad, Lior Wolf and Moni Shahar

Python 21 7 Updated Mar 21, 2018

Measuring Disentanglement: A Review of Metrics

Python 36 8 Updated Dec 18, 2020

MOS score prediction by fine-tuned wav2vec2.0 model

Python 180 22 Updated Oct 20, 2022

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Python 950 151 Updated Dec 1, 2024

The VoxTube dataset official repository

HTML 71 2 Updated Feb 14, 2024

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Python 1,445 133 Updated Apr 24, 2024

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

Python 10,626 2,063 Updated Nov 3, 2023

Context Dependent Semantic Parsing: Awesome Paper List

20 3 Updated Nov 9, 2024