Stars
Automatic classification of stop consonant realisation with wav2vec2.0
A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)
This package contains functions for converting wav files into auditory representations and comparing them
String-to-String Algorithms for Natural Language Processing
Open-source vector similarity search for Postgres
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Python library for downloading, loading & working with sound datasets
Tools for handling multimodal data in machine learning projects.
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Automatic download and forced alignment of youtube videos using subtitles
A vocoder framework which had been widely used in research community since 1999.
Read-only unofficial mirror of the OpenGrm NGram Library
A few small collections of voice onset times in a common format
Interface for running Praat scripts through Python
A benchmark framework for testing algorithms and pairwise metrics.
Development repository for Integrated Speech Corpus Analaysis (ISCAN)
Large Scale Facial Model (LSFM) - an automatic pipeline for constructing 3D Morphable Models from large collections of facial meshes
A collection of links and notes on forced alignment tools
A Praat plug-in for performing interactive phonetic forced alignment
A pitch tracker using Camacho's SWIPE' algorithm, written in C