-
University of Edinburgh
- Edinburgh, Scotland
- https://medium.com/@pilarsoledad
Stars
💻 A repository with comprehensive instructions for using the Festvox toolkit for generating Emotional speech 🔈 from text
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.
A Cross-Domain Transferable Neural Coherence Model https://arxiv.org/abs/1905.11912
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
A minimal pytorch package implementing a gradient reversal layer.
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a ca…
A simple tutorial of Variational AutoEncoders with Pytorch
PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]
Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"
CodeSwitch is a NLP tool, can use for language identification, pos tagging, name entity recognition, sentiment analysis of code mixed data.
Command line utility for forced alignment using Kaldi
A technical report on convolution arithmetic in the context of deep learning
Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.
list of efficient attention modules
An implementation of the efficient attention module.
Language Markup and Experimental Design Software -- for running experiments over the internet
A pitch tracker using Camacho's SWIPE' algorithm, written in C
Graph to sequence implemented in Pytorch combining Graph convolutional networks and opennmt-py
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Contrastive Predictive Coding for Automatic Speaker Verification