Stars
Advances in audio anti-spoofing and deepfake detection using graph neural networks and self-supervised learning
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
Curated list of python software and packages related to scientific research in audio
Audio Research in US. US-based professors who work on audio (music, speech, acoustics). For students who would like to apply for RA, PhD, postdoc in audio research.
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
This repository includes the code to reproduce our paper "End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification Anti-Spoofing and Speech Deepfake Detection" (https://arxiv.o…
PiML (Python Interpretable Machine Learning) toolbox for model development & diagnostics
This repository includes the code to reproduce our paper "Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation".
Baseline for the Spoofing-aware Speaker Verification Challenge 2022
This repository includes the code to reproduce our paper "RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Speaker Verification Anti-Spoofing".
A collection of research papers and software related to explainability in graph machine learning.
Official PyTorch implementation of "AASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph Attention Networks"
A high-level toolbox for using complex valued neural networks in PyTorch
This repo contains annotated research papers that I found really good and useful
Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)
Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP 2020
Angular penalty loss functions in Pytorch (ArcFace, SphereFace, Additive Margin, CosFace)
Pytorch library for fast transformer implementations
A library for speech data augmentation in time-domain
📓 Notes and summaries of various ML, Computer Vision & NLP papers.
A wavefunction ansatz based on Recurrent Neural Networks to perform Variational Monte-Carlo Simulations
Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for Automatic Speech Recognition https://arxiv.org/abs/1904.08779
Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for Automatic Speech Recognition https://arxiv.org/abs/1904.08779
🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
Official repository for RawNet, RawNet2, and RawNet3