Skip to content
View TakHemlata's full-sized avatar

Block or report TakHemlata

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
20 stars written in Python
Clear filter

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,721 799 Updated May 27, 2025

The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."

Python 9,681 1,615 Updated Jun 26, 2024

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,211 334 Updated Sep 10, 2025

Pytorch library for fast transformer implementations

Python 1,765 191 Updated Mar 23, 2023

A high-level toolbox for using complex valued neural networks in PyTorch

Python 752 159 Updated Sep 4, 2024

A library for speech data augmentation in time-domain

Python 684 60 Updated Aug 30, 2021

Angular penalty loss functions in Pytorch (ArcFace, SphereFace, Additive Margin, CosFace)

Python 497 92 Updated Dec 13, 2023

Official repository for RawNet, RawNet2, and RawNet3

Python 399 58 Updated Mar 21, 2024

Official PyTorch implementation of "AASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph Attention Networks"

Python 273 73 Updated Jun 25, 2023

This repository includes the code to reproduce our paper "Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation".

Python 162 37 Updated Sep 26, 2023

Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)

Python 100 27 Updated Apr 20, 2020

This repository includes the code to reproduce our paper "End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification Anti-Spoofing and Speech Deepfake Detection" (https://arxiv.o…

Python 92 20 Updated Sep 17, 2023
Python 82 12 Updated Dec 1, 2023

This repository includes the code to reproduce our paper "RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Speaker Verification Anti-Spoofing".

Python 73 15 Updated Sep 24, 2023

Baseline for the Spoofing-aware Speaker Verification Challenge 2022

Python 66 22 Updated May 3, 2022

MelNet-Tensorflow implementation

Python 40 2 Updated Dec 1, 2020

Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for Automatic Speech Recognition https://arxiv.org/abs/1904.08779

Python 24 6 Updated Aug 7, 2019

Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for Automatic Speech Recognition https://arxiv.org/abs/1904.08779

Python 3 Updated Aug 7, 2019