Starred repositories
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a ca…
Conferencing Speech Challenge
Code for the Active Speakers in Context Paper (CVPR2020)
Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"
Open source audio annotation tool for humans
🔓 Lip Reading - Cross Audio-Visual Recognition using 3D Architectures
torchsummaryX: Improved visualization tool of torchsummary
TRI-ML Monocular Depth Estimation Repository
Google Research
ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks
This is a PyTorch implementation of the paper "Multi-branch and Multi-scale Attention Learning for Fine-Grained Visual Categorization (MMAL-Net)" (Fan Zhang, Meng Li, Guisheng Zhai, Yizhao Liu).
Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation
pytorch code of multi scale 1d resnet, we hope it will help your research
(ImageNet pretrained models) The official pytorch implemention of the TPAMI paper "Res2Net: A New Multi-scale Backbone Architecture"
a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
Python implementation of the Short Term Objective Intelligibility measure
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
A Pytorch Implementation of "Attention is All You Need" and "Weighted Transformer Network for Machine Translation"
Implementation of Transformer model (originally from Attention is All You Need) applied to Time Series.
A temporal module for PyTorch-ComplexTensor
code and trained models for "Attentional Feature Fusion"
You like pytorch? You like micrograd? You love tinygrad! ❤️
FSA/FST algorithms, differentiable, with PyTorch compatibility.
Tools for handling multimodal data in machine learning projects.