Skip to content
View KiAlexander's full-sized avatar

Block or report KiAlexander

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a ca…

Python 1,865 310 Updated Mar 14, 2023

A PyTorch-based Speech Toolkit

Python 10,967 1,615 Updated Dec 24, 2025

Conferencing Speech Challenge

Python 95 35 Updated Apr 6, 2021

Code for the Active Speakers in Context Paper (CVPR2020)

Python 56 14 Updated May 19, 2021

Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"

Python 113 25 Updated Nov 16, 2020

Open source audio annotation tool for humans

TypeScript 1,125 139 Updated Dec 14, 2025

🔓 Lip Reading - Cross Audio-Visual Recognition using 3D Architectures

Python 1,897 333 Updated Nov 7, 2022

torchsummaryX: Improved visualization tool of torchsummary

Python 302 32 Updated Mar 17, 2022

TRI-ML Monocular Depth Estimation Repository

Python 1,271 245 Updated Jul 16, 2023

Google Research

Jupyter Notebook 36,951 8,279 Updated Dec 23, 2025

ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks

Python 428 102 Updated May 18, 2023
Python 1 3 Updated May 1, 2018

This is a PyTorch implementation of the paper "Multi-branch and Multi-scale Attention Learning for Fine-Grained Visual Categorization (MMAL-Net)" (Fan Zhang, Meng Li, Guisheng Zhai, Yizhao Liu).

Python 257 56 Updated Dec 29, 2020

Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation

Python 14 2 Updated Nov 16, 2020

pytorch code of multi scale 1d resnet, we hope it will help your research

Python 224 49 Updated Mar 29, 2021

(ImageNet pretrained models) The official pytorch implemention of the TPAMI paper "Res2Net: A New Multi-scale Backbone Architecture"

Python 1,112 217 Updated Dec 8, 2022

a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi

Python 344 54 Updated Dec 25, 2020
Python 114 25 Updated Jan 8, 2021

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

Python 805 196 Updated Apr 6, 2023

Python implementation of the Short Term Objective Intelligibility measure

MATLAB 355 58 Updated Dec 29, 2023

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python 2,289 545 Updated Jul 27, 2024

gan_torch (cpu & gpu)

Jupyter Notebook 30 11 Updated Feb 18, 2021
Python 51 8 Updated May 16, 2021

A Pytorch Implementation of "Attention is All You Need" and "Weighted Transformer Network for Machine Translation"

Python 574 124 Updated Oct 1, 2020

Implementation of Transformer model (originally from Attention is All You Need) applied to Time Series.

Jupyter Notebook 905 167 Updated Jun 2, 2023

A temporal module for PyTorch-ComplexTensor

Python 44 18 Updated Jun 28, 2024

code and trained models for "Attentional Feature Fusion"

Python 806 101 Updated Jul 23, 2021

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 30,932 3,793 Updated Dec 25, 2025

FSA/FST algorithms, differentiable, with PyTorch compatibility.

Cuda 1,296 232 Updated Nov 19, 2025

Tools for handling multimodal data in machine learning projects.

Python 1,095 257 Updated Dec 15, 2025
Next