Skip to content
View fchest's full-sized avatar

Block or report fchest

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Assist Non-native Viewers: Multimodal Crosslingual Summarization for How2 Videos

Python 10 Updated Sep 2, 2024

This is an unofficial implementation of BOAT: Bilateral Local Attention Vision Transformer

Python 17 5 Updated Mar 29, 2022

Pytorch/Python implementation of the joint CNN-LSTM deep learning model

Python 24 3 Updated Aug 3, 2021

This repository includes the code to reproduce our paper "RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Speaker Verification Anti-Spoofing".

Python 70 14 Updated Sep 24, 2023

This repository includes the code to reproduce our paper "End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification Anti-Spoofing and Speech Deepfake Detection" (https://arxiv.o…

Python 3 1 Updated Oct 23, 2022

implementation of Monaural Speech Enhancement with Recursive Learning in the Time Domain

Python 48 7 Updated Nov 4, 2020

The implementation of "A Recursive Network with Dynamic Attention for Monaural Speech Enhancement"

Python 80 18 Updated Dec 8, 2022

This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for Single-channel Speech Enhancement", which was accepted by El…

Python 71 9 Updated Feb 10, 2022

Official implementation of the SPL paper "One-class Learning Towards Synthetic Voice Spoofing Detection"

Jupyter Notebook 132 32 Updated Aug 30, 2024

This repository contains some material of speech enhancement and dereverberation. On the one hand, I summarize this work for my further understanding. On the other hand, I hope that all beginners o…

47 12 Updated Jul 6, 2020

Deep Learning Based Monaural Speech Dereverberation Models: Hope We Can Get Better Performance of Dereverberation

Python 20 6 Updated Mar 16, 2022

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,953 1,171 Updated Dec 19, 2025

Python codes for Lite Audio-Visual Speech Enhancement.

Python 93 15 Updated May 3, 2024

Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch

Python 4 Updated Jul 13, 2020

Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch

Python 460 68 Updated Feb 14, 2023

A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.

3 Updated Jul 2, 2020

Spectral Subtraction, Wiener Filtering, MMSE

MATLAB 4 1 Updated Nov 29, 2019

Spectral Subtraction, Wiener Filtering, MMSE

MATLAB 127 39 Updated Nov 29, 2019
Python 42 20 Updated Oct 30, 2019

The PyTorch improved version of TPAMI 2017 paper: Face Alignment in Full Pose Range: A 3D Total Solution.

Python 3,674 650 Updated May 14, 2022

The official PyTorch implementation of Towards Fast, Accurate and Stable 3D Dense Face Alignment, ECCV 2020.

Python 3,082 544 Updated Feb 2, 2024
8 Updated Aug 16, 2020

A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.

MATLAB 812 153 Updated Dec 1, 2020

A PyTorch implementation of dual-path RNNs (DPRNNs) based speech separation described in "Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation".

Python 179 40 Updated Aug 5, 2020

transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.

Python 300 57 Updated Jun 15, 2021

A must-read paper for speech separation based on neural networks

TypeScript 878 140 Updated Aug 11, 2025

A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code is followed by kaituo xu's work.

Python 10 5 Updated Dec 25, 2019

基于深度学习的语音增强、去混响

Python 100 33 Updated Jan 30, 2024

A PyTorch implementation of Conv-TasNet

Python 46 11 Updated Nov 25, 2019
Next