a897456

a897456

Achievements

Stars

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 23,175 2,602 Updated Mar 3, 2026

Chen-Shaowen / QHARMA-GAN

Jupyter Notebook 3 Updated Sep 23, 2025

yxlu-0102 / AP-BWE

Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction

Python 184 18 Updated Apr 15, 2025

flageval-baai / Chinese-LiPS

9 2 Updated Apr 22, 2025

Ereboas / MagiCodec

A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.

Python 114 7 Updated Jun 4, 2025

Rongjiehuang / FastDiff

PyTorch Implementation of FastDiff (IJCAI'22)

Python 422 59 Updated Jun 20, 2024

Rongjiehuang / ProDiff

PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline

Python 432 51 Updated Apr 19, 2023

kan-bayashi / ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Jupyter Notebook 1,639 352 Updated Apr 22, 2024

oucailab / OUC-LaTex-master

中国海洋大学硕士博士学位论文 LaTeX 模板（2025版）

TeX 84 11 Updated Feb 25, 2025

facebookresearch / FlowDec

An neural full-band audio codec for general audio sampled at 48 kHz with 7.5 kps or 4.5 kbps.

Python 200 19 Updated Jul 14, 2025

bigpon / ComplexDec_demo

A demo page of ComplexDec

CSS 1 Updated Feb 5, 2025

yaoxunji / gen-se

GenSE: Generative Speech Enhancement via Language Models using Hierarchical Modeling

Python 168 22 Updated Feb 28, 2025

XiaoMi / dasheng

Official PyTorch code for Deep Audio-Signal Holistic Embeddings

Python 188 14 Updated Nov 7, 2025

sp-uhh / ears_benchmark

Generation scripts for EARS-WHAM and EARS-Reverb

Python 44 7 Updated Jul 4, 2025

yxlu-0102 / MP-SENet

Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement

Python 478 73 Updated May 19, 2025

schmiph2 / pysepm

Python implementation of performance metrics in Loizou's Speech Enhancement book

Python 452 93 Updated Feb 15, 2025

neillu23 / CDiffuSE

Conditional Diffusion Probabilistic Model for Speech Enhancement

Python 250 35 Updated Dec 20, 2022

geetkhatri / speech-enhancement-psr

Speech enhancement using Wiener filtering and pitch-synchronous STFT phase reconstruction

MATLAB 3 3 Updated Sep 12, 2020

xixi219 / MOS

The MOS system combines components from DNSMOS, NISQA, MOSSSL, and SIGMOS, using the librosa library to process audio waveforms.

Jupyter Notebook 31 6 Updated Feb 16, 2024

hojonathanho / diffusion

Denoising Diffusion Probabilistic Models

Python 5,146 479 Updated Aug 29, 2023

sp-uhh / sgmse

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation

Python 735 104 Updated Feb 1, 2026

lmnt-com / diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Python 886 132 Updated Mar 26, 2024

descriptinc / descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,763 178 Updated Jan 26, 2026

chazo1994 / Amphion

Forked from open-mmlab/Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 2 Updated Jun 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

a897456

Achievements

Achievements

Block or report a897456

Stars

facebookresearch / audiocraft

Chen-Shaowen / QHARMA-GAN

yxlu-0102 / AP-BWE

flageval-baai / Chinese-LiPS

Ereboas / MagiCodec

Rongjiehuang / FastDiff

Rongjiehuang / ProDiff

kan-bayashi / ParallelWaveGAN

oucailab / OUC-LaTex-master

facebookresearch / FlowDec

bigpon / ComplexDec_demo

yaoxunji / gen-se

XiaoMi / dasheng

sp-uhh / ears_benchmark

yxlu-0102 / MP-SENet

schmiph2 / pysepm

neillu23 / CDiffuSE

geetkhatri / speech-enhancement-psr

xixi219 / MOS

hojonathanho / diffusion

sp-uhh / sgmse

lmnt-com / diffwave

descriptinc / descript-audio-codec

chazo1994 / Amphion

SpeechResearch / speechresearch.github.io

tuanad121 / Python-WORLD

speechbrain / speechbrain

facebookresearch / AudioDec

HeCheng0625 / Amphion

open-mmlab / Amphion