Skip to content
View a897456's full-sized avatar

Block or report a897456

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 23,125 2,597 Updated Mar 3, 2026
Jupyter Notebook 3 Updated Sep 23, 2025

Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction

Python 180 18 Updated Apr 15, 2025

A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.

Python 114 7 Updated Jun 4, 2025

PyTorch Implementation of FastDiff (IJCAI'22)

Python 422 59 Updated Jun 20, 2024

PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline

Python 432 51 Updated Apr 19, 2023

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Jupyter Notebook 1,639 353 Updated Apr 22, 2024

中国海洋大学硕士博士学位论文 LaTeX 模板(2025版)

TeX 84 11 Updated Feb 25, 2025

An neural full-band audio codec for general audio sampled at 48 kHz with 7.5 kps or 4.5 kbps.

Python 198 19 Updated Jul 14, 2025

A demo page of ComplexDec

CSS 1 Updated Feb 5, 2025

GenSE: Generative Speech Enhancement via Language Models using Hierarchical Modeling

Python 168 22 Updated Feb 28, 2025

Official PyTorch code for Deep Audio-Signal Holistic Embeddings

Python 187 13 Updated Nov 7, 2025

Generation scripts for EARS-WHAM and EARS-Reverb

Python 44 7 Updated Jul 4, 2025

Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement

Python 479 73 Updated May 19, 2025

Python implementation of performance metrics in Loizou's Speech Enhancement book

Python 451 93 Updated Feb 15, 2025

Conditional Diffusion Probabilistic Model for Speech Enhancement

Python 250 35 Updated Dec 20, 2022

Speech enhancement using Wiener filtering and pitch-synchronous STFT phase reconstruction

MATLAB 3 3 Updated Sep 12, 2020

The MOS system combines components from DNSMOS, NISQA, MOSSSL, and SIGMOS, using the librosa library to process audio waveforms.

Jupyter Notebook 31 6 Updated Feb 16, 2024

Denoising Diffusion Probabilistic Models

Python 5,125 475 Updated Aug 29, 2023

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation

Python 732 103 Updated Feb 1, 2026

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Python 889 132 Updated Mar 26, 2024

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,749 174 Updated Jan 26, 2026

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 2 Updated Jun 24, 2024
Python 154 34 Updated Dec 20, 2023

A PyTorch-based Speech Toolkit

Python 11,386 1,675 Updated Mar 30, 2026

An Open-source Streaming High-fidelity Neural Audio Codec

Python 502 29 Updated Mar 4, 2025

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 2 1 Updated Mar 1, 2025

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,728 799 Updated Mar 25, 2026
Next