Starred repositories
AFTER : Audio Features Transfer and Exploration in Real-time
SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations (CVPR 2026 Findings)
🌋LavaSR: Fast Speech restoration and enhancement
Neural network emulator for guitar amplifiers.
Official repository for Aria-MIDI: a MIDI dataset of 1,186,253 transcribed solo-piano recordings.
AVES: Animal Vocalization Encoder based on Self-Supervision
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
a list of demo websites for automatic music generation research
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.
Official inference repo for FLUX.1 models
Simple implementation of TensorRT [I]RFFT[2] plugins.
Code for the paper "Timbre-Trap: A Low-Resource Framework for Instrument-Agnostic Music Transcription"
Real-time audio to chords, lyrics, beat, and melody.
Unofficial PyTorch implementation of "SCNet: Sparse Compression Network for Music Source Separation"
Export Hugging Face models to Core ML and TensorFlow Lite
Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE)
A swift and unified toolkit for symbolic music processing
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
The web framework for content-driven websites. ⭐️ Star to support our work!
Tiny AutoEncoder for Stable Diffusion (and other image models)
Machine Learning Engineering Open Book
A Jupyter book accompanying the ISMIR 2023 tutorial Introduction to DIfferentiable Audio Synthesiser Programming
An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation