Highlights
Lists (1)
Sort Name ascending (A-Z)
Stars
Code for the paper βAutomatic Music Sample Identification with Multi-Track Contrastive Learningβ.
Official Repository of Smule Renaissance, Smule's Vocal Restoration Models
kyutai-labs / nanoGPTaudio
Forked from karpathy/nanoGPTCode for the blog "Neural audio codecs: how to get audio into LLMs"
PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.
Encode and decode audio samples to/from continuous and discrete compressed representations!
Audio-to-Audio Schrodinger Bridges is a diffusion-based audio restoration model for bandwidth extension and inpainting.
JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks
Personalized Image Generation with Large Multimodal Models
Explore and interpret large embeddings in your browser with interactive visualization! π
Symbolic Music NLP Artificial Intelligence Toolkit
Community-maintained faiss wheel builder
ππ Efficient implementations of Native Sparse Attention
Chordonomicon: A Dataset of 666,000 Chord Progressions
Efficient Training of Audio Transformers with Patchout
Official implementation of the paper - GD-Retriever: Controllable generative text-music retrieval with diffusion models (Accepted at ISMIR25!)
MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows
Explorations into NEAT and some of its derivative research
Dual Diffusion is a generative diffusion model for music trained on video game soundtracks.
Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Training