Stars
Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯
Music Analysis, Chord Recognition, Beat Tracking, Guitar Diagrams, Lyrics Transcription Application, LLM context aware for analysis from uploaded audio and YouTube video
automatic audio labelling with laion-clap
Versatile audio super resolution (any -> 48kHz) with AudioSR.
Banquet: A Stem-Agnostic Single-Decoder System for Music Source Separation Beyond Four Stems
A tool to generate guitar chord-melody arrangements from MusicXML leadsheets, using music21 and Lilypond.
A library to inspect and extract intermediate layers of PyTorch models.
A curated list of software, services, and resources to create and distribute music
Audio generation using diffusion models, in PyTorch.
A timeline of the latest AI models for audio generation, starting in 2023!
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
An invertible and differentiable implementation of the Constant-Q Transform (CQT).
🔉 spafe: Simplified Python Audio Features Extraction
Pytorch: Channel-wise subband (CWS) input for better voice and accompaniment separation
Speech recognition module for Python, supporting several engines and APIs, online and offline.
PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)
A JavaScript library for rendering music notation and guitar tablature.
Gender recognition by voice and speech analysis
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
Enhanced version of Cockos' iPlug - A simple-to-use C++ framework for developing cross platform audio plugins and targeting multiple plugin APIs with the same code. VST / VST3 / Audiounit / RTAS / …
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications