Build software better, together

ksasso1028 / audio-reverb-removal

Code to train a custom time-domain autoencoder to dereverb audio

audio dsp pytorch autoencoder convolutional-neural-networks time-domain denoising-autoencoders denoising multi-task-learning dereverberation autoencoder-neural-network demucs audio-denoising audio-machine-learning audio-ml audio-ai convtasnet

Updated Nov 30, 2023
Python

zhangzijie-pro / Speaker-Verification

Star

Dual-model speech AI toolkit for speaker verification and speaker-aware diarization, with streaming inference, meeting analysis, long-audio monitoring, and speaker-bank integration.

pytorch speaker-recognition speaker-verification speaker-diarization voice-ai open-set-identification audio-ml streaming-inference meeting-analysis

Updated Apr 23, 2026
Python

shiroonigami23-ui / 11-785-dual-teacher-distillation

Star

Unified dual-teacher distillation (ReDimNet + ASSIST) into Wav2Vec2 for speaker verification and deepfake detection

pytorch colab academic-project speaker-verification speech-processing knowledge-distillation anti-spoofing deepfake-detection wav2vec2 audio-ml

Updated Apr 24, 2026
Jupyter Notebook

ripunjay-kashyap / audio-sonic-mcp

Star

Local MCP server + CLI turning YouTube & local audio into rich sonic signatures. Extracts BPM, section-by-section key, vocal presence, transient punch, and 512-dim CLAP vibe embeddings. Powered by Demucs stem separation & librosa. 100% private, offline-first, and GPU-accelerated with graceful CPU/HPSS degradation.

machine-learning deep-learning mcp transformers music-information-retrieval librosa clap audio-processing source-seperation demucs audio-ml llm-tools stem-separation

Updated May 21, 2026
Python

alexjsmac / switch-jockey

Star

An intelligent, automated video switcher for live performances

webgl shaders glsl audio-ml

Updated Aug 13, 2020
JavaScript

Stavion-Colquitt / Audio_ML_Noise_Reduction

Star

Real-time speech enhancement pipeline — custom-trained U-Net denoising model, ONNX inference, Overlap-Add synthesis, and virtual audio routing for Teams, Zoom, and DAW use. CPU-only, no cloud dependency.

python deep-learning signal-processing pytorch dns-challenge u-net speech-enhancement onnx noise-suppression u-net-pytorch virtual-audio virtual-audio-cable real-time-audio-signal-processing audio-ml

Updated Apr 2, 2026
Python

catherinepereira / dinscribe

Star

machine-learning vad transcription whisper audio-processing denoising audio-ml whisper-ai

Updated Jun 9, 2026
Python

thavasix-gr8 / engine-identification-acoustic-ml

Star

Engine identification using acoustic signal analysis and machine learning to classify 8 vehicle types. Audio signals are processed using FFT and feature extraction, and a multi-class model predicts vehicle categories based on their unique sound patterns.

python flask machine-learning signal-processing feature-extraction audio-classification librosa svm-classifier fft-analysis scikit-learn-python vehicle-classification acoustic-analysis audio-ml

Updated Mar 29, 2026
Python

AbijahKaj / audio-ml

Star

Audio analysis in javascript/typescript

audio-analysis audio-processing audio-js audio-ml

Updated Jun 10, 2026
TypeScript

joshleh / nano-kws

Star

Edge-deployable keyword spotter: INT8-quantized DS-CNN on Google Speech Commands, exported to ONNX, with fp32 vs INT8 benchmarks, a live mic demo, and a C++ inference harness.

deep-learning cpp pytorch speech-recognition quantization keyword-spotting wake-word-detection onnx speech-commands edge-ai on-device-ml onnxruntime streamlit tinyml post-training-quantization edge-ml int8-quantization audio-ml ds-cnn

Updated May 6, 2026
Python

catherinepereira / dinnote

Star

machine-learning vad transcription whisper audio-processing denoising diarization audio-ml whisper-ai

Updated Jun 9, 2026
Python

kershrita / Music-Genre-Classification

Star

Machine learning system for music genre classification using feature engineering, stratified evaluation, SVC/XGBoost modeling, and reproducible prediction export.

python data-science machine-learning scikit-learn xgboost classification data-preprocessing feature-engineering music-genre-classification model-evaluation audio-ml music-analytics

Updated Apr 10, 2026
Jupyter Notebook

benny-conn / solo-trace

Star

Automated audio/video ML pipeline for detecting and transcribing jazz solos from live recordings. Runs nightly against Smalls Jazz Club archives: uses CLAP (instrument detection), Demucs (source separation), CLIP (performer identification), and basic-pitch (MIDI transcription). Results served via REST API.

python golang machine-learning computer-vision midi pytorch jazz demucs audio-ml

Updated Mar 16, 2026
Python

Devanik21 / Advanced-AI-voice

Star

Neural TTS and voice-cloning application using XTTS/VITS. Supports 3–30 s reference audio for speaker adaptation, real-time pitch/speed control, and WAV/MP3 export.

natural-language-processing text-to-speech deep-learning speech-synthesis neural-networks conversational-ai neural-tts voice-ai generative-ai audio-ml

Updated Mar 15, 2026
Python

Devanik21 / AI-audio-overview

Star

AI-generated audio summarisation pipeline — Whisper transcription, LLM key-insight extraction, and structured spoken summaries with TTS playback and Streamlit interface.

nlp deep-learning whisper audio-to-text large-language-models generative-ai audio-ml multimodal-ai content-summarization podcast-summarization

Updated Mar 15, 2026
Python

Devanik21 / audio-file-error-handling-using-gpt-4

Star

Audio file processing pipeline with GPT-4-powered error diagnosis — detects codec issues, sample rate mismatches, and corruption artefacts with automated remediation suggestions.

python deep-learning error-handling neural-networks audio-processing gpt-4 large-language-models generative-ai audio-ml robust-pipeline

Updated Mar 15, 2026
Python

catherinepereira / f1-2025-radio-transcriptions

Star

formula1 dataset transcription f1 audio-processing audio-ml

Updated Apr 28, 2026

Devanik21 / MusicVAE

Star

Key Features: Simple VAE architecture with encoder/decoder Synthetic music data generation for training Interactive training with progress tracking Music generation from latent space sampling Audio conversion and playback Downloadable audio files

deep-learning magenta music-generation variational-autoencoder hierarchical-lstm music-ai latent-space-interpolation generative-ai audio-ml sequential-generation

Updated May 8, 2026
Python

dexxdean / htdemucs-coreml

Star

Convert Meta's HTDemucs (Hybrid Transformer Demucs) to Apple Core ML. Real-valued STFT/ISTFT wrapper, manual MHA decomposition, pre-computed overlap-add. Includes Swift example.

audio macos swift ios pytorch audio-processing music-source-separation coreml coremltools apple-silicon demucs audio-ml stem-separation mlpackage htdemucs

Updated Apr 26, 2026
Python

Devanik21 / HarmoniaX

Star

Music harmony AI — chord progression analysis with Roman numeral labelling, voice leading checker, style-conditioned progression generation (Baroque/Jazz/Pop), and MIDI export via music21.

deep-learning neural-networks music-generation sound-synthesis creative-ai music-ai large-language-models generative-ai neural-audio audio-ml

Updated Mar 15, 2026
Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

audio-ml

Here are 24 public repositories matching this topic...

ksasso1028 / audio-reverb-removal

zhangzijie-pro / Speaker-Verification

shiroonigami23-ui / 11-785-dual-teacher-distillation

ripunjay-kashyap / audio-sonic-mcp

alexjsmac / switch-jockey

Stavion-Colquitt / Audio_ML_Noise_Reduction

catherinepereira / dinscribe

thavasix-gr8 / engine-identification-acoustic-ml

AbijahKaj / audio-ml

joshleh / nano-kws

catherinepereira / dinnote

kershrita / Music-Genre-Classification

benny-conn / solo-trace

Devanik21 / Advanced-AI-voice

Devanik21 / AI-audio-overview

Devanik21 / audio-file-error-handling-using-gpt-4

catherinepereira / f1-2025-radio-transcriptions

Devanik21 / MusicVAE

dexxdean / htdemucs-coreml

Devanik21 / HarmoniaX

Improve this page

Add this topic to your repo