alexandergwm

Gavin alexandergwm

Acoustic

15 followers · 100 following

Acoustic and Speech
Shenzhen
10:32 (UTC +08:00)

Achievements

Stars

xzf-thu / Audio-Interaction

Python 382 18 Updated Jun 4, 2026

hugohe3 / ppt-master

AI generates a real, editable PowerPoint from any document — native shapes & animations, speaker notes voiced as audio narration, and the option to follow your own .pptx template, not slide images …

Python 27,015 2,407 Updated Jun 10, 2026

yuzhouhe2000 / OMLSA-IMCRA

Python implementation of OMLSA+IMCRA algorithm for speech enhancement.

Python 70 21 Updated Jun 29, 2021

Yaxin9Luo / FigMirror

Forked from VILA-Lab/FigMirror

An Automated AI Agent Tool for Plotting Your Data in Any Paper's Figure Style.

Python 1 Updated May 24, 2026

CaiBailin / MCRA-python

MCRA+OMLSA python version

Python 10 1 Updated Jul 11, 2024

ZFTurbo / asr_consilium

A repository for Automatic Speech Recognition (ASR) that ensembles multiple open-source models to achieve SOTA quality of recognition. Useful if you need to get the maximum quality of recognition d…

Python 15 1 Updated May 20, 2026

zerong7777-boop / gtcrn-light

Operator-level compressed GTCRN with ERB-CRM pipeline preserved and DPGRNN intact, ready for edge deployment.

Python 22 8 Updated Feb 11, 2026

Xiaobin-Rong / SEtrain

A training code template for DNN-based speech enhancement.

Python 199 46 Updated Sep 4, 2025

LeventureQys / AudioProcesser

This project focuses on audio processing and filter simulation research. It uses Python for simulation experiments and C++ for engineering implementation, covering extensive machine learning practi…

Jupyter Notebook 13 5 Updated May 21, 2026

Egonex-AI / Understand-Anything

Graphs that teach > graphs that impress. Turn any code into an interactive knowledge graph you can explore, search, and ask questions about. Works with Claude Code, Codex, Cursor, Copilot, Gemini C…

TypeScript 58,158 4,842 Updated Jun 11, 2026

PandoraLS / traditional-speech-enhancement

Traditional Speech Enhancement Methods

MATLAB 148 34 Updated Sep 28, 2025

yeyupiaoling / Whisper-Finetune

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deplo…

C 1,216 219 Updated May 8, 2026

lifeiteng / vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,206 333 Updated Sep 10, 2025

jingyaogong / minimind

🧠「大模型」2小时完全从0训练64M的小参数LLM！Train a 64M-parameter LLM from scratch in just 2h!

Python 51,676 6,640 Updated Jun 1, 2026

Acceleration123 / Sound-Source-Localization

Python 4 1 Updated Jun 7, 2025

jingyaogong / minimind-o

🎙️ 「大模型」从0训练0.1B能听能说能看的全模态Omni模型！A 0.1B Omni model trained from scratch, capable of listening, speaking, and seeing!

Python 1,837 218 Updated Jun 8, 2026

cszheng-ioa / Sixty-years-of-frequency-domain-monaural-speech-enhancement

Python 160 30 Updated Jan 30, 2024

Ryuk17 / noise-xorcist

Single Channel Speech Enhancement Methods and Toolbox

Python 54 14 Updated Apr 8, 2026

WenzheLiu-Speech / awesome-speech-enhancement

speech enhancement\speech seperation\sound source localization

1,240 224 Updated Nov 14, 2023

Wang-Boxiang / PD-SFANC

2 Updated Apr 25, 2026

swagshaw / Awesome-Speech-and-Audio-Continual-Learning

A Survey of Continual Learning for Speech and Audio Models

5 Updated May 26, 2026

GuitarsAI / AudioCodingTutorials

Audio Coding Notebooks and Tutorials

Jupyter Notebook 89 14 Updated Dec 16, 2020

schmiph2 / pysepm

Python implementation of performance metrics in Loizou's Speech Enhancement book

Python 455 93 Updated Feb 15, 2025

iranroman / DCASE2026_Task3_SAISELD_baseline

Models for DCASE 2026 Semantic Acoustic Imaging for Sound Event Localization and Detection from Spatial Audio and Audiovisual Scenes

Python 14 1 Updated May 28, 2026

thomeou / SALSA-Lite

This is the public repository for SALSA-Lite features for polyphonic sound event localization and detection using microphone arrays.

15 4 Updated Dec 3, 2021

Ryuk17 / SpeechAlgorithms

You can find the speech algorithms you want here

C 865 262 Updated Jan 25, 2026

DragonLiu1995 / xRIR_code

[CVPR 2025] Pytorch implementation of the paper "Hearing Anywhere in Any Environment"

Python 33 1 Updated Sep 18, 2025

SonyResearch / OpenVocabularySELD

[TASLP] Open-Vocabulary Sound Event Localization and Detection with Joint Learning of CLAP Embedding and Activity-Coupled Cartesian DOA Vector

Python 9 2 Updated Mar 25, 2026

FireRedTeam / FireRedVAD

A SOTA Industrial-Grade Voice Activity Detection & Audio Event Detection, supporting 100+ languages, outperforming Silero-VAD, TEN-VAD, FunASR-VAD and WebRTC-VAD

Python 423 28 Updated May 6, 2026

kiiril / urban-sound-classification

Python 1 Updated Aug 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gavin alexandergwm

Achievements

Achievements

Block or report alexandergwm

Stars

xzf-thu / Audio-Interaction

hugohe3 / ppt-master

yuzhouhe2000 / OMLSA-IMCRA

Yaxin9Luo / FigMirror

CaiBailin / MCRA-python

ZFTurbo / asr_consilium

zerong7777-boop / gtcrn-light

Xiaobin-Rong / SEtrain

LeventureQys / AudioProcesser

Egonex-AI / Understand-Anything

PandoraLS / traditional-speech-enhancement

yeyupiaoling / Whisper-Finetune

lifeiteng / vall-e

jingyaogong / minimind

Acceleration123 / Sound-Source-Localization

jingyaogong / minimind-o

cszheng-ioa / Sixty-years-of-frequency-domain-monaural-speech-enhancement

Ryuk17 / noise-xorcist

WenzheLiu-Speech / awesome-speech-enhancement

Wang-Boxiang / PD-SFANC

swagshaw / Awesome-Speech-and-Audio-Continual-Learning

GuitarsAI / AudioCodingTutorials

schmiph2 / pysepm

iranroman / DCASE2026_Task3_SAISELD_baseline

thomeou / SALSA-Lite

Ryuk17 / SpeechAlgorithms

DragonLiu1995 / xRIR_code

SonyResearch / OpenVocabularySELD

FireRedTeam / FireRedVAD

kiiril / urban-sound-classification