kooBH

🎓

KooBH kooBH

🎓

Working on audio in C++, python, DSP and DNN

39 followers · 18 following

IIP, Sogang University

Organizations

Stars

nttcslab / dcase2026_task4_baseline

dcase2026_task4_baseline

Python 4 2 Updated Apr 16, 2026

mdeff / fma

FMA: A Dataset For Music Analysis

Jupyter Notebook 2,609 452 Updated Jan 5, 2023

johnma2006 / mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Python 2,946 223 Updated Mar 8, 2024

k2-fsa / sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 12,363 1,398 Updated May 20, 2026

QwenLM / Qwen3-TTS

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…

Python 11,461 1,493 Updated Mar 17, 2026

aask1357 / fastenhancer

Speed-optimized streaming neural speech enhancement network

Python 114 31 Updated Apr 9, 2026

seongq / AGI_HER_MER

Python 30 11 Updated Dec 19, 2025

CHiME9-ECHI / CHiME9-ECHI

Baseline and Evaluation Framework for CHiME-9 ECHI Challenge

Python 14 8 Updated Nov 11, 2025

dmlguq456 / TF_Restormer

Official repository of TF-Restormer for speech restoration

Python 14 Updated May 14, 2026

sarulab-speech / UTMOSv2

UTokyo-SaruLab MOS Prediction System

Python 318 34 Updated Apr 2, 2026

cycfi / q

C++ Library for Audio Digital Signal Processing

C++ 1,384 177 Updated May 6, 2026

ITCoders / Human-detection-and-Tracking

Human-detection-and-Tracking

Python 873 301 Updated Dec 30, 2022

google-ai-edge / gallery

A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally.

Kotlin 23,140 2,360 Updated May 19, 2026

serengil / retinaface

RetinaFace: Deep Face Detection Library for Python

Python 1,977 195 Updated May 13, 2026

BUTSpeechFIT / AMI-diarization-setup

55 30 Updated Oct 17, 2023

ncsoft / PhonMatchNet

Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)

Python 62 9 Updated Jun 3, 2024

urgent-challenge / urgent2024_challenge

Official data preparation scripts for the URGENT 2024 Challenge

Python 89 7 Updated May 21, 2025

state-spaces / mamba

Mamba SSM architecture

Python 18,275 1,734 Updated May 10, 2026

RoyChao19477 / SEMamba

This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)

Python 262 30 Updated Dec 12, 2025

vb000 / SemanticHearing

Real-time binaural target sound extraction model.

Python 99 20 Updated Mar 28, 2024

dmlguq456 / PIT_CSS

dual-path multi-channel network for speech separation

Python 6 Updated Jan 15, 2024

AlbertoAncilotto / NeSsi

Keras/Pytorch neural network size, operations and parameters counter

Python 16 3 Updated Mar 23, 2023

CPJKU / dcase2024_task1_baseline

Python 10 4 Updated Jun 6, 2024

google / XNNPACK

High-efficiency floating-point neural network inference operators for mobile, server, and Web

C 2,343 493 Updated May 20, 2026

FFTW / fftw3

DO NOT CHECK OUT THESE FILES FROM GITHUB UNLESS YOU KNOW WHAT YOU ARE DOING. (See below.)

C 3,069 704 Updated May 16, 2026

microsoft / DNS-Challenge

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

Python 1,416 455 Updated Jul 25, 2024

openspeech-team / openspeech

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

Python 718 115 Updated Oct 23, 2023

sooftware / kospeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

Python 637 191 Updated May 27, 2023

JannesP / AudioMirror

An audio driver for Windows 10 (only tested on x64) that works as a virtual audio cable.

C++ 249 57 Updated Oct 15, 2022

labmlai / annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 66,779 6,715 Updated Jan 22, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly