Skip to content
View kooBH's full-sized avatar
🎓
🎓
  • IIP, Sogang University

Organizations

@IIP-Sogang @mpWAV

Block or report kooBH

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

dcase2026_task4_baseline

Python 4 2 Updated Apr 16, 2026

FMA: A Dataset For Music Analysis

Jupyter Notebook 2,609 452 Updated Jan 5, 2023

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Python 2,946 223 Updated Mar 8, 2024

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 12,363 1,398 Updated May 20, 2026

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…

Python 11,461 1,493 Updated Mar 17, 2026

Speed-optimized streaming neural speech enhancement network

Python 114 31 Updated Apr 9, 2026
Python 30 11 Updated Dec 19, 2025

Baseline and Evaluation Framework for CHiME-9 ECHI Challenge

Python 14 8 Updated Nov 11, 2025

Official repository of TF-Restormer for speech restoration

Python 14 Updated May 14, 2026

UTokyo-SaruLab MOS Prediction System

Python 318 34 Updated Apr 2, 2026

C++ Library for Audio Digital Signal Processing

C++ 1,384 177 Updated May 6, 2026

Human-detection-and-Tracking

Python 873 301 Updated Dec 30, 2022

A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally.

Kotlin 23,140 2,360 Updated May 19, 2026

RetinaFace: Deep Face Detection Library for Python

Python 1,977 195 Updated May 13, 2026

Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)

Python 62 9 Updated Jun 3, 2024

Official data preparation scripts for the URGENT 2024 Challenge

Python 89 7 Updated May 21, 2025

Mamba SSM architecture

Python 18,275 1,734 Updated May 10, 2026

This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)

Python 262 30 Updated Dec 12, 2025

Real-time binaural target sound extraction model.

Python 99 20 Updated Mar 28, 2024

dual-path multi-channel network for speech separation

Python 6 Updated Jan 15, 2024

Keras/Pytorch neural network size, operations and parameters counter

Python 16 3 Updated Mar 23, 2023

High-efficiency floating-point neural network inference operators for mobile, server, and Web

C 2,343 493 Updated May 20, 2026

DO NOT CHECK OUT THESE FILES FROM GITHUB UNLESS YOU KNOW WHAT YOU ARE DOING. (See below.)

C 3,069 704 Updated May 16, 2026

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

Python 1,416 455 Updated Jul 25, 2024

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

Python 718 115 Updated Oct 23, 2023

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

Python 637 191 Updated May 27, 2023

An audio driver for Windows 10 (only tested on x64) that works as a virtual audio cable.

C++ 249 57 Updated Oct 15, 2022

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 66,779 6,715 Updated Jan 22, 2026
Next