Stars
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…
Speed-optimized streaming neural speech enhancement network
Baseline and Evaluation Framework for CHiME-9 ECHI Challenge
Official repository of TF-Restormer for speech restoration
Human-detection-and-Tracking
A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally.
RetinaFace: Deep Face Detection Library for Python
Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)
Official data preparation scripts for the URGENT 2024 Challenge
This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)
Real-time binaural target sound extraction model.
dual-path multi-channel network for speech separation
Keras/Pytorch neural network size, operations and parameters counter
High-efficiency floating-point neural network inference operators for mobile, server, and Web
DO NOT CHECK OUT THESE FILES FROM GITHUB UNLESS YOU KNOW WHAT YOU ARE DOING. (See below.)
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.
Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
An audio driver for Windows 10 (only tested on x64) that works as a virtual audio cable.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…