-
Hanoi University of Science and Technology
Stars
A toolkit for speaker diarization.
Python package for combining diarization system outputs.
Conditional Diffusion Probabilistic Model for Speech Enhancement
Adaptive Flow-Matching for Target Speaker Extraction
The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation
Awesome speech/audio LLMs, representation learning, and codec models
This is the repo of the manuscript "Embedding and Beamforming: All-Neural Causal Beamformer for Multichannel Speech Enhancement", which was submitted to ICASSP2022.
Code for the ICASSP-2021 paper: Continuous Speech Separation with Conformer.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
s3prl / LibriMix
Forked from ftshijt/LibriMixAn open source dataset for source separation
A PyTorch implementation of End-to-End Neural Diarization
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
Run TensorFlow on ESP32 chips without pain
PyTorch implementation of YOLO-v1 including training
Conformer: Convolution-augmented Transformer for Speech Recognition