vad

Here are 92 public repositories matching this topic...

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model rnnt dfsmn paraformer speechgpt speechllm

Updated Mar 17, 2026
Python

snakers4 / silero-vad

Star

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

voice-commands speech pytorch voice-recognition vad voice-control speech-processing voice-detection voice-activity-detection onnx onnxruntime onnx-runtime

Updated Mar 26, 2026
Python

smacke / ffsubsync

Sponsor

Star

Automagically synchronize subtitles with video.

Updated Nov 25, 2025
Python

CheshireCC / faster-whisper-GUI

Star

faster_whisper GUI with PySide6

openai vad whisper asr transcribe voice-transcription faster-whisper whisperx

Updated Dec 8, 2024
Python

amsehili / auditok

Star

An audio/acoustic activity detection and audio segmentation tool

vad audio-data audio-activities audio-segmentation voice-detection voice-activity-detection

Updated Apr 7, 2026
Python

DmitryRyumin / ICASSP-2023-24-Papers

Star

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

Updated May 5, 2025
Python

FireRedTeam / FireRedASR2S

Star

A SOTA Industrial-Grade All-in-One ASR system with ASR, VAD, LID, and Punc modules. FireRedASR2 supports Chinese (Mandarin, 20+ dialects/accents), English, code-switching, and both speech and singing ASR. FireRedVAD supports speech/singing/music in 100+ langs. FireRedLID supports 100+ langs and 20+ zh dialects. FireRedPunc supports zh and en.

open-source speech-recognition vad automatic-speech-recognition asr lid language-identification sota voice-activity-detection asr-pipeline punctuation-restoration audio-event-classification llm punctuation-prediction industrial-grade multimodal-llm speechllm audio-event-detection

Updated Mar 24, 2026
Python

filippogiruzzi / voice_activity_detection

Star

Voice Activity Detection based on Deep Learning & TensorFlow

python machine-learning deep-neural-networks deep-learning time-series tensorflow speech artificial-intelligence speech-recognition vad resnet deeplearning time-series-classification voice-activity-detection librispeech speech-detection librispeech-dataset mfcc-features

Updated Mar 24, 2023
Python

EtienneAb3d / WhisperHallu

Star

Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts

text-to-speech sound-processing vad whisper audio-processing asr noise-removal vocals

Updated Nov 12, 2024
Python

FireRedTeam / FireRedVAD

Star

A SOTA Industrial-Grade Voice Activity Detection & Audio Event Detection, supporting 100+ languages, outperforming Silero-VAD, TEN-VAD, FunASR-VAD and WebRTC-VAD

vad voice-activity-detection aed sound-event-detection audio-event-classification audio-event-detection

Updated Apr 4, 2026
Python

Picovoice / cobra

Star

On-device voice activity detection (VAD) powered by deep learning

speech-recognition vad voice-activity-detection on-device voice-activity voice-activity-detector

Updated Mar 26, 2026
Python

asiff00 / On-Device-Speech-to-Speech-Conversational-AI

Star

This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming architecture for fluid conversations with immediate responses and natural interruption handling.

tts vad audio-processing asr voice-assistant conversational-ai speech-to-speech ollama kokoro-tts

Updated Nov 24, 2025
Python

voithru / voice-activity-detection

Star

Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021

vad voice-activity-detection

Updated Oct 26, 2021
Python

lef-fan / aria

Star

A local and uncensored AI entity.

python bot text-to-speech ai deep-learning speech pytorch tts assistant vad speech-to-text voice-assistant large-language-models llm xttsv2 localllama llamacpp-python kokoro-tts

Updated Aug 1, 2025
Python

fjchange / object_centric_VAD

Star

An Tensorflow Re-Implement of CVPR 2019 "Object-centric Auto-Encoders and Dummy Anomalies for Abnormal Event Detection in Video"

vad anomaly cvpr2019

Updated May 6, 2022
Python

NickWilkinson37 / voxseg

Star

A python library for voice activity detection (VAD) for speech/non-speech segmentation.

python python-library speech vad speech-processing voice-activity-detection speech-segmentation

Updated Sep 7, 2022
Python

xucailiang / cascade

Star

Cascade is a production-ready, high-performance, and low-latency audio stream processing library designed for Voice Activity Detection (VAD). Built upon the excellent Silero VAD model, Cascade significantly reduces VAD processing latency while maintaining high accuracy through its 1:1:1 binding architecture and asynchronous streaming technology.

audio python streaming high-performance numpy vad async-await torchaudio onnxruntime

Updated Dec 22, 2025
Python

mounalab / LSTM-RNN-VAD

Star

Voice Activity Detection LSTM-RNN learning model

tensorflow lstm rnn vad rnn-tensorflow nlp-machine-learning lstm-neural-network

Updated Apr 17, 2018
Python

sooftware / End-to-End-Speech-Recognition-Models

Sponsor

Star

PyTorch implementation of automatic speech recognition models.

end-to-end pytorch transformer las vad e2e asr acoustic-model voice-activity-detection deepspeech2 listen-attend-and-spell

Updated Jan 10, 2021
Python

videosdk-live / NAMO-Turn-Detector-v1

Star

High-performance, semantic turn detection for conversational AI

vad voice-activity-detection turn-detection turn-detector

Updated Oct 1, 2025
Python

Improve this page

Add a description, image, and links to the vad topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vad topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vad

Here are 92 public repositories matching this topic...

modelscope / FunASR

snakers4 / silero-vad

smacke / ffsubsync

CheshireCC / faster-whisper-GUI

amsehili / auditok

DmitryRyumin / ICASSP-2023-24-Papers

FireRedTeam / FireRedASR2S

filippogiruzzi / voice_activity_detection

EtienneAb3d / WhisperHallu

FireRedTeam / FireRedVAD

Picovoice / cobra

asiff00 / On-Device-Speech-to-Speech-Conversational-AI

voithru / voice-activity-detection

lef-fan / aria

fjchange / object_centric_VAD

NickWilkinson37 / voxseg

xucailiang / cascade

mounalab / LSTM-RNN-VAD

sooftware / End-to-End-Speech-Recognition-Models

videosdk-live / NAMO-Turn-Detector-v1

Improve this page

Add this topic to your repo