Replace pyannote with Silero VAD for voice activity detection #12

mogwai · 2025-11-10T12:57:32Z

This change simplifies the VAD implementation by replacing pyannote.audio
with Silero VAD, which offers several benefits:

No HuggingFace authentication required
Lighter weight and fewer dependencies
Simpler API and easier to use
Maintained function signature compatibility

Changes:

Rewrote src/vui/vad.py to use Silero VAD instead of pyannote
Removed pyannote.audio dependency from pyproject.toml
Updated readme.md to remove pyannote authentication instructions
Added merge_segments helper function for post-processing

This change simplifies the VAD implementation by replacing pyannote.audio with Silero VAD, which offers several benefits: - No HuggingFace authentication required - Lighter weight and fewer dependencies - Simpler API and easier to use - Maintained function signature compatibility Changes: - Rewrote src/vui/vad.py to use Silero VAD instead of pyannote - Removed pyannote.audio dependency from pyproject.toml - Updated readme.md to remove pyannote authentication instructions - Added merge_segments helper function for post-processing

Improvements: - Fixed double loading of Silero VAD model - Store both model and utils in pipeline for efficiency - Added comprehensive test suite for validation - Improved code documentation Test files added: - test_vad.py: Full integration test with synthetic audio - test_vad_code_review.py: Code structure validation - test_vad_simple.py: Module structure test

claude added 4 commits November 10, 2025 12:54

Add Silero VAD attribution and documentation to readme

d545fd4

Remove authentication mention from readme

5ff2397

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Replace pyannote with Silero VAD for voice activity detection #12

Replace pyannote with Silero VAD for voice activity detection #12

Uh oh!

mogwai commented Nov 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Replace pyannote with Silero VAD for voice activity detection #12

Are you sure you want to change the base?

Replace pyannote with Silero VAD for voice activity detection #12

Uh oh!

Conversation

mogwai commented Nov 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants