Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity detection, and speaker diarization. In Swift, powered by SOTA open source.
-
Updated
Apr 15, 2026 - Swift
Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity detection, and speaker diarization. In Swift, powered by SOTA open source.
A real-time Voice Activity Detection (VAD) library for iOS and macOS using Silero models powered by ONNX Runtime. Includes advanced noise suppression and audio preprocessing with WebRTC APM, supporting seamless WAV data output with header metadata.
Spokestack: give your iOS app a voice interface!
iOS Voice Activity Detection (VAD). Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
Silenci — AI-powered silence removal & subtitle generator for Final Cut Pro. macOS native app, Qwen3-ASR + MLX, 100% local & free.
Swift library for Voice Activity Detection (VAD) using NVIDIA NeMo MarbleNet model converted to CoreML. Detect speech segments in real-time on iOS/macOS with high accuracy and low latency.
🎤 A voice interface for OpenClaw AI bots on Telegram — talk to your AI hands-free, like Siri
Modular Swift package for on-device voice activity detection on Apple platforms using CoreML and Silero VAD.
Add a description, image, and links to the vad topic page so that developers can more easily learn about it.
To associate your repository with the vad topic, visit your repo's landing page and select "manage topics."