-
ByteMind
- Germany
- http://bytemind.de
Stars
Robust Speech Recognition via Large-Scale Weak Supervision
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Faster Whisper transcription with CTranslate2
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
On-device wake word detection powered by deep learning
A fast local neural text to speech engine for Mycroft
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
End to end text to speech system using gruut and onnx
Open tools and data for cloudless automatic speech recognition
SEPIA server to support open-source speech recognition via WebSocket connection.
Mycroft's Mark II Rpi mechanical, electrical and industrial designs
Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)
Experiments to test different speech recognition systems for SEPIA Framework
An Internet-Radio built around a Waveshare 7.9" Display
Evaluation of STT models for german language
Python wrapper for phonetisaurus grapheme to phoneme tool
Basic python tornado app for handling websocket audio
fquirin / kaldi-adapt-lm
Forked from gooofy/kaldi-adapt-lmCreate and adapt n-gram and JSGF language models, e.g. for Kaldi-ASR nnet3 chain models from Zamia-Speech
Image for the Mark II based on Raspberry Pi OS
Wake-word tools and implementations for S.E.P.I.A.
Installation, test and evaluation of Scribosermo speech-to-text engine
The Python-bridge-server connects other SEPIA components to the Python runtime.