-
ByteMind
- Germany
- http://bytemind.de
Stars
Robust Speech Recognition via Large-Scale Weak Supervision
Free and Open Source, Distributed, RESTful Search Engine
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Solidity, the Smart Contract Programming Language
Faster Whisper transcription with CTranslate2
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
kaldi-asr/kaldi is the official location of the Kaldi project.
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
A simple expressive web framework for java. Spark has a kotlin DSL https://github.com/perwendel/spark-kotlin
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
On-device wake word detection powered by deep learning
🎨 Pickr - A simple, multi-themed, responsive and hackable Color-Picker library. No dependencies, no jQuery. Compatible with all CSS Frameworks e.g. Bootstrap, Materialize. Supports alpha channel, r…
Cordova Local-Notification Plugin
MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity.
Phaser CE is a fun, free and fast 2D game framework for making HTML5 games for desktop and mobile web browsers, supporting Canvas and WebGL rendering.
The binary distribution of openHAB
simple socket.io server for webrtc signaling
A fast local neural text to speech engine for Mycroft
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries