Speech to Text to Speech, sends text as OSC messages
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Robust Speech Recognition via Large-Scale Weak Supervision
Build your own AI friend
Speech-to-text, text-to-speech, and speaker recognition
Mice speech to text with MX Cinnamon OS ISO
In-App assistant SDK to build a multimodal conversational UX websites
Captcha solver extension for humans
A free, open source, and extensible speech-to-text application
Speech recognition module for Python
The behavior guidance framework for customer-facing LLM agents
Toolkit for conversational AI
Repo of Qwen2-Audio chat & pretrained large audio language model
Subtitle Creation Assistant
TEN, a voice agent framework to create conversational AI.
Conversational voice AI agents
C++ library for high performance inference on NVIDIA GPUs
Go efficient multilingual NLP and text segmentation
HTML5 js recording mp3 wav ogg webm amr format
elevenlabs-api is an open source Java wrapper around the ElevenLabs
In-App assistant SDK to build a multimodal conversational UX for iOS
Assistant SDK to build a multimodal conversational UX for Android
Industrial-strength Natural Language Processing (NLP)
Build voice-based LLM agents. Modular + open source
Stanford NLP Python library for many human languages