Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
-
Updated
Oct 19, 2024 - HTML
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
Rhasspy voice assistant for offline home automation
This is the end-to-end Speech Recognition neural network, deployed in Keras. This was my final project for Artificial Intelligence Nanodegree @udacity.
Real-time transcription using faster-whisper
Built a deep neural network that functions as part of an end-to-end automatic speech recognition (ASR) pipeline.
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
VietGPT VoiceBot: Chatbot automatically recognizes Vietnamese voice and uses the ChatGPT API for natural language interaction.
This App allows users to convert their speech into text and send that text as a message. It records blobs in realtime! After every 10 seconds recorded blob is sent to server and there it is converted into text and send as a message to other user.
A MATLAB implementation of CHiME4 baseline Beamformit
whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++
Pytorch implementation of subband decomposition
Speakify is a web application that uses Edge TTS to convert text to speech using a variety of voices.
🔊 A fully basic voice synthesizer in vanillaJS
ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).
webpage for maintaining the list of openly available DL, ML, RL, Vision, NLP, Optimization courses
A mobile web application that helps you convert spoken words to sharable/editable text 🎊
Generative AI Therapist built using Google-Cloud's Speech-To-Text Recognition and OpenAI's LLM. Deployed using Flask for server-side script rendering (Python).
Build IVR, run voice campaign, with machine detection, speech recognition and much more. Integrated support for twilio api. Asterisk integration planned.
Usar webkitSpeechRecognition para convertir voz a texto en la web con JavaScript
Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."