Local voice-to-text engine for Windows | Whisper + optional LLM cleanup, real-time DSP, C++ UI. Press a key, talk, it types. No cloud, no API keys.
-
Updated
Mar 6, 2026 - Python
Local voice-to-text engine for Windows | Whisper + optional LLM cleanup, real-time DSP, C++ UI. Press a key, talk, it types. No cloud, no API keys.
A database of challenging voice utterances collected by the Biometrics Vision and Computing (BVC) group.
End-to-end pipeline for training a custom keyword detection model with TensorFlow & TFLite expor
Config files for my GitHub profile.
DΞCIBΞLION is an audio intelligence module forged in the labs of OBINexus, where noise meets logic and shouting is a feature, not a bug. It mathematically analyzes human vocal input to determine emotional projection through log-scaled loudness evaluation, using a sacred constant: 85 dB.
🎤 MotionVox - Python-based security camera with intelligent motion and voice detection, automatic recording triggers, and smart alert system
Voice detection, wake words and voice commands on the ESP32-S3 microcontroller.
おやじギャグ・昭和トーク・絵文字洪水を笑いでツッコむ相棒AI。矯正じゃない、共存。自分のおやじ化を自覚している人のためのツール。
Deep learning model to detect real vs fake (deepfake) voices using CNN and MFCC features.
This is a full-stack Deepfake Voice Detection System built with Python's Flask framework. It employs an ensemble of machine learning models—including XGBoost, Random Forest, and SVM—to classify audio files as either genuine human speech or AI-generated synthetic voices.
Spoofing voice detection : 2nd YAICON
The Poetry Pronunciation Learning App is an interactive AI-powered tool that helps users practice and improve their pronunciation of poems. It uses real-time speech recognition, voice activity detection, and fuzzy word matching to provide instant feedback on spoken verses.
REST API for detecting AI-generated vs human voices across 5 Indian languages (Tamil, English, Hindi, Malayalam, Telugu) using audio analysis and machine learning
using a simple convolution neural network to classify voices based on the existence of wake word
A Python project that handles speech commands and retrieves results from Google or Wikipedia based on the spoken input. Functions are organized in separate files, with a single raw file to execute the project. This repository is intended for project purposes and will be updated with additional features in the future.
TranscribeTube is a Python tool that transcribes and generates subtitles for videos from local files or YouTube links using Hugging Face models. It features an interactive Gradio web interface, allowing users to easily upload videos, select languages, and download subtitles in SRT format.
AI system for detecting AI-generated voice clones using speech embeddings and acoustic signal analysis.
Kotlin Music App (PN) is a compact reference implementation demonstrating standard patterns for an Android music player: background playback, MediaSession integration, playlist and library management, and common authentication screens.
Binary classification problem that aims to classify human voices from audio recordings. Implemented using PyTorch and Librosa.
Add a description, image, and links to the voice-detection topic page so that developers can more easily learn about it.
To associate your repository with the voice-detection topic, visit your repo's landing page and select "manage topics."