Build software better, together

Julyothecesar / lingbot-world

🌍 Advance world modeling with LingBot-World, an open-source simulator designed for video generation and high-quality environment interaction.

language cms hugo country countries documentation-tool geolocation-api speech-synthesis states depth timezones depth-camera speech-analysis video-generation countries-api world-models aigc lingbot-world

Updated Apr 30, 2026
Python

satishqa2022 / WorldCanvas

Star

🖌️ Generate artistic creations using prompts, reference images, and trajectories with WorldCanvas, a versatile platform for visual expression and exploration.

react javascript blog language static-site-generator dom hugo sketch country documentation-tool publishing whiteboard cities currencies canvas-lms html5-charts vocoder speech-analysis

Updated Apr 30, 2026
Python

Zer0pa / ZPE-Prosody

Star

Updated Apr 29, 2026
Python

takenori-y / pylstraight

Star

An unofficial Python reimplementation of the legacy-STRAIGHT

python signal-processing speech-synthesis straight vocoder speech-analysis

Updated Apr 28, 2026
Python

uncrypt3d / Speech-to-TXT

Star

Converts .mp3 to a .txt with WhisperModel.

converter mp3-converter speech-recognition mp3-files speech-to-text speech-processing speech-analysis speechtotext

Updated Apr 27, 2026
Python

zhousandeqingshu / ZhCode

Star

本库存放基于深度学习的压缩语音隐写信息检测模型。与 RNN‑SM、CSW、SFFN 相比，该方法可更有效地检测与捕获压缩语音中的隐藏信息，并支持隐藏检测分类。（这也是我在读本科的时候第一篇SCI 当时AI也没有太盛行纯手搓大佬勿喷愿提出宝贵意见）

steganalysis speech-analysis steganalysis-algorithms compressed-speech

Updated Apr 27, 2026
Python

thisisawkwardicantdothis / SpeechAnalyzer

Star

CLI-based speech analysis pipeline: transcribes audio/video with OpenAI Whisper, then runs NLP analyzers (speech rate, complexity, vocabulary, wordcloud) via spaCy. Exports metrics and charts as PDF or JSON.

python nlp cli text-analysis linguistics spacy wordcloud transcription whisper audio-processing speech-analysis speech-rate

Updated Apr 26, 2026
Python

ClawCap / Mimic

Star

Turn AI into anyone — data-driven persona generation from real social media. 说一个名字，AI 就能变成 TA。自动采集真实数据，生成比手写人设真实100倍的角色。

social-media data-driven skill character character-creation bilibili weibo roleplay personality cosplay speech-analysis persona ai-agent character-ai openclaw openclaw-skill soul-md manobrowser

Updated Apr 10, 2026
Python

sw30labs / sst-autoresearch

Star

Speaker State Trajectory analysis — treats voice as a nonlinear dynamical system and drives research with a Karpathy-style autoresearch loop.

dynamical-systems speech-analysis voice-analysis llm-agents langgraph takens-embedding autoresearch

Updated Apr 10, 2026
Python

akeren12 / Audio-Pause-And-Stutter-Detection

Star

Speech analysis system for detecting pauses and stuttered speech patterns using MFCC, cosine similarity, and phoneme-based reconstruction with a Streamlit interface.

python data-science machine-learning signal-processing speech-recognition mfcc audio-processing speech-analysis streamlit stutter-detection

Updated Apr 7, 2026
Python

ohdanieldw / SocialCurrents

Star

Open-source toolkit for social interaction research: extract 400+ multimodal features from conversation videos, then analyze synchrony, conversational states, and impression dynamics

python psychology research-tool pose-estimation speech-analysis social-psychology social-interaction multimodal-feature-extraction dyadic-interaction social-perceptoin conversnation-analysis

Updated Apr 3, 2026
Python

mingzhi-c / ASD-Detection

Star

Code for audio-based autism spectrum disorder (ASD) classification using Transformer models, machine learning baselines, and SHAP analysis.

machine-learning transformer audio-classification asd autism-spectrum-disorder speech-analysis shap speech-classification

Updated Mar 31, 2026
Python

MontrealCorpusTools / PolyglotDB

Star

PolyglotDB is a package for phonetic corpus storage and analysis

database influxdb neo4j rest-api speech-processing acoustics speech-analysis

Updated Mar 31, 2026
Python

Santhanu7Z / mci_detection

Star

Listening Between the Lines: An explainable multimodal framework for MCI detection from spontaneous speech. Leverages Selective State Space Models (Mamba) and Gated Fusion to integrate linguistic disfluencies and eGeMAPS biomarkers across multi-corpus benchmarks (Pitt, ADReSS, TAUKADIAL)

speech-analysis multimodal-fusion clinical-nlp mild-cognitive-impairment whisper-asr mamba-ssm

Updated Mar 30, 2026
Python

avulaankith / gita-grader-app

Star

Local Sanskrit recitation coach for Bhagavad Gita shlokas with audio-based pronunciation analysis, shloka detection, practice mode, and LLM feedback.

education nextjs language-learning speech-recognition recitation sanskrit audio-processing speech-analysis bhagavad-gita fastapi llm faster-whisper whisperx ollama pronunciation-assessment

Updated Mar 26, 2026
Python

modiraunak / SpeakSmart-AI---Your-Personal-Communication-Coach

Star

AI-powered communication coach that analyzes real-time speech signals to detect confidence drops, hesitation, and nervousness, providing data-driven feedback for interviews and public speaking.

python machine-learning ai signal-processing audio-processing speech-analysis real-time-analysis streamlit

Updated Mar 23, 2026
Python

Michael-Pytel / Time-Domain-Audio-Analysis

Star

Streamlit app for time-domain audio signal analysis — silence detection, voiced/unvoiced classification, F0 estimation via autocorrelation and AMDF, and weighted multi-feature speech/music discrimination.

autocorrelation audio-signal-processing pitch-estimation speech-analysis streamlit zero-crossing-rate speech-music-classification voiced-unvoiced-detection

Updated Mar 16, 2026
Python

Michael-Pytel / Frequency-Domain-Audio-Analysis

Star

Streamlit app for frequency-domain audio signal analysis — FFT, spectrogram, spectral features (centroid, bandwidth, SFM, SCF), formant detection, and F0 estimation via cepstrum.

spectrogram fft audio-signal-processing pitch-estimation speech-analysis cepstrum spectral-features

Updated Mar 16, 2026
Python

linguisticlogiclab / speech-fluency-analyzer

Star

Lightweight Python toolkit for analyzing speech fluency features such as pauses and silence ratio.

python librosa speech-processing audio-processing speech-analysis speech-features speech-fluency fluency-analysis

Updated Mar 11, 2026
Python

RhysonYang-2030 / ASACA-Automatic-Speech-Analysis-for-Cognitive-Assessment

Star

The automatic system that can extract PRAAT-like speech features from raw speech wav files, and also can get low WER (<10) high quality transcriptions at the same time.

python training machine-learning natural-language-processing ai deep-learning python-script speech feature-extraction classification speech-to-text feature-engineering praat speech-analysis multimodal shap speech-and-language-processing wav2vec2 wav2vec2ctc

Updated Feb 16, 2026
Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-analysis

Here are 86 public repositories matching this topic...

Julyothecesar / lingbot-world

satishqa2022 / WorldCanvas

Zer0pa / ZPE-Prosody

takenori-y / pylstraight

uncrypt3d / Speech-to-TXT

zhousandeqingshu / ZhCode

thisisawkwardicantdothis / SpeechAnalyzer

ClawCap / Mimic

sw30labs / sst-autoresearch

akeren12 / Audio-Pause-And-Stutter-Detection

ohdanieldw / SocialCurrents

mingzhi-c / ASD-Detection

MontrealCorpusTools / PolyglotDB

Santhanu7Z / mci_detection

avulaankith / gita-grader-app

modiraunak / SpeakSmart-AI---Your-Personal-Communication-Coach

Michael-Pytel / Time-Domain-Audio-Analysis

Michael-Pytel / Frequency-Domain-Audio-Analysis

linguisticlogiclab / speech-fluency-analyzer

RhysonYang-2030 / ASACA-Automatic-Speech-Analysis-for-Cognitive-Assessment

Improve this page

Add this topic to your repo