speech-classification

Here are 28 public repositories matching this topic...

ndrco / directed-speech-ru

RU directed speech classifier (ruElectra, synthetic ASR noise)

nlp machine-learning speech-recognizer ai-assistants russian-language speech-classification speech-classification-python

Updated Mar 2, 2026
Python

TanukiDong / Sound-and-Tempo-Classification

Star

Detecting Speed and Tempo Alterations in Speech Recordings

machine-learning speech-classification

Updated Feb 11, 2026
Python

Rayyan9477 / speech_emotion_classification

Star

This project implements a speech emotion classification system using neural networks and genetic algorithms for optimization. The system classifies emotions such as calm, happy, sad, angry, fearful, surprise, and disgust from speech audio using the RAVDESS dataset.

python machine-learning neural-network speech-recognition emotion-detection speech-classification

Updated Nov 6, 2025
HTML

Choise-ieee / yamnet_onnx_cpp_audio_speech_classification

Star

Yamnet for speech classification using CPP and ONNX-runtime-2025高通边缘智能创新应用大赛入围决赛方案

cpp qualcomm onnx speech-classification yamnet

Updated Oct 8, 2025
C++

acw-upv / INTERSPEECH2025_Depression

Star

This repository contains the code for the INTERSPEECH2025 paper: "Speech and Text Foundation Models for Depression Detection: Cross-Task and Cross-Language Evaluation"

speech depression-detection speech-classification natual-language-processing

Updated Jun 15, 2025
Python

acw-upv / SER_EMOVOME

Star

Python implementation of the article "EMOVOME Database: Advancing Emotion Recognition in Speech Beyond Staged Scenarios"

speech speech-emotion-recognition speech-classification

Updated Jun 15, 2025
Jupyter Notebook

aliyzd95 / Emotion-Recognition-In-Persian-Speech-Using-Deep-Neural-Networks

Star

This project aims to perform Emotion Recognition in Speech using Deep Neural Networks (DNNs)

deep-neural-networks opensmile librosa speech-processing ser emotion-recognition speech-emotion-recognition speech-classification

Updated May 22, 2025
Python

OgeNI / BVC_Challenging_Voice_Set

Star

A database of challenging voice utterances collected by the Biometrics Vision and Computing (BVC) group.

voice voice-recognition speech-recognition speech-to-text voice-detection voice-biometrics speech-translation speech-classification voice-dataset voice-datasets voice-data age-from-voice gender-from-voice voice-age voice-gender

Updated Mar 27, 2025

kaistmm / Audio-Mamba-AuM

Star

Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"

audio deep-learning pytorch representation-learning audio-classification mamba state-space-model speaker-identification speech-classification audio-mamba

Updated Nov 24, 2024
Python

Amir-Hofo / Speech_commands_Classification

Star

In this notebook, we aim to recognize speech commands using classification. For this purpose, we used the SPEECHCOMMANDS dataset and the deep convolutional model M5. The code is written in Python and designed for the PyTorch platform.

machine-learning ai deep-learning cnn pytorch artificial-intelligence speech-recognition convolutional-neural-networks speech-to-text audio-classification torchaudio speech-classification

Updated Jun 18, 2024
Jupyter Notebook

HoseinAzad / Transformer-based-SER

Star

Transformer-based model for Speech Emotion Recognition(SER) - implemented by Pytorch

emotion-recognition speech-emotion-recognition speech-classification transformer-pytorch speech-python speech-emotion-classification speech-classification-python

Updated Apr 12, 2024
Python

KrajShuffle / Classifying_SpeechAudio_CNN

Star

CNN Based Approach for Audio File Classification. Contains Notebooks Illustrating Data Preprocessing, Feature Extraction, Model Training, & Model Inference Workflows & Overall Pipeline

feature-extraction convolutional-neural-networks data-preprocessing feature-engineering metrics-visualization speech-classification model-inference model-training-and-evaluation

Updated Feb 8, 2024
Jupyter Notebook

Mubarekethio / Voice-Recognition-Qafaraf-and-Amharic

Sponsor

Star

Qafar-af and Amharic voice Command Recognition project to control the movement of wheelchair

voice-commands voice-recognition speech-recognition amharic voice-control audio-classification keyword-spotting kws amharic-words speech-classification afar-language qafaraf-voice qafaraf afaraf

Updated Jan 24, 2024
Jupyter Notebook

YuanGongND / ast

Star

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

audio deep-learning pytorch representation-learning audio-classification keyword-spotting speech-commands speech-classification

Updated May 21, 2023
Jupyter Notebook

MilanaShhanukova / uni-research-dementia-detection

Star

This project represents my research on dementia classification using audio data.

deep-learning attention-mechanism dementia-detection speech-classification

Updated May 20, 2023
Jupyter Notebook

Jason-Oleana / speech-classification

Star

In this challenge, the goal is to learn to recognize which of several English words is pronounced in an audio recording. This is a multiclass classification task.

convolutional-neural-network speech-classification mfcc-features