speech-classification

A convolutional neural network for gender classification, which achieved an F1-score of 94.3% when tested on the RAVDESS dataset. Created as postgraduate coursework, the report is included. The report also discusses Sodiq Adebiy's CNN, which I'd recommend looking at to anyone interested in emotion classification.

machine-learning deep-neural-networks deep-learning audio-analysis gender-recognition convolutional-neural-networks gender-classification speech-classification

Updated Jun 22, 2022
Jupyter Notebook

ndrco / directed-speech-ru

Star

RU directed speech classifier (ruElectra, synthetic ASR noise)

nlp machine-learning speech-recognizer ai-assistants russian-language speech-classification speech-classification-python

Updated Mar 2, 2026
Python

KrajShuffle / Classifying_SpeechAudio_CNN

Star

CNN Based Approach for Audio File Classification. Contains Notebooks Illustrating Data Preprocessing, Feature Extraction, Model Training, & Model Inference Workflows & Overall Pipeline

feature-extraction convolutional-neural-networks data-preprocessing feature-engineering metrics-visualization speech-classification model-inference model-training-and-evaluation

Updated Feb 8, 2024
Jupyter Notebook

acw-upv / INTERSPEECH2025_Depression

Star

This repository contains the code for the INTERSPEECH2025 paper: "Speech and Text Foundation Models for Depression Detection: Cross-Task and Cross-Language Evaluation"

speech depression-detection speech-classification natual-language-processing

Updated Jun 15, 2025
Python

aliyzd95 / Emotion-Recognition-In-Persian-Speech-Using-Deep-Neural-Networks

Star

This project aims to perform Emotion Recognition in Speech using Deep Neural Networks (DNNs)

deep-neural-networks opensmile librosa speech-processing ser emotion-recognition speech-emotion-recognition speech-classification

Updated May 22, 2025
Python

Mubarekethio / Voice-Recognition-Qafaraf-and-Amharic

Sponsor

Star

Qafar-af and Amharic voice Command Recognition project to control the movement of wheelchair

voice-commands voice-recognition speech-recognition amharic voice-control audio-classification keyword-spotting kws amharic-words speech-classification afar-language qafaraf-voice qafaraf afaraf

Updated Jan 24, 2024
Jupyter Notebook

mingzhi-c / ASD-Detection

Star

Code for audio-based autism spectrum disorder (ASD) classification using Transformer models, machine learning baselines, and SHAP analysis.

machine-learning transformer audio-classification asd autism-spectrum-disorder speech-analysis shap speech-classification

Updated Mar 31, 2026
Python

Rayyan9477 / speech_emotion_classification

Star

This project implements a speech emotion classification system using neural networks and genetic algorithms for optimization. The system classifies emotions such as calm, happy, sad, angry, fearful, surprise, and disgust from speech audio using the RAVDESS dataset.

python machine-learning neural-network speech-recognition emotion-detection speech-classification

Updated Nov 6, 2025
HTML

manashpratim / Frame-Level-Classification-of-Speech

Star

python deep-learning jupyter-notebook pytorch mlp-classifier google-colab google-colaboratory speech-classification

Updated May 28, 2020
Jupyter Notebook

ryanquinnnelson / CMU-11685-Utterance-to-Phoneme-Mapping

Star

Fall 2021 Introduction to Deep Learning - Homework 3 Part 2 (RNN-based phoneme recognition)

cnn torch lstm rnn octopus melspectrogram ctc-loss speech-classification ctcdecode

Updated Dec 3, 2021
Python

Amir-Hofo / Speech_commands_Classification

Star

In this notebook, we aim to recognize speech commands using classification. For this purpose, we used the SPEECHCOMMANDS dataset and the deep convolutional model M5. The code is written in Python and designed for the PyTorch platform.

machine-learning ai deep-learning cnn pytorch artificial-intelligence speech-recognition convolutional-neural-networks speech-to-text audio-classification torchaudio speech-classification

Updated Jun 18, 2024
Jupyter Notebook

MilanaShhanukova / uni-research-dementia-detection

Star

This project represents my research on dementia classification using audio data.

deep-learning attention-mechanism dementia-detection speech-classification

Updated May 20, 2023
Jupyter Notebook

Choise-ieee / yamnet_onnx_cpp_audio_speech_classification

Star

Yamnet for speech classification using CPP and ONNX-runtime-2025高通边缘智能创新应用大赛入围决赛方案

cpp qualcomm onnx speech-classification yamnet

Updated Oct 8, 2025
C++

sarthak268 / Multimedia-Computing-and-Applications

Sponsor

Star

This repository contains code for all assignments in the Multimedia Computing and Applications (CSE563) course.

multimedia text-retrieval text-representation speech-classification multimedia-computing

Updated May 16, 2020
Python

vishaal27 / IFN-Python

Star

A Python implementation of the Iterative Feature Normalization algorithm

machine-learning feature-extraction speech-classification feature-normalization

Updated May 12, 2020
Jupyter Notebook

deep-spin / speech-continuous-attention

Star

Speech Classification using Continuous Attention Mechanisms

speech-classification continuous-attention continuous-sparsemax continuous-softmax

Updated Jul 22, 2022
Python

Jason-Oleana / speech-classification

Star

In this challenge, the goal is to learn to recognize which of several English words is pronounced in an audio recording. This is a multiclass classification task.

convolutional-neural-network speech-classification mfcc-features

Updated Mar 25, 2023
Jupyter Notebook

Improve this page

Add a description, image, and links to the speech-classification topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-classification topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-classification

Here are 29 public repositories matching this topic...

TanukiDong / Sound-and-Tempo-Classification

acw-upv / SER_EMOVOME

OgeNI / BVC_Challenging_Voice_Set

Chris-Winnard / Speech-Gender-Classifier

ndrco / directed-speech-ru

KrajShuffle / Classifying_SpeechAudio_CNN

acw-upv / INTERSPEECH2025_Depression

aliyzd95 / Emotion-Recognition-In-Persian-Speech-Using-Deep-Neural-Networks

Mubarekethio / Voice-Recognition-Qafaraf-and-Amharic

mingzhi-c / ASD-Detection

Rayyan9477 / speech_emotion_classification

manashpratim / Frame-Level-Classification-of-Speech

ryanquinnnelson / CMU-11685-Utterance-to-Phoneme-Mapping

Amir-Hofo / Speech_commands_Classification

MilanaShhanukova / uni-research-dementia-detection

Choise-ieee / yamnet_onnx_cpp_audio_speech_classification

sarthak268 / Multimedia-Computing-and-Applications

vishaal27 / IFN-Python

deep-spin / speech-continuous-attention

Jason-Oleana / speech-classification

Improve this page

Add this topic to your repo