0% found this document useful (0 votes)

65 views5 pages

Chapter I. Introduction 1-13

This document provides a table of contents for a thesis on speaker recognition. The table of contents shows that the thesis includes 6 chapters which cover an introduction, literature review, feature extraction methods, speaker modeling techniques, dimensionality reduction methods, and conclusions. It also lists appendices and references. The chapters describe fundamental concepts in speaker recognition, a review of existing approaches, the author's proposed methods for feature extraction, modeling speakers using neural networks and support vector machines, using genetic algorithms for dimensionality reduction, and performance analysis.

Uploaded by

Abdelkbir Ws

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

65 views5 pages

Chapter I. Introduction 1-13

Uploaded by

Abdelkbir Ws

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Declaration i
Certificate of the Supervisor ii
Acknowledgements iii
List of Publications iv
Abstract v
Table of Contents vii
List of Tables xii
List of Figures xiv
List of Abbreviations xvii

CHAPTER I. INTRODUCTION 1-13

1.1 Fundamentals of Speaker Recognition 1
1.2 Applications 7
1.3 Historical Achievements in Speaker Recognition Technology 8
1.4 Challenges to the Speaker Recognition System 9
1.5 Motivation 10
1.6 Problem Formulation 11
1.7 Objectives of Research 11
1.8 Organization of Thesis 12

CHAPTER II. LITERATURE REVIEW 14- 43

2.1 Introduction 14
2.1.1 Speech Production Mechanism in Human Beings 15
2.1.2 Source Filter Model of Speech Production 17
2.1.3 Short Term Analysis of Speech Signal 19
2.2 Basic Structure of Speaker Recognition System 19
2.3 Voice Activity Detection 22
2.4 Feature Extraction Methods used in Speaker Recognition 23

vii
CONTENTS Page No.

2.4.1 Spectral Features 24

2.4.2 Dynamic Features 25
2.4.3 Prosodic Features 26
2.4.4 High-level Features 27
2.5 Speaker Modeling - Classical Approaches 27
2.5.1 Template Models 28
2.5.2 VQ Source Modeling 29
2.5.3 Hidden Markov Model 30
2.5.4 Neural Networks 31
2.5.5 Support Vector Machines 32
2.5.6 Gaussian Mixture Models 32
2.6 Dimensionality Reduction Techniques 35
2.7 Performance Terms for Speaker Recognition Task 36
2.8 Gaps in the Study 41
2.9 Conclusions 42

CHAPTER III. FEATURE EXTRACTION 44-76

3.1 Introduction 44
3.2 Pre-processing 47
3.2.1 Pre-emphasis 48
3.2.2 Voice Activity Detection 49
3.3 Proposed Method of Voice Activity Detection 52
3.4 Mel Frequency Cepstral Coefficients 54
3.4.1 Frame Blocking 55
3.4.2 Windowing 56
3.4.3 Short Term Fast Fourier Transform 57
3.4.4 Mel-Frequency Warping 57
3.4.5 Log Compression and Discrete Cosine Transform 59
3.4.6 Delta and Delta-Delta Coefficients 60

viii
CONTENTS Page No.

3.5 Simulation 62
3.5.1 Voice Activity Detection 63
3.5.2 MFCC 63
3.6 Feature Extraction using MFCC and its Derivatives 65
3.6.1 Number of filters in the filter bank vs. Identification Rate 65
3.6.2 Effect of variation in Type of Window 66
3.6.3 Effect of Adding Derivatives 67
3.7 Effect of VAD on Speaker Recognition Rate 69
3.8 Factors affecting MFCC performance 71
3.9 Conclusions 75

CHAPTER IV. SPEAKER MODELING 77-109

4.1 The Neural Network 77
4.2 Network Structures 80
4.3 Training of Artificial Neural Networks 82
4.4 Implementation of the Speaker Recognition System using Back 86
Propagation Algorithm
4.5 Support Vector Machines 89
4.6 SVM Classification Mechanism 91
4.6.1 Linear Separable Case 91
4.6.2 Linear Non-separable Case 94
4.6.3 Nonlinear Case 95
4.7 Implementation of the Speaker Recognition System using SVM 97
4.8 Performance of the Speaker Recognition System 100
4.8.1 Performance of the Speaker Identification System in Presence of 100
Noise
4.8.2 Relative Performance of SVM and Neural Network in a Speaker 102
Recognition System

ix
CONTENTS Page No.

4.9 Real Time Speaker Recognition System for Hindi Words 103
4.9.1 Methodology 104
4.9.2 Graphical User Interface (GUI) for Real Time Speaker 106
Recognition
4.9.3 Display on LCD 108
4.10 Conclusions 109

CHAPTER V. DIMENSIONALITY REDUCTION OF FEATURE 110-128

VECTORS
5.1 Introduction 110
5.2 Genetic Algorithms 113
5.3 Feature Selection using GA 116
5.4 Performance of the Speaker Recognition System using GA 117
5.4.1 Effect of Noise on Speaker Recognition Rate 119
5.4.2 Processing Time 121
5.4.3 Effect of Number of Utterances per Speaker on Recognition Rate 122
5.4.4 Relative Performance of GA and PCA in a Speaker 123
Recognition System
5.4.5 Performance of GA with Different Kernel Functions of SVM 125
using Reduced Dimensional Feature Vectors
5.5 Conclusions 127

CHAPTER VI. CONCLUSIONS AND FUTURE WORK 129-135

6.1 Introduction 129
6.2 Summary and Findings 130
6.3 Future Scope 135

x
CONTENTS Page No.

APPENDICES
A. Voicebox 136
B. Description of Speaker Databases 137

REFERENCES 139

BRIEF PROFILE OF THE RESEARCH SCHOLAR 151

Analysis of Speech Recognition Techniques
No ratings yet
Analysis of Speech Recognition Techniques
61 pages
Project Report On Speech Coder For Communication
No ratings yet
Project Report On Speech Coder For Communication
46 pages
Design of A Reverb Plugin and Evaluation PDF
No ratings yet
Design of A Reverb Plugin and Evaluation PDF
57 pages
Facemask Detection Using Convolutional Neural Networks
100% (1)
Facemask Detection Using Convolutional Neural Networks
11 pages
Message Spam Classification Using Machine Learning Report
No ratings yet
Message Spam Classification Using Machine Learning Report
28 pages
Ser Final Report
No ratings yet
Ser Final Report
38 pages
Wi Vi Tchnology
100% (1)
Wi Vi Tchnology
29 pages
Ser Final Report
No ratings yet
Ser Final Report
31 pages
Mp3 Hardware Audio Decoder
No ratings yet
Mp3 Hardware Audio Decoder
54 pages
4 Contents
No ratings yet
4 Contents
1 page
Project Report-Final Yr (6months) - Eshan
25% (4)
Project Report-Final Yr (6months) - Eshan
52 pages
Audio Bandwidth Extension Application of Psychoacoustics Signal Processing and Loudspeaker Design 1st Edition Erik Larsen No Waiting Time
100% (8)
Audio Bandwidth Extension Application of Psychoacoustics Signal Processing and Loudspeaker Design 1st Edition Erik Larsen No Waiting Time
120 pages
AI Desktop Voice Assistant Guide
No ratings yet
AI Desktop Voice Assistant Guide
50 pages
Content Part - Merged
No ratings yet
Content Part - Merged
76 pages
Fake News Detection Using Passive Aggressive Classification and Confusion Matrix
No ratings yet
Fake News Detection Using Passive Aggressive Classification and Confusion Matrix
28 pages
Final Report
No ratings yet
Final Report
84 pages
Text-to-Speech Converter: A Mini Project Report Submitted by
No ratings yet
Text-to-Speech Converter: A Mini Project Report Submitted by
20 pages
Ai Assistant Major Project
100% (1)
Ai Assistant Major Project
33 pages
Chapter No. Title Page No
No ratings yet
Chapter No. Title Page No
9 pages
Prashant Thesis
No ratings yet
Prashant Thesis
49 pages
Crop Diagnosis System Report
100% (1)
Crop Diagnosis System Report
53 pages
Saro 2.0
No ratings yet
Saro 2.0
31 pages
Social Media Analysis Using Machine Learning
No ratings yet
Social Media Analysis Using Machine Learning
11 pages
App Repot - Palindrome Checker
No ratings yet
App Repot - Palindrome Checker
8 pages
Investigation of Various Properties of Jumping Finite Automata
No ratings yet
Investigation of Various Properties of Jumping Finite Automata
59 pages
Starting Pages
No ratings yet
Starting Pages
8 pages
FFT Implementation For Fpga
No ratings yet
FFT Implementation For Fpga
73 pages
External Report
No ratings yet
External Report
48 pages
Phase 2 Final
100% (1)
Phase 2 Final
65 pages
Engineering Dissertation 2008-09
80% (5)
Engineering Dissertation 2008-09
87 pages
Agriculture Crop Recommendation System Using Machine Learning
No ratings yet
Agriculture Crop Recommendation System Using Machine Learning
11 pages
Agriculture Crop Recommendation System Using
No ratings yet
Agriculture Crop Recommendation System Using
57 pages
Aditya Tittle Pagesaa
No ratings yet
Aditya Tittle Pagesaa
9 pages
Sign Language System Project
No ratings yet
Sign Language System Project
52 pages
Front Pages1
No ratings yet
Front Pages1
6 pages
Sat - 67.Pdf - Human Activity Recognition With Smartphones Using Machine Learning Process
No ratings yet
Sat - 67.Pdf - Human Activity Recognition With Smartphones Using Machine Learning Process
11 pages
TEMPLATEpdf Merged
No ratings yet
TEMPLATEpdf Merged
23 pages
A Machine Learning Project Report Fake News Prediction
No ratings yet
A Machine Learning Project Report Fake News Prediction
24 pages
Offline Signature Verification System Using Artificial Neural Networks
No ratings yet
Offline Signature Verification System Using Artificial Neural Networks
65 pages
111 Final Report
No ratings yet
111 Final Report
34 pages
Wordprediction Reportfinal
No ratings yet
Wordprediction Reportfinal
45 pages
Sat - 81.Pdf - Criminal Identification and Weapon Detection
No ratings yet
Sat - 81.Pdf - Criminal Identification and Weapon Detection
11 pages
Movie Recom REPORT Update
No ratings yet
Movie Recom REPORT Update
26 pages
Table of Contents
No ratings yet
Table of Contents
4 pages
Contents F
No ratings yet
Contents F
9 pages
Objectfy 1
No ratings yet
Objectfy 1
54 pages
Thesis
No ratings yet
Thesis
73 pages
TM 05
No ratings yet
TM 05
2 pages
Chapter 1
No ratings yet
Chapter 1
26 pages
PART2
No ratings yet
PART2
4 pages
Music Recommendation System
No ratings yet
Music Recommendation System
50 pages
Phishing Website Detection
No ratings yet
Phishing Website Detection
62 pages
Face Identification System
No ratings yet
Face Identification System
57 pages
Face Recognition Technology
No ratings yet
Face Recognition Technology
33 pages
Bird Species Project Report Final
No ratings yet
Bird Species Project Report Final
50 pages
Forrest Shull
No ratings yet
Forrest Shull
87 pages
Artificial Intelligence in Education Technologies: New Development and Innovative Practices
No ratings yet
Artificial Intelligence in Education Technologies: New Development and Innovative Practices
224 pages
Applicationnof AIin Educational MGT
No ratings yet
Applicationnof AIin Educational MGT
9 pages
Chat GP Tvs Bard Patient Information
No ratings yet
Chat GP Tvs Bard Patient Information
9 pages
ChatGPT's Role in Education
No ratings yet
ChatGPT's Role in Education
6 pages
Performanceanalysisof ASRModelfor Santhalilanguageon Kaldiand Matlab Toolkit
No ratings yet
Performanceanalysisof ASRModelfor Santhalilanguageon Kaldiand Matlab Toolkit
5 pages
1 s2.0 S0040162523007618 Main
No ratings yet
1 s2.0 S0040162523007618 Main
18 pages
Spasov Ski 2015
No ratings yet
Spasov Ski 2015
8 pages
Noise Effect On Amazigh Digits in Speech
No ratings yet
Noise Effect On Amazigh Digits in Speech
8 pages
Cosentino 2020
No ratings yet
Cosentino 2020
5 pages
Speech Recognition in Noisy Environments
No ratings yet
Speech Recognition in Noisy Environments
130 pages
AI's Role in Education: Applications & Future
No ratings yet
AI's Role in Education: Applications & Future
11 pages
Noise Effect On Arabic Alphadigits in Au
No ratings yet
Noise Effect On Arabic Alphadigits in Au
4 pages
Comparative Analysis of Automatic Speech Recognition Techniques
No ratings yet
Comparative Analysis of Automatic Speech Recognition Techniques
8 pages
Arabic Speech Recognition Evolution
No ratings yet
Arabic Speech Recognition Evolution
8 pages
Comparing Open-Source Speech Recognition Toolkits
No ratings yet
Comparing Open-Source Speech Recognition Toolkits
12 pages
1 s2.0 S0167639322000292 Main
No ratings yet
1 s2.0 S0167639322000292 Main
16 pages
Isolated Digit Recognizer Using Gaussian Mixture Models
No ratings yet
Isolated Digit Recognizer Using Gaussian Mixture Models
44 pages
Multi-Channel Acoustic Noise Database
No ratings yet
Multi-Channel Acoustic Noise Database
6 pages
1 s2.0 S0885230819302992 Main
No ratings yet
1 s2.0 S0885230819302992 Main
16 pages
1 s2.0 S1877050916300588 Main
No ratings yet
1 s2.0 S1877050916300588 Main
8 pages
Moroccan Dialect Speech Recognition System Based On Cmu Sphinxtools
No ratings yet
Moroccan Dialect Speech Recognition System Based On Cmu Sphinxtools
5 pages
Arabic Language Learning Assistance Base
No ratings yet
Arabic Language Learning Assistance Base
7 pages
Advancing RNN Transducer Technology For Speech Recognition
No ratings yet
Advancing RNN Transducer Technology For Speech Recognition
5 pages
Automatic Isolated Digit Recognition System: An Approach Using HMM
No ratings yet
Automatic Isolated Digit Recognition System: An Approach Using HMM
3 pages
Kaldi Arabic Speech Recognition Recipe
No ratings yet
Kaldi Arabic Speech Recognition Recipe
5 pages
Speaker and Language Recognition by GMM
No ratings yet
Speaker and Language Recognition by GMM
5 pages
pxc3872774 PDF
No ratings yet
pxc3872774 PDF
7 pages
Idce 820CT
No ratings yet
Idce 820CT
4 pages
Mobile Jammer
No ratings yet
Mobile Jammer
14 pages
SND Version
No ratings yet
SND Version
15 pages
CGS 3269 Computer Architecture Concepts Assignment #3
No ratings yet
CGS 3269 Computer Architecture Concepts Assignment #3
5 pages
19ecb201 - Formal Language and Automata Theory
No ratings yet
19ecb201 - Formal Language and Automata Theory
3 pages
Assignment PLC Youssef Awad
No ratings yet
Assignment PLC Youssef Awad
8 pages
Msi Afterburner: "Afterburner Is The Gold Standard of Overclocking Utilities."
No ratings yet
Msi Afterburner: "Afterburner Is The Gold Standard of Overclocking Utilities."
9 pages
Java Programming Exam Guide
No ratings yet
Java Programming Exam Guide
16 pages
Data Factory
No ratings yet
Data Factory
57 pages
Creatingmcpserverswithoauth
No ratings yet
Creatingmcpserverswithoauth
57 pages
ACA Unit 4
No ratings yet
ACA Unit 4
41 pages
Data Center Networking Essentials
No ratings yet
Data Center Networking Essentials
59 pages
IOT Lab Manual
No ratings yet
IOT Lab Manual
30 pages
AI Questions For Class 6, 7, 8 - PM SHRI KENDRIYA VIDYALAYA CHURU - India
No ratings yet
AI Questions For Class 6, 7, 8 - PM SHRI KENDRIYA VIDYALAYA CHURU - India
4 pages
8257 DMA Controller Overview
No ratings yet
8257 DMA Controller Overview
26 pages
Substation Device Test Results
No ratings yet
Substation Device Test Results
3 pages
3.3.2.5 Packet Tracer - Threat Modeling at The IoT Device Layer
No ratings yet
3.3.2.5 Packet Tracer - Threat Modeling at The IoT Device Layer
12 pages
402 Information Tech SQP
100% (1)
402 Information Tech SQP
7 pages
It Era Reviewer
No ratings yet
It Era Reviewer
4 pages
Grade 9-LESSON
No ratings yet
Grade 9-LESSON
20 pages
UGRD-AI6100 - MIDTERM EXAM - Attempt PERFECT
No ratings yet
UGRD-AI6100 - MIDTERM EXAM - Attempt PERFECT
11 pages
MCQ in UNIT1
No ratings yet
MCQ in UNIT1
5 pages
J1939 Data Mapping Explained - Final 1
No ratings yet
J1939 Data Mapping Explained - Final 1
13 pages
HADR Users Guide: Public SAP Adaptive Server Enterprise 16.0 SP04 Document Version: 1.0 - 2022-04-15
No ratings yet
HADR Users Guide: Public SAP Adaptive Server Enterprise 16.0 SP04 Document Version: 1.0 - 2022-04-15
634 pages
Industrial Bus Node Specs
No ratings yet
Industrial Bus Node Specs
2 pages
Digital Voltmeter
No ratings yet
Digital Voltmeter
35 pages
Resume of Eve
No ratings yet
Resume of Eve
5 pages
Azure Book 79
No ratings yet
Azure Book 79
1 page
IT Professionals' Guide to FusionAccess
No ratings yet
IT Professionals' Guide to FusionAccess
11 pages
AZ 04T00A ENU PowerPoint - 05
No ratings yet
AZ 04T00A ENU PowerPoint - 05
36 pages

Chapter I. Introduction 1-13

Uploaded by

Chapter I. Introduction 1-13

Uploaded by

TABLE OF CONTENTS

CHAPTER I. INTRODUCTION 1-13

CHAPTER II. LITERATURE REVIEW 14- 43

2.4.1 Spectral Features 24

CHAPTER III. FEATURE EXTRACTION 44-76

CHAPTER IV. SPEAKER MODELING 77-109

CHAPTER V. DIMENSIONALITY REDUCTION OF FEATURE 110-128

CHAPTER VI. CONCLUSIONS AND FUTURE WORK 129-135

BRIEF PROFILE OF THE RESEARCH SCHOLAR 151

You might also like