0% found this document useful (0 votes)

16 views5 pages

Sphinx Speech Recognition

The document discusses speech recognition in Python using the CMU Sphinx toolkit, specifically the Pocketsphinx library for offline applications. It outlines the installation process for necessary libraries and provides code examples for continuous speech recognition and keyword searching. The document concludes by emphasizing the utility of CMU Sphinx in various applications of speech recognition.

Uploaded by

Madhavan Jayarama Mohan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views5 pages

Sphinx Speech Recognition

Uploaded by

Madhavan Jayarama Mohan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

EXERCISE 2

Speech Recognition in Python using

CMU Sphinx



“Hey, Siri!”, “Okay, Google!” and “Alexa playing some music” are some of
the words that have become an integral part of our life as giving voice
commands to our virtual assistants make our life a lot easier. But have
you ever wondered how these devices are giving commands via
voice/speech?
Do applications understand your voice? How does the computer even
decode this if it only understands 0/1?
The answer is simple: it uses Speech Recognition software to decode the
user input received as speech/voice using the device’s
microphone. Speech Recognition software to decode the user input
received as speech/voice using the device’s microphone. the task of this
software is to convert the speech to a string(text) so that the computer can
then decode it.
One such Toolkit is CMU Sphinx which is an open-source toolkit used for
speech recognition, it also has a lightweight recognizer library
called Pocketsphinx which will be used to recognize the speech. This
library is a great resource especially when you are offline as when you
have internet access you should prefer Google API with speech
recognition due to higher precision. but when you are building a project
that works offline or uses speech on an offline embedded device,
use pocketsphinx.

Recognition Process

Let’s discuss how this library works from behind to actually recognize our
voice, It takes a waveform and then splits it according to utterances by
silence then traverses and tries to find out what is being said in each
utterance for accomplishing this task it takes all possible combinations of
words and try to match them with audio choosing the best matching
combination.

Installation of modules
Since pocketsphinx is an external library i.e. its not present as an inbuilt
entity in python we would install it to our machines using pip installer and
then using import to invoke all the functionalities of this library,
Now open your terminal and type the following command
NOTE- make sure that you have latest version of pip installed if not then
type following
python -m pip install --upgrade pip setuptools wheel
If you have latest version of pip then proceed directly and type the
following code into your terminal.
pip install pocketsphinx
Now that you have installed pocketsphinx in your machine lets move
forward to more.

Prerequisites

There are two prerequisite library which is used along side with
pocketsphinx they are :-
1. SpeechRecognition – used for speech recognition ,with support for
several engines and APIs, online and offline.
2. PyAudio-used to play and even record audio in python.
Now it is recommended to install these two library using pip install
command:-
pip install SpeechRecognition
brew install portaudio
pip install pyaudio
Now installation of all required external library is completed so lets move
forward to code.

LiveSpeech

It is an external iterator class available in pocketsphinx which can be used

for continuous recognition or keyword search from a microphone.
Here is the code for continuous recognition.

 Python3

# import LiveSpeech
from pocketsphinx import LiveSpeech
for phrase in LiveSpeech():
# here the result is stored in phrase which
# ultimately displays all the words recognized
print(phrase)
else:
print("Sorry! could not recognize what you said")

Output :

We used LiveSpeech in a basic for in loop to fetch continuous speech

input from user using the device microphone then we store the converted
string into phrase and display each word uttered by the user.

Keyword searching

We use an variable named speech of type pocketsphinx.LiveSpeech ,

In which we invoke the class LiveSpeech with arguments keyphrase i.e.
the keyword to be searched and kws_threshold then we used an for in
loop on speech which continuously looks for user input in form of voice if
the user utters the word ‘forward’ then it is printed along with segments.
 Python3

# importing livespeech
from pocketsphinx import LiveSpeech

speech = LiveSpeech(keyphrase='forward', kws_threshold=1e-20)

# an for in loop to iterate in speech

for phrase in speech:
# printing if the keyword is spoken with segments along side.
print(phrase.segments(detailed=True))

Output :
Test program

First of all import speech_recognition with referencing it as some

reference name aud now you can recognize speech using your code.
Now fetch audio from devices microphone and store in variable reference
of type speech_recognition.Recognizer to recognize the audio and
convert to text. After that define microphone as your source of input and
define an variable reference say audio to listen i.e it takes user input of
speech and stores it there, then we use invoke sphinx using try we try
printing what user said here we invoke recognize_sphinx and pass
argument audio, now the work of this class to convert what user said (in
form of speech ) to text form and display it in console simply
called Recognition.
If the code is unable to accept voice input due to unclear voice then we
throw an exception for unclear voice and for RequestError tool.

 Python3

import speech_recognition as aud

# fetch audio from devices microphone

# and store in variable reference of type speech_recognition
a = aud.Recognizer()

# declaring device microphone as the source to take audio input

with aud.Microphone() as source:
print("Say something!")

# variable audio prints what user said in text format the end
audio = a.listen(source)

# invoking sphinx for speech recognition

try:
# printing audio
print("You said " + a.recognize_sphinx(audio))

except aud.UnknownValueError:
# if the voice is unclear
print("Could not understand")

except aud.RequestError as e:
print("Error; {0}".format(e))

Output:
Conclusion

This winds up our topic of discussion of Speech recognition using CMU

Sphinx , there lot of more applications of this useful library.

Speech Recognition System
No ratings yet
Speech Recognition System
16 pages
Python Speech Recognition Guide
No ratings yet
Python Speech Recognition Guide
25 pages
Speech Recognition
No ratings yet
Speech Recognition
5 pages
Speech Recognition System Using Python Report
No ratings yet
Speech Recognition System Using Python Report
7 pages
Speech Recognition Introduction
No ratings yet
Speech Recognition Introduction
8 pages
Ai
No ratings yet
Ai
2 pages
Speech Recognition Transcription With Open Source ...
No ratings yet
Speech Recognition Transcription With Open Source ...
2 pages
Python Virtual Assistant Guide
No ratings yet
Python Virtual Assistant Guide
8 pages
Coding The Future: A Comprehensive Guide To AI Development-By Tyler Welch
No ratings yet
Coding The Future: A Comprehensive Guide To AI Development-By Tyler Welch
180 pages
Lesson 7 Speech Recognition Techniques
No ratings yet
Lesson 7 Speech Recognition Techniques
56 pages
Voice Assistant Suggetion
No ratings yet
Voice Assistant Suggetion
3 pages
Voice Assistant - Doge: Bachelor of Engineering IN Computer Science & Engineering
No ratings yet
Voice Assistant - Doge: Bachelor of Engineering IN Computer Science & Engineering
48 pages
Voice Recognition Using Python
No ratings yet
Voice Recognition Using Python
24 pages
Minor Project123
No ratings yet
Minor Project123
40 pages
Coding The Future: A Comprehensive Guide To AI Development-By Tyler P Welch - The Astral Merchant
No ratings yet
Coding The Future: A Comprehensive Guide To AI Development-By Tyler P Welch - The Astral Merchant
31 pages
Speech To Text Conversion
No ratings yet
Speech To Text Conversion
7 pages
Python Speech Recognition Guide
No ratings yet
Python Speech Recognition Guide
3 pages
7B Sem DL Lab1
No ratings yet
7B Sem DL Lab1
1 page
Python SpeechRecognition Guide
No ratings yet
Python SpeechRecognition Guide
23 pages
Speech-To-Text: Python
No ratings yet
Speech-To-Text: Python
10 pages
Pocketsphinx: A Free, Real-Time Continuous Speech Recognition System For Hand-Held Devices
No ratings yet
Pocketsphinx: A Free, Real-Time Continuous Speech Recognition System For Hand-Held Devices
4 pages
Pocketsphinx: A Free, Real-Time Continuous Speech Recognition System For Hand-Held Devices
No ratings yet
Pocketsphinx: A Free, Real-Time Continuous Speech Recognition System For Hand-Held Devices
4 pages
Labs 9
No ratings yet
Labs 9
4 pages
Voice Recognition Word Game
No ratings yet
Voice Recognition Word Game
4 pages
Desktop Assistant Final
No ratings yet
Desktop Assistant Final
15 pages
DL Proj Rep
No ratings yet
DL Proj Rep
11 pages
Project Report
No ratings yet
Project Report
58 pages
Speech Recognition Techniques - GUVI
No ratings yet
Speech Recognition Techniques - GUVI
4 pages
Python Text To Spesdfssech
No ratings yet
Python Text To Spesdfssech
2 pages
Speech Understanding Content
No ratings yet
Speech Understanding Content
9 pages
Python GuiaUser
No ratings yet
Python GuiaUser
23 pages
Speech Recognition Using Python
No ratings yet
Speech Recognition Using Python
12 pages
Speech Recognition Technology
No ratings yet
Speech Recognition Technology
22 pages
Jarvis
No ratings yet
Jarvis
4 pages
Pocket Sphinx
No ratings yet
Pocket Sphinx
31 pages
Main Py
No ratings yet
Main Py
2 pages
Speech Recognition Seminar Report
No ratings yet
Speech Recognition Seminar Report
24 pages
Speech Recognition Seminar Report
No ratings yet
Speech Recognition Seminar Report
32 pages
Jarvis Voice Assistant
No ratings yet
Jarvis Voice Assistant
2 pages
Python Speech Recognition Guide
No ratings yet
Python Speech Recognition Guide
18 pages
Speech To Text
No ratings yet
Speech To Text
4 pages
Speech Recognition
No ratings yet
Speech Recognition
9 pages
Voice Assistant Using Python 2
No ratings yet
Voice Assistant Using Python 2
20 pages
PBL 2
No ratings yet
PBL 2
5 pages
ASR - Thesis Report PDF
No ratings yet
ASR - Thesis Report PDF
42 pages
Python Assistent Mini Project Report
No ratings yet
Python Assistent Mini Project Report
23 pages
Synopsis
No ratings yet
Synopsis
5 pages
Assistant
No ratings yet
Assistant
2 pages
NLP 1.3.1 - Speed Recogmnition
No ratings yet
NLP 1.3.1 - Speed Recogmnition
20 pages
Methodology To Use in Speech To Text Python - Google Search PDF
No ratings yet
Methodology To Use in Speech To Text Python - Google Search PDF
1 page
Vivek Kumar - 1613112052
No ratings yet
Vivek Kumar - 1613112052
7 pages
Voice Assistant Report
No ratings yet
Voice Assistant Report
4 pages
Jarvis
No ratings yet
Jarvis
2 pages
Chat Bot 1
No ratings yet
Chat Bot 1
7 pages
Speech To Text
No ratings yet
Speech To Text
17 pages
Voice Assistant
No ratings yet
Voice Assistant
3 pages
AI Desktop Assistant Project
No ratings yet
AI Desktop Assistant Project
14 pages
Group No. 5: AI Desktop Assistant
No ratings yet
Group No. 5: AI Desktop Assistant
10 pages
Big Data Business Context
No ratings yet
Big Data Business Context
17 pages
CMU Rhyming Words
No ratings yet
CMU Rhyming Words
16 pages
Pratt Parser
No ratings yet
Pratt Parser
17 pages
OOPS For Design Pattern
No ratings yet
OOPS For Design Pattern
15 pages
Cds Unit 5 Notes
No ratings yet
Cds Unit 5 Notes
16 pages
BCT Unit 3
No ratings yet
BCT Unit 3
25 pages
Ten Free Blockchain Resources: Factom University
No ratings yet
Ten Free Blockchain Resources: Factom University
16 pages
Iot Lab Record
No ratings yet
Iot Lab Record
33 pages
C++ Practicals Shree Ram College Bhandup
No ratings yet
C++ Practicals Shree Ram College Bhandup
20 pages
UD4 - Software
No ratings yet
UD4 - Software
18 pages
02 Gaddis Python Lecture PPT Ch02
No ratings yet
02 Gaddis Python Lecture PPT Ch02
71 pages
Graduation - Project Final
No ratings yet
Graduation - Project Final
52 pages
IT Resume: Jyothi Gajula
No ratings yet
IT Resume: Jyothi Gajula
3 pages
Ai FINAL
No ratings yet
Ai FINAL
22 pages
W3.CSS (A CSS Framework) : and Other Internet Based Tutorials
No ratings yet
W3.CSS (A CSS Framework) : and Other Internet Based Tutorials
24 pages
Introduction To IOS - XR 6.0: System Engineer, Global Service Providers CCIE SP #42403
No ratings yet
Introduction To IOS - XR 6.0: System Engineer, Global Service Providers CCIE SP #42403
48 pages
(Gaseous Diffusion) CERa MkII Issue 1 Instruction Manual
No ratings yet
(Gaseous Diffusion) CERa MkII Issue 1 Instruction Manual
26 pages
Recruitment and Selection Process of Tata Consultancy Services
100% (1)
Recruitment and Selection Process of Tata Consultancy Services
8 pages
VB Sheets 2020
No ratings yet
VB Sheets 2020
22 pages
Sample
100% (1)
Sample
64 pages
3.5. SQL - DDL - Commands
No ratings yet
3.5. SQL - DDL - Commands
10 pages
Eat2Eat V4
No ratings yet
Eat2Eat V4
38 pages
STLINK-V3MINI Mini Debugger/programmer For STM32
No ratings yet
STLINK-V3MINI Mini Debugger/programmer For STM32
4 pages
Data Compression: LZ77 vs LZ78
No ratings yet
Data Compression: LZ77 vs LZ78
5 pages
Syllabus of BCA MDSU Ajmer
No ratings yet
Syllabus of BCA MDSU Ajmer
8 pages
P. A. Hilton LTD.: Instruction Manual
No ratings yet
P. A. Hilton LTD.: Instruction Manual
10 pages
Android Versions
No ratings yet
Android Versions
7 pages
Fundamentals of Database Management
0% (1)
Fundamentals of Database Management
13 pages
Introduction To Computer Application
100% (1)
Introduction To Computer Application
19 pages
13 Cursors Exception
No ratings yet
13 Cursors Exception
16 pages
Lazy Man Formula
No ratings yet
Lazy Man Formula
14 pages
Windows 10 Operating System Vulnerability Assessment and Exploitation
No ratings yet
Windows 10 Operating System Vulnerability Assessment and Exploitation
5 pages
3crxjk10075 13apr2005
No ratings yet
3crxjk10075 13apr2005
3 pages
JWD - Unit 4 - Using CSS For Web Designing - PPT
No ratings yet
JWD - Unit 4 - Using CSS For Web Designing - PPT
16 pages
cmm2 User Guide
No ratings yet
cmm2 User Guide
118 pages
Resume 5
No ratings yet
Resume 5
3 pages
CAESAR II Animation in IE Guide
100% (2)
CAESAR II Animation in IE Guide
3 pages
The Loop Laying Head (LLH)
No ratings yet
The Loop Laying Head (LLH)
10 pages

Sphinx Speech Recognition

Uploaded by

Sphinx Speech Recognition

Uploaded by

EXERCISE 2

Speech Recognition in Python using

It is an external iterator class available in pocketsphinx which can be used

We used LiveSpeech in a basic for in loop to fetch continuous speech

We use an variable named speech of type pocketsphinx.LiveSpeech ,

speech = LiveSpeech(keyphrase='forward', kws_threshold=1e-20)

# an for in loop to iterate in speech

First of all import speech_recognition with referencing it as some

import speech_recognition as aud

# fetch audio from devices microphone

# declaring device microphone as the source to take audio input

# invoking sphinx for speech recognition

This winds up our topic of discussion of Speech recognition using CMU

You might also like