0% found this document useful (0 votes)

84 views5 pages

Speech Recognition

The document provides an overview of speech recognition technology using Python, particularly focusing on the SpeechRecognition library. It includes instructions for installation, code examples for transcribing audio files, and capturing audio from a microphone. Additionally, it discusses the setup for using Google Speech Recognition and handling exceptions during the recognition process.

Uploaded by

gauravendra272002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

84 views5 pages

Speech Recognition

Uploaded by

gauravendra272002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

SpeechRecognition

Speech recognition is a technology that allows computers to understand and

process human speech. Python, with its simplicity and robust libraries, offers
several modules to tackle speech recognition tasks effectively. One of the most
popular libraries for this purpose is the SpeechRecognition library.

With SpeechRecognition Library

In this section, we will base our speech recognition system on this tutorial.
SpeechRecognition library offers many transcribing engines like Google Speech
Recognition, and that's what we'll be using.

 Before we get started, let's install the required libraries:

 $ pip install SpeechRecognition pydub

Open up a new file named speechrecognition.py, and add the following:

# importing libraries
import speech_recognition as sr
import os
from pydub import AudioSegment
from pydub.silence import split_on_silence

# create a speech recognition object

r = sr.Recognizer()

 The below function loads the audio file, performs speech recognition, and
returns the text:

 # a function to recognize speech in the audio file

 # so that we don't repeat ourselves in in other functions
def transcribe_audio(path):
# use the audio file as the audio source
with sr.AudioFile(path) as source:
audio_listened = r.record(source)
# try converting it to text
text = r.recognize_google(audio_listened)
return text
 Next, we make a function to split the audio files into chunks in silence:

# a function that splits the audio file into chunks on silence

# and applies speech recognition
def get_large_audio_transcription_on_silence(path):
"""
Splitting the large audio file into chunks
and apply speech recognition on each of these chunks
"""
# open the audio file using pydub
sound = AudioSegment.from_file(path)
# split audio sound where silence is 700 miliseconds or more and
get chunks
chunks = split_on_silence(sound,
# experiment with this value for your target audio file
min_silence_len = 500,
# adjust this per requirement
silence_thresh = sound.dBFS-14,
# keep the silence for 1 second, adjustable as well
keep_silence=500,
)
folder_name = "audio-chunks"
# create a directory to store the audio chunks
if not os.path.isdir(folder_name):
os.mkdir(folder_name)
whole_text = ""
# process each chunk
for i, audio_chunk in enumerate(chunks, start=1):
# export audio chunk and save it in
# the `folder_name` directory.
chunk_filename = os.path.join(folder_name, f"chunk{i}.wav")
audio_chunk.export(chunk_filename, format="wav")
# recognize the chunk
with sr.AudioFile(chunk_filename) as source:
audio_listened = r.record(source)
# try converting it to text
try:
text = r.recognize_google(audio_listened)
except sr.UnknownValueError as e:
print("Error:", str(e))
else:
text = f"{text.capitalize()}. "
print(chunk_filename, ":", text)
whole_text += text
# return the text for all chunks detected
return whole_text
print(get_large_audio_transcription_on_silence("7601-291468-0006.wav"))

Implementing Speech Recognition with Python

basic implementation using the SpeechRecognition library involves several steps:

Audio Capture: Capturing audio from the microphone using PyAudio.

Audio Processing: Converting the audio signal into data that the SpeechRecognition library can work
with.

Recognition: Calling the recognize_google() method (or another available recognition method) on
the SpeechRecognition library to convert the audio data into text.

Pro_2

import speech_recognition as sr

# Initialize recognizer class (for recognizing the speech)

r = sr.Recognizer()

# Reading Microphone as source

# listening the speech and store in audio_text variable
with sr.Microphone() as source:
print("Talk")
audio_text = r.listen(source)
print("Time over, thanks")
# recoginze_() method will throw a request
# error if the API is unreachable,
# hence using exception handling

try:
# using google speech recognition
print("Text: "+r.recognize_google(audio_text))
except:
print("Sorry, I did not get that")

Speech Recognition in Python using Google Speech API

sudo pip install SpeechRecognition

PyAudio: Use the following command for Linux users

sudo apt-get install python-pyaudio python3-pyaudio
If the versions in the repositories are too old,
install pyaudio using the following command
sudo apt-get install portaudio19-dev python-all-dev python3-all-dev
&&
sudo pip install pyaudio
pip install pyaudio
USB Device 0x46d:0x825: Audio (hw:1, 0)

Make a note of this as it will be used in the program.

Set Chunk Size: This basically involved specifying how many bytes of data we want to read at once.
Typically, this value is specified in powers of 2 such as 1024 or 2048

Set Sampling Rate: Sampling rate defines how often values are recorded for processing

Set Device ID to the selected microphone : In this step, we specify the device ID of the microphone
that we wish to use in order to avoid ambiguity in case there are multiple microphones. This also
helps debug, in the sense that, while running the program, we will know whether the specified
microphone is being recognized. During the program, we specify a parameter device_id. The
program will say that device_id could not be found if the microphone is not recognized.

Allow Adjusting for Ambient Noise: Since the surrounding noise varies, we must allow the program a
second or two to adjust the energy threshold of recording so it is adjusted according to the external
noise level.

Speech to text translation: This is done with the help of Google Speech Recognition. This requires an
active internet connection to work. However, there are certain offline Recognition systems such as
PocketSphinx, that have a very rigorous installation process that requires several dependencies.
Google Speech Recognition is one of the easiest to use.

SPEECH HINDI
pip install SpeechRecognition
pip install PyAudio
pip install pipwin
pipwin install pyaudio

WAP Speech Hindi

# import required module
import speech_recognition as sr

# explicit function to take input commands

# and recognize them
def takeCommandHindi():

r = sr.Recognizer()
with sr.Microphone() as source:

# seconds of non-speaking audio before

# a phrase is considered complete
print('Listening')
r.pause_threshold = 0.7
audio = r.listen(source)
try:
print("Recognizing")
Query = r.recognize_google(audio, language='hi-In')

# for listening the command in indian english

print("the query is printed='", Query, "'")

# handling the exception, so that assistant can

# ask for telling again the command
except Exception as e:
print(e)
print("Say that again sir")
return "None"
return Query

# Driver Code

# call the function

takeCommandHindi()

Speech Recognition System
No ratings yet
Speech Recognition System
16 pages
Python Speech Recognition Guide
No ratings yet
Python Speech Recognition Guide
25 pages
Python Text To Spesdfssech
No ratings yet
Python Text To Spesdfssech
2 pages
Speech To Text Conversion
No ratings yet
Speech To Text Conversion
7 pages
Python Speech Recognition Guide
No ratings yet
Python Speech Recognition Guide
3 pages
Sphinx Speech Recognition
No ratings yet
Sphinx Speech Recognition
5 pages
Lecture
No ratings yet
Lecture
7 pages
Labs 9
No ratings yet
Labs 9
4 pages
Speech To Text
No ratings yet
Speech To Text
17 pages
Voice Assistant - Doge: Bachelor of Engineering IN Computer Science & Engineering
No ratings yet
Voice Assistant - Doge: Bachelor of Engineering IN Computer Science & Engineering
48 pages
Python SpeechRecognition Guide
No ratings yet
Python SpeechRecognition Guide
23 pages
Python GuiaUser
No ratings yet
Python GuiaUser
23 pages
Jarvis Voice Assistant
No ratings yet
Jarvis Voice Assistant
2 pages
Voice Assistant Report
No ratings yet
Voice Assistant Report
4 pages
Jarvis
No ratings yet
Jarvis
4 pages
Week-8 NLP Lab Program
No ratings yet
Week-8 NLP Lab Program
6 pages
Lesson 7 Speech Recognition Techniques
No ratings yet
Lesson 7 Speech Recognition Techniques
56 pages
Pydub
No ratings yet
Pydub
26 pages
2.5 Automatic Speech Recognition
No ratings yet
2.5 Automatic Speech Recognition
8 pages
Speech Recognition Introduction
No ratings yet
Speech Recognition Introduction
8 pages
Speech to Text Guide Using Python
No ratings yet
Speech to Text Guide Using Python
1 page
Voice Recognition Word Game
No ratings yet
Voice Recognition Word Game
4 pages
7B Sem DL Lab1
No ratings yet
7B Sem DL Lab1
1 page
Methodology To Use in Speech To Text Python - Google Search PDF
No ratings yet
Methodology To Use in Speech To Text Python - Google Search PDF
1 page
Voice Assistant Suggetion
No ratings yet
Voice Assistant Suggetion
3 pages
PBL 2
No ratings yet
PBL 2
5 pages
Assistant
No ratings yet
Assistant
2 pages
Spoken Language Processing in Python Chapter3
No ratings yet
Spoken Language Processing in Python Chapter3
26 pages
Speech Recog
No ratings yet
Speech Recog
5 pages
Speech-To-Text: Python
No ratings yet
Speech-To-Text: Python
10 pages
Python Virtual Assistant Guide
No ratings yet
Python Virtual Assistant Guide
8 pages
Speech Recognition Transcription With Open Source ...
No ratings yet
Speech Recognition Transcription With Open Source ...
2 pages
Speech Recognition Techniques - GUVI
No ratings yet
Speech Recognition Techniques - GUVI
4 pages
TSA Lab 2
No ratings yet
TSA Lab 2
3 pages
Speech Recognition
No ratings yet
Speech Recognition
13 pages
Jarvis
No ratings yet
Jarvis
2 pages
Project Report
No ratings yet
Project Report
58 pages
Speech Understanding Content
No ratings yet
Speech Understanding Content
9 pages
Exercise 8
No ratings yet
Exercise 8
2 pages
Speech Recognition System Using Python Report
No ratings yet
Speech Recognition System Using Python Report
7 pages
DL Proj Rep
No ratings yet
DL Proj Rep
11 pages
Coding The Future: A Comprehensive Guide To AI Development-By Tyler Welch
No ratings yet
Coding The Future: A Comprehensive Guide To AI Development-By Tyler Welch
180 pages
Synopsis
No ratings yet
Synopsis
5 pages
Department of Computer Science and Engineering) : CGB1121/ EGB1122
No ratings yet
Department of Computer Science and Engineering) : CGB1121/ EGB1122
18 pages
Coding The Future: A Comprehensive Guide To AI Development-By Tyler P Welch - The Astral Merchant
No ratings yet
Coding The Future: A Comprehensive Guide To AI Development-By Tyler P Welch - The Astral Merchant
31 pages
Design Lab2
No ratings yet
Design Lab2
22 pages
Desktop Assistant Final
No ratings yet
Desktop Assistant Final
15 pages
Voice M
No ratings yet
Voice M
19 pages
Speech Understanding Content
No ratings yet
Speech Understanding Content
10 pages
Voice Assistant
No ratings yet
Voice Assistant
30 pages
Voice Assistant Report 40 Pages
No ratings yet
Voice Assistant Report 40 Pages
44 pages
#Pip Install Pyttsx3 #Pip Install Speechrecognition #Pip Install Wikipedia
No ratings yet
#Pip Install Pyttsx3 #Pip Install Speechrecognition #Pip Install Wikipedia
3 pages
Raspberry Pi
No ratings yet
Raspberry Pi
16 pages
Py Report
No ratings yet
Py Report
8 pages
Voice Identification GLM4 Guide
No ratings yet
Voice Identification GLM4 Guide
2 pages
Training Project - Pptyx
No ratings yet
Training Project - Pptyx
11 pages
The PC Interfaced Voice Recognition System Is To Implement A Password For Authentication
No ratings yet
The PC Interfaced Voice Recognition System Is To Implement A Password For Authentication
7 pages
Python Based Voice Assistant Presentation
No ratings yet
Python Based Voice Assistant Presentation
8 pages
Digital Guardian User & Entity Behavior Analytics: See, Analyze, and Understand Risky Behaviors
No ratings yet
Digital Guardian User & Entity Behavior Analytics: See, Analyze, and Understand Risky Behaviors
2 pages
Important Questions in Engineering Chemistry - I Cy2111 Result 20
0% (1)
Important Questions in Engineering Chemistry - I Cy2111 Result 20
4 pages
R53 Amd Comal Uma/Muxless System Diagram: ATI AMD Trinity APU
No ratings yet
R53 Amd Comal Uma/Muxless System Diagram: ATI AMD Trinity APU
44 pages
Get 'Total Editing Time' To Work in Word 2010
No ratings yet
Get 'Total Editing Time' To Work in Word 2010
5 pages
Modbus Protocol Overview
No ratings yet
Modbus Protocol Overview
7 pages
Poly Partner Mode Admin 3 7 0
No ratings yet
Poly Partner Mode Admin 3 7 0
137 pages
Catalogo - Ipc2122lb SF28 A
No ratings yet
Catalogo - Ipc2122lb SF28 A
4 pages
Affidavit of Loss Iphone Sample
100% (1)
Affidavit of Loss Iphone Sample
1 page
AJP - Microproject
No ratings yet
AJP - Microproject
22 pages
OPC UA Interoperability For Industrie4 and IoT en
0% (1)
OPC UA Interoperability For Industrie4 and IoT en
56 pages
Jayantika Dev CV World Bank
No ratings yet
Jayantika Dev CV World Bank
4 pages
8085 Assembly Language Guide
No ratings yet
8085 Assembly Language Guide
9 pages
KONNECT Kong Presentation
No ratings yet
KONNECT Kong Presentation
19 pages
Kerberos Authentication in Chat Apps
No ratings yet
Kerberos Authentication in Chat Apps
35 pages
A00 EE4211 Module Info 2024
No ratings yet
A00 EE4211 Module Info 2024
1 page
Towards An Efficient Model For Network Intrusion Detection System (IDS) : Systematic Literature Review
No ratings yet
Towards An Efficient Model For Network Intrusion Detection System (IDS) : Systematic Literature Review
30 pages
Embedded System Interface Guide
No ratings yet
Embedded System Interface Guide
26 pages
Spring JPA & Hibernate Guide
No ratings yet
Spring JPA & Hibernate Guide
98 pages
Part-4-Timer and Interrupt
100% (2)
Part-4-Timer and Interrupt
23 pages
Bar Management System-1
No ratings yet
Bar Management System-1
10 pages
SOU Lecture Handout ADA Unit-8
No ratings yet
SOU Lecture Handout ADA Unit-8
17 pages
Network Services, Virtualization, and Cloud Computing
No ratings yet
Network Services, Virtualization, and Cloud Computing
45 pages
Crucial Nvme Pcie m2 SSD Install Guide
No ratings yet
Crucial Nvme Pcie m2 SSD Install Guide
8 pages
Unit 2
No ratings yet
Unit 2
15 pages
Upload Guidelines for OMPL Recruitment
No ratings yet
Upload Guidelines for OMPL Recruitment
2 pages
Update Bios Msi
100% (1)
Update Bios Msi
4 pages
Forcepoint Cloud Security Administrator Virtual Instructor-Led Training
No ratings yet
Forcepoint Cloud Security Administrator Virtual Instructor-Led Training
4 pages
Release Notes: EMC FAST Cache For VNX OE For Block
No ratings yet
Release Notes: EMC FAST Cache For VNX OE For Block
12 pages
Computer Basic
100% (1)
Computer Basic
7 pages
LTE Power Control
100% (2)
LTE Power Control
34 pages

Speech Recognition

Uploaded by

Speech Recognition

Uploaded by

SpeechRecognition

Speech recognition is a technology that allows computers to understand and

With SpeechRecognition Library

 Before we get started, let's install the required libraries:

Open up a new file named speechrecognition.py, and add the following:

# create a speech recognition object

 # a function to recognize speech in the audio file

# a function that splits the audio file into chunks on silence

Implementing Speech Recognition with Python

basic implementation using the SpeechRecognition library involves several steps:

Audio Capture: Capturing audio from the microphone using PyAudio.

# Initialize recognizer class (for recognizing the speech)

# Reading Microphone as source

Speech Recognition in Python using Google Speech API

sudo pip install SpeechRecognition

PyAudio: Use the following command for Linux users

Make a note of this as it will be used in the program.

WAP Speech Hindi

# explicit function to take input commands

# seconds of non-speaking audio before

# for listening the command in indian english

# handling the exception, so that assistant can

# call the function

You might also like