lOMoARcPSD|33212006
MINI Project Report
Engineering (Rashtrasant Tukadoji Maharaj Nagpur University)
Scan to open on Studocu
Studocu is not sponsored or endorsed by any college or university
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
MINI PROJECT REPORT
ON
TEXT TO SPEECH CONVERTER
VI SEMESTER
COMPUTER ENGINEERING
Submitted by
ANIKET AGHAM
DHAWAL MADANKAR
DIPALI KAMBALE
MANTHAN JADHAV
Under the guidance of
PROF. KOMAL JAISINGHANI
Academic Year 2021-2022
Department of Computer Engineering
ST. VINCENT PALLOTTI COLLEGE OF ENGINEERING AND
TECHNOLOGY
Wardha Road, Gavsi Manapur, Nagpur
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
ST. VINCENT PALLOTTI COLLEGE OF ENGINEERING
AND TECHNOLOGY
Wardha Road, Gavsi Manapur, Nagpur
Department of Computer Engineering
CERTIFICATE
This is to certify that Aniket Agham, Dhawal Madankar, Dipali Kambale and Manthan
Jadhav have completed mini project on “TEXT TO SPEEECH CONVERTER” under my
supervision for partial fulfillment of VI semester, Bachelor of Engineering in COMPUTER
ENGINEERING under RASHTRASANT TUKDOJI MAHARAJ NAGPUR
UNIVEERSITY, NAGPUR.
Dr. S. M. Wanjari Prof. Komal Jaisinghani
Associate Professor Assistant Professor
Head of the Department Project Guide
Computer Engineering Computer Engineering
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
ACKNOWLEDGEMENT
The Title of our Mini project is, “TEXT TO SPEECH CONVERTER”. Any project requires
a lot of hard work, sincerity and systematic work methodologies. We express our gratitude to
our Project Guide, Prof. Komal Jaisinghani, for guiding us in every step of the project.
We would also like to thank Prof. Sunil Wanjari, Head of the Department of Computer
Engineering and all our faculty members who evaluated our project and pointed out the
shortcomings in the projects. They also gave us important feedback for our project which
helped us along the way.
We are also grateful to the Principal, Management of College, Dr. Surendra V. Gole and
Vice-principal, Prof. R. B. Gowardhan for the support in providing us the facilities of
computer lab and other required infrastructure.
Group Members
Aniket Agham
Dhawal Madankar
Dipali Kambale
Manthan Jadhav
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
CONTENTS
CHAPTER NO. TITLE PAGE NO
1. INTRODUCTION
1.1 INTRODUCTION
1.2 OBJECTIVES
1.3 PURPOSE
1.4 SCOPE
2. LITERATURE REVIEW & ANALYSIS
2.1 LITERATURE REVIEW
2.2 ANALYSIS
3. PROPOSED SYSTEM
3.1 PLANNING AND SCHEDULING
4. SYSTEM DESIGN
4.1 FLOW OF PROGRAM
4.2 USED MODULES
5. IMPLEMENTATION
6. TESTING & RESULT
7. CONCLUSION & FUTURE SCOPE
8. REFERENCES
PROJECT TEAM MEMBER & PROJECT GUIDE INFORMATION
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
ABSTRACT
A Text-to-speech synthesizer is an application that converts text into spoken word, by
analyzing and processing the text using Natural Language Processing (NLP) and then using
Digital Signal Processing (DSP) technology to convert this processed text into synthesized
speech representation of the text. Here, we developed a useful text-to-speech synthesizer
in the form of a simple application that converts inputted text into synthesized speech and
reads out to the user which can then be saved as an mp3 file. The development of a text to
speech synthesizer will be of great help to people with visual impairment and make making
through large volume of text easier.
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
CHAPTER 1
INTRODUCTION
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
1.1 INTRODUCTION
Text-to-speech synthesis -TTS - is the automatic conversion of a text into speech that
resembles, as closely as possible, a native speaker of the language reading that text. Text-to
speech synthesizer (TTS) is the technology which lets computer speak to you. The TTS
system gets the text as the input and then a computer algorithm which called TTS engine
analyses the text, pre-processes the text and synthesizes the speech with some mathematical
models. The TTS engine usually generates sound data in an audio format as the output.
1.2 OBJECTIVES
To convert text to speech
To convert text from one language to another
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
1.3 PURPOSE
Our purpose is text to speech translation. The system is helpful for persons having learning
difficulties or visually challenged. It Prevents eye from strain, and user can sit and listen
comfortably, help avoiding an external human translator. And also avoid sharing of trade
secrets with other translators. It will help in widening the trade market and Travelling to
foreign countries and speaking their native language will be made easy mechanical attempts
dating to the eighteenth century.
1.4 SCOPE
This application is designed to overcome the language barriers.
The application is designed to overcome language difference.
The application designed to help visually impaired people.
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
CHAPTER 2
LITERATURE REVIEW AND ANALYSIS
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
2.1 LITERATURE REVIEW
Sr. No. Paper Title and its Author Details of Publication Findings
1 Assistive system for product label detection International Journal Texts written in
with voice output for blind users of Research in objects are converted
Purnima C. and Rubesh Kumar T Engineering & into audio.
Advanced Technology
2014
2 Text-to-speech conversion on a personal ER publication Text converted into
computer audio
El-Imam, Y.A.; Banat, K.
3 Text-to-speech conversion in Spanish a IEEE International Text to converted into
complete rule-based synthesis system," Conference Acoustics, audio in Spanish
Acoustics Santos, J. Nombela Speech, and Signal language
processing
4 English Text to Multilingual Speech Blue Eyes Intelligence Text converted into
Translator Engineering audio with three
Using Android Vivek Hanumante, Rubi & Sciences different language
Debnath, Disha Bhattacharjee, Deepti Publication Pvt. Ltd.
Tripathi, Sahadev Roy
2.2 ANALYSIS
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
2.2.1 FUNCTIONAL REQUIREMENTS
User-definable lexicon
Language switching within the text
Support for multiple document types
Switching among the multiple voices
Ability to create many audio file at once
Text spelling check
2.2.2 NON-FUNCTIONAL REQUIREMENTS
Non-functional requirement specifies how the system should behave and that it is a constraint
upon the system behavior. Non-functional requirements of our system are:
Ease of use
Flexibility of voice control (set pitch, timbre, pause in the speech)
Ease of Mark-up Language setting
Accessibility of technical support
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
CHAPTER 3
PROPOSED SYSTEM
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
3.1 PLANNING AND SCHEDULING
TASK NAME START END DURATION (in days)
Project Selection 10-03-2022 11-03-2022 2
Requirement Analysis 12-03-2022 17-03-2022 6
Requirement 18-03-2022 25-03-2022 8
Gathering
Planning 26-03-2022 02-04-2022 8
Design 03-04-2022 10-04-2022 8
Coding 11-04-2022 30-04-2022 20
Testing 01-05-2022 08-05-2022 8
Dispatch 09-05-2022 10-05-2022 2
Table 3.3.1 Planning and Scheduling
Table 3.3.1 Planning and Scheduling
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
CHAPTER 4
SYSTEM DESIGN
4.1 FLOW OF PROGRAM
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
Below is mentioned the appropriate and overall flow of program. The flow of program starts
with taking input as text followed by selecting language, downloading module, text
processing and finally the output as voice will generate.
Taking Input
Selecting Language
Downloading
Module
Text Processing
Voice Output
4.2 USED MODULES
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
In programming, a module is a piece of software that has a specific functionality.
We have used two different types of modules in our project
1 - Text to text processing module.
2 – Text to voice processing module.
Firstly, the language is detected and the module is downloaded, initially the text is translated
from one language to another and then the text is converted to speech. The modules are taken
from ML kit.
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
CHAPTER 5
IMPLEMENTATION
5.1 SOFTWARE AND HARDWARE USED
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
5.1.1 SOFTWARE USED
Java JDK5 or latest version
Java Runtime Environment (JRE) 6
Android SDK
Android Studio
Eclipse IDE for Java Developers
Android Development Tool kit (ADT kit) / Eclipse
5.1.2 HARDWARE USED
Microsoft Windows 7/8/10 (32 or 64 bit)
2 GB RAM minimum, 8 GB recommended
2 GB of available disk space minimum, 4 GB recommended
(500 MB for
IDE + 1.5 GB for Android SDK and emulator system image)
1280 x 800 minimum screen resolution
Java Development Kit (JDK) 8
5.3 CODE SNIPPET
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
CHAPTER 6
TESTING AND RESULTS
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
6.1 TESTING
The aim of the testing process was to determine all defects in our project. This
program was subjected to a set of test inputs and various observations were made and based
on these observations it will be decided whether the program behaves as expected or not. Our
project went through two levels of testing
6.1.1 Unit testing
6.1.2 Integration testing
6.2 Unit testing
Unit testing is undertaking when a module has been created and successfully
reviewed. In order to test a single module we need to provide a complete environment
i.e. besides the module we would require.
The procedures belonging to other modules that the modules under the test calls.
Non-local data structures that module accesses.
A procedure to call functions of the module under test with appropriate parameters
unit testing was done on each and every module that is described under module
description.
6.3 Integration testing
In this type of testing we test various integration of the project module by providing the input.
The primary objective is to test the module interfaces in order to ensure that no errors are
occurring when one module invokes the other module.
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
6.4 White Box Testing
White box testing is a method of testing software that tests internal structure or working of an
application, as we know the internal working of our project code we use white box testing.
Output Screenshots:
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
CHAPTER 7
CONCLUSION AND FUTURE SCOPE
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
7.1Conclusion
As per the goal of this project an attempt is made to show how the app speaks out the any
language text. Here the provision is provided to the user to input the text and he can listen to
his text. The “naturalness” of the synthetic speech needs to be improved for implementing the
expressions of the human beings. By developing such systems, relationship between human
and computer becomes much closer. Thus it helps in overcoming the problem of DIGITAL
DIVIDE.
7.2Future Scope
Improvement of the smoothness of the sound.
Inclusion of prosody and the naturalization of the voices like human expressions.
Reading of special cases like date and number.
Inclusion of different kinds of voices and graphical faces.
Import and Export of documents.
Controlling the reading speed.
The system can be further extended to include more languages
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
CHAPTER 8
REFERENCES
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
References :
T. Rubesh Kumar, C. Purnima “Assistive System for Product Label Detection with Voice
Output For Blind Users” International Journal of Research in Engineering & Advanced
Technology 2014.
Santos J. Nombela, "Text-to-speech conversion in Spanish a complete rule-based synthesis
system", Acoustics, Speech, and Signal, Processing, IEEE
International Conference on ICASSP '82, vol.7, no.pp.1593-1596, May 1982.
Programming knowledge -> https://www.youtube.com/watch?v=_Q8wtPCyMdo
Project Guide
Downloaded by Aman Sharma (assharma2425@gmail.com)
lOMoARcPSD|33212006
And
Project Team Member
1. Project Guide
Prof. Komal Jaisinghani
Assistant Professor
2. Project Team Members
Sr. Name Contact Email Roll No
No.
1 Aniket Agham 7083324685 aniketa.ce19@stvincentngp.edu.in
2 Dhawal Madankar 9673534092 dhawalm.ce19@stvincentngp.edu.in
3 Dipali Kambale 9022032822 dipalik.ce19@stvincentngp.edu.in 05
4 Manthan Jadhav 9765251265 manthanj.ce19@stvincentngp.edu.in
Downloaded by Aman Sharma (assharma2425@gmail.com)