International Journal of Research Publication and Reviews, Vol 5, no 5, pp 13141-13147 May 2024
International Journal of Research Publication and Reviews
Journal homepage: www.ijrpr.com ISSN 2582-7421
Personal Voice Assistant
1Mr. Mayur Sharma,2Mr. Ankur Kaushik, 3Rohan Gajrotia, 4Mohd. Sultan, 5Mohd. Farman
1&2
Assistant Professor, 3,4,5 UG Student
Department of Computer Science & Engineering, Shri Ram Group of Colleges, Muzaffarnagar, Uttar Pradesh, India
ABSTRACT:
Individuals presently interface with computers in novel ways much obliged to individual associates, conversational interfacing, and chatbots. an individual virtual
collaborator may indeed perform certain fundamental obligations like propelling apps, perusing out news, taking notes, etc. with fair a voice command. Clients can
inquire request to them in the same way they would to a genuine individual. Individual colleagues like Siri, Google Right hand, and Alexa work on text-to-speech
innovation. Python is being utilized to make a voice right hand that will empower clients to total any movement without requiring a console. This Venture analyzes
the clever conduct of voice associates and how they might be connected to scholastic and every day tasks.By making an individual desktop right hand that combines
comfort, colonization, and customized highlights, this extends points to upgrade users' efficiency and effectiveness in their day-to-day computer tasks.
INTRODUCTION :
A Voice collaborator is an advanced collaborator that employments voice acknowledgment , dialect handling calculations, and voice amalgamation to
tune in to particular voice commands and pertinent data or perform particular capacities as asked by the client. In numerous cases clients can inquire their
virtual colleagues questions, control domestic mechanization gadgets and media playback, and oversee other essential errands such as e-mail, to-do
records, and calendars - all with verbal commands. In later a long time, noticeable virtual collaborators for coordinate buyer utilize have included Apple's
Siri, Amazon Alexa, Google Collaborator, and Samsung's Bixby. The actualized partner can open up the application (on the off chance that it’s introduced
in the framework), look Google, Wikipedia and YouTube approximately the inquiry, calculate any numerical address, etc. by fair giving the voice
command. We can prepare the information as per the requirement or can include the usefulness, depends upon how we code things.
TERATURE SURVEY :
Conversational AI and virtual partners have their roots in talk affirmation advancement dating back decades. Investigators begun testing with
computerized talk affirmation systems in the 1950s and 1960s. It was, in any case, still in its most punctual stages and far off from exact. A basic whole
of development was not fulfilled until the 1980s, much acknowledged in parcel to the change of secured up Markov models (HMMs) and other machine
learning strategies. Voice accomplice has a long history with a few waves of major progressions. The to start with talk affirmation system, named Audrey,
was made by Chime Investigate offices in 1952. Audrey was or perhaps regimental and compelled advancement sharp, understanding as it were ten digits
- talked by particular people (Entering, 2012). Around 10 a long time a short time later, IBM made and outlined their Shoebox Machine. The contraption
recognized and responded to 16 particular talked words, checking all ten digits “0” to “9” as well as calculating commands such as “plus” or “minus”
(IBM, 2018). Shoebox Machine recognized and responded to 16 talked words, checking the ten digits from “0” through “9”, as it were in English by a
relegated speaker
International Journal of Research Publication and Reviews, Vol 5, no 5, pp 13141-13147 May 2024 13142
HISTORY OF VOICE ASSISTANT
METHODOLOGY :
This voice partner works on our voice commands. We donate a command at that point take the command and change over the voice to the content of our
command execute the command and provide output. The calculations utilized in the foundation will interpret the user's talked instruction into content
when the collaborator listens it. Also, the partner will carry out the fitting activity by the catchphrases display in the content (based on the user's arrange).
Much obliged to the capacities found in numerous libraries, this is achievable. We utilizes an assortment of Python installer bundles, such as Discourse
acknowledgment, gets, Pippin, etc., to make virtual associates. In discourse acknowledgment, sound is changed into content. This is as often as possible
utilized by voice colleagues like Siri, Alexa, and others. Python offers a Discourse Acknowledgment API that empowers us to decipher voice or sound
commands into content for afterward handling.
TECHNOLOGY USED :
Python was chosen as the programming dialect for this extend since of its flexibility and openness to numerous libraries. We utilized Python programming
dialect supporting Microsoft Visual Studio Code (IDE) to make the Virtual Right hand. Python has a discourse acknowledgment bundle that incorporates
certain built-in capacities. We will to begin with characterize a work that will turn the content into discourse. We utilize the pyttsx3 library for that. We
will set the library instance's starting esteem to a variable. Moreover, we utilize the say() strategy and supply the content as a contention; The result is a
International Journal of Research Publication and Reviews, Vol 5, no 5, pp 13141-13147 May 2024 13143
vocal reaction. Another work has been characterized to recognize the user's voice command. We have chosen Google's Discourse Acknowledgment
Motor for this venture since it will interpret each analog voice command into a computerized content arrange. The Partner will see for the catchphrase
after getting that content as input.
WORKING MODEL :
To make a straightforward virtual collaborator with Python, certain libraries and modules will be required.
- PYTTSX3: Pyttsx3 is a cross-platform text-to-speech library which is free of stages. The fundamental advantage of utilizing this library is to change
over text-to-speech which works offline as well.
- Speech Recognition: The Discourse Acknowledgment include in Python empowers simple acknowledgment of discourse from an amplifier. It, too,
permits the translation of sound records and sparing sound information into a record.
- Web browser: The web browser module in Python gives a high-level interface for controlling a web browser. It permits clients to show web-based
reports helpfully and associated with them programmatically.
- OS: The OS module in Python gives you get to meddled with your OS such as opening a few desktop apps or playing music or motion pictures from
the PC.
- WolframAlpha: WolframAlpha is a reply motor created by Wolfram Inquire about. Furthermore, it is advertised as an online benefit that answers
genuine inquiries by computing answers from remotely sourced information.
- Ecapture: A spare title input as a string will spare the captured picture with the craved title. Video Capture. Work: Record recordings Number of
Arguments.
-Twilio: Twilio is utilized for making calls and messages.- Excellent Soup: Excellent Soup is a library that makes it simple to rub data from web pages.
- Date time: Date and Time are utilized to appearing Date and Time. This module comes built-in with Python.
Start
Speak
User Command
CMD Say Again
Execute Command
International Journal of Research Publication and Reviews, Vol 5, no 5, pp 13141-13147 May 2024 13144
Start Debugging and Speak
End
Flowchart for a working model
EXECUTION :
These components play critical parts in building an individual desktop right hand that can get it voice input, make API calls, and produce talked reactions
utilizing text-to-speech change.
- Speech Recognition: Since we’re building an Application of voice collaborator, one of the most imperative things in this is that your partner recognizes
your voice.
- Python Backend: The individual desktop assistant's whole program is actualized utilizing Python on the backend. Python gives broad capabilities for
creating web applications, overseeing databases, and producing APIs.
- API Calls: An API, or Application Programming Interface, is a server that you can utilize to recover and send information to utilizing code. APIs give
fundamental devices in the world of manufactured insights (AI) and information science, empowering get to tremendous sums of information and capable
computing capabilities.
- Text-to-Speech: Text-to-Speech (TTS) innovation changes over composed content into talked words. Empowers simple integration of Google content
acknowledgment innovations into designer applications. Send content and get synthesized sound yield from the Cloud Text-to-Speech API benefit. By
coordination these components into your desktop collaborator, you can empower voice input, associated with outside administrations through API calls,
and produce talked reactions utilizing text-to-speech transformation
SCOPE :
Our project is to create based on a voice command that can help to perform various tasks on their personal computers. This assistant will operate through
voice commands, minimizing the need for physical hardware. It will be able to open applications and websites, play media, tell the time and date, and
even greet users based on the current Tim.
Result :
Wish me :- In this program...as soon as the virtual assistant activates it greets the user.
International Journal of Research Publication and Reviews, Vol 5, no 5, pp 13141-13147 May 2024 13145
WISH ME PROGRAME
Opening Website :- Desktop’s virtual assistant can open any websites just by saying its name. As soon as it listens it will open required website and
present on screen.
. Opening Facebook
Opening YouTube
International Journal of Research Publication and Reviews, Vol 5, no 5, pp 13141-13147 May 2024 13146
Opening camera
CONCLUSION :
This report gives a careful portrayal of the plan and execution of a voice-enabled individual computer right hand in Python. In comparison to prior times,
this voice-activated individual right hand will be more productive at sparing time in today's lifestyle.The key characteristic of this Individual Collaborator
is its straightforwardness of utilization. The Collaborator viably completes a few obligations that clients relegate it. The individual voice partner will be
basic to utilize and will dispose of the requirement for manual labor to do an assortment of exercises. The display voice right-hand system's capability is
limited to working online (requires a web association to finish errands) and on desktop computers. Since the voice collaborator framework is secluded,
unused highlights can be included without influencing existing framework usefulness.
REFERENCE :
1. Mr. K. Vikram Reddy, Assistant Professor, Department of Computer Science and Engineering, Matrusri Engineering College, Saidabad,
Hyderabad, Telangana, S. Lahari, A. Naveen, G. Sarath Chandra, B.E Scholars, Department of Computer Science and Engineering, Matrusri
Engineering College, Saidabad, Hyderabad, Telangana.
2. CORTANA-INTELLIGENT PERSONAL DIGITAL ASSISTANT: A REVIEW,Heena Reyaz Bhat, Tanveer Ahmad Lone, Student,
Computer Science Engineering, SSM College of Engineering & Technology, Kashmir, India,( Volume 8, No. 7, July – August 2017, ISSN
No. 0976-5697), DOI: http://dx.doi.org/10.26483/ijarcs.v8i7.4225.
International Journal of Research Publication and Reviews, Vol 5, no 5, pp 13141-13147 May 2024 13147
3. Rabin Joshi, Supriyo Kar, Abenezer Wondimu Bamud and Mahesh T R, (2023). Personal A.I. Desktop Assistant, 2(2), 54-60. ISSN: 2583
5343.( Published by: PRISMA Publications), DOI: 10.59461/ijitra.v2i2.58.
4. Rohit K. Roy1, Megha D. Deshmukh2, Rajat K. Biswas3, Milind V. Lande4 1,2,3Students , 4 I/C HOD, Government Polytechnic, Gadchiroli,
Maharashtra, India, 25 January, 2023. https://doi.org/10.46335/IJIES.2023.8.1.3 Vol. 8, No. 1, 2023, PP. 12-16(-ISSN: 2456-3463).
5. Anusha S, N Vignesh Karthik, Sampada K S. ”Comparative Study on Voice Based Chat Bots”. 2018.
6. Easwara Moorthy, A., Vu, K.-P.L.: Privacy Concerns for Use of Voice Activated Personal Assistant in the Public Space. International Journal
of Human-Computer Interaction 31, 307–335 (2015).
7. Bhargavi Mokashi, Vandana S. Bhat, Jagadeesh D. Pujari, S. Roopashree, T. R. Mahesh, D. Stalin Alex, "Efficient Hybrid Blind Watermarking
in DWT-DCT-SVD with Dual Biometric Features for Images", Contrast Media & Molecular Imaging, vol. 2022, Article ID 2918126, 14
pages, 2022. https://doi.org/10.1155/2022/2918126.
8. K.A. Sharada, KSN Sushma, V. Muthukumaran, T.R. Mahesh, B. Swapna, S. Roopashree, High ECG diagnosis rate using novel machine
learning techniques with Distributed Arithmetic (DA) based gated recurrent units, Microprocessors and Microsystems, Volume
98,2023,104796,ISSN 0141-9331, https://doi.org/10.1016/j.micpro.2023.104796.