Report
Report
Department of Technology
Jodhpur Institute of Engineering & Technology, Jodhpur
2024-25
CANDIDATE’S DECLARATION
I hereby declare that the work, which is being presented in this Seminar, entitled “Development
and Deployment of a Smart Chatbot (AVA) using Machine Learning and Modern Cloud
Technologies” in partial fulfillment for the award of Degree of “Bachelor of Technology” in
Dept. of CSE with specialization in AIML and submitted to the Department of Technology,
Jodhpur Institute of Engineering and Technology, is are cord of my own work carried under the
guidance of Sourav Soni, Faculty, REGEX Software Services, Jaipur.
I have not submitted the matter presented in this report anywhere for the award of another
degree.
ANUJ NEGI
B.Tech (V Semester)
CSE (AIML)
AIML/23/039
i
CERTIFICATE
ii
ACKNOWLEDGEMENT
I express my sincerest and utmost gratitude to my Training supervisor Mr. Saurab Soni, Faculty,
REGEX SOFTWARE SERVICES, Jaipur, for his valuable guidance, constant supervision and
continuous encouragement during all the stages of this work . His vast knowledge base,
innovative vision and detailed guidance helped me get through challenges and successfully
complete this training/project.
I wish to express my deep gratitude to Prof. (Dr.) Pratibha Peshwa Swami, Head (Technology)
and Prof. Seminar in - charged Designation, for their valuable guidance and immense assistance
in preparation of this report.
Lastly I would like to express my sincere gratefulness to GOD, my parents and my dear friends
and all those people who have helped me directly or indirectly for the completion of this work.
iii
ABSTRACT
This project presents the design and development of Chatbot AVA, an AI-powered virtual
assistant capable of performing intelligent conversations and assisting users with daily tasks. The
chatbot is developed using Machine Learning and Natural Language Processing (NLP)
techniques to understand user inputs, generate context-aware responses, and deliver a natural
conversational experience.
The front-end interface is built using Streamlit, providing a clean, interactive UI. The backend
integrates Google Generative AI for enhanced NLP capabilities and speech recognition/text-to-
speech for voice interaction. Deployment is done using Docker containers, ensuring portability
and scalability, and hosted on AWS EC2 for 24/7 accessibility.
The project bridges the gap between academic AI concepts and real-world application by
combining multiple modern technologies into a single intelligent assistant.
iv
List of Figures
v
List of Tables
vi
CONTENTS Page No.
Candidate’s Declaration i
Certificate ii
Acknowledgement iii
Abstract iv
List of Figures v
List of Tables vi
CHAPTER 1: INTRODUCTION 1-3
1.1 Background of The Company 1
1.2 Organizational Structure 1
1.3 Nature of Work/Training Assigned 2
1.4 Objectives of the Training 2
1.5 Scope of the Project 2-3
CHAPTER 2 : COMPANY INFRASTRUCTURE 4-6
2.1 Departmental Structure 4
2.2 Network Structure 4
2.3 Infrastructure Facilities 5
2.4 Software Stack Availability 6
2.5 Learning and Work Culture 6
CHAPTER 3: TRAINING ATTENDED 7 – 10
3.1Introduction 7
3.2 Exposure Level 7-8
3.2.1 Technical Exposure 8
3.2.2 Deployment Exposure 8
3.2.3 Project Development Workflow 8
3.3 Tasks Assigned 9
3.3.1 Phase 1: Setup and Learning 9
3.3.2 Phase 2: Chatbot Development 9
3.3.3 Phase 3: Feature Integration 9
3.3.4 Phase 4: Testing and Deployment 9
3.4 Learning Curve 10
3.5 Team Collaboration and Mentorship 10
3.6 Summary of Outcomes 10
CHAPTER 4 : SYSTEMS/PROJECT DEVELOPMENT 11 – 16
4.1 Project Description 11
4.1.1 Modules Implemented 12
4.1.2 Architecture Overview 12-13
4.2 Role & Responsibilities 14
4.2.1 Development Responsibilities 15
4.2.2 Integration Responsibilities 15
4.2.3 DevOps Responsibilities 15
4.2.4 Team & Communication 16
CHAPTER 5 : CONCLUSION 17 – 25
5.1 Lessons Learned Skills Developed 17
5.2 Knowledge Gained 20
5.3 Career Impact and Future Scope 21
5.4 Comparative Analysis with Industry Projects 22
5.5 Technical Challenges Faced & Solutions 23
5.6 Code Documentation and Versioning 24
5.7 Project Evaluation & Feedback 25
5.8 Final Reflection 25
REFERENCES 26
CHAPTER 1
INTRODUCTION
With a strong emphasis on practical, hands-on learning, Regex has partnered with various
educational institutions to help bridge the gap between academic learning and real-world
industry requirements. Their training modules are industry-oriented, often resulting in students
developing complete software projects by the end of their internships.
Training Department – Handles industrial and technical training programs for students
and professionals.
Development Team – Works on client-based projects and research & development
using modern Software stacks.
Admin & HR Team – Manages student onboarding, project assignments, and
evaluation.
Mentors/Trainers – Professionals with expertise in Python, AI/ML, DevOps, and
Full-Stack technologies.
Project Coordinators – Bridge communication between mentors and trainees, ensuring
smooth workflow and timely project completion.
The company follows a collaborative work environment where interns interact directly with
industry experts and receive continuous mentoring.
1
1.3 Nature of Work/Training Assigned
During the training, students are given hands-on experience in live projects or simulated
industry-grade applications. For this project, the nature of work involved:
This practical exposure helped trainees understand real-world project development cycles,
including design, coding, debugging, and deployment.
To gain hands-on experience with DevOps tools like Docker and cloud platforms like
AWS.
2
A modular platform that can be extended to handle tasks like reminders, weather updates,
news summaries, and more.
3
CHAPTER -2
COMPANY INFRASTRUCTURE
Project Management & Support: Ensures smooth flow of tasks, team coordination,
performance tracking, and reporting. It includes project managers, guides, and reviewers.
This structure ensures that trainees are not just trained, but also experience the lifecycle of a real
software project in an industry-like setting.
4
Key highlights:
LAN & High-Speed Wi-Fi: The facility is equipped with fiber-optic internet
connections with backup support, ensuring uninterrupted access to cloud servers,
repositories, and APIs.
Cloud Access: Trainees are given access to tools like AWS EC2, GitHub, and
DockerHub through secure credentials.
Local and Remote Servers: Projects are developed locally and then tested on remote
EC2 instances via SSH for deployment.
Firewall and Access Control: To ensure cybersecurity, firewalls are enabled and access
rights are provided based on trainee roles (read-only, contributor, admin).
Code Collaboration Tools: Platforms like GitHub, Trello, and Slack are used to
promote team communication and version control.
Regex Software Services provides a professional and modern infrastructure conducive to both
learning and development.
Some of the key facilities include:
Fully Equipped Labs: Systems with the latest software installations (Python, Docker,
Visual Studio Code, etc.) and microphone/speaker support for AI voice projects.
High-Speed Internet: Dedicated leased line with failover ensures uninterrupted
connectivity during API-intensive training and deployments.
AI Training Rooms: Designed for ML/AI experimentation and integrated with cloud
platforms.
Deployment Room: Equipped with local servers and Docker environments for testing
containerized apps.
Seminar and Review Halls: Used for project presentations, code walkthroughs, and live
debugging sessions with trainers.
5
These infrastructure capabilities ensure students experience a simulation of working in a top-tier
tech environment.
APIs & AI Tools: Google Generative AI, Spotify API, OpenAI, HuggingFace
This software stack enables the implementation of full-stack AI projects, including frontend,
backend, APIs, and cloud-based deployment.
Daily Standups and code reviews mimic the agile software development model.
Mentor Sessions are conducted to address doubts and explain core logic.
Peer Learning is encouraged through team-based projects and shared GitHub repos.
6
CHAPTER –3
TRAINING ATTENDED
3.1 Introduction
As a part of the B.Tech curriculum, industrial training plays a vital role in providing students
with hands-on experience and an opportunity to work with current industry tools, environments,
and expectations. I undertook my industrial training at Regex Software Services, Jaipur, for a
duration of 45–60 days, where I worked on a live project titled “Chatbot AVA using Machine
Learning.”
The training was designed to bridge the gap between academic knowledge and real-world
technical applications. It involved a comprehensive curriculum that combined technical theory,
real-time project work, and guided mentorship. The experience helped me strengthen my
understanding of software development life cycles, machine learning workflows, deployment
using cloud platforms, and the integration of third-party APIs for building smart systems.
This period also enabled me to gain insights into professional coding practices, teamwork,
documentation standards, time management, and agile development processes—all under a
professional mentor-supervised environment.
7
Frontend Interface: Developed using Streamlit, enabling fast UI rendering and easy
deployment of machine learning models in web apps.
Spotify API: Used to recommend songs dynamically based on detected user mood.
Location Integration: Implemented interactive maps using folium and geopy for real-
time user location display.
AWS EC2: Gained exposure to launching cloud servers, deploying the application
remotely, and managing public access to the chatbot system.
8
3.3 Tasks Assigned
The training involved structured tasks designed to gradually build up the project. Each phase
included planning, coding, testing, and deployment.
9
3.4 Learning Curve
The training helped me transition from theoretical understanding to practical application. Some
key learnings include:
10
CHAPTER –4
SYSTEMS / PROJECT DEVELOPMENT
This chapter explains the entire development process of the project titled “Chatbot AVA using
Machine Learning”. It covers the system's purpose, architecture, components, technologies
used, and the roles undertaken during its development.
APIs for external functionalities like Spotify (for music) and Folium (for maps).
Voice Support: Allows both speaking to and hearing from the bot using
speechrecognition and pyttsx3.
Cloud Deployment: Fully containerized using Docker and deployed to AWS EC2 for
online access.
11
Deployment Module Google Generative
UI Interface
Music Recommendation
Map Module
Voice Interaction
Built using Streamlit, which renders a simple and interactive web interface.
Uses the speech recognition library to capture user speech and convert it into text.
12
Recommends and plays songs based on user mood or request.
Uses folium and geopy to detect and display the user’s location.
f) Deployment Module
13
4.1.2 Architecture Overview
The system follows a modular, layered architecture:
1. Input Layer:
o Accepts user input (text or voice).
2. Processing Layer:
o Processes the input using NLP and maps it to a corresponding function.
5. Output Layer:
o Displays the final output on the Streamlit UI.
14
Each module is loosely coupled and can be extended or replaced without affecting the entire
system.
Wrote modular, reusable code using Python best practices and documentation.
Linked various modules through proper API routing and error handling.
Deployed the application on AWS EC2 and managed public access settings.
15
4.2.4 Team & Communication
Participated in regular mentor meetings and peer reviews.
16
CHAPTER – 5
CONCLUSION
3. API Integration
o Consuming RESTful APIs like Google Generative AI, Spotify, and Open
Weather.
17
Fig. 5.1 : API Integration
4. Voice Processing
o Managing image builds and running the app on any OS without setup issues.
18
7. Cloud Deployment on AWS
o Hosting the chatbot on a live server using secure credentials and port
management.
o Pushing code commits, collaborating on a shared repo, and using GitHub Issues
for task tracking.
o Documenting code and features to help others understand the project flow.
19
4. Professionalism
From initial planning, writing features, testing modules, to final deployment, I learned the
importance of maintaining a structured development process.
Understanding that project development is not only about writing code but also about
writing maintainable, documented, and testable code.
🔹 AI + API Collaboration
This project taught me how modern chatbots are much more than simple rule-based bots.
🔹 Modular Architecture
Each function in the AVA chatbot — speech recognition, NLP, map display, or music
recommendation — was developed independently, making the system scalable and easy
to debug.
20
The knowledge of structuring applications into modules is something I can carry into
larger future projects.
I now know how to use Docker and AWS EC2 — two of the most demanded DevOps
tools in today’s software industry — to deploy my own applications and share them with
the world.
Unlike classroom projects, training timelines at Regex were strict and deadline-
oriented.
This taught me how to prioritize, optimize my code, and deliver features that are stable
and functional within time constraints.
AI Chatbot Developer
DevOps Engineer
21
🔹 Future Enhancements to the AVA Chatbot
This chatbot is not a finished product — it is a scalable framework that can grow into a full AI
assistant platform.
During the training, we reviewed how actual AI-driven chatbots (like Google Assistant, Siri, or
Alexa) are built in the industry. While our project was academic in nature, many similarities
emerged:
22
This comparison helped me understand what additional features are required to make AVA a
market-ready product and the kind of infrastructure that supports large-scale voice assistants.
Issue: Google Generative AI API had delayed responses during high load.
23
Solution: Implemented asynchronous calls in Streamlit using background threading and
spinners to improve UX.
Solution: Added a fallback using gTTS (Google Text-to-Speech) with MP3 playback.
Issue: Spotify’s OAuth2 setup required proper redirect URI even for testing.
Solution: Used a developer token temporarily and documented the OAuth setup for future
deployment.
Solution: Opened required ports in EC2 security groups and exposed Streamlit’s port
properly in the Dockerfile.
24
Branch-based development for major modules (voice, maps, music).
Final build was zipped and pushed to GitHub for long-term access and sharing.
Mentors appreciated the effective use of Streamlit and Docker, technologies not
commonly mastered at the student level.
Suggestions were made to add emotion detection and user profiling in future versions.
The presentation was well received, with an emphasis on how multi-API integration was
handled effectively.
Handle voice, language, maps, music, and cloud — all in one project.
Gain the confidence to now develop, test, and deploy complete AI applications.
25
REFERENCES
26