0% found this document useful (0 votes)

7 views65 pages

Sasi

The document is a project report titled 'Real-Time Customer Behavior and Satisfaction Insight System in Shopping Malls' submitted for a Bachelor of Technology degree in CSE (Artificial Intelligence and Machine Learning). It outlines a system that utilizes AI and data analytics to analyze shopper behavior, optimize store layouts, and enhance personalized marketing strategies. The project aims to improve customer satisfaction and operational efficiency in shopping malls through advanced data-driven insights and real-time analytics.

Uploaded by

chandinigedela321

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views65 pages

Sasi

Uploaded by

chandinigedela321

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 65

Real-Time Customer Behavior and Satisfaction Insight System

in Shopping Malls

A PROJECT REPORT
Submitted in partial fulfillment of the requirements for the award of the degree
Of
BACHELOR OF TECHNOLOGY
in
CSE (Artificial Intelligence and Machine Learning)
Submitted by
YAMALA SASI REKHA 21A51A4257
GEDELA CHANDINI 21A51A4239
BODIGI SAI KARTHIK 21A51A4243
ANDHAVARAPU CHARAN 21A51A4223

Under the Guidance of

Sri. KONNI SRINIVASA RAO M.Tech
Assistant Professor

DEPARTMENT OF CSE (Artificial Intelligence and Machine Learning)

ADITYA INSTITUTE OF TECHNOLOGY AND MANAGEMENT

(An Autonomous Institution)
Approved by AICTE, Permanently Affiliated to JNTUGV – Vizianagaram
Accredited by NBA & NAAC with A+, Recognized Under 2(f) 12(B) of UGC
K. Kotturu, Tekkali, Srikakulam - 532201, Andhra Pradesh.
April – 2025
i
DECLARATION

“I hereby declare that the project entitled “Real-Time Customer Behavior and
Satisfaction Insight System in Shopping Malls” Submitted for the award of the
degree of Bachelor of Technology in CSE(Artificial Intelligence and Machine
Learning) is my own work and that, to the best of my knowledge and belief, it
contains no material previously published or written by another person nor
material which has been accepted for the completion for the award of any other
degree, associate ship, fellowship or any other similar titles.

PLACE: Tekkali
DATE:
YAMALA SASI REKHA 21A51A4257
GEDELA CHANDINI 21A51A4239
BODIGI SAI KARTHIK 21A51A4243
ANDHAVARAPU CHARAN 21A51A4223

ii
ADITYA INSTITUTE OF TECHNOLOGY AND MANAGEMENT
(An Autonomous Institution)

Department of CSE (Artificial Intelligence and Machine Learning)

CERTIFICATE

This is to certify that the project report entitled “Real-Time Customer Behavior and
Satisfaction Insight System in Shopping Malls” being submitted by YAMALA SASI
REKHA(21A51A4257),GEDELA CHANDINI (21A51A4239), BODIGI SAI KARTHIK
(21A51A4243), ANDHAVARAPU CHARAN( 21A51A4223)Submitted in partial fulfillment
for the award of Degree of Bachelor of Technology in CSE (Artificial Intelligence and
Machine Learning) during the year 2024-2025 to the JNTUGV, Vizianagaram is a record of
bonafied work carried out by them under my guidance and supervisio

Signature of the Head of the Department Signature of the Project Supervisor

Dr. M. V. B Chandrasekhar, M.Tech, Ph.D K.Srinivasa Rao M.Tech

Professor & Head of the Department Assistant Professor
Department of CSM Department of CSM

iii
ACKNOWLEDGEMENT

We wish to thank Mr. K. Srinivasa Rao for his kind support and his valuable suggestions
and encouragement helped us a lot in carrying out this project work as well as in bringing
this project to this form

We take this opportunity to express our sincere gratitude to our Director Prof. V. V.
Nageswara Rao for his encouragement in all respect.

We take the privilege to thank our principal Dr. A. S. Srinivasa Rao for his encouragement
and support.

We are also very much thankful to Dr. M. V. B. Chandrasekhar, Head of CSE (Artificial
Intelligence and Machine Learning) for his help and valuable support in completing the
project

We are also thankful to all staff members in the Department of of CSE (Artificial Intelligence
and Machine Learning), for their feedback in the reviews and kind help throughout our
project

Last but not the least, we thank all our classmates for their encouragement and help in making
this project a success

It is their help and support, due to which we became able to complete the design and technical
report.

YAMALA SASI REKHA 21A51A4257

GEDELA CHANDINI 21A51A4239
BODIGI SAI KARTHIK 21A51A4243
ANDHAVARAPU CHARAN 21A51A4223

iv
Program Outcomes (PO)

1. ENGINEERING KNOWLEDGE: Apply the knowledge of mathematics, science,

engineering fundamentals, and an engineering specialization to the solution of
complex engineering problems.
2. PROBLEM ANALYSIS: Identify, formulate, review research literature, and
analyze complex engineering problems reaching substantiated conclusions using
first principles of mathematics, natural sciences, and engineering sciences.
3. DESIGN/DEVELOPMENT OF SOLUTIONS: Design solutions for complex
engineering problems and design system components or processes that meet the
specified needs with appropriate consideration for the public health and safety,
and the cultural, societal, and environmental considerations.
4. CONDUCT INVESTIGATIONS OF COMPLEX PROBLEMS: Use
research-based knowledge and research methods including design of
experiments, analysis and interpretation of data, and synthesis of the information
to provide valid conclusions.
5. MODERN TOOL USAGE: Create, select, and apply appropriate techniques,
resources, and modern engineering and IT tools including prediction and
modeling to complex engineering activities with an understanding of the
limitations.
6. THE ENGINEER AND SOCIETY: Apply reasoning informed by the
contextual knowledge to assess societal, health, safety, legal and cultural issues
and the consequent responsibilities relevantto the professional engineering
practice.
7. ENVIRONMENT AND SUSTAINABILITY: Understand the impact of the
professional engineering solutions in societal and environmental contexts, and
demonstrate the knowledge of, and need for sustainable development.
8. ETHICS: Apply ethical principles and commit to professional ethics and
responsibilities and norms of the engineering practice.

v
9. INDIVIDUAL AND TEAM WORK: Function effectively as an individual,
and as a member or leader in diverse teams, and in multidisciplinary settings.
10. COMMUNICATION: Communicate effectively on complex engineering
activities with the engineering community and with society at large, such as,
being able to comprehend and write effective reports and design documentation,
make effective presentations, and give and receive clear instructions.
11. PROJECT MANAGEMENT AND FINANCE: Demonstrate knowledge and
understanding of the engineering and management principles and apply these to
one’s own work, as a member and leader in a team, to manage projects and in
multidisciplinary environments.
12. LIFE-LONG LEARNING: Recognize the need for, and have the preparation and
ability to engage in independent and life-long learning in the broadest context of
technological change.

Program Specific Outcomes

 PSO1: Apply the fundamental knowledge for problem analysis and conduct
investigations in CSE (AIML) for sustainable development.
 PSO2: Design and development of solutions by using modern software for the
purpose of execution of the projects in specialized areas. 
 PSO3: Inculcate effective communication and ethics for lifelong learning with
social awareness.
PO-PSO Mapping

PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12 PSO1 PSO2 PSO3
3 3 3 2 3 3 2 3 3 3 2 2 3 2 2

vi
Project Details

Batch No : 12
Project Title : Real-Time Customer Behavior and Satisfaction Insight
System in shopping malls.
Methodology : Deep Learning
Application : Personalized Marketing
Description of the
Project : The “Real Time Customer Behaviour and Satisfaction Insight
Systems in Shopping Malls” project leverages AI and data
analytics to track shopper movements, purchase patterns, and
engagement, enabling retailers to gain a deeper understanding
of customer behavior. By analyzing foot traffic, dwell times,
and product interactions, the system helps optimize store
layouts for better navigation and improved sales. The project
enhances personalized marketing by identifying customer
preferences and delivering targeted promotions, leading to
increased engagement and conversion rates. Additionally, it
supports operational efficiency by improving inventory
management, reducing checkout bottlenecks, and streamlining
staff allocation.
Batch Details :
GEDELA CHANDINI 21A51A4239
YAMALA SASI REKHA 21A51A4257
BODIGI SAI KARTHIK 21A51A4243
ANDHAVARAPU CHARAN 21A51A4223

vii
Objectives : The project “Real Time Customer Behaviour and
Satisfaction Insight Systems in Shopping Malls” aims to
analyze shopper movements and purchase patterns using AI
and data analytics to optimize store layouts, enhance customer
engagement, and improve inventory and queue management.
The project aims to provide predictive insights for boosting
sales, security, and the overall shopping experience.
Additionally, it leverages real-time analytics and machine
learning to enable data-driven decision-making for efficient
store operations and targeted marketing strategies.

viii
ABSTRACT

In this study, we analyzed existing customer behavior analysis systems in shopping malls, such as
Pose Network and MoveNet, which primarily rely on movement tracking and pose estimation.
However, these methods fall short in capturing deeper insights into customer interests, such as
gaze direction, dwell time, and product interactions. To overcome these limitations, we propose an
advanced AI-driven solution integrating YOLO for pose estimation with deep learning
algorithms to classify customer interests more accurately. Our system leverages computer vision
and machine learning to generate real-time heatmaps, sentiment analysis, and predictive
insights, assisting mall operators in optimizing store layouts, enhancing personalized marketing,
and improving customer satisfaction. Additionally, our approach enables real-time crowd density
monitoring, improving queue management and staff allocation for a seamless shopping
experience.

The system differentiates between casual browsers and serious buyers, allowing businesses to
tailor marketing strategies and optimize sales efforts. Furthermore, inventory management
benefits from demand trend analysis based on customer movement patterns near product displays.
By incorporating predictive analytics, mall operators can forecast shopping trends, adjust
promotional campaigns, and refine engagement strategies. To ensure continuous improvement,
customers can provide direct feedback via a QR code, linking to a form that gathers valuable
insights for further system refinement and data-driven decision-making.

Keywords:
YOLO, CNN, DeepSort, Object Detection, Image Recognition, Feature Extraction, Customer
Behavior Analysis, Deep Learning, Computer Vision, Retail Analytics, Predictive Analytics,
Sentiment Analysis, Machine Learning.

ix
TABLE OF CONTENTS

Contents
Page. No

Candidate’s Declaration ii
Supervisor’s Certificate iii
Acknowledgements iv
PO – PSO mapping v-vi
Project Details vii-viii
Abstract ix
Table of Contents x-xii
LIST OF TABLES xiii
LIST OF FIGURES xiv
LIST OF ABBREVATIONS xv

Chapter 1 Introduction 1-7

1.1 Project Objective
1.2 Problem Statement
1.3 Scope of the Project
1.3.1 Technical scope
1.3.2 User Scope
1.4 Operational Scope
1.4.1 Future Scope & Expansion

Chapter 2 Literature survey 8-13

Chapter 3 Problem Statement
14-18
3.1 Introduction

x
3.2 Challenges Faced by Shopping Malls in
Customer Behavior Analysis

3.2.1 Limited Behavioral Insights from Pose-Based

Systems
3.2.2 Inaccurate Tracking in Crowded and Dynamic
Environments
3.2.3 Delayed and Non-Anonymous Feedback Collection
3.2.4 Difficulty in Quantifying Engagement with Mall
3.2.5 Dependence on Fragmented and Manual Analysis
3.3 Need for an AI-Powered Real-Time Solution

Chapter 4 Data Collection And Procurement 19-25

4.1 Introduction
4.2 Types of Data Required
4.3 Sources of Data Acquisition
4.3.1 Primary Data Collection
4.3.2 Secondary Data Sources
4.3.3 Data Augmentation
4.4 Data Annotation and Preprocessing
4.5 Ethical Considerations in Data Collection
4.6 Challenges in Data Collection and Procurement
4.7 Strategies to Overcome Challenges
Chapter 5 Theoretical Background 26-32
5.1 Introduction
5.2 Computer Vision and Image Processing
5.3 Deep Learning and Object Detection Object
5.4 Object Tracking with Deep SORT
5.5 Spatial Analysis for Behavioral Insights
5.6 Natural Language Processing for Feedback Analysis
5.7 QR Code Technology for Data Collection
5.8 Software Frameworks and Technologies Used
5.9 Challenges in Theoretical Implementation
5.10 Theoretical Integration

xi
Chapter 6 Methodology 33-36
6.1 Model Selection and Loading
6.2 Real-Time Object Detection
6.2.1 Video Frame Processing
6.2.2 Object Detection and positioning
6.3 Real-Time Customer Identification
6.4 Estimating Age and Gender of Customers
6.5 Movement Tracking with Deep SORT
6.6 Analyzing Customer Trends
6.6.1 Time Spent in AOI
Chapter 7 Results And Discussions 37-46
7.1 Object Detection Performance
7.2 Detecting and Analyzing Customers
7.3 Analysis of Product Categories
7.4 Collecting Customer Feedback via QR Codes
7.5 User Interface and Usability
7.6 System Limitations and Improvements
7.7 Conclusion
7.8 Appendix (Source Code)

Chapter 8 Conclusion And Future Enhancements 47-48

References 49
Publications 50

xii
LIST OF TABLES

Table Title Page

2.1 Multi-Person Tracking Under Occlusions for 8
Customer Behavior Analysis.
2.2 Big Data-Driven Customer Behavior Analysis. 9

2.3 The role of shopping orientation in variety-seeking 9

behaviour.
2.4 Influence of Facial Feedback on Automatic Detection 10
of Unexpected Emotional Body Expressions.
2.5 Assessing Customer Satisfaction in Omnichannel 10
Fashion Retail.
2.6 Customer Perception, Integration Behavior, and Loyalty 11
in IoT Enterprises.
11
Daylighting in Shopping Malls: Customer
2.7 Perception, Preferences, and Satisfaction.
12
Design and Implementation of an Online Shopping
2.8 Mall Using Collaborative Filtering.
2.9 Sustaining Shopping Momentum in Retail Malls 12
Through Real- Time Messaging.
Lost in a Mall: Effects of Gender, Mall Familiarity, 13
2.10 and Shopping Values on Wayfinding.

xiii
LIST OF FIGURES

Figure Title Page

Fig 6.1 Real-Time Customer Satisfaction Analysis 33

Flowchart

Fig 6.6 Sentiment Analysis and Feedback 36

Processing Workflow

Fig 7.2 Real-Time Detection and Analysis of 38

Customers

Fig 7.3 Customer Preferences on sarees 39

Fig 7.4 Customer Feedback Form Interface 40

Fig 7.5 Dashboard Overview 41

xiv
LIST OF ABBREVIATIONS

YOLO You Only Look Once

CNN Convolutional Neural Network

DeepSORT Deep Simple Online and Realtime Tracking

NLP Natural Language Processing

QR Quick Response (Code)

PO Program Outcome

PSO Program Specific Outcome

SEM Structural Equation Modeling

EEG Electroencephalogram

AR Augmented Reality

GPU Graphics Processing Unit

MVC Model-View-Controller

UTAUT Unified Theory of Acceptance and Use of

Technology

GDPR General Data Protection Regulation

xv
CHAPTER 1

INTRODUCTION
Understanding customer behavior in shopping malls is crucial for enhancing retail experiences and
optimizing business strategies. Traditional methods of customer analysis, such as surveys and
manual observation, are time-consuming and often lack accuracy. In recent years, computer vision-
based approaches, including pose estimation models like Pose Network and Move Net, have been
explored for tracking customer movements and engagement. However, these systems have
limitations in accurately detecting and interpreting nuanced customer behaviors, such as intent and
interest in specific products or areas.

To overcome these limitations, we propose an advanced approach that integrates YOLO (You Only
Look Once), a cutting-edge object detection framework, with deep learning techniques for precise
pose estimation and behavior classification. Unlike conventional pose estimation models, our
system not only detects customer movements but also classifies their interests by analyzing
movement patterns, gaze direction, and dwell time.

By leveraging machine learning and real-time analytics, our solution provides actionable insights
into customer preferences, enabling mall operators to optimize store layouts, improve product
placements, and personalize marketing strategies. The proposed system enhances decision-making
by offering a more accurate and adaptive method for customer behavior analysis in retail
environments.

1.1 Project Objective

Customer behavior analysis plays a crucial role in modern retail environments, helping businesses
optimize store layouts, improve marketing strategies, and enhance customer experiences.
Traditional methods such as surveys and manual observations are often inefficient and fail to
provide real-time insights. Recent advancements in computer vision and machine learning have
enabled automated solutions for tracking and analyzing customer behavior in shopping malls.

Pose estimation struggles with detecting intent and interaction. Our approach integrates YOLO and deep
learning for better accuracy.

1
1.2 Problem Statement

Despite the growing adoption of AI-driven analytics in retail, existing pose estimation models have
the following challenges:

Limited Accuracy: Current systems struggle to distinguish between browsing, product selection,
and purchasing intent.

Lack of Context Awareness: Existing approaches do not consider behavioral cues such as gaze
direction, hesitation, or dwell time near specific products.

Inefficient Real-time Processing: Most traditional models require extensive post-processing,

making them unsuitable for real-time applications.

These limitations reduce the effectiveness of customer behavior analysis in shopping malls,
leading to missed opportunities for personalized marketing, customer engagement, and optimized
store layouts.

1.3 Scope of the Project

The "Real-Time Customer Behaviour and Satisfaction Insight System in Shopping Malls" is
designed to provide mall operators with a sophisticated tool for monitoring and analyzing customer
behavior and satisfaction in real time, leveraging advanced artificial intelligence (AI) and data
collection technologies. The project integrates the YOLO (You Only Look Once) object detection
framework, Deep SORT (Simple Online and Realtime Tracking) algorithm, Natural Language
Processing (NLP), and QR code-based feedback mechanisms to deliver actionable insights. This
system bridges the gap between raw observational data and strategic decision-making to enhance
customer experiences and optimize operations in shopping malls. It targets multiple dimensions,
including technical capabilities, user needs, and operational contexts. The project ensures a
scalable and adaptable solution for modern retail environments.

1.3.1 Technical Scope

The technical foundation of the system is built on a combination of cutting-edge computer vision,
multi-object tracking, and text analysis technologies, designed to operate seamlessly in real-time

2
retail settings. The key technical components include:

 Real-Time Object Detection and Tracking:

The system employs YOLO, a state-of-the-art deep learning model, to detect customers
within live video feeds captured by strategically positioned CCTV cameras across the mall.
YOLO’s single-pass processing ensures high-speed detection, identifying individuals with
bounding boxes and confidence scores even in busy environments. This is complemented
by Deep SORT, which enhances tracking accuracy by associating detections across frames,
using Kalman filters for motion prediction and appearance-based re-identification to handle
occlusions and identity switches. For example, the system can track a customer moving
from a clothing store to a food court, maintaining continuity despite temporary obstructions
like other shoppers.

 BehavioralAnalysis:
Beyond mere detection, the system analyzes customer movement patterns, such as walking
paths, dwell times, and interactions with products or digital touchpoints (e.g., kiosks or
mobile apps). It calculates metrics like the average time spent in specific zones (e.g., 15
minutes in the saree section) or the frequency of interactions with promotional displays,
providing quantitative insights into engagement levels. Heatmaps and flow analysis plots
are generated to visualize high-traffic areas and popular routes, aiding in store layout
optimization.

 Feedback Collection and Sentiment Analysis:

QR codes placed at key locations (e.g., exits, service desks) enable customers to submit
anonymous feedback via mobile devices, capturing preferences, complaints, and
satisfaction levels in real time. The collected textual data is processed using NLP
techniques, including sentiment analysis (e.g., classifying “great deals” as positive) and
topic modeling (e.g., grouping feedback into themes like “pricing” or “service”). Combining
video tracking and text insights provides a holistic view of customer experiences.

 User Interface and Reporting:

A real-time dashboard, developed using a framework like Python’s Tkinter or a web-based

3
platform (e.g., Flask), displays analytical outputs such as heatmaps, customer flow
diagrams, and sentiment trends.

 Scalability and Flexibility:

The system is designed to handle varying data volumes, from a single-store setup with one
camera to a multi-level mall with dozens of feeds. It uses modular architecture, allowing
components (e.g., YOLO detection, NLP processing) to be updated independently as
technology evolves, ensuring long-term relevance.

1.3.2 User Scope

The project targets a diverse range of stakeholders within the retail ecosystem, each benefiting
from its insights in unique ways:

 Mall Operators and Management:

The primary users, who leverage the system to monitor customer behavior, optimize store
layouts, and enhance operational efficiency. For instance, managers can use dwell time data
to reposition underperforming stores or adjust staffing during peak hours based on crowd
density insights.

 Retail Store Owners:

Individual shop owners within the mall can access tailored reports (e.g., time spent in their
store, product category preferences) to refine inventory, promotions, and customer service.
For example, a clothing store might stock more sarees if feedback and dwell times indicate
high interest.

 Marketing Teams:

Marketing professionals use demographic data (e.g., 59.7% male, 40.3% female customers)
and sentiment analysis to design targeted campaigns, such as promotions for adolescents
(48.4% of visitors) or addressing negative feedback about pricing.

 Customers:

4
Indirect beneficiaries, as improved layouts, faster service, and personalized experiences
enhance their shopping satisfaction. The anonymous QR code feedback mechanism
empowers them to voice opinions without pressure, fostering a participatory role.

 Retail Technology Researchers:

Academics and developers can use the system as a case study for advancing AI-driven retail
analytics, potentially integrating it with emerging tools like augmented reality (AR) or
blockchain.

1.3.3 Operational Scope

The system is engineered to function effectively across various operational contexts within
shopping malls, ensuring versatility and practical applicability:

 Indoor Retail Environments:

The primary focus is on indoor malls, where CCTV cameras capture customer movements
across stores, corridors, and common areas like food courts. It excels in tracking
interactions with physical products (e.g., lifting a shirt) and digital touchpoints (e.g., app
usage at a kiosk), providing insights into both shopping and leisure behaviors.

 Crowded and Dynamic Settings:

Designed to handle high-traffic scenarios, such as weekend sales or holiday seasons, the
system uses Deep SORT to maintain tracking accuracy in dense crowds. It identifies busy
zones (e.g., near escalators) and suggests crowd management strategies, like temporary
signage or staff deployment.

 Variable Lighting and Conditions:

While optimized for well-lit indoor environments, the system includes preprocessing
techniques (e.g., noise reduction, contrast adjustment) to adapt to moderate lighting
variations, such as dimmer evening conditions or shadowed areas. Future iterations could
incorporate infrared cameras for low-light performance.

5
 Real-Time Decision Support:

The system delivers immediate insights, enabling operators to respond quickly—e.g.,

reallocating staff to a crowded checkout within minutes of detection. It also supports
longer-term planning, such as monthlylayout adjustments based on aggregated data trends.

 Integration with Existing Infrastructure:

It leverages pre-installed CCTV networks, minimizing deployment costs, and integrates

with mall management software for seamless data sharing. QR codes are strategically
placed at high-traffic points (e.g., exits, restrooms) to maximize feedback collection
without disrupting shopping flow.

1.3.4 Future Scope & Expansion Possibilities

The system’s modular design and forward-looking approach allow for significant enhancements
and broader applications, ensuring its evolution alongside retail trends:

 Integration with Wearable and Mobile Technologies:

Future versions could connect with customer wearables (e.g., smartwatches) or mobile apps
to track behavior beyond CCTV range, offering personalized notifications like “20% off at
the store you lingered at.” This would enhance engagement while maintaining privacy
through opt-in mechanisms.

 Augmented Reality (AR) Enhancements:

QR codes can evolve into AR triggers, enabling interactive experiences like virtual try-ons
and store maps. They also provide behavioral data, such as time spent on AR content. This
transformation enhances customer engagement and system interactivity.

 AI-Driven Personalization:

Advanced machine learning models could analyze historical data to predict preferences,
recommending promotions tailored to individual or demographic trends (e.g., targeting
adolescents with gaming deals). This shifts the system from reactive to proactive insights.

6
 Blockchain for Data Security:

Incorporating blockchain could secure feedback and tracking data, ensuring transparency
and trust. Customers could verify their anonymized contributions, while operators benefit
from tamper-proof records for audits or compliance.

 Expansion to Other Retail Contexts:

The system could adapt to outdoor markets, airports, or supermarkets, adjusting detection
for open spaces or perishable goods. For example, in an airport, it might track passenger
flow to optimize gate assignments, demonstrating cross-industry potential.

 Multi-Language and Cultural Adaptation:

Adding multi-language NLP support (e.g., Hindi, Telugu) and cultural context analysis
could make the system globally deployable, catering to diverse mall demographics in
regions like India or international hubs.

 Enhanced Environmental Robustness:

Future iterations might integrate additional sensors (e.g., audio for crowd noise levels,
thermal for occupancy) to improve performance in challenging conditions, such as noisy
food courts or extreme weather affecting outdoor sections.

The scope of this project is deliberately broad yet focused, balancing immediate applicability with
long-term potential. It targets the core needs of shopping mall management—real-time behavior
tracking, engagement analysis, and satisfaction insights—while laying a foundation for innovative
expansions that could redefine retail analytics. Balancing technical precision with user focus

7
CHAPTER 2

LITERATURE SURVEY
This survey examines key research on real-time customer behavior analysis in retail, highlighting
methodologies, findings, and gaps. It provides context for the proposed system, integrating YOLO,
Deep SORT, NLP, and QR-based feedback to enhance object detection, tracking, and customer
insights. By addressing limitations in existing studies, the system aims to deliver comprehensive,
real-time behavioral analysis. Additionally, it emphasizes the role of AI-driven analytics in
improving customer engagement. The study underscores the need for seamless integration of
technology to refine shopping experiences.

1. Title Tracking multiple persons under partial and global occlusions:

Application to customers’ behavior analysis

DOI https://doi.org/10.1016/j.patrec.2016.04.011
Author Djamal Merad , Kheir-
Eddine Aziz , Rabah Iguernaissi , Bernard Fertil , Pierre Drap
What they have done? Enhanced multiple-object tracking for behavioral marketing by
reducing identity switches using a re-identification strategy.
How they have done? Used a re-identification strategy with pose classification,
integrated with particle filter-based tracking in a mono-camera
setup.
Which method/approach Applied a particle filter-based tracking approach combined with
they followed? re-identification to segment individuals and classify poses.
What they found? Integrating re-identification with particle filter tracking reduces
identity switches and improves trajectory recovery in crowded
spaces.

What they concluded? Re-identification combined with particle filter tracking enhances
multiple-object tracking, improving customer behavior analysis
in dense environments.

Table 2.1 Multi-Person Tracking Under Occlusions for Customer Behavior Analysis

8
2. Title Performing Customer Behavior Analysis using Big Data
Analytics

DOI https://doi.org/10.1016/j.procs.2016.03.125
Author Anindita A. Khade
What they have done? Used OpenPose for customer tracking in malls, integrated with
MapReduce for decision tree analysis, and visualized insights
using D3.js.
How they have done? Applied OpenPose for movement tracking, used MapReduce for
analysis, and generated interactive visualizations with D3.js.
Which method/approach Employed OpenPose for pose estimation, deep learning models
they followed? for behavior classification, and D3.js for data visualization.
What they found? Combining OpenPose with deep learning improves customer
tracking, enabling precise behavior classification and insightful
visualizations.
What they concluded? Integrating pose estimation, deep learning, and visualization
enhances customer behavior analysis, offering valuable business
insights.

Table 2.2 Big Data-Driven Customer Behavior Analysis

3. Title The role of shopping orientation in variety-seeking behaviour

DOI https://doi.org/10.1016/j.jbusres.2022.02.074
Author Scott D. Murray, Hyun Seung Jin, Brett A.S. Martin
What they have done? Explored how shopping orientation influences variety-seeking
behavior in consumer choices.
How they have done? They conducted three studies examining the link between shopping
enjoyment and variety-seeking in decision-making.

Which method/approach They analyzed psychological traits like information-seeking

they followed? and deal-proneness.
What they found? Shopping enjoyment increases variety-seeking, influenced by
information-seeking and deal-proneness.
What they concluded? The study highlights shopping orientation as a key antecedent
to variety-seeking, offering insights for firms to manage and
cater to consumers' variety-seeking tendencies effectively.

Table 2.3 The role of shopping orientation in variety-seeking behavior

9
4. Title Facial feedback manipulation influences the automatic detection
of unexpected emotional body expressions
DOI https://doi.org/10.1016/j.neuropsychologia.2024.108802
Author Jianyi Liu , Yang Liu , Heng Jiang , Jingjing Zhao , Xiaobin Ding
What they have done? Examined whether facial feedback influences bEMMN by
analyzing EEG responses to happy and sad body expressions.
How they have done? Recorded EEG data from 29 participants using a reverse oddball
paradigm with different pen-holding conditions to manipulate
facial feedback.
Which method/approach Applied a nonparametric cluster permutation test to assess
they followed? bEMMN amplitude differences under varying facial feedback
conditions.
What they found? Both happy and sad body expressions triggered significant
bEMMN, with happy expressions evoking stronger negativity,
modulated by facial feedback.
What they concluded? Facial feedback influences neural responses to body expressions,
supporting the facial feedback hypothesis across various
emotional expressions.

Table 2.4 Influence of Facial Feedback on Automatic Detection of Unexpected Emotional

Body Expressions

5. Title Evaluating customer perspectives on omnichannel shopping

satisfaction in the fashion retail sector

DOI https://doi.org/10.1016/j.heliyon.2024.e36027
Author Bilal Khalid
What they have done? They studied omnichannel experiences' impact on customer
satisfaction in Thai fashion retail using the UTAUT model and
SEM.
How they have done? Surveyed 509 omnichannel shoppers and analyzed data using
SEM with Amos software.
Which method/approach Used a quantitative survey design with simple random sampling and
they followed? SEM analysis.
What they found? Ease of use, enjoyment, promotions, service, and transactions enhance
omnichannel satisfaction.
What they concluded? Better coordination across service channels improves customer
satisfaction in fashion retail.

Table 2.5 Assessing Customer Satisfaction in Omnichannel Fashion Retail

10
6. Title Customer perception, integration behavior, and loyalty of
internet of things enterprises

DOI https://doi.org/10.1016/j.techsoc.2024.102600
Author Gaofei Ren , Yaoyao Chen , Maobao Yang
What they have done? Identified key factors influencing customer loyalty in IoT,
including price perception, service perception, and integration
behavior.
How they have done? Collected data from 211 IoT users via an anonymous survey
and analyzed it using structural equation modeling (SEM).
Which method/approach Used a quantitative research approach with SEM to examine
they followed? relationships between key factors.
What they found? Price perception, service perception, and integration behavior
positively impact customer satisfaction and loyalty.
What they concluded? Improving these factors enhances customer satisfaction and
loyalty, providing strategic insights for IoT companies.

Table 2.6 Customer Perception, Integration Behavior, and Loyalty in IoT Enterprises

7. Title Daylighting in shopping malls: Customer’s perception,

preference, and satisfaction

DOI https://doi.org/10.1016/j.enbuild.2021.111691
Author M.S. Mayhoub , Emad H. Rabboh

What they have done? Explored the emotional and functional impacts of daylighting in
shopping malls from customers' perspectives.
How they have done? Conducted a field survey with 552 customers of Carian
shopping malls to analyze preferences and perceptions.
Which method/approach Used quantitative analysis to assess illumination, sunlight
they followed? presence, and connection to outdoor views.
What they found? Daylighting enhances mood more than energy savings, with
illumination quality being the most valued aspect.
What they concluded? Designers should prioritize daylighting’s emotional impact,
focusing on quality over source for better customer
experiences.

Table 2.7 Daylighting in Shopping Malls: Customer Perception, Preferences, and

Satisfaction.
11
8. Title Design and Implementation of Online Shopping Mall
Based on Collaborative Filtering

DOI https://doi.org/10.1016/j.procs.2024.10.024
Author Yujie Wu
What they have done? Designed an online shopping mall to enhance user experience
and functionality.
How they have done? Used Spring Boot, Vue, and a multi-layer Java architecture
with collaborative filtering and sensitive word filtering.
Which method/approach Applied collaborative filtering for recommendations and a
they followed? dynamic sensitive word filter for comment moderation.
What they found? The system improved functionality, personalization, and user
trust.
What they concluded? It serves as a reference model for future e-commerce platform
development.

Table 2.8 Design and Implementation of an Online Shopping Mall Using Collaborative
Filtering.

9. Title Sustaining shopping momentum in retail malls using

real-time messaging

DOI https://doi.org/10.1016/j.jretai.2022.11.002
Author KhadijaAli Vakeel , Morana Fudurić , Vijay Viswanathan ,
Mototaka Sakashita

What they have done? Investigated the impact of real-time mobile promotions
(RTMs) on shopping momentum and spending in retail malls.
How they have done? Used a quasi-experimental design targeting loyalty program
members with RTMs and analyzed buyer responses.
Which method/approach Categorized buyers by spending levels to assess RTMs'
they followed? influence on shopping momentum and spending.
What they found? RTMs increased spending for moderate and heavy buyers but
had no effect or reduced spending for light buyers.
What they concluded? Retailers should focus RTMs on moderate and heavy buyers
for maximum effectiveness in driving sales.

Table 2.9 Sustaining Shopping Momentum in Retail Malls Through Real-Time

Messaging.

12
10. Title Lost in a mall, the effects of gender, familiarity with the
shopping mall and the shopping values on shoppers'
wayfinding processes

DOI https://doi.org/10.1016/j.jbusres.2004.02.006
Author Jean-Charles Chebat , Claire Gélinas-Chebat , Karina Therrien
What they have done? The study explored how shopper characteristics—gender, mall
familiarity, and shopping values—affect wayfinding processes
and information sources used in a shopping mall.
How they have done? Data were collected from 156 real shoppers who recorded their
thoughts and actions during wayfinding, which were then
content-analyzed.
Which method/approach they The study analyzed variations in wayfinding behavior based on
followed? shopper characteristics and examined mediating effects of
hedonistic shopping values.
What they found? Wayfinding processes and information source preferences
significantly differed by gender, mall familiarity, and shopping
values, with notable mediation by hedonistic values.
What they concluded? Understanding these factors can help mall managers improve
wayfinding systems and enhance shopper experiences, aligning
with their diverse needs and values.

Table 2.10 Lost in a Mall: Effects of Gender, Mall Familiarity, and Shopping Values on
Wayfinding.

13
CHAPTER 3

PROBLEM STATEMENT
3.1 Introduction

Shopping malls serve as bustling hubs of commerce and leisure, where understanding customer
behavior and satisfaction is critical to maintaining competitiveness, optimizing operations, and
enhancing the overall shopping experience. However, the dynamic and multifaceted nature of
customer interactions within these environments poses significant challenges to traditional retail
analytics methods. Operators need real-time insights into how customers move, what engages
them, and how they feel about their experiences to make informed decisions about store layouts,
staffing, marketing strategies, and service improvements. Existing systems, such as pose-based
detection frameworks (e.g., Pose Network, Move-Net) and static feedback mechanisms (e.g., paper
surveys), fall short in providing a comprehensive, immediate, and actionable understanding of
customer dynamics. This chapter delves into the specific challenges faced by shopping malls in
this context, highlighting the limitations of current approaches and establishing the urgent need for
an advanced, integrated solution like the proposed "Real-Time Customer Behaviour and
Satisfaction Insight System in Shopping Malls," which leverages YOLO (You Only Look Once),
Deep SORT (Simple Online and Real-time Tracking), Natural Language Processing (NLP), and
QR code-based feedback collection.

3.2 Challenges Faced by Shopping Malls in Customer Behavior Analysis

The inability to accurately and promptly analyze customer behavior and satisfaction in shopping malls
stems from several interconnected challenges. These issues not only hinder operational efficiency but
also impact customer retention and revenue potential.

Limited real-time insights delay strategic decisions, affecting personalized marketing efforts.

Fragmented and unstructured data sources make it difficult to create a unified customer profile.

14
3.2.1 Limited Behavioral Insights from Pose-Based Systems

Traditional systems like Pose Network and Move-Net rely heavily on pose detection to infer
customer behavior, focusing on static postures such as standing, reaching, or bending. While these
methods can identify basic actions, they often misinterpret customer intent and fail to capture the
broader spectrum of behaviors that influence shopping decisions. For example, a customer standing
still near a clothing display might be waiting for a friend rather than showing interest, yet pose-
based systems might classify this as engagement. Similarly, these systems overlook critical
patterns such as movement paths, dwell times, or interactions with multiple products, which are
essential for understanding preferences and engagement levels. This narrow focus limits mall
operators’ ability to discern whether a customer is casually browsing, actively shopping, or simply
passing through, resulting in incomplete or misleading insights that undermine effective decision-
making.

3.2.2 Inaccurate Tracking in Crowded and Dynamic Environments

Shopping malls are inherently crowded and dynamic, especially during peak hours, weekends, or
sales events, where customer density can obscure visibility and complicate tracking. Existing
multi-object tracking systems often struggle with occlusions—when one customer blocks another
from the camera’s view—or identity switches, where a customer’s tracking ID is erroneously
reassigned to someone else as they cross paths. For instance, in a busy food court, a system might
lose track of a customer moving behind a group, skewing data on popular areas or dwell times.
This inaccuracy hampers the ability to map customer flow accurately, identify high-traffic zones,
or assess congestion-related dissatisfaction, leaving operators with unreliable data for layout
planning or crowd management. The lack of robust tracking in such scenarios is a significant
barrier to real-time behavioral analysis.

3.2.3 Delayed and Non-Anonymous Feedback Collection

Customer feedback is a cornerstone of satisfaction analysis, yet traditional methods like paper surveys,
suggestion boxes, or post-visit online questionnaires are slow, cumbersome, and often non-
anonymous, deterring participation. For example, a customer frustrated by a long checkout line might
leave without completing a survey, or one satisfied with a sale might forget to provide feedback later.
These delays mean operators miss the chance to
15
address issues in real-time e.g., adding staff to a busy counter or capitalize on positive experiences
with instant follow-ups, like targeted promotions. Moreover, the lack of anonymity in some systems
(e.g., requiring email addresses) can discourage honest responses, especially about negative
experiences, leading to biased or incomplete data. This gap in timely, candid feedback limits the
ability to gauge true satisfaction levels and respond proactively.

3.2.4 Difficulty in Quantifying Engagement with Mall Services

Understanding how customers interact with mall services—such as digital kiosks, mobile apps,
promotional displays, or in-store products—is crucial for assessing engagement and tailoring
experiences. However, current systems lack standardized metrics to measure these interactions
effectively. For instance, there’s no automated way to determine how long a customer lingers at a
kiosk, how often they use an app for navigation, or whether they pick up and then return an item.
Manual observation is impractical in large malls, and pose-based systems don’t track these subtle
actions across time and space. Without quantifiable engagement data, operators cannot evaluate
the effectiveness of digital touchpoints, optimize product placement, or identify underperforming
services, resulting in missed opportunities to enhance customer experiences and drive sales.

3.2.5 Dependence on Fragmented and Manual Analysis

Mall operators often rely on fragmented data sources—CCTV footage reviewed manually, sporadic
surveys, or sales reports—requiring significant human effort to synthesize into actionable insights. This
manual process is time-consuming, prone to error, and incapable of delivering real- time results. For
example, analyzing hours of video to identify busy zones might take days, by which time customer
patterns have shifted. Similarly, correlating survey feedback with observed behavior is challenging
without integrated tools, leaving operators with disconnected datasets that fail to provide a unified
view. This dependence on labor-intensive, disjointed methods restricts the ability to respond swiftly to
trends, address dissatisfaction, or capitalize on emerging opportunities, ultimately impacting
operational agility and customer satisfaction.

16
3.3 Need for an AI-Powered Real-Time Solution

The challenges outlined above underscore the urgent need for a technologically advanced, real-
time system that overcomes the limitations of existing approaches and delivers comprehensive,
immediate, and actionable insights. Traditional tools—whether pose-based detection, basic
tracking, or static feedback mechanisms—are insufficient for the complex, fast-paced environment
of shopping malls, where customer behaviors and preferences evolve rapidly. The ideal solution
must address the following requirements:

Comprehensive Behavioral Analysis:

Utilize advanced computer vision (e.g., YOLO) to detect customers and a robust tracking
algorithm (e.g., Deep SORT) to capture detailed movement patterns , dwell times, and interactions
across the mall, providing a fuller picture of behavior beyond mere postures.

Accurate Tracking in Crowded Settings:

Ensure reliable tracking in high-density areas by handling occlusions and identity switches,
enabling precise identification of busy zones, popular paths, and congestion points for effective
crowd management and layout optimization.

Immediate and Anonymous Feedback:

Implement a fast, anonymous feedback mechanism (e.g., QR codes) that captures customer
sentiments in real time, allowing operators to address issues like long queues or poor service
instantly and gather honest input without privacy concerns.

Quantifiable Engagement Metrics:

Develop automated methods to measure interactions with mall services such as time spent
at kiosks or product handling—offering concrete data to assess engagement and inform service
enhancements or marketing strategies.

Integrated and Automated Insights:

Combine video-based tracking, engagement metrics, and sentiment analysis into a unified, real-
time dashboard eliminating manual synthesis and enabling operators to make data-driven
decisions swiftly, such as reallocating staff or adjusting promotions on the fly.

17
Scalability and Offline Capability:
Operate efficiently across malls of varying sizes, leveraging existing CCTV infrastructure and
functioning offline to ensure reliability in environments with limited internet connectivity, while
remaining adaptable to future technological integrations.

The system integrates YOLO for detection, Deep SORT for tracking, and NLP for real-time
sentiment analysis of QR feedback. It bridges the gap between fragmented data and immediate
insights, overcoming pose-based misinterpretations, tracking issues in crowds, and delayed
feedback. For example, it cross-references dwell time with feedback to clarify intent.

18
CHAPTER 4

DATA COLLECTION AND PROCUREMENT

4.1 Introduction

The success of the "Real-Time Customer Behaviour and Satisfaction Insight System in Shopping
Malls" hinges on the availability of high-quality, diverse, and well-structured data to train, validate,
and operate its AI-driven components, including YOLO (You Only Look Once) for object detection,
Deep SORT (Simple Online and Realtime Tracking) for movement analysis, and Natural Language
Processing (NLP) for sentiment extraction. Data serves as the foundation for detecting customers,
tracking their behaviors, quantifying engagement, and analyzing satisfaction in real time, enabling
mall operators to derive actionable insights. This chapter outlines the systematic process of data
collection and procurement, detailing the types of data required, their sources, preprocessing
techniques, ethical considerations, and challenges encountered. By ensuring a robust data pipeline,
the system can accurately interpret customer dynamics in complex mall environments, delivering
reliable and scalable analytics to enhance operational efficiency and customer experiences.

4.2 Types of Data Required

To achieve its objectives, the system relies on multiple data categories, each tailored to specific
analytical needs. These types are carefully selected to capture both the physical and perceptual
aspects of customer behavior within shopping malls.

Video Data for Detection and Tracking:

High-resolution video footage is essential for detecting customers and analyzing their movements.
This includes real-time feeds from CCTV cameras capturing actions such as walking, standing,
browsing, or interacting with products. For example, a 1080p video at 30 frames per second (FPS)
provides sufficient detail to identify individuals and track their paths through crowded aisles or
open spaces like food courts. Annotated datasets with bounding boxes around customers and labels
(e.g., “person,” “group”) are required to train the YOLO model, ensuring accurate detection across
diverse mall settings—indoor stores, corridors, and escalator zones.

19
Textual Data for Feedback and Sentiment Analysis:

Textual inputs from customer feedback are critical for understanding satisfaction and preferences.
This includes short responses from QR code surveys (e.g., “great variety”), longer comments (e.g.,
“queues too long, but good deals”), and potentially social media posts or online reviews
mentioning the mall. These texts vary in length and tone, requiring NLP to process positive,
negative, or neutral sentiments and extract themes like “service quality” or “pricing.” The data
must be timestamped to align with video observations, enabling correlation between behavior (e.g.,
lingering in a store) and feedback (e.g., “loved the shirts”).

Engagement Metrics for Behavioral Insights:

Quantitative metrics derived from video and interaction logs measure customer engagement with
mall services. This includes dwell times (e.g., seconds spent near a display), interaction frequencies
(e.g., number of kiosk uses), and movement patterns (e.g., total distance traveled). These metrics
provide concrete data to assess interest—for instance, a 10-minute dwell time in the saree section
might indicate strong engagement—supporting decisions on product placement or promotional
efforts.

4.3 Sources of Data Acquisition

The system draws from a mix of primary and secondary sources to build a comprehensive dataset,
balancing real-world applicability with scalability and cost-efficiency. Primary sources include CCTV
footage, in-mall sensors, and customer feedback forms, which provide direct insights into customer
movement, engagement, and satisfaction. Secondary sources such as social media activity, e-
commerce trends, and industry reports offer broader contextual understanding and trend analysis.
Together, these sources enable a holistic view of customer behavior, supporting data-driven decision-
making for improved mall operations and customer experience.

20
4.3.1 Primary Data Collection

CCTV Video Feeds:

Real-time video is captured from existing CCTV cameras strategically placed throughout the
mall—entrances, store fronts, food courts, and escalators. For instance, a mall with 50 cameras
might provide 24/7 coverage of key areas, generating terabytes of footage weekly. This data is
collected in collaboration with mall management, ensuring alignment with operational needs (e.g.,
monitoring peak hours from 12 PM to 6 PM). Custom recordings in varied conditions—bright
daylight, dim evening lighting, or crowded sales events—enhance robustness.

QR Code Surveys:

Customers provide feedback by scanning QR codes displayed at high-traffic points like exits,
restrooms, or service desks. These codes link to mobile-friendly forms collecting ratings (e.g., 1-
5 stars) and free-text comments, designed for quick input (under 30 seconds) to maximize
participation. During a pilot, 500 daily responses might be gathered in a medium-sized mall,
offering a rich dataset of immediate reactions—e.g., “fast service” after a purchase or “no seating”
near the food court.

4.3.2 Secondary Data Sources

Open-Source Datasets:

Public datasets like COCO (Common Objects in Context) and Open Images provide pre-annotated
images of people in various settings, ideal for initial YOLO training. COCO, with over 330,000
images, includes diverse human poses (e.g., walking, standing), while Open Images offers 9
million annotated instances, ensuring the model generalizes across demographics and
environments. These datasets supplement custom mall footage, reducing the annotation burden.

Retail-Specific Benchmarks:

Datasets like the KITTI Vision Benchmark Suite, though designed for autonomous driving, include
pedestrian tracking examples adaptable to mall corridors.

21
Hypothetical retail datasets (e.g., “MallCrowd 2023”) could provide annotated videos of shopping
behaviors, filling gaps in public resources tailored to indoor retail.

4.3.3 Data Augmentation

To enhance model robustness and prevent overfitting, augmentation techniques are applied

Video Augmentation: Rotations, flips, and brightness adjustments simulate camera angles and lighting
changes (e.g., evening shadows). Noise addition mimics real-world imperfections like lens glare. Frame
skipping and motion blur replicate rapid movements, while color shifts simulate varying environmental
conditions.

Text Augmentation: Synonym replacement (e.g., “great” to “excellent”) and sentence

paraphrasing expand feedback variety, ensuring NLP handles diverse expressions.

Synthetic Data: Tools like Unity or Blender generate simulated mall scenes with virtual customers,
adding controlled variations (e.g., occlusion by groups) to training data.

4.4 Data Annotation and Preprocessing

Raw data must be refined and structured to support AI model training and real-time analysis,
involving several key steps:

Video Annotation:

Tools like Label Img or CVAT are used to manually draw bounding boxes around customers in
sample footage, labeling them as “person” with attributes like position (e.g., “left aisle”). For a 10-
minute clip, 300 frames might be annotated, creating a dataset of 1,000+ labeled instances.
Automated pre-labeling with pre-trained YOLO speeds this process, followed by human
verification for accuracy.

Text Preprocessing:

Feedback text is cleaned by removing typos, emojis, or irrelevant punctuation (e.g., “great!!!” to
“great”), tokenized into words (e.g., “slow service” → [“slow”, “service”]), and labeled for

22
sentiment (positive, negative, neutral) using tools like VADER. Stop words (e.g., “the”) are filtered
to focus on meaningful terms.

Data Normalization:

Video frames are resized to 416x416 pixels (YOLO’s standard input) and normalized to a 0-1
range for faster processing. Engagement metrics are standardized (e.g., dwell times in seconds) for
consistent analysis across datasets.

Dataset Splitting:

The data is divided into training (70%), validation (15%), and test (15%) sets. For example, 7,000
video frames and 3,500 feedback entries might train the models, with 1,500 each for validation
and testing, ensuring balanced evaluation.

4.5 Ethical Considerations in Data Collection

Responsible data handling is paramount to protect customer privacy and ensure fairness, guided
by ethical principles and legal standards:

Informed Consent:

Signage near CCTV cameras (e.g., “Your movements may be recorded for service improvement”)
informs customers of data collection, while QR code participation is voluntary, with clear opt-in
prompts (e.g., “Scan to share your thoughts, no personal data required”).

Privacy and Anonymization:

Faces in video footage are blurred using OpenCV’s face detection algorithms to prevent
identification, and feedback responses exclude personal identifiers (e.g., names, phone numbers).
Data is stored encrypted on secure servers, adhering to regulations like GDPR or India’s Personal
Data Protection Bill.

Bias Mitigation:
Datasets include diverse demographics (age, gender, attire) and mall conditions (busy vs. quiet) to
avoid skewed models—e.g., ensuring YOLO detects both children and adults accurately.

23
Transparency:
Mall operators disclose data usage policies to customers via signage or apps, fostering trust.
Anonymized aggregate insights (e.g., “60% of visitors liked the layout”) are shared, not individual
records.

4.6 Challenges in Data Collection and Procurement

Despite a structured approach, several obstacles complicate data acquisition and preparation:

Variability in Customer Appearances:

Customers vary widely in clothing, size, and posture (e.g., carrying bags, pushing strollers),
requiring extensive training data to ensure YOLO’s detection accuracy.

Crowded Scene Complexity:

High-density areas like food courts or sales events introduce occlusions, making it hard to capture
clean footage or track individuals consistently. This necessitates robust preprocessing and
augmentation to simulate such conditions.

Feedback Participation Rates:

QR code surveys may suffer from low engagement if not promoted effectively—e.g., only 10% of
visitors might scan during a busy day. Incentives (e.g., discount coupons) or strategic placement
(e.g., near exits) are needed to boost responses.

Environmental Factors:

Lighting variations (e.g., dim corners, bright storefronts) and camera angles affect video quality,
potentially reducing detection reliability. Weather or seasonal events (e.g., monsoon crowds)
further complicate data consistency.

Resource Intensity:
Annotating thousands of frames and processing terabytes of video is time- and computationally
expensive. A small team might take weeks to label a week’s footage, requiring efficient tools or
outsourcing to balance cost and quality.

24
4.7 Strategies to Overcome Challenges

To address these issues, the project employs targeted strategies:

Automated Tools: Pre-trained models assist annotation, reducing manual effort by 50%.

Diverse Sampling: Footage from multiple malls and times ensures variety, while synthetic data
fills gaps.

User Incentives: QR codes offer small rewards (e.g., 5% off purchase) to increase feedback rates.

Preprocessing Enhancements: Stabilization and noise filters improve video usability, tested
across lighting conditions.

25
CHAPTER 5

THEORETICAL BACKGROUND

5.1 Introduction

The "Real-Time Customer Behaviour and Satisfaction Insight System in Shopping Malls"
leverages a synergistic combination of advanced technologies to monitor customer behavior,
quantify engagement, and analyze satisfaction in real time. This system integrates computer vision,
deep learning-based object detection and tracking, natural language processing (NLP), and QR
code-based data collection to transform raw mall data into actionable insights. Understanding the
theoretical underpinnings of these components is essential for appreciating how the system detects
customers, tracks their movements, processes feedback, and delivers real-time analytics to mall
operators. This chapter explores the core principles and algorithms—namely, computer vision,
YOLO (You Only Look Once), Deep SORT (Simple Online and Real-time Tracking), NLP, and
QR code technology—providing a detailed foundation for their application in the retail context.
By grounding the system in these theories, we ensure its technical robustness, scalability, and
effectiveness in addressing the complexities of shopping mall environments.

In addition, by utilizing QR codes for voluntary feedback collection, the system ensures customer
participation while maintaining privacy and compliance with data protection standards. This low-
cost, high-engagement method provides direct sentiment analysis opportunities, complementing
the observational data captured by AI tools. The blend of passive observation and active input
allows for a more nuanced understanding of customer journeys, preferences, and pain points. As
shopping environments grow more competitive and data-driven, such intelligent systems play a
crucial role in redefining customer engagement strategies, enabling malls to stay relevant and
responsive in an evolving retail landscape. The real-time nature of the system ensures that mall
administrators can respond instantly to customer needs, optimize operations dynamically, and
personalize experiences at scale, ultimately driving customer satisfaction and loyalty.

26
5.2 Computer Vision and Image Processing

Computer vision is a field of artificial intelligence that enables machines to interpret and analyze
visual data, mimicking human perception. It forms the backbone of the system’s ability to detect
and track customers within video feeds from CCTV cameras.

Core Concepts:

Images and videos are represented as matrices of pixel values—grayscale (intensity) or RGB (red,
green, blue)—where resolution (e.g., 1080p) determines detail. Feature extraction identifies
patterns like edges, shapes, or textures using techniques such as Sobel filters (for edge detection)
or Histogram of Oriented Gradients (HOG) (for object outlines). In a mall, this might mean
detecting a customer’s silhouette against a cluttered background of shelves and signage.

Image Preprocessing:

Preprocessing enhances video quality for analysis. Techniques include noise reduction (e.g.,
Gaussian blur to smooth out graininess from low-light feeds), contrast adjustment (to distinguish
customers in dim corridors), and frame resizing (to 416x416 pixels for YOLO compatibility).
Stabilization via optical flow corrects shaky footage, ensuring consistent tracking across frames.

Application in the System:

Computer vision enables the system to process live mall footage, identifying customers as distinct
objects and extracting spatial-temporal data (e.g., position over time). This lays the groundwork
for subsequent detection and tracking, critical for mapping movement patterns and engagement
zones.

5.3 Deep Learning and Object Detection

Deep learning, a subset of machine learning, uses neural networks to model complex patterns in
data, making it ideal for real-time customer detection in malls.

27
Convolutional Neural Networks (CNNs):

CNNs are specialized neural networks for image analysis, featuring layers that extract hierarchical
features—edges in early layers, shapes in middle layers, and full objects (e.g., people) in deeper
layers. Architectures like VGGNet or ResNet underpin modern detection models, with
convolutional filters scanning images to identify key visual elements. In our system, CNNs process
video frames to recognize customers amidst diverse backgrounds.

YOLO (You Only Look Once):

YOLO is a state-of-the-art object detection algorithm designed for speed and accuracy, making it
perfect for real-time applications. Unlike two-stage detectors (e.g., R-CNN), YOLO processes an
entire image in a single pass, dividing it into a grid (e.g., 13x13) and predicting bounding boxes,
class probabilities (e.g., “person”), and confidence scores simultaneously. Its backbone, often
Darknet-53, balances computational efficiency with precision. In a mall, YOLO might detect
multiple customers in a crowded aisle, outputting boxes with 92% confidence, enabling rapid
identification for tracking.

Role in the System:

YOLO serves as the detection engine, identifying customers in each frame with bounding boxes
(e.g., coordinates [x, y, width, height]). This real-time capability (~30 FPS) ensures the system
keeps pace with dynamic mall activity, providing the raw data for behavioral analysis.

5.4 Object Tracking with Deep SORT

Tracking customers across video frames requires associating detections over time, a task handled
by Deep SORT.

Principles of Multi-Object Tracking:

Tracking involves linking detected objects (e.g., customers) across frames to maintain their
identities despite movement, occlusions, or camera switches. Basic methods use motion prediction
(e.g., Kalman filters), but modern systems add appearance features for robustness.

28
Deep SORT Mechanics:

Deep SORT extends the SORT (Simple Online and Realtime Tracking) algorithm by integrating

deep learning-based appearance descriptors. It uses:

Kalman Filter: Predicts a customer’s next position based on velocity and past trajectory, updating
with new detections to minimize errors. For example, it estimates a customer’s path through a store
even if briefly obscured.

Appearance Model: A pre-trained CNN extracts features (e.g., clothing color, shape) from
bounding boxes, calculating similarity scores (e.g., cosine distance) to re-identify customers post-
occlusion.

Hungarian Algorithm: Matches predicted tracks to new detections, resolving identity switches
in crowds.

Application in the System:

Deep SORT tracks customers from entrance to exit, handling occlusions (e.g., a group blocking
view) with 85% precision. It maps movement and dwell times for heatmaps and flow analysis.

5.5 Spatial Analysis for Behavioral Insights

Understanding where customers are and how they move enhances behavioral analysis beyond
mere detection.

Bounding Box Coordinates:

Each detected customer is enclosed in a bounding box with coordinates (x, y, width, height). The
system calculates the center point (x_center, y_center) to determine spatial position relative to the
frame—e.g., left (<40% frame width), right (>60%), or center (40-60%). This maps customer
locations within stores or corridors.

Engagement Metrics:

Temporal analysis tracks dwell times (e.g., seconds spent near a display) and interaction counts
(e.g., kiosk touches), derived from Deep SORT’s continuous tracking. For instance, a customer
lingering 15 seconds at a shirt rack suggests interest, informing product placement.
29
Role in the System:

Spatial analysis produces actionable data—e.g., “three customers on the left near dresses”—and
visual outputs like heatmaps, showing high-traffic zones for layout adjustments.

5.6 Natural Language Processing for Feedback Analysis

NLP processes textual feedback from QR code surveys, extracting sentiments and preferences to
complement video data.

Text Processing Fundamentals:

NLP involves tokenization (splitting “great service” into “great” and “service”), stop-word
removal (e.g., “the”), and stemming (e.g., “running” to “run”) to prepare text for analysis.
Sentiment analysis tools like VADER score phrases—e.g., “love the deals” (+0.8, positive)—
while topic modeling (e.g., Latent Dirichlet Allocation) groups feedback into themes like “service”
or “pricing.”

Real-Time Sentiment Analysis:

The system uses pyttsx3 or similar offline NLP engines to process feedback instantly, classifying
sentiments (e.g., “queues too long” as negative) and identifying key issues (e.g., “slow checkout”).
This ensures timely insights without internet dependency.

Application in the System:

NLP correlates feedback with behavior—e.g., negative comments about queues align with long
dwell times at checkouts—offering a dual perspective that enhances satisfaction analysis and
guides operational responses.

5.7 QR Code Technology for Data Collection

QR codes enable fast, anonymous feedback collection, bridging physical and digital interactions.

Operational Principles:

QR codes are matrix barcodes encoding URLs, generated with Python libraries like qrcode.

30
Customers scan them with smartphones, accessing forms to rate experiences (e.g., 1-5 stars) or
write comments (e.g., “fast staff”). Data is timestamped and stored locally or on a server.

Advantages:
QR codes offer immediacy (feedback in seconds), anonymity (no login required), and scalability
(deployable mall-wide). A single scan at an exit might yield “great variety,” instantly processed
by NLP.

Role in the System:

QR codes collect real-time customer feedback, enriching video-based insights with subjective
opinions. They provide a non-intrusive way to gauge satisfaction, ensuring customer privacy. This
dual-layered approach enhances accuracy in understanding shopping behaviors. It bridges the gap
between observed actions and customer perceptions. The system thus delivers a more
comprehensive and actionable analysis.

5.8 Software Frameworks and Technologies Used

The system relies on a robust tech stack:

Python: Core language for its AI and data libraries.

OpenCV: Handles video processing (e.g., preprocessing, detection).

TensorFlow: Powers YOLO and Deep SORT training/inference.

NLTK: Supports NLP tasks like sentiment analysis.

Django: Builds dashboards for real-time reporting.

NumPy/Pandas: Manages data arrays and analytics.

5.9 Challenges in Theoretical Implementation

Real-Time Constraints: Achieving <100ms latency for YOLO and Deep SORT requires optimized

31
hardware (e.g., GPUs).

Occlusion Handling: Deep SORT may falter in extreme crowds, needing parameter tuning.

NLP Accuracy: Informal feedback (e.g., slang) challenges sentiment models, requiring robust
training.

Scalability: Processing dozens of camera feeds demands efficient resource allocation.

5.10 Theoretical Integration

The system combines these theories into a cohesive pipeline: computer vision and YOLO detect
customers, Deep SORT tracks them, spatial analysis quantifies behavior, NLP interprets feedback,
and QR codes collect it—all unified in real-time analytics. This integration leverages each
component’s strengths, overcoming individual limitations to deliver a comprehensive retail
solution.

32
CHAPTER 6

METHODOLOGY
The methodology for analyzing customer behavior in shopping malls using state-of-the-art AI
techniques. The proposed system enhances existing methods by integrating YOLO for person
detection, DeepSORT for multi-object tracking, and QR code-based feedback collection. The
framework ensures high accuracy in detecting customer movements, identifying interactions, and
providing real-time insights for data-driven decision-making

6.1 Model Selection and Loading

To analyze customer behavior in malls, the system employs YOLO for real-time object detection.
YOLO is chosen for its speed and accuracy in detecting multiple objects simultaneously. The
model is initialized using the YOLO class from the ultralytics library. The load_model function
loads the pre-trained model for video frame processing. This ensures efficient and accurate
customer tracking in dynamic retail environments.

Fig 6.1 Real-Time Customer Satisfaction Analysis Flowchart

33
6.2 Real-Time Object Detection:

Real-time object detection leverages YOLO to instantly recognize and classify customers within a
shopping environment. Deep SORT enhances this by continuously tracking movement patterns,
distinguishing between first-time and returning visitors. This seamless integration ensures accurate
identification and reduces false positives, improving analytical precision.

6.2.1 Video Frame Processing:

The system captures real-time video frames using a webcam or any connected camera, feeding
them into the YOLO model via OpenCV. Each frame is resized and pre-processed to meet the
input requirements of the YOLO architecture, ensuring accurate object detection. The model then
identifies customers and other relevant objects within the frame, outputting bounding boxes and
confidence scores. These detections are passed on to the tracking algorithm for consistent
identification across frames. This continuous frame-by-frame processing enables real-time
monitoring of customer movement and interaction within the shopping mall environment.

6.2.2 Object Detection and Positioning:

The detect_objects function identifies and classifies objects in each frame, with a primary focus on
detecting people (class 0 in YOLO). Once detected, the system extracts the bounding box coordinates
and calculates the center position of each object. Based on these coordinates, the system determines
the relative positioning of individuals—such as whether they are to the left, right, or directly in front
of the camera. This spatial information is then used to interpret customer behavior and trigger relevant
feedback mechanisms or alerts. The positioning data also aids in mapping customer density across
different zones in the mall. It enables the system to detect crowd formations and unusual movement
patterns. These insights can be used to enhance mall navigation, prevent congestion, and improve
overall safety. Furthermore, the collected positioning data contributes to heatmap generation and
layout optimization over time.

34
6.3 Real-Time Customer Identification

The YOLO model identifies all individuals in the video feed in real time. A predefined Area of
Interest (AOI) is marked (e.g., checkout counter space). Individuals inside the AOI are classified
as customers, while those outside (e.g., sellers behind the counter) are excluded. A pre-trained
Caffe deep learning model analyzes facial features using a CNN to predict age ranges (e.g., 18-25,
26-35) and gender (male, female, or non-binary). Customers are grouped into demographic
categories, enabling personalized marketing strategies and trend analysis.

6.4 Movement Tracking with Deep SORT

Deep SORT is an enhanced version of the SORT (Simple Online and Real-time Tracker)
algorithm, incorporating deep learning-based appearance embedding for robust object tracking. It
is commonly used for tracking objects in video streams. Deep SORT (Simple Online and Real-
time Tracking) assigns a unique ID to each customer detected by YOLO. The system uses Kalman
filtering for motion prediction and appearance features to maintain consistent tracking, even with
occlusions or overlapping paths. Tracks customer movement from the AOI to other store areas,
providing insights into shopping behavior and navigation patterns.

6.5 Analyzing Customer Trends

The system records how long each customer remains in the AOI (e.g., near a product display or
checkout counter). Customers staying beyond a threshold (e.g., 30 seconds) are classified as
"interested" Pose estimation and object detection track specific actions (e.g., picking up items or
interacting with sellers), providing. Aggregated data reveals peak interest periods, frequently
visited areas, and high-demand products, helping optimize store layouts and marketing strategie.

6.6 QR Code Placement:

Unique QR codes are placed at checkout counters, store exits, and key locations. These strategically
placed codes are easily accessible to customers as they move through various areas of the mall. Scanning
these QR codes allows customers to provide instant feedback on their shopping experience, satisfaction,
and service quality.

35
Fig 6.6 Sentiment Analysis and Feedback Processing Workflow

6.6.1 Digital Survey Integration:

Customers scan the QR code to access a short digital survey for feedback on satisfaction levels,
product preferences, and store experience. The collected feedback is processed using Natural
Language Processing (NLP) to classify sentiment as positive, negative, or neutral. Sentiment trends
help identify improvement areas, enhance service quality, and refine store operations.

Additionally, the survey data is analyzed over time to track changes in customer satisfaction and
identify recurring issues. By correlating this feedback with other data sources, such as foot traffic
and purchase behavior, the system can provide actionable recommendations for targeted marketing
strategies and personalized services. Ultimately, this integration of digital surveys and sentiment
analysis empowers mall operators to continuously adapt and enhance the shopping experience.

36
CHAPTER 7

RESULTS AND DISCUSSIONS

7.1 Overview of System Performance

An advanced detection and tracking algorithm provides robust real-time capabilities to monitor
and analyze consumer behavior. This system tracks customer movements, measures time spent in
specific areas, and evaluates engagement levels with products or staff. Complementing this,
Natural Language Processing (NLP) enables businesses to interpret customer feedback efficiently,
extracting sentiment and suggestions that are often challenging to discern manually. Additionally,
QR codes facilitate precise data collection on purchases and shopping preferences, enhancing

customer profiling. These real-time insights empower store managers to optimize layouts, allocate
resources effectively, and deliver personalized customer experiences, thereby accelerating
decision-making processes.

7.2 Detecting and Analyzing Customers

The system employs advanced object detection algorithms to detect and track customers as they
move through the mall. Using real-time video feeds, the system identifies individuals based on
pre-trained deep learning models such as YOLO (You Only Look Once), which allows for rapid
and accurate detection of people in a crowded environment. Once detected, the system tracks the
movement of each customer across video frames using tracking algorithms like Deep SORT,
ensuring consistent identification even as the person moves through different areas of the mall.
The analysis extends beyond mere detection, as the system captures key behavioral data, such as
dwell times at specific locations, movement patterns, and frequency of visits to certain stores.
These insights are used to generate heatmaps and identify areas of high engagement or congestion,
offering valuable data for mall operators to optimize store layouts, manage foot traffic, and
improve customer service.

37
Fig 7.2: Real-Time Detection and Analysis of Customers

This system uses YOLO for real-time customer detection in a retail clothing store. An Area of
Interest (AOI) helps differentiate customers (blue bounding boxes) from sellers (red bounding
boxes). This approach ensures accurate tracking of shopping behavior within defined store zones.

7.3 Analysis of Product Categories

This figure illustrates product arrangement across key clothing categories: sarees, dresses, western
wear, children’s clothing, shirts, pants, and t-shirts. These categories serve as a foundation for
understanding customer behavior, including shopping durations, purchased items, verbal
interactions, and satisfaction levels.

For example, customers who spend more time in a particular category like sarees may have specific
preferences related to fabric, design, or occasion. Understanding these preferences can help retailers
tailor promotions, product displays, and staff training to enhance customer experience. Similarly,
customers purchasing western wear or children's clothing may exhibit different behaviors, such as
prioritizing comfort or style, which may influence their satisfaction levels.

38
Fig 7.3: Customer Preferences on sarees

The system generates reports on consumer interest in saree styles, browsing time, and feedback on
quality and pricing. It analyzes clothing preferences, revealing trends in fit, fabric fading, and
interactivity levels. By processing feedback, businesses can personalize recommendations to
enhance customer experience. Retailers refine inventory by identifying demand patterns and
popular product categories. Browsing duration analysis highlights which items attract the most
attention. QR-based sentiment analysis enables real-time adaptation to customer preferences.
Heatmaps visualize high-traffic zones, optimizing store layouts for better engagement.
Data-driven insights help businesses streamline marketing, pricing, and inventory strategies.

7.4 Collecting Customer Feedback via QR Codes

Collecting customer feedback via QR codes has become an innovative and effective way for
retailers to gather real-time insights. By placing QR codes at strategic points in stores—such as
near product displays, checkout counters, or customer service areas—retailers can make it easy
for customers to scan the code with their smartphones and quickly access an online survey or
feedback form.

39
Fig 7.4: Customer Feedback Form Interface

To facilitate seamless feedback collection, the system implements QR codes placed at strategic
locations within the retail environment, such as checkout counters or store exits. Customers can
scan these QR codes using their smartphones, which redirect them to a user-friendly digital
feedback form, as shown in Figure 8. The form, titled "Customer Feedback Form," prompts users
to input their name, age, product type (e.g., electronics, with a dropdown menu for selection), and
detailed feedback in a text box. Additional features include an "Add Another Product" button for
multiple entries and a "Submit" button to finalize the response.

This QR code-based approach ensures easy access for customers, encouraging higher participation
rates in feedback collection. The form’s simple design minimizes user effort, while the collected
data—such as product preferences and satisfaction levels—provides valuable insights for
businesses.

7.5 User Interface and Usability

The project dashboard, developed using the Django framework and Python programming
language, provides an intuitive interface for retail managers to monitor real-time customer analysis
data.
40
Django’s MVC architecture ensures scalability, while Python enables seamless integration with
system components like YOLO, Deep SORT, NLP, and the Caffe model. The dashboard features
interactive visualizations of customer metrics—such as time spent, movement patterns,
demographics, and feedback sentiment—along with filters and export options for easy data
management. Integrated feedback forms from QR codes allow real-time monitoring of customer
responses. With a responsive design optimized for desktop and mobile, and Django’s security
features like user authentication, the dashboard ensures a user-friendly and secure experience,
empowering retailers to make data-driven decisions efficiently.

Fig 7.5: Dashboard Overview

The dashboard provides real-time insights on customer behavior, including visit trends, gender distribution,
sentiment analysis, and time spent per category. These insights help retailers optimize inventory, store
layout, and marketing strategies for better customer engagement.

7.6. System Limitations and Improvements

While the system performed well under typical conditions, there are several areas for potential
improvement:

41
 Detection Accuracy: The system’s performance may be impacted by low-light conditions
or occlusions, where objects are partially hidden or obscured

● Model Optimization: For a smoother experience, the YOLO model could be optimized
further, especially for devices with limited processing power, to maintain high frame rates
without compromising detection accuracy.
● Person Tracking: occasionally faces challenges in maintaining consistent IDs under
complex scenarios, such as when customers move quickly, change directions abruptly, or
temporarily exit and re-enter the camera’s field of view. These situations can lead to ID
switching (where two individuals’ IDs are swapped) or tracking loss, impacting the
accuracy of time-spent analysis and movement patterns.
● NLP in Feedback Processing: the system encounters limitations in accurately interpreting
nuanced or ambiguous responses. For instance, colloquial language, sarcasm, or mixed
sentiments (e.g., "The fabric is great, but the price is too high") can lead to misclassification
of sentiment as purely positive or negative. Additionally, the system may struggle with
feedback in multiple languages or with grammatical errors, reducing the accuracy of
sentiment analysis
● Caffe Model for Age and Gender Estimation: it faces challenges in achieving high
accuracy under varying conditions. Factors such as poor image quality, non-frontal facial
views, or diverse lighting conditions can lead to incorrect predictions, such as
misclassifying a young adult as an older individual or failing to determine gender
accurately. Additionally, the model may struggle with underrepresented demographic
groups in its training data, leading to biased estimations.

42
7.7 Conclusion
In conclusion, the integration of advanced detection and tracking algorithms with Natural
Language Processing and QR code-based data collection creates a powerful system for real-time
consumer behavior analysis. By accurately monitoring customer movements, dwell time, and
engagement, the system offers valuable insights into in-store dynamics. The addition of NLP to
interpret customer feedback further enhances the depth of understanding, allowing businesses to
uncover sentiments and preferences that may not be explicitly stated.
This holistic approach empowers store managers with actionable intelligence, enabling them to
make swift, informed decisions on layout adjustments, staff deployment, and personalized
marketing strategies. As a result, customer satisfaction and operational efficiency are significantly
improved. The seamless fusion of technology and human behavior analysis positions this system
as a vital tool for modern retail environments aiming to stay competitive and customer-focused.

43
7.8 APPENDIX (SOURCE CODE)
import cv2
from ultralytics import YOLO
import wget
import numpy as np
from deep_sort.deep_sort import DeepSort
import time
import csv
import datetime
import torch

# Check if GPU is availableA

device = 'cuda' if torch.cuda.is_available() else 'cpu'
print(f"Using device: {device}")
model = YOLO('yolov10x.pt')
deep_sort_weights = 'deep_sort/deep/checkpoint/ckpt.t7'

def draw_label(image, text, top_left, bottom_right, color, font_color, font_scale=0.6, font_thickness=2):

# Calculate text size
font = cv2.FONT_HERSHEY_SIMPLEX
text_size = cv2.getTextSize(text, font, font_scale, font_thickness)[0]

# Create a filled rectangle for text background

text_background_top_left = (top_left[0]+18, top_left[1] - text_size[1] - 10)
text_background_bottom_right = (top_left[0] + text_size[0] + 25, top_left[1])

cv2.rectangle(image, text_background_top_left, text_background_bottom_right, color, cv2.FILLED)

# Add text on top of the rectangle

44
text_position = (top_left[0] + 18, top_left[1] - 5)
cv2.putText(image, text, text_position, font, font_scale, font_color, font_thickness)

def draw_rounded_rectangle(image, top_left, bottom_right, color, thickness, radius):

tl = (top_left[0] + radius, top_left[1] + radius)
tr = (bottom_right[0] - radius, top_left[1] + radius)
bl = (top_left[0] + radius, bottom_right[1] - radius)
br = (bottom_right[0] - radius, bottom_right[1] - radius)

# image=cv2.rectangle(image, p1, p2, color, thickness=lw, lineType=cv2.LINE_AA)

cv2.rectangle(image, (tl[0], top_left[1]), (tr[0], bottom_right[1]), color, 2, cv2.LINE_AA)

cv2.rectangle(image, (top_left[0], tl[1]), (bottom_right[0], bl[1]), color, thickness)
cv2.circle(image, tl, radius, color, thickness)
cv2.circle(image, tr, radius, color, thickness)
cv2.circle(image, bl, radius, color, thickness)
cv2.circle(image, br, radius, color, thickness)

def draw_text(image, text, position, background_color, font_color, font_scale=0.5, font_thickness=1):

# Calculate text size
font = cv2.FONT_HERSHEY_SIMPLEX
text_size = cv2.getTextSize(text, font, font_scale, font_thickness)[0]

# Create a filled rectangle for text background

text_background_top_left = (position[0] - 5, position[1] + 5)
text_background_bottom_right = (position[0] + text_size[0] + 5, position[1] - text_size[1] - 5)

cv2.rectangle(image, text_background_top_left, text_background_bottom_right, background_color,

cv2.FILLED)

# Add text on top of the rectangle

text_position = (position[0], position[1] - 5)
45
cv2.putText(image, text, text_position, font, font_scale, font_color, font_thickness)

def get_box_details(boxes):
cls = boxes.cls.tolist() # Convert tensor to list
xyxy = boxes.xyxy
conf = boxes.conf
xywh = boxes.xywh
return cls, xyxy, conf, xywh

46
CHAPTER 8

CONCLUSION AND FUTURE ENHANCEMENTS

The proposed AI-based customer behavior analysis system offers a substantial improvement over
conventional methods like Pose Network and MoveNet by going beyond simple movement
tracking. Our approach incorporates YOLO for pose estimation and deep learning algorithms to
gain deeper insights into customer interests, such as gaze direction, product interactions, and dwell
time. These capabilities allow for a more comprehensive understanding of customer intent and
behavior, helping mall operators optimize store layouts and improve personalized marketing
strategies.
By generating real-time heatmaps and performing sentiment analysis, the system enables mall
operators to identify high-traffic zones, customer preferences, and emotional responses to products
or environments. It further supports operational efficiency through real-time crowd density
monitoring, which plays a crucial role in effective queue management and staff allocation. The
ability to distinguish between casual browsers and serious buyers enhances targeted sales efforts,
boosting conversion rates and overall customer satisfaction.
Another significant contribution of the system is its role in improving inventory management. By
analyzing movement patterns near specific product displays, the system can predict demand trends
and guide stocking decisions. Predictive analytics empowers mall management to stay ahead of
consumer behavior, adjusting promotional campaigns in real time and refining marketing strategies
based on data-driven insights.
Future enhancements could focus on integrating more advanced gaze estimation techniques,
including eye-tracking for better engagement measurement. Combining facial expression
recognition with voice analysis could provide even more accurate sentiment detection.
Implementing multi-camera setups and 3D environment mapping would allow for precise spatial
tracking and further improve the effectiveness of store layout optimization and customer journey
mapping.
Scalability will also be a key focus, with potential extensions to multi-level malls and integration
across various retail locations. Adding features like AI-based virtual assistants and augmented
reality product recommendations could enrich the customer experience. Finally, incorporating
secure and privacy-conscious technologies such as blockchain and federated learning can help
47
ensure responsible data use while expanding the system’s reach and effectiveness in the retail
industry.
To ensure ongoing improvement and adaptability, the system includes a feedback mechanism that
enables customers to share their experiences through a QR code-linked form. This allows mall
operators to gather direct input and continuously refine the system based on real user perspectives.
Such a feedback loop not only empowers customers but also helps businesses stay aligned with
evolving expectations and trends, creating a more responsive and customer-centric retail
environment.
Looking ahead, the integration of this system with broader smart city infrastructure presents
exciting possibilities. By connecting mall behavior analytics with urban transportation data or
external retail ecosystems, stakeholders can plan better logistics, offer seamless shopping journeys,
and create unified experiences across locations. This convergence of AI, IoT, and data science
opens new avenues for revolutionizing the retail landscape and delivering next-level service
personalization and operational excellence.

48
REFERENCES

[1] D. Merad, K.-E. Aziz, R. Iguernaissi, B. Fertil, and P. Drap, ‘Tracking multiple persons under
partial and global occlusions: Application to customers’ behavior analysis’, Pattern Recognit.
Lett., vol. 81, pp. 11–20, Oct. 2016, doi: 10.1016/j.patrec.2016.04.011.

[2] B. Kesari and S. Atulkar, ‘Satisfaction of mall shoppers: A study on perceived utilitarian and
hedonic shopping values’, J. Retail. Consum. Serv., vol. 31, pp. 22–31, Jul. 2016, doi:
10.1016/j.jretconser.2016.03.005.

[3] X. Sheng, ‘The consumer behavior analysis of virtual clothes’, Telemat. Inform. Rep., vol. 10,
p. 100047, Jun. 2023, doi: 10.1016/j.teler.2023.100047.

[4] B. Khalid, ‘Evaluating customer perspectives on omnichannel shopping satisfaction in the

fashion retail sector’, Heliyon, vol. 10, no. 16, p. e36027, Aug. 2024, doi:
10.1016/j.heliyon.2024.e36027.

[5] G. Alves Rezende, A. M. Mariano, M. R. Santos, and A. C. Coelho Constatin, ‘Modeling

Customer Satisfaction in the Food Industry: Insights from a Structural Equation Approach.’,
Procedia Comput. Sci., vol. 242, pp. 130–137, 2024, doi: 10.1016/j.procs.2024.08.250.

[6] M. S. Mayhoub and E. H. Rabboh, ‘Daylighting in shopping malls: Customer’s perception,

preference, and satisfaction’, Energy Build., vol. 255, p. 111691, Jan. 2022, doi:
10.1016/j.enbuild.2021.111691.

[7] Y. Wu, ‘Design and Implementation of Online Shopping Mall Based on Collaborative
Filtering’, Procedia Comput. Sci., vol. 247, pp. 201–210, 2024, doi:
10.1016/j.procs.2024.10.024.

[8] K. A. Vakeel, M. Fudurić, V. Viswanathan, and M. Sakashita, ‘Sustaining shopping

momentum in retail malls using real-time messaging’, J. Retail., vol. 99, no. 1, pp. 102–114,
Mar. 2023, doi: 10.1016/j.jretai.2022.11.002.

[9] D. Oosterlinck, D. F. Benoit, P. Baecke, and N. Van De Weghe, ‘Bluetooth tracking of humans
in an indoor environment: An application to shopping mall visits’, Appl. Geogr., vol. 78, pp.
55–65, Jan. 2017, doi: 10.1016/j.apgeog.2016.11.005.
49
PUBLICATIONS

Srinivasa Rao Konni , Chandini Gedela , Andhavarapu Charan , Sasi Rekha Yamala , Sai Karthik
Bodigi , Dasaradha Arangi “Real-Time Customer Behaviour and Satisfaction Insight System In
Shopping Malls”, Communicated with GIET College , 2025 International Conference on Next
Generation of Green Information and Emerging Technologies.

Charan
No ratings yet
Charan
69 pages
Final Report Indhu
No ratings yet
Final Report Indhu
23 pages
Real-Time Research Project Report: Blue Connect
No ratings yet
Real-Time Research Project Report: Blue Connect
46 pages
RRP Documentation
No ratings yet
RRP Documentation
74 pages
Lmsashu
No ratings yet
Lmsashu
14 pages
Skincare
No ratings yet
Skincare
43 pages
Automated Review Classification ML
No ratings yet
Automated Review Classification ML
61 pages
Miniproject 19
No ratings yet
Miniproject 19
40 pages
B-11 Document-1
No ratings yet
B-11 Document-1
43 pages
Major Project Report BIG MART Final Reedited
No ratings yet
Major Project Report BIG MART Final Reedited
91 pages
Report Batch 10
No ratings yet
Report Batch 10
82 pages
Project Report Final Updated'
No ratings yet
Project Report Final Updated'
37 pages
Bi Report 17 Final
No ratings yet
Bi Report 17 Final
40 pages
Research Paper Recommendation System
No ratings yet
Research Paper Recommendation System
12 pages
Mini Project Report4.2
No ratings yet
Mini Project Report4.2
27 pages
Report Done - Merged
No ratings yet
Report Done - Merged
37 pages
B.Tech Project Report in CSE
No ratings yet
B.Tech Project Report in CSE
13 pages
2403811711421055-Tharun M
No ratings yet
2403811711421055-Tharun M
35 pages
Dbmsnav F2
No ratings yet
Dbmsnav F2
39 pages
Mini Project Report3
No ratings yet
Mini Project Report3
27 pages
p3 Front Pages
No ratings yet
p3 Front Pages
8 pages
Submitted in Partial Fulfillment of The Requirements For The Award of
No ratings yet
Submitted in Partial Fulfillment of The Requirements For The Award of
29 pages
CFMS Synopsis
No ratings yet
CFMS Synopsis
8 pages
WT Mini Project
No ratings yet
WT Mini Project
35 pages
Project File
No ratings yet
Project File
99 pages
JCT UG Project Format
No ratings yet
JCT UG Project Format
14 pages
Team-7 Project Report
No ratings yet
Team-7 Project Report
59 pages
Main Project
No ratings yet
Main Project
43 pages
Mini Project Front Pages Format
No ratings yet
Mini Project Front Pages Format
7 pages
Batch-06 Documentation Akshay
No ratings yet
Batch-06 Documentation Akshay
64 pages
Logbook Sem 8
No ratings yet
Logbook Sem 8
19 pages
Big Sales Prediction Model Using Machine Learning1
No ratings yet
Big Sales Prediction Model Using Machine Learning1
21 pages
A Minor Project Report
No ratings yet
A Minor Project Report
67 pages
B.Tech Sentiment Analysis Report
75% (4)
B.Tech Sentiment Analysis Report
45 pages
Wa0009.
No ratings yet
Wa0009.
27 pages
House Price Prediction Using Linear Regression Updated One
No ratings yet
House Price Prediction Using Linear Regression Updated One
79 pages
Bi Report
No ratings yet
Bi Report
45 pages
Project Report
No ratings yet
Project Report
29 pages
Web Analytics Doc Final
No ratings yet
Web Analytics Doc Final
40 pages
BI REPORT 8 To Rem Pages Final
No ratings yet
BI REPORT 8 To Rem Pages Final
40 pages
IO PROJECT - Final Submit
No ratings yet
IO PROJECT - Final Submit
57 pages
Stock Price Prediction Final Report
No ratings yet
Stock Price Prediction Final Report
50 pages
CORE COURSE PROJECT - Report - Format - For - All - Department
No ratings yet
CORE COURSE PROJECT - Report - Format - For - All - Department
34 pages
Project Merged-1-66 Merged
No ratings yet
Project Merged-1-66 Merged
68 pages
Final Project
No ratings yet
Final Project
43 pages
Mini Project
67% (3)
Mini Project
64 pages
Newsportal Report
No ratings yet
Newsportal Report
48 pages
EDE Taniskha
No ratings yet
EDE Taniskha
24 pages
BI REPORT First - 1-8
No ratings yet
BI REPORT First - 1-8
8 pages
Design Report 1 (Repaired)
No ratings yet
Design Report 1 (Repaired)
50 pages
Proj Report 1
No ratings yet
Proj Report 1
54 pages
DOCUMENTATION
No ratings yet
DOCUMENTATION
104 pages
Store Sales Documentation
No ratings yet
Store Sales Documentation
9 pages
A20 Doc
No ratings yet
A20 Doc
41 pages
Karuppu Intern Report
No ratings yet
Karuppu Intern Report
31 pages
Full Report - Merged
No ratings yet
Full Report - Merged
62 pages
Internship Report Shourish - 19202A0047
No ratings yet
Internship Report Shourish - 19202A0047
21 pages
DBMS 2 MCQ
No ratings yet
DBMS 2 MCQ
6 pages
Methods of Data Collection July 2023
No ratings yet
Methods of Data Collection July 2023
52 pages
JDBC Questions and Answers
No ratings yet
JDBC Questions and Answers
18 pages
Hibernate
No ratings yet
Hibernate
29 pages
Test 4
No ratings yet
Test 4
2 pages
Course Outline
No ratings yet
Course Outline
6 pages
Analyzing The Public Transportation in Amman: The Case of The Bus Rapid Transit (BRT)
No ratings yet
Analyzing The Public Transportation in Amman: The Case of The Bus Rapid Transit (BRT)
156 pages
AI in Language in Learning and Teaching
No ratings yet
AI in Language in Learning and Teaching
15 pages
How Does Mongodb Differ From Traditional Relational Databases?
No ratings yet
How Does Mongodb Differ From Traditional Relational Databases?
6 pages
Consumer Behaviour & Marketing Research - Bba - IV Sem
No ratings yet
Consumer Behaviour & Marketing Research - Bba - IV Sem
9 pages
Chapter 18 Data Manipulation - Database Access - Ms
No ratings yet
Chapter 18 Data Manipulation - Database Access - Ms
19 pages
Architecting A Platform For Big Data Analytics
No ratings yet
Architecting A Platform For Big Data Analytics
23 pages
Linked Lists: CENG 213 Metu/Odtü Data Structures Yusuf Sahillioğlu
No ratings yet
Linked Lists: CENG 213 Metu/Odtü Data Structures Yusuf Sahillioğlu
63 pages
Module I - 1
No ratings yet
Module I - 1
23 pages
Section A Objective Questions (50 Marks) Instruction:: Confidential
No ratings yet
Section A Objective Questions (50 Marks) Instruction:: Confidential
19 pages
The Exodus of Filipino Teachers To Uzbekistan - A Hermeneutic Phen
No ratings yet
The Exodus of Filipino Teachers To Uzbekistan - A Hermeneutic Phen
12 pages
Assignment 1: Data Analytics For Business Decision
No ratings yet
Assignment 1: Data Analytics For Business Decision
6 pages
Lecture 2
No ratings yet
Lecture 2
16 pages
KSSC First Phase Report Summary For Districts, 18-6-2018
No ratings yet
KSSC First Phase Report Summary For Districts, 18-6-2018
9 pages
Functional Dependencies in DBMS
No ratings yet
Functional Dependencies in DBMS
8 pages
Manuscript Format To Research (Word)
No ratings yet
Manuscript Format To Research (Word)
3 pages
Project Report Phase I - Template - August 2025
No ratings yet
Project Report Phase I - Template - August 2025
14 pages
AI Professional 6 Week Course in IIIT
No ratings yet
AI Professional 6 Week Course in IIIT
2 pages
Chapter 11 Multiple Choice (Database) : True/False Questions
50% (2)
Chapter 11 Multiple Choice (Database) : True/False Questions
2 pages
Power Bi Project
No ratings yet
Power Bi Project
7 pages
Unit 2
No ratings yet
Unit 2
25 pages
Data Analytics Quiz
100% (1)
Data Analytics Quiz
8 pages
Guide To Transaction Processing
No ratings yet
Guide To Transaction Processing
14 pages
Bitmap Join Indexes
No ratings yet
Bitmap Join Indexes
15 pages
Fulltext01 28
No ratings yet
Fulltext01 28
1 page