0% found this document useful (0 votes)

23 views17 pages

Statistics

Uploaded by

Muskan Sikarwar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views17 pages

Statistics

Uploaded by

Muskan Sikarwar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 17

Hypothesis

Testing for
Categorical
Data (Chi-
Square Test)
By- Kingshuk Roy 26
Kunal Ahirwar 27
Manoj Singh 28
Mohan Singh Bhadoriya 29
Muskan Sikarwar 30
Roadmap
Introduction to Hypothesis Testing

Understanding the Chi-Square Test

Defining the Chi-Square Test Statistic (x²)

Chi-Square Goodness-of-Fit Test

Hypothesis Testing for a Population Proportion Using Chi-Square

Goodness-of-Fit Test

Chi-Square Test of Independence

Chi-Square Test of Homogeneity

Introduction to Hypothesis
Testing for Categorical Data
Meaning:
Hypothesis testing for categorical data helps
determine if there are meaningful patterns or
relationships in data that fall into categories (like
survey responses, customer preferences, or
demographic characteristics).
• What is Categorical Data? Data grouped into categories
without inherent numeric meaning (e.g., gender, brand
preference).
• Chi-Square Test Purpose: Used to evaluate relationships
and test hypotheses on categorical data without requiring
normal distribution.
• Applications: Commonly used in fields like market
research, healthcare studies, and social sciences to make
Defining the Chi-Square Test
Statistic (x²)
Chi-Square Test Overview:
The Chi-Square (x²) test is a statistical method used to
analyze categorical data. It assesses whether observed
frequencies in categories significantly differ from expected
frequencies.
Formula:
x² = Σ((O - E)² / E)
• O = Observed Frequency, E = Expected Frequency
Interpretation:
A higher x² value suggests a greater difference between
observed and expected frequencies, potentially indicating a
significant relationship or difference within the data.
Conditions for Applying
the Chi-Square Test
Key Conditions:
• Categorical Data Required: The test is designed for
categorical variables, such as survey responses or
demographic groups.
• Sample Size: Expected frequencies in each category
should be at least 5 to ensure reliable results.
• Independence of Observations: Each observation
should be independent, meaning that one outcome
doesn’t influence another.
Importance:
Meeting these conditions is essential for the validity of the
Chi-Square test, as it helps produce accurate and meaningful
Chi-Square for Goodness-of-
Purpose:
Fit
The Goodness-of-Fit test checks is the observed frequency distribution of a single
categorical variable matches an expected distribution (e.g., testing if preferences are
equally spread across options).
•The chi-square test for goodness-of-fit uses frequency data from a sample to
test hypotheses about the shape or proportions of a population.
•The data, called observed frequencies, simply count how many individuals from
the sample are in each category.

Example : Candy Color Preference

Objective:
• To determine if customers have a uniform preference for candy colors.
Hypotheses:
• Null Hypothesis (H0): Customers prefer candy colors uniformly.
• Alternative Hypothesis (H1): Customers do not prefer candy colors uniformly
Data Collection:
A survey of 200 customers yields the following observed preferences:
• Red: 50
• Blue: 60
• Green: 30
• Yellow: 60
Expected Distribution:
Assuming equal preference, the expected count for each color is:
Expected count= 200/4= 50
Chi-Square Calculation: Using the formula x² = Σ((O−E)2E):
For Red: (50 - 50)²/50 = 0
For Blue: (60 - 50)²/50 = 2
For Green: (30 - 50)²/50 = 8
For Yellow: (60 - 50)²/50= 2
Total x²:
x² = 0 + 2 + 8 + 2 = 12
Degrees of Freedom:
df=k−1=4−1=3
Critical Value (α = 0.05):
Approximately 7.815 for df=3
Conclusion:
• Since 12>7.815 we reject the null hypothesis.
• Interpretation: There is significant evidence that customer preferences for candy colors are not uniform.
Case Study for Goodness-of-Fit
Test: Survey on Preferred
Coffee Flavors
Objective: Test if customer preference for coffee flavors (e.g., Vanilla, Mocha, Hazelnut) is
uniformly distributed.

Case Study Details:

• Hypotheses:
⚬ Null (H0): Preferences are uniformly distributed across flavors.
⚬ Alternative (H1): Preferences are not uniformly distributed.
• Data Collection:
⚬ Survey results: 150 respondents, with observed preferences:
￭ Vanilla: 40
￭ Mocha: 55
￭ Hazelnut: 55
• Expected Distribution (Assume Equal Preference):
⚬ Expected count for each flavor = 150 / 3 = 50.
Chi-Square Calculation:
• For Vanilla: (40 - 50)² / 50 = 2
• For Mocha: (55 - 50)² / 50 = 0.5
• For Hazelnut: (55 - 50)² / 50 = 0.5
• Total x² = 2 + 0.5 + 0.5 = 3
Interpretation: The calculated x² value for this test is 3. To determine whether this value is
statistically significant, we compare it to the critical chi-square value for the specified degrees of
freedom (df) and significance level (commonly set at 0.05).
• Degrees of Freedom (df): Calculated as the number of categories minus one, so df = 3 - 1 = 2.
• Critical Value: For df = 2 at a 0.05 significance level, the critical chi-square value is
approximately 5.99.
Decision Rule:
• If x² ≤ 5.99: Fail to reject the null hypothesis (H0), suggesting that any differences in flavor
preferences could be due to random chance.
• If x² > 5.99: Reject the null hypothesis (H0), indicating that the differences in observed
preferences are significant and not due to random variation.
Conclusion for This Case Study: Since the calculated x² value (3) is less than the critical value
(5.99), we fail to reject the null hypothesis. This suggests that customer preferences across
Vanilla, Mocha, and Hazelnut flavors do not significantly differ from a uniform distribution, meaning
any observed variation is likely due to random chance rather than a true preference.
Hypothesis Testing for a Population
Proportion Using Chi-Square Goodness-
Objective:
of-Fit Test
Use the Chi-Square (x²) test as an alternative to the z-test for testing if an observed proportion matches an expected population
proportion.
Example Scenario:
A retailer believes 60% of its customers prefer online shopping, while 40% prefer in-store shopping. To verify this, a survey is conduct
with 200 customers.
Steps:
1.Formulate Hypotheses:
⚬ Null Hypothesis (H0): The observed distribution matches the expected proportions (60% online, 40% in-store).
⚬ Alternative Hypothesis (H1): The observed distribution does not match the expected proportions.
2.Collect Observed and Expected Frequencies:
⚬ Observed (from Survey):
￭ Online: 120 customers, In-store: 80 customers
⚬ Expected (based on Assumption):
￭ Online: 200 × 0.60 = 120, In-store: 200 × 0.40 = 80
3.Apply x² Formula:
⚬ x² = Σ((Observed - Expected)² / Expected) for each category
⚬ Here, x² = ((120 - 120)² / 120) + ((80 - 80)² / 80) = 0
4.Interpret Result (p-value or Critical Value):
⚬ Degrees of Freedom (df): 1 (number of categories - 1)
⚬ Decision Rule:
￭ If x² is less than or equal to the critical value for df = 1 (e.g., 3.84 at a 0.05 significance level), we fail to reject H0, suggest
Case Study for Test of
Independence: Customer
Satisfaction and Brand Loyalty
Objective:
Investigate if customer satisfaction levels are associated with brand loyalty
status.
Case Study Details:
Hypotheses:
• Null Hypothesis (H0): Satisfaction and loyalty are independent.
• Alternative Hypothesis (H1): Satisfaction and loyalty are not independent.
Data Collection: Survey results show:
• Satisfied: 70 loyal, 30 not loyal
• Neutral: 40 loyal, 60 not loyal
• Dissatisfied: 10 loyal, 90 not loyal
Contingency Loyal Not Loyal Row Total

Table: Satisfied 70 30 100

Neutral 40 60 100

Dissatisfied 10 90 100

Total 120 180 300

Expected Frequencies Calculation: Chi-Square Calculation:

Using the formula:
For each cell, calculate the expected frequency using the x² = Σ((Observed - Expected)² / Expected)
formula: • For Satisfied & Loyal: (70-40)2/40 = 22.5
Expected Frequency = (Row Total × Column Total) / Grand • For Satisfied & Not Loyal: (30-60)2 / 60 = 15
Total • For Neutral & Loyal: (40-40)2 / 40 = 0
• For Satisfied & Loyal: (100 × 120) / 300 = 40 • For Neutral & Not Loyal: (60-60)2 / 60 = 0
• For Dissatisfied & Loyal: (10-40)2 / 40 = 22.5
• For Satisfied & Not Loyal: (100 × 180) / 300 = 60
• For Dissatisfied & Not Loyal: (90-60)2 / 60 =15
• For Neutral & Loyal: (100 × 120) / 300 = 40
• For Neutral & Not Loyal: (100 × 180) / 300 = 60
• For Dissatisfied & Loyal: (100 × 120) / 300 = 40
• For Dissatisfied & Not Loyal: (100 × 180) / 300 = 60
Total Chi-Square Value:
x² = 22.5 + 15 + 0 + 0 + 22.5 + 15 = 75

Interpretation:
Compare the calculated x² value (75) to the critical value with degrees of freedom (df
= (rows - 1) × (columns - 1) = 2) at a significance level of 0.05 (critical value ≈ 5.99).
Conclusion:
Since 75 > 5.99, we reject the null hypothesis (H0). This indicates that customer
satisfaction levels are associated with brand loyalty status.
Chi-Square Test of Homogeneity
Objective:
To test if distributions of preferences are the same across different groups.
Example Use Case:
Testing if beverage preferences differ among various age groups (18-25, 26-35, 36-45).
Case Study Details:
• Hypotheses:
⚬ Null Hypothesis (H0): Beverage preferences are the same across age groups.
⚬ Alternative Hypothesis (H1): Beverage preferences differ across age groups.

Age Group Beverage A Beverage B Beverage C Beverage D

Data Collection:
18-25 30 20 50 100

26-35 25 35 40 100

36-45 20 30 50 100

Total 75 85 140 300

Steps Involved:
Calculate Expected Frequencies:
For example, for Age Group 18-25 and Beverage A:
Expected Count=100×75/300=25
Chi-Square Calculation:
For each cell, calculate:
x² = Σ((O - E)²/E)
Calculated values:
Beverage A (18-25): (30 - 25)²/25 = 1
Beverage B (18-25): (20 - 28.33)²/28.33=1.14
Beverage C (18-25): (50 - 46.67)²/46.67=0.21
Repeat for other cells and sum the values to get x².
Total Chi-Square Value:
Assume calculated x² = 10.5
Degrees of Freedom:
df=(r−1)(c−1)=(3−1)(3−1)=4
Critical Value (α = 0.05):
From Chi-Square table, critical value for df=4 is approximately 9.488.
Conclusion:
Decision: Since 10.5>9.488, we reject H0.
Interpretation: There is significant evidence that beverage preferences differ across age groups,
suggesting tailored marketing strategies for each demographic.
Conclusion
In summary, the Chi-Square test serves as a robust method for hypothesis
testing involving categorical data, making it invaluable across diverse fields
such as marketing, healthcare, and social sciences. The case studies discussed
highlight its practical applications, particularly in analyzing customer
preferences and behaviors. By enabling researchers and analysts to assess the
relationships and distributions of categorical variables, the Chi-Square test aids
in informed decision-making and strategic planning, ultimately enhancing the
understanding of market trends and consumer choices.
THANK YOU!

Module 17. Lesson Proper
No ratings yet
Module 17. Lesson Proper
6 pages
Ombc 106 Notes U11
No ratings yet
Ombc 106 Notes U11
4 pages
Chi Square
No ratings yet
Chi Square
37 pages
Chisquare Gonzales
No ratings yet
Chisquare Gonzales
32 pages
RM Unit 4 - Part 2
No ratings yet
RM Unit 4 - Part 2
35 pages
Chi-Square Tests-1
No ratings yet
Chi-Square Tests-1
16 pages
Chi Square
No ratings yet
Chi Square
50 pages
Lecture 17 - Ch10 - ChiSquare Test
No ratings yet
Lecture 17 - Ch10 - ChiSquare Test
35 pages
Chi-Square and F Distribution Guide
No ratings yet
Chi-Square and F Distribution Guide
68 pages
Lecture3 - Contingency Analysis
No ratings yet
Lecture3 - Contingency Analysis
16 pages
Chi Square and Annova
100% (1)
Chi Square and Annova
29 pages
Chi Square
No ratings yet
Chi Square
13 pages
1 - CA51018 - Chi Square - Introduction - Goodness of Fit Test - 2
No ratings yet
1 - CA51018 - Chi Square - Introduction - Goodness of Fit Test - 2
36 pages
T Test, ANOVA, Chi Square Test
No ratings yet
T Test, ANOVA, Chi Square Test
26 pages
Chi Squre
No ratings yet
Chi Squre
11 pages
Define The Null Hypothesis (No Difference Between Sample and Theoretical Distribution) and The Alternative Hypothesis (Difference Exists) .
No ratings yet
Define The Null Hypothesis (No Difference Between Sample and Theoretical Distribution) and The Alternative Hypothesis (Difference Exists) .
21 pages
Chapter 6
No ratings yet
Chapter 6
10 pages
Chi Square
No ratings yet
Chi Square
28 pages
Chi Square
No ratings yet
Chi Square
36 pages
Chi-Square Test: by Dr. M.Supriya Moderator:Dr.B.Aruna, M.D. (H)
No ratings yet
Chi-Square Test: by Dr. M.Supriya Moderator:Dr.B.Aruna, M.D. (H)
75 pages
0064ED90-5D9C-4A27-93B4-DBC9A22B0382
No ratings yet
0064ED90-5D9C-4A27-93B4-DBC9A22B0382
37 pages
M Stat CH 4
No ratings yet
M Stat CH 4
55 pages
Chi-Square Test Notes
No ratings yet
Chi-Square Test Notes
12 pages
AI22 Chi Square Goodness of Fit Test
No ratings yet
AI22 Chi Square Goodness of Fit Test
15 pages
Chi-Square Tests Explained
No ratings yet
Chi-Square Tests Explained
47 pages
Reserch Analysis Between Wall's & Omore Ice Cream By:mian Shahnnawaz
100% (1)
Reserch Analysis Between Wall's & Omore Ice Cream By:mian Shahnnawaz
31 pages
Chi Square Test
100% (2)
Chi Square Test
75 pages
5 Chi Square
No ratings yet
5 Chi Square
36 pages
Assessment in Learning 1 Chi Square
No ratings yet
Assessment in Learning 1 Chi Square
5 pages
Hypothesis Testing - Chi - Squared Answers
No ratings yet
Hypothesis Testing - Chi - Squared Answers
6 pages
Goodness of Fit
No ratings yet
Goodness of Fit
15 pages
BSC - Applied Statistics - Chi Square Test
No ratings yet
BSC - Applied Statistics - Chi Square Test
16 pages
Risk Chapter 6
No ratings yet
Risk Chapter 6
6 pages
Chapter 6. Chi-Square Test
No ratings yet
Chapter 6. Chi-Square Test
25 pages
Psychology Statistics
No ratings yet
Psychology Statistics
26 pages
STAT 1013 Statistics: Week 13 AND 14
No ratings yet
STAT 1013 Statistics: Week 13 AND 14
46 pages
50.2 - Chi Square Goodness-of-Fit Test
No ratings yet
50.2 - Chi Square Goodness-of-Fit Test
11 pages
Chi Square
No ratings yet
Chi Square
16 pages
BRM Chi Square Test
No ratings yet
BRM Chi Square Test
13 pages
QM Lecture 10 - Chi Square Tests
No ratings yet
QM Lecture 10 - Chi Square Tests
48 pages
Chi Square (KI Square) Test
No ratings yet
Chi Square (KI Square) Test
30 pages
When To Use Chi-Square? Sample Problems
No ratings yet
When To Use Chi-Square? Sample Problems
5 pages
Statistics Assignment 2 (Team 3) - 1
No ratings yet
Statistics Assignment 2 (Team 3) - 1
27 pages
Maths Report
No ratings yet
Maths Report
15 pages
Lecture 9
No ratings yet
Lecture 9
44 pages
Chi-Square Basics for Students
No ratings yet
Chi-Square Basics for Students
39 pages
Non-Parametric Methods: Goodness of Fit Tests: (Chi-Square Applications)
No ratings yet
Non-Parametric Methods: Goodness of Fit Tests: (Chi-Square Applications)
45 pages
Statistics: The Chi Square Test
No ratings yet
Statistics: The Chi Square Test
41 pages
Answer To Chi-Square
No ratings yet
Answer To Chi-Square
11 pages
Desoasidochristinemae - 6904 - 348623 - Chi Square Test (Non-Parametric) - DESOASIDO
No ratings yet
Desoasidochristinemae - 6904 - 348623 - Chi Square Test (Non-Parametric) - DESOASIDO
22 pages
Group11.Pptx (Read Only)
No ratings yet
Group11.Pptx (Read Only)
21 pages
Chi Square
No ratings yet
Chi Square
34 pages
Chi Square Test
No ratings yet
Chi Square Test
9 pages
Chi-Square Test in Applied Statistics
No ratings yet
Chi-Square Test in Applied Statistics
19 pages
Test For Goodness of Fit
No ratings yet
Test For Goodness of Fit
3 pages
Chi Square Test
No ratings yet
Chi Square Test
13 pages
Chi Square
No ratings yet
Chi Square
16 pages
Psych Stats 7 - Non Parametric Tests
No ratings yet
Psych Stats 7 - Non Parametric Tests
52 pages
All India Granite Companies
100% (13)
All India Granite Companies
234 pages
New TKD Rules For Palaro - Div - Pang.i
No ratings yet
New TKD Rules For Palaro - Div - Pang.i
8 pages
Invoice 1115906
No ratings yet
Invoice 1115906
2 pages
UNIT - 1 Cyber Jurisprudence & Law, Approaches, Cyber Ethics, Cyber Jurisdictin Complete
0% (2)
UNIT - 1 Cyber Jurisprudence & Law, Approaches, Cyber Ethics, Cyber Jurisdictin Complete
41 pages
Work Life Balance of Employees and Its Effect On Work Related Factors in Nationalized Banks
No ratings yet
Work Life Balance of Employees and Its Effect On Work Related Factors in Nationalized Banks
8 pages
Urinary Tract Infection in Pregnancy
No ratings yet
Urinary Tract Infection in Pregnancy
49 pages
The White Curse (Gazellian Series 2) - VentreCanard - Wattpad - Wattpad
No ratings yet
The White Curse (Gazellian Series 2) - VentreCanard - Wattpad - Wattpad
349 pages
Sales Cases Revalida
No ratings yet
Sales Cases Revalida
71 pages
Unit-1 Nyaya Philosophy PDF
No ratings yet
Unit-1 Nyaya Philosophy PDF
13 pages
Airframe - Aircraft Drawing Part 2022
80% (5)
Airframe - Aircraft Drawing Part 2022
22 pages
Lesson 5
No ratings yet
Lesson 5
30 pages
New Language Leader Intermediate: Unit 3 (Pages 26 To 35) Please Go Through This Powerpoint Document Page by Page
100% (1)
New Language Leader Intermediate: Unit 3 (Pages 26 To 35) Please Go Through This Powerpoint Document Page by Page
57 pages
Chapter - 29: Inventory Management
No ratings yet
Chapter - 29: Inventory Management
10 pages
Spatial Progression of Sigiriya
No ratings yet
Spatial Progression of Sigiriya
41 pages
7038MAA - Ground Vehicle Aerodynamics
No ratings yet
7038MAA - Ground Vehicle Aerodynamics
2 pages
Microsoft Visual C++ 2010 x86 Able Setup - 20111124 - 154217282-MSI - VC - Red
No ratings yet
Microsoft Visual C++ 2010 x86 Able Setup - 20111124 - 154217282-MSI - VC - Red
107 pages
(2019) ICME For Advanced Materials
No ratings yet
(2019) ICME For Advanced Materials
7 pages
(NOCTURNAL ENURESIS) AGYEIWAA - TUFFUOH DIANA's FINAL WORK
No ratings yet
(NOCTURNAL ENURESIS) AGYEIWAA - TUFFUOH DIANA's FINAL WORK
67 pages
Tanya Resume
No ratings yet
Tanya Resume
1 page
Terex Demag AC 140 Petrolift
No ratings yet
Terex Demag AC 140 Petrolift
2 pages
Unit Plan - Bring Me Little Water Silvy
No ratings yet
Unit Plan - Bring Me Little Water Silvy
13 pages
Electric Discharge Through Gases: Electron, Photon, Photoelectric Effect and X-Rays
No ratings yet
Electric Discharge Through Gases: Electron, Photon, Photoelectric Effect and X-Rays
21 pages
Brdide Tbeam and Concrete On Steel
No ratings yet
Brdide Tbeam and Concrete On Steel
60 pages
Marking Scheme: 1992-CE-REL-STUD (A) Marking Scheme Page1 of 10
No ratings yet
Marking Scheme: 1992-CE-REL-STUD (A) Marking Scheme Page1 of 10
10 pages
BA English Honours: Indian Writing Assignment
No ratings yet
BA English Honours: Indian Writing Assignment
4 pages
PPC Zimbabwe Economic Insights
No ratings yet
PPC Zimbabwe Economic Insights
38 pages
Solar Elbow
No ratings yet
Solar Elbow
22 pages
Evonik ROHACELL Materials For PCB
No ratings yet
Evonik ROHACELL Materials For PCB
5 pages
Bankers To The Issue of Rights Issue 2010-11 of Ucb
No ratings yet
Bankers To The Issue of Rights Issue 2010-11 of Ucb
2 pages
Starting Formulation
No ratings yet
Starting Formulation
5 pages

Statistics

Uploaded by

Statistics

Uploaded by

Hypothesis

Understanding the Chi-Square Test

Defining the Chi-Square Test Statistic (x²)

Chi-Square Goodness-of-Fit Test

Hypothesis Testing for a Population Proportion Using Chi-Square

Chi-Square Test of Independence

Chi-Square Test of Homogeneity

Example : Candy Color Preference

Case Study Details:

Table: Satisfied 70 30 100

Total 120 180 300

Expected Frequencies Calculation: Chi-Square Calculation:

Age Group Beverage A Beverage B Beverage C Beverage D

Total 75 85 140 300

You might also like