0% found this document useful (0 votes)

18 views7 pages

Lecture 5

The document discusses bivariate distribution, which involves analyzing the relationship between two variables, X and Y, using methods like scatter diagrams and correlation coefficients. It explains how to calculate the correlation coefficient to measure the strength and direction of the linear relationship between the variables, as well as introducing rank correlation coefficients for ranked data. Additionally, it covers normal probability distributions, their characteristics, formulas, and real-world applications.

Uploaded by

shaheenchoudhary647

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views7 pages

Lecture 5

Uploaded by

shaheenchoudhary647

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Correlation

Bivariate Distribution: Bivariate distribution refers to distribution that consists of two

variables, typically denoted as X and Y, which are measured or observed together. The objective
of analysing bivariate distribution is to under understand the relationship between these two
variables. For example, if we measure the heights and weights of a certain group of persons,
we shall get what is known as Bivariate distribution, one variable relating to height and other
variable relating to weight.

Scatter Diagram: It is the simplest way of the diagrammatic representation of bivariate data.
Thus, for the bivariate distribution (𝑥𝑖 , 𝑦𝑖 ); 𝑖 = 1,2,3, … , 𝑛, if the values of the variables X and
Y be plotted along the x-axis and y-axis respectively in the 𝑥𝑦-plane, the diagram of dots so
obtained is known as scatter diagram.
Scatter Plot example

Coefficient of Correlation: As a measure of intensity or degree of linear relationship between

two variables, Karl Pearson (1867-1936), a British Biometrician, developed a formula called
correlation Coefficient.
Correlation coefficient between two random variables X and Y, usually denoted by
𝑟(𝑋, 𝑌) or simply 𝑟𝑋𝑌 is a numerical measure of linear relationship between them and is defined
as
𝐶𝑜𝑣(𝑋, 𝑌)
𝑟(𝑋, 𝑌) =
𝜎𝑋 𝜎𝑌
If (𝑥𝑖 , 𝑦𝑖 ); 𝑖 = 1,2,3, … , 𝑛, is the bivariate distribution, then
𝑛
1
𝐶𝑜𝑣(𝑋, 𝑌) = ∑(𝑥𝑖 − 𝑥̅ ) (𝑦𝑖 − 𝑦̅)
𝑛
𝑖=1
𝑛
1
𝜎𝑋 = √ ∑(𝑥𝑖 − 𝑥̅ )2
𝑛
𝑖=1

𝑛
1
𝜎𝑌 = √ ∑(𝑦𝑖 − 𝑦̅)2
𝑛
𝑖=1

Where, 𝑟(𝑋, 𝑌) = Correlation Coefficient between variables X and Y.

𝐶𝑜𝑣(𝑋, 𝑌) = Covariance between the variables X and Y
𝜎𝑋 = Standard deviation of variable X
and 𝜎𝑌 = Standard deviation of variable Y
Correlation Coefficient between X and Y can be written as

∑𝑛𝑖=1(𝑥𝑖 − 𝑥̅ ) (𝑦𝑖 − 𝑦̅)

𝑟(𝑋, 𝑌) =
√∑𝑛𝑖=1(𝑥𝑖 − 𝑥̅ )2 √∑𝑛𝑖=1(𝑦𝑖 − 𝑦̅)2

Here, 𝑥̅ , 𝑦̅ are the means of data X and Y respectively.

Example: Calculate the correlation coefficient for the following bivariate data as
X 3 4 5 6 7
Y 12 14 16 18 20

Solution: Make table to obtain the required quantities which will help in calculating the
Correlation coefficient.
X Y (𝑥 − 𝑥̅ ) (𝑥 − 𝑥̅ )2 (𝑦 − 𝑦̅) (𝑦 − 𝑦̅)2 (𝑥 − 𝑥̅ )(𝑦 − 𝑦̅)
3 12 -2 4 -4 16 8
4 14 -1 1 -2 4 2
5 16 0 0 0 0 0
6 18 1 1 2 4 2
7 20 2 4 4 16 8
𝑥̅ = 5 𝑦̅ = 16 10 44 20

From the above table, we get

∑(𝑥𝑖 − 𝑥̅ )2 = 10

∑(𝑦𝑖 − 𝑦̅)2 = 44
And ∑(𝑥𝑖 − 𝑥̅ )(𝑦𝑖 − 𝑦̅) = 20
Then the correlation coefficient between X and Y is calculated as

∑𝑛𝑖=1(𝑥𝑖 − 𝑥̅ ) (𝑦𝑖 − 𝑦̅)

𝑟(𝑋, 𝑌) =
√∑𝑛𝑖=1(𝑥𝑖 − 𝑥̅ )2 √∑𝑛𝑖=1(𝑦𝑖 − 𝑦̅)2
20
=
√10√44
20
=
√440
= 0.95
Hence, the correlation coefficient between X and Y as 𝑟(𝑋, 𝑌) = 0.95
Range of correlation coefficient: The range of correlation coefficient 𝑟(𝑋, 𝑌) lies between -1
and 1, Mathematically it can be denoted as −1 ≤ 𝑟(𝑋, 𝑌) ≤ 1.
Interpretation
1. Strength: Coefficient value means 0 ≤ 𝑟(𝑋, 𝑌) ≤ 1.
2. Direction: Positive (direct), negative (inverse).

Correlation
Rank Correlation Coefficient: Let us suppose that a group of n individuals is arranged in
order of merits or proficiency in possession of two characteristics A and B. These ranks in the
two characteristics will, in general, be different. For example, if we consider the relation
between intelligence and beauty, it is not necessary that a beautiful individual is intelligent also.
Let (𝑥𝑖 , 𝑦𝑖 ); 𝑖 = 1,2,3, … , 𝑛 be the ranks of the 𝑖 𝑡ℎ individual in two characteristics A
and B respectively. Then, correlation coefficient between the ranks x’s and y’s is called the
rank correlation coefficient between A and B for the group of individuals.
Assuming that no two individuals are bracketed equal (means have same rank) in either
classification, each of the variable X and Y takes the values 1, 2, 3, …, n.
Then, the Rank correlation coefficient is defined as

6 ∑𝑛𝑖=1 𝑑𝑖 2
𝑟(𝑋, 𝑌) = 1 −
𝑛 (𝑛 2 − 1)
Where, 𝑑𝑖 = 𝑥𝑖 − 𝑦𝑖 ; 𝑖 = 1,2,3, … , 𝑛

Example: The ranks of same 10 students in Mathematics and Physics are as follows. Two
numbers within bracket denotes the ranks of the students in Mathematics and Physics.
(1, 10) (2, 9) (3, 4) (5, 3) (4, 6)
(7, 2) (8, 1) (6, 8) (10, 9) (9, 7)
Calculate the rank correlation coefficient for proficiencies of this group in Mathematics and
Physics.
Solution:
Ranks in Ranks in d=x-y 𝑑2
Maths(X) Physics(Y)
1 10 -9 81
2 9 -7 49
3 4 -1 1
5 3 2 4
4 6 -2 4
7 2 5 25
8 1 7 49
6 8 -2 4
10 9 1 1
9 7 2 4
∑ 𝑑 2 = 222

Thus, the rank correlation coefficient is calculated as

6 ∑𝑛𝑖=1 𝑑𝑖 2
𝑟(𝑋, 𝑌) = 1 −
𝑛 (𝑛 2 − 1)
Using the table, we get 𝑛 = 10 and ∑𝑛𝑖=1 𝑑𝑖 2 = 222
6 ∗ 222
𝑟(𝑋, 𝑌) = 1 −
10 ∗ (102 − 1)
1332
=1−
10 ∗ (100 − 1)
1332
=1−
10 ∗ 999
1332
=1−
9990
= 1 − 0.13
= 0.87
Hence, the rank correlation coefficient as 𝑟(𝑋, 𝑌) = 0.87.

Uses of correlation coefficient: Correlation coefficient measure the strength and direction of
linear relationship between two variables. Here are some uses:
Statistical Analysis
1. Hypothesis Testing: Test significance of correlation
2. Regression Analysis: Identify predictor variables.
3. Inference: make predictions based on correlations
Real World application
1. Finance: Analyse stock prices, trading volumes.
2. Medicine: Study disease relationship, treatment outcomes.
3. Social Sciences: Examine social, economic factors.
Normal Probability
Normal probability, also known as gaussian probability, is a continuous probability distribution
that is symmetric about the mean. It is widely used in statistics, mathematics, and science to
model real-valued random variables.
Some key characteristics:
1. Symmetry: The normal distribution is symmetric about its mean.
2. Bell-shaped: The normal distribution has a bell-shaped curve.
3. Mean (𝜇): The average value of the distribution.
4. Standard Deviation (𝜎): It measures the spread or dispersion of the distribution.
5. Total Area: The total area under the curve is 1.

Normal Distribution formula

The probability density function of a normal distribution is given by
1 1 𝑥−𝜇 2
𝑒 −2( )
𝑓 (𝑥 ) = 𝜎
√2𝜋𝜎 2
Where 𝑥 is the random variable, 𝜇 is the mean, 𝜎 is the standard deviation, and 𝑒 is the base
of the natural logarithm.
Types of Normal Distributions
1. Standard Normal Distribution: A normal distribution with a mean of 0 and a standard
deviation of 1 is called standard normal distribution.
2. Non-standard Normal Distribution: A normal distribution with a mean and standard
deviation other than 0 and 1 is called non-standard normal distribution.
Real-World Applications
1. Finance: Modelling stock prices and returns
2. Medicine: Modelling the distribution of blood pressure and other health metrics.
3. Engineering: Modelling measurement errors and tolerances.

L3 - Correlation & Rank Correlation
No ratings yet
L3 - Correlation & Rank Correlation
11 pages
Unit 1 Correlation, Regression and Curve Fitting 2024-25-1
No ratings yet
Unit 1 Correlation, Regression and Curve Fitting 2024-25-1
23 pages
Correlaton Stats
No ratings yet
Correlaton Stats
8 pages
Ch.-1 Correlation, Regression and Curve Fitting
No ratings yet
Ch.-1 Correlation, Regression and Curve Fitting
22 pages
Understanding Correlation Coefficients
No ratings yet
Understanding Correlation Coefficients
19 pages
Unit III Notes
No ratings yet
Unit III Notes
9 pages
How To Calculate A Correlation
No ratings yet
How To Calculate A Correlation
5 pages
Correlation and Regression Unit 1
No ratings yet
Correlation and Regression Unit 1
16 pages
IV - Measures of Relationship
100% (1)
IV - Measures of Relationship
4 pages
Correlation Analysis Overview
No ratings yet
Correlation Analysis Overview
40 pages
ACC 107 - Regression and Correlation
No ratings yet
ACC 107 - Regression and Correlation
24 pages
PSNM - Ch. 1
No ratings yet
PSNM - Ch. 1
16 pages
Chapter 3 Stat
No ratings yet
Chapter 3 Stat
66 pages
Correlation Rank - Correlation Curve - Fitting For Student
No ratings yet
Correlation Rank - Correlation Curve - Fitting For Student
26 pages
8 Correlation
No ratings yet
8 Correlation
22 pages
Correlation & Regression (Complete) .PDF Theory Module-6-B
100% (1)
Correlation & Regression (Complete) .PDF Theory Module-6-B
9 pages
Mathematics III (Prob&Stats) - Unit 4 5
No ratings yet
Mathematics III (Prob&Stats) - Unit 4 5
122 pages
Rank Correlation Explained
No ratings yet
Rank Correlation Explained
21 pages
Correlation Notes
No ratings yet
Correlation Notes
15 pages
12 Correlation and Rank Correlation 05-02-2024
No ratings yet
12 Correlation and Rank Correlation 05-02-2024
19 pages
Lecture 11-Correlation and Linear Regression
No ratings yet
Lecture 11-Correlation and Linear Regression
7 pages
8.3 Correlation
No ratings yet
8.3 Correlation
11 pages
Correlation Regression Theory
No ratings yet
Correlation Regression Theory
8 pages
MRS - Diana-Correlation Analysis-Notes
No ratings yet
MRS - Diana-Correlation Analysis-Notes
16 pages
Understanding Correlation Measures
100% (1)
Understanding Correlation Measures
6 pages
Correlation
No ratings yet
Correlation
31 pages
Understanding Correlation & Graphs
No ratings yet
Understanding Correlation & Graphs
7 pages
Correlation and Regression
No ratings yet
Correlation and Regression
167 pages
Correlation
No ratings yet
Correlation
6 pages
Lecture Notes #4 Correlation
No ratings yet
Lecture Notes #4 Correlation
8 pages
Introduction To Correlationand Regression Analysis BY Farzad Javidanrad PDF
No ratings yet
Introduction To Correlationand Regression Analysis BY Farzad Javidanrad PDF
52 pages
Correlation and Regression
No ratings yet
Correlation and Regression
39 pages
Correlation
No ratings yet
Correlation
5 pages
Correlation Coefficient in Medical Research
No ratings yet
Correlation Coefficient in Medical Research
6 pages
Coo Relation
No ratings yet
Coo Relation
16 pages
Correction
No ratings yet
Correction
10 pages
Probability and Statistics - Session 4
No ratings yet
Probability and Statistics - Session 4
35 pages
Lecture VII Bivariate Data
No ratings yet
Lecture VII Bivariate Data
8 pages
Understanding Correlation Basics
No ratings yet
Understanding Correlation Basics
9 pages
Correlation Coefficient Guide
No ratings yet
Correlation Coefficient Guide
7 pages
Correlation and Regression Guide
No ratings yet
Correlation and Regression Guide
42 pages
Correlation and Regression Analysis
100% (1)
Correlation and Regression Analysis
59 pages
r23 P & S Unit 2 Material
No ratings yet
r23 P & S Unit 2 Material
14 pages
Using Statistical Techniq Ues in Analyzing Data
100% (1)
Using Statistical Techniq Ues in Analyzing Data
40 pages
Correlation
No ratings yet
Correlation
5 pages
MANSCI Midterm Correlation
No ratings yet
MANSCI Midterm Correlation
27 pages
CORRELATION
No ratings yet
CORRELATION
5 pages
Correlation and Regression
No ratings yet
Correlation and Regression
4 pages
Oe Statistics Notes
No ratings yet
Oe Statistics Notes
32 pages
5-Correlation, Regression and Rank Correlation-08-03-2024
No ratings yet
5-Correlation, Regression and Rank Correlation-08-03-2024
29 pages
Chapter 8 - PSYC 284
No ratings yet
Chapter 8 - PSYC 284
7 pages
Correlation
No ratings yet
Correlation
33 pages
Correlation and Regression Analysis
No ratings yet
Correlation and Regression Analysis
17 pages
ECN 652 Handout 9 Student
No ratings yet
ECN 652 Handout 9 Student
46 pages
Statistics & Probability Q4 - Week 7-8
No ratings yet
Statistics & Probability Q4 - Week 7-8
15 pages
Linear Correlation Analysis Guide
No ratings yet
Linear Correlation Analysis Guide
11 pages
AMDS - 3 - Statistical Averages
No ratings yet
AMDS - 3 - Statistical Averages
18 pages
Demo Lesson Plan For in Math 11 - Pearson Product Moment Correlation Coefficient 1
100% (5)
Demo Lesson Plan For in Math 11 - Pearson Product Moment Correlation Coefficient 1
10 pages
ARTICLE 2, Vol 1, No 4, Correlation Coefficient For Continuous and Discrete Data 2
No ratings yet
ARTICLE 2, Vol 1, No 4, Correlation Coefficient For Continuous and Discrete Data 2
26 pages
Chapter 11
No ratings yet
Chapter 11
22 pages
Bayesian Statistics: Thomas Bayes
No ratings yet
Bayesian Statistics: Thomas Bayes
22 pages
Statistics Using IBM SPSS: An Integrative Approach - Ebook PDF Version Instant Download
100% (3)
Statistics Using IBM SPSS: An Integrative Approach - Ebook PDF Version Instant Download
61 pages
(Ebook) Introduction To Machine Learning by Ethem Alpaydin ISBN 9780262043793, 0262043793 Download PDF
No ratings yet
(Ebook) Introduction To Machine Learning by Ethem Alpaydin ISBN 9780262043793, 0262043793 Download PDF
53 pages
Anomaly Detection
No ratings yet
Anomaly Detection
49 pages
Statistical Software Engineering
No ratings yet
Statistical Software Engineering
160 pages
Unit-I Introduction To Human Resource Analytics
No ratings yet
Unit-I Introduction To Human Resource Analytics
41 pages
Math Modeling With Mat Lab
No ratings yet
Math Modeling With Mat Lab
214 pages
Business Statistics II
100% (2)
Business Statistics II
100 pages
ASTM E2862-12 Standard Practice For Probability of Detection Analysis For Hit-Miss Data PDF
No ratings yet
ASTM E2862-12 Standard Practice For Probability of Detection Analysis For Hit-Miss Data PDF
6 pages
Chapter 1. Introduction of Environmental Modelling QK
No ratings yet
Chapter 1. Introduction of Environmental Modelling QK
37 pages
PDVSA Facimage Final
No ratings yet
PDVSA Facimage Final
4 pages
Credit Risk Management at Icici Bank
86% (7)
Credit Risk Management at Icici Bank
128 pages
Neural Networks in Stock Trading
No ratings yet
Neural Networks in Stock Trading
14 pages
Chap 5 MCQ
No ratings yet
Chap 5 MCQ
12 pages
Financial Econometrics Guide
100% (1)
Financial Econometrics Guide
483 pages
A Machine Learning Based Crop Yield Prediction
No ratings yet
A Machine Learning Based Crop Yield Prediction
25 pages
CMO-No.-89-Series-of-2017 - Policies Standards and Guidelines For The Bachelor of Science in Geodetic Engineering - BSGE Program Effective Academic Year - AY-2018-2019
No ratings yet
CMO-No.-89-Series-of-2017 - Policies Standards and Guidelines For The Bachelor of Science in Geodetic Engineering - BSGE Program Effective Academic Year - AY-2018-2019
84 pages
Lec-1 Probabilistic Models
No ratings yet
Lec-1 Probabilistic Models
29 pages
Radar Target Fluctuation Models
No ratings yet
Radar Target Fluctuation Models
65 pages
Introduction To Statistical Modelling PDF
100% (1)
Introduction To Statistical Modelling PDF
133 pages
Fundamentals of Statistics Notes
No ratings yet
Fundamentals of Statistics Notes
77 pages
ML Notes-1
No ratings yet
ML Notes-1
59 pages
Descriptive Sta-WPS Office
No ratings yet
Descriptive Sta-WPS Office
3 pages
Cost 336 - FWD
No ratings yet
Cost 336 - FWD
66 pages
SPE-181049-MS Reservoir Uncertainty Analysis: The Trends From Probability To Algorithms and Machine Learning
No ratings yet
SPE-181049-MS Reservoir Uncertainty Analysis: The Trends From Probability To Algorithms and Machine Learning
5 pages
PINHEIRO and BATES - 2000 - Mixed Effects Model in S and S-Plus
No ratings yet
PINHEIRO and BATES - 2000 - Mixed Effects Model in S and S-Plus
535 pages
Bayesian Methods in Actuarial Science
100% (1)
Bayesian Methods in Actuarial Science
22 pages
SLS Corrected 1.4.16 PDF
No ratings yet
SLS Corrected 1.4.16 PDF
362 pages
MTH6134 Notes11
No ratings yet
MTH6134 Notes11
77 pages

Lecture 5

Uploaded by

Lecture 5

Uploaded by

Correlation

Bivariate Distribution: Bivariate distribution refers to distribution that consists of two

Coefficient of Correlation: As a measure of intensity or degree of linear relationship between

Where, 𝑟(𝑋, 𝑌) = Correlation Coefficient between variables X and Y.

∑𝑛𝑖=1(𝑥𝑖 − 𝑥̅ ) (𝑦𝑖 − 𝑦̅)

Here, 𝑥̅ , 𝑦̅ are the means of data X and Y respectively.

From the above table, we get

∑𝑛𝑖=1(𝑥𝑖 − 𝑥̅ ) (𝑦𝑖 − 𝑦̅)

Thus, the rank correlation coefficient is calculated as

Normal Distribution formula

You might also like