Unit-13 (1) - Normal Distribution
Unit-13 (1) - Normal Distribution
13.1 INTRODUCTION
In Units 9 to 12, we have studied standard discrete distributions. From this unit
onwards, we are going to discuss standard continuous univariate distributions.
This unit and the next unit deal with normal distribution. Normal distribution
has wide spread applications. It is being used in almost all data-based research
in the field of agriculture, trade, business, industry and the society. For
instance, normal distribution is a good approximation to the distribution of
heights of randomly selected large number of students studying at the same
level in a university.
The normal distribution has a unique position in probability theory, and it can
be used as approximation to most of the other distributions. Discrete
distributions occurring in practice including binomial, Poisson,
hypergeometric, etc. already studied in the previous block (Block 3) can also
be approximated by normal distribution. You will notice in the subsequent
courses that theory of estimation of population parameters and testing of
hypotheses on the basis of sample statistics have also been developed using
the concept of normal distribution as most of the sampling distributions tend to
normality for large samples. Therefore, study of normal distribution is very
important.
Due to various properties and applications of the normal distribution, we have
covered it in two units – Units 13 and 14. In the present unit, normal
distribution is introduced and explained in Sec. 13.2. Chief characteristics of
normal distribution are discussed in Sec. 13.3. Secs. 13.4, 13.5 and 13.6
describes the moments, mode, median and mean deviation about mean of the
distribution.
Objectives
After studing this unit, you would be able to:
introduce and explain the normal distribution;
5
Continuous Probability
Distributions know the conditions under which binomial and Poisson distributions
tend to normal distribution;
state various characteristics of the normal distribution;
compute the moments, mode, median and mean deviation about mean
of the distribution; and
solve various practical problems based on the above properties of
normal distribution.
q p 1 2p
1 1 , and
npq npq
1 6pq
2 2 3 .
npq
6
From the above results, it may be noticed that if n , then moment Normal Distribution
coefficient of skewness ( 1 ) 0 and the moment coefficient of kurtosis i.e.
2 3 or 2 0 . Hence, as n , the distribution becomes symmetrical
and the curve of the distribution becomes mesokurtic, which is the main
feature of normal distribution.
Normal Distribution as a Limiting Case of Poisson Distribution
You have already studied in Unit 10 of this course that Poisson distribution is a
limiting case of binomial distribution under the following conditions:
i) n, the number of trials is indefinitely large i.e. n
ii) p, the constant probability of success for each trial is very small i.e. p 0.
iii) np is a finite quantity say ‘’.
As we have discussed above that there is a relation between the binomial and
normal distributions. It can, in fact, be shown that the Poisson distribution
approaches a normal distribution with standardized variable given by
X
Z as λ increases indefinitely.
For Poisson distribution, you have already studied in Unit 10 of the course that
32 2 1 1
1 3
3 1 1 ; and
2
4 3 2 1 1
2 2
2 3 2 2 3 .
2
Like binomial distribution, here in case of Poisson distribution also it may be
noticed from the above results that the moment coefficient of skewness
( 1 ) 0 and the moment coefficient of kurtosis i.e. 2 3 or 2 0 as
λ . Hence, as λ , the distribution becomes symmetrical and the curve
of the distribution becomes mesokurtic, which is the main feature of normal
distribution.
Under the conditions discussed above, a random variable following a binomial
distribution or following a Poisson distribution approaches to follow normal
distribution, which is defined as follows:
Definition: A continuous random variable X is said to follow normal
distribution with parameters ( ) and 2(>0) if it takes on any real
value and its probability density function is given by
2
1 x
1
f x e 2
, x ;
2
which may also be written as
1 1 x 2
= exp , x .
2 2
7
Continuous Probability
Distributions Remark
i) The probability function represented by f x may also be written as
f x; , 2 .
ii) If a random variable X follows normal distribution with mean and
variance 2, then we may write, “X is distributed to N(, 2)” and is
expressed as X N(, 2).
iii) No continuous probability function and hence the normal distribution
can be used to obtain the probability of occurrence of a particular value
of the random variable. This is because such probability is very small,
so instead of specifying the probability of taking a particular value by
the random variable, we specify the probability of its lying within
interval. For detail discussion on the concept, Sec. 5.4 of Unit 5 may be
referred to.
X
iv) If X ~ N , 2 , then Z is standard normal variate having
mean ‘0’ and variance ‘1’. The values of mean and variance of standard
normal variate are obtained as under, for which properties of
expectation and variance are used (see Unit 8 of this course).
X 1
Mean of Z i.e. E Z E = E X
1
E X
1
= 0 [ E(X) = Mean of X = ]
X
Variance of Z i.e. V(Z) = V
1 1
V X 2 V X
2
1 2
2
[ variance of X is 2]
= 1.
X
v) The probability density function of standard normal variate Z
1 12 z2
is given by z e , z .
2
8
vi) The graph of the normal probability function f x with respect to x is Normal Distribution
famous ‘bell-shaped’ curve. The top of the bell is directly above the
mean . For large value of , the curve tends to flatten out and for
small values of , it has a sharp peak as shown in (Fig. 13.1):
Fig. 13.1
μ = 40, σ 2 = 25 σ = ± 25
5 0always
Now, the p.d.f. of random variable X is given by
2
1 x
1
2 σ
f(x) = e
σ 2π
2
1 x 40
1
2 5
= e , x
5 2π
9
Continuous Probability
Distributions
(ii) Here we are given X ~ N ( 36, 20).
in usual notations, we have
μ 36, σ 2 20 σ 20
Now, the p.d.f. of random variable X is given by
2
1 x µ
1
2 σ
f(x) = e
σ 2π
2
1 x ( 36) 1
1 1 (x +36) 2
2 20
= e = e 40
20 2π 40π
1
1 (x +36) 2
= e 40 , x
2 10π
(iii) Here we are given X ~ N (0, 2).
in usual notations, we have
μ 0, σ 2 2 σ 2
Now, the p.d.f. of random variable X is given by
2 2
1 x µ 1 x 0
1
2 σ
1
2 2
f(x) = e = e
σ 2π 2 2π
1
1 x2
4
e , x
2 π
Example 2: Below, in each case, there is given the p.d.f. of a normally
distributed random variable. Obtain the parameters (mean and variance)
of the variable.
2
1 x 46
1
2 6
(i) f(x) = e , x
6 2π
1
1 (x 60) 2
(ii) f(x) = e 32 , x
4 2π
2
1 x 46
1
2 6
Solution: (i) f(x) = e , x
6 2π
Comparing it with,
2
1 x µ
1
2 σ
f(x) = e
σ 2π
we have
µ 46, 6
10
1
1 (x 60) 2 Normal Distribution
(ii) f(x) = e 32 , x
4 2π
2
1 X 60
1
2 4
= e
4 2π
Comparing it with,
2
1 x µ
1
2 σ
f(x) = e
σ 2π
we get
µ = 60, 4
11
Continuous Probability
Distributions iv) f x , being the probability, can never be negative and hence no portion
of the curve lies below x-axis.
v) Though x-axis becomes closer and closer to the normal curve as the
magnitude of the value of x goes towards or , yet it never touches it.
µ 32
β1 = 3
0, 2 24 3
µ2 2
i.e. the distribution is symmetrical and curve is always mesokurtic.
Note: Not only µ1 and µ 3 are 0 but all the odd order central moments
are zero for a normal distribution.
P X f x dx 0.6827,
1
Or P 1 Z 1 z dz = 0.6827,
1
2
P 2 X 2 f x dx 0.9544,
2
2
Or P 2 Z 2 z dz = 0.9544, and
2
3
P 3 X 3 f x dx 0.9973.
3
3
Or P 3 Z 3 z dz = 0.9973.
3
This property and its applications will be discussed in detail in Unit 14.
Let us now establish some of these properties.
n .
2 x 31 x
e.g. x e dx x e dx 3
0 0
1
1/ 2 x
1
x 1
and 0 x e dx 0 x 2 e dx 2
Some properties of the gamma function are
i) If n > 1, n n 1 n 1
ii) If n is a positive integer, then n n 1
1
iii) .
2
13
Continuous Probability
Distributions
Now, the first four central moments of normal distribution are obtained as
follows:
First Order Central Moment
As first order central moment (1) of any distribution is always zero [see Unit
3 of MST-002], therefore, first order central moment (1) of normal
distribution = 0.
Second Order Central Moment
2
2 = x f x dx
[See Unit 8 of MST-003]
2
1 x
2 1
x
2
. e dx
2
x
Put z x z
Differentiating
dx
dz
dx dz
Also, when x , we have z and
and when x , z
1
2 1 z2
2 z
2
e 2 dz
1
2 2
z2
2
ze dz
2
2
z
2 2 2
2
z e dz
2 0
z2
2
on changing z to – z, the integrand i.e. z e does not get changed 2
f z dz 2 f z dz if f z is even function of z
0
1
1 12
Now, put z 2 2t z 2 t 2 t 2 dz 2 t dt
2
14
1 Normal Distribution
2 1 2
2 2 (2t)e t 1
dt 2 2 t 2 e t dt
0 0
2t 2
2 2 32 1 t
t e dt
0
2 2 3
[By def. of gamma function]
2
2 2 1 1
[By Property (i) of gamma function]
2 2
2
[By Property (iii) of gamma function]
2
Third Order Central Moment
3
3 = x f x dx
2
1 x
3 1
x
2
e dx
2
x
Put z x z dx dz
and hence
3 1 12 z 2
3
z .
2
e dz
1
1 z2
3 z 3e 2
dz
2
1 1 1
z2 z2 z2
3 3 3
Now, as integrand z e 2 changes to z e 2 on changing z to – z i.e. z e 2
is an odd function of z.
Therefore, using the following property of definite integral:
a
we have,
1
3 3 0 = 0
2
15
Continuous Probability
Distributions Fourth Order Central Moment
2
1 x
4 4 1
x f x dx x
2
4 = e dx
2
x
Putting z
dx dz
4 1 12 z 2
4 z
2
e dz
1 1
4 z2 2 4 z2
z 4e 2
dz z 4
e 2
dz
2 2 0
z2
4
integrand z . e does not get changed on changing z to – z and hence it is
2
z2
Put t z2 = 2t
2
2zdz = 2dt
z dz = dt
dt dt
dz =
z 2t
2 4 1
4 (2t) 2 e t dt
2 0 2t
24 .4 2 t 1 4 4 32 t
= t e dt t e dt
2 2 0 t 0
4 4 52 1 t
t e dt
0
4 4 5
[By definition of gamma function]
2
4 4 3 3
[By Property (i) of gamma function]
2 2
4 4 3 1 1
[By Property (i) of gamma function]
22 2
3 4 1
[Using (Property (iii) of gamma function)]
2
3 4
16
Thus, the first four central moments of normal distribution are Normal Distribution
1 0, 2 2 , 3 0, 4 3 4 .
32 4 34 3 4
1 = 0, 2 = 3
23 22 2
2
4
Therefore, moment coefficient of skewness ( 1 ) 0
Now, let us obtain the mode and median for normal distribution in the next
section.
(x )
f(x) 0
2
x 0 as f (x) 0
17
Continuous Probability x
Distributions
Now differentiating (2) w.r.t. x, we have
x 1
f (x) f '(x) f (x)
2
f () f ( )
f (x) at x 0 2
2 0
x = is point where function has a maximum value.
Mode of X is .
Median
Let M denote the median of the normally distributed random variable X.
We know that median divides the distribution into two equal parts
M
1
f (x)dx f (x)dx
M
2
M
1
f (x)dx 2
2
µ 1 x M
1
1
e 2 dx f (x)dx
σ 2π
2
x
In the first integral, let us put z
Therefore, dx dz
Also when x z 0 , and
when x z .
Thus, we have
0 M
1 12 z 2 1
e dz f (x)dx
2
2
0 1 M
1 z2 1
e 2 dz f (x)dx
2
2
1 12 z2
M Z is s.n.v.with p.d.f. (z) e
1 1 2
+ f (x) dx
2 2 0
1
So (z)dz 1 (z)dz
2
M
f(x)dx = 0
µ
M as f (x) 0
18
Median of X Normal Distribution
2
1 x
1
2
x e dx
2
x
Put z x z
dx
dz
1
1 z2
M.D. about mean = | z | e 2 dz
2
1
z2
2
| z |e dz
2
1
z2
Now, | z | e 2 (integrand) is an even function z as it does not get changed on
changing z to –z, by the property,
a a
“ f (x)dx 2 f (x)dx, if f x is an even function of x ”, we have
a 0
1
z2
M.D. about mean = 2 z e 2 dz
2 0
Now, as the range of z is from 0 to ∞ i.e. z takes non-negative values,
z = z and hence
1
2 z2
2
M.D. about mean ze dz
2 0
z2
Put t z 2 2t 2zdz = 2dt zdz = dt
2
2 e t 2 2
M.D. about mean = e t
dt 2 0 1 =
2 0 1 0
19
Continuous Probability
Distributions 2
In practice, instead of , its approximate value is mostly used and that is
4
.
5
2 2 7 7 4
0.6364 0.7977 0.08 or (approx.)
22 11 5
Let us now take up some problems based on properties of Normal Distribution
in the next section.
X1 ~ N 1 , 12 and X 2 ~ N 2 , 22
then
X1 X 2 ~ N 1 2 , 12 22 , and
X1 X 2 ~ N 1 2 , 12 22 [See Property xiii (Section 13.3)]
i) X1 X 2 ~ N 0 0, 1 1
i.e. X1 X 2 ~ N 0, 2 , and
ii) X1 X 2 ~ N 0 0, 1 1
i.e. X1 X 2 ~ N 0, 2
2 2 2
Mean deviation about mean = = .5 = 5
Solution: Here = 0, 2 = 1 σ = 1.
first four central moments are:
1 0, 2 2 1, 3 0, 4 3 4 3.
Mean of X1 = E(X1 ) 40
Variance of X1 Var(X1 ) 25
Mean of X 2 = E(X 2 ) 60
Variance of X 2 V ar(X 2 ) 36
Now,
(i) Mean of X = E(X) E(2X1 3X 2 ) E(2X1 ) E(3X 2 )
4Var(X1 ) + 9Var(X 2 )
E(3X1 ) E( 2X 2 )
= 3E(X1 ) + ( 2)E(X 2 )
21
Continuous Probability
Distributions 13.8 SUMMARY
The following main points have been covered in this unit:
1) A continuous random variable X is said to follow normal distribution with
parameters ( ) and 2(>0) if it takes on any real value and its
probability density function is given by
2
1 x
1
f x e 2
, x
2
X
2) If X
N , 2 , then Z
is standard normal variate.
13.9 SOLUTIONS/ANSWERS
1 4
E 1) (i) Here we are given X ~ N ,
2 9
in usual notations, we have
1 4 2
, 2
2 9 3
Now, p.d.f. of r.v. X is given by
2
1 x
1
2
f(x) = e , x
2
2
1 x 1/2
1
2 2/3
= e
2
2π
3
22
2
9 2x 1 Normal Distribution
3
2 4
= e , x
2 2π
(ii) Here we are given X ~ N(40, 16)
in usual notations, we have
40, 2 16 4
Now, p.d.f. of r.v. X is given by
2
1 x
1
2
f(x) = e , x
σ 2π
2
1 x ( 40)
1
2 4
= e
4 2π
2
1 x + 40
1
2 4
= e , x
4 2π
x2
1
E 2) (i) f(x) = e 8, x
2 2π
2
1x
1
= e 24
2 2π
2
1 x 0
1
2 2
= e ...(1) , x
2 2π
Comparing (1) with,
2
1 x
1
2
f(x) = e , x
σ 2π
we get
0, 2
2, 2
30 = x 5 x 35 Mean = 35
Given that fourth moment about 35 is 768. But mean is 35, and hence the
fourth moment about mean = 768.
4 = 768
34 = 768
768
4 = 4 3 4
3
4 = 256 = (4)4 = 4.
24