Unit 3
Unit 3
SAMPLING
Structure
3.1 Introduction
Objectives
3.2 Principles of Stratification
Notations and Terminology
3.3 Properties of Stratified Random Sampling
3.4 Mean and Variance for Proportions
3.5 Allocation of Sample Size
Equal Number of Units from Each Stratum
Proportional Allocation
Neyman’s Allocation
Optimum Allocation
3.6 Stratified Sampling versus Simple Random Sampling
Proportional Allocation Versus Simple Random Sampling
Neyman’s Allocation Versus Proportional Allocation
Neyman’s Allocation Versus Simple Random Sampling
Merits and Demerits of Stratified Random Sampling
3.7 Summary
3.8 Solutions/Answers
3.1 INTRODUCTION
When the units of the population are scattered and not completely
homogeneous in nature, then simple random sample does not give proper
representation of the population. So if the population is heterogeneous the
simple random sampling is not found suitable. In simple random sampling the
variance of the sample mean is proportional to the variability of the sampling
units in the population. So, in spite of increasing the sample size n or
sampling fraction n/N, the only other way of increasing the precision is to
device a sampling which will effectively reduce the variability of the sample
units, the population heterogeneity. One such method is stratified sampling
method.
Thus, all strata would comprise the population. Then from each stratum
sample would be drawn and lastly all samples would be combined to get the
ultimate sample. For example, let us consider that population consists of N
units and these are distributed in a heterogeneous structure. Now first of all
45
Statistical Techniques
we divide the population into ‘k’ non overlapping strata of sizes N1, N2, N3,
..., Nk such that each stratum becomes homogeneous. Evidently N = N1 + N2 +
N3 + ... + Nk. Then from first stratum a sample of size n1 would be drawn by
simple random sampling method. Similarly, from the second stratum a sample
of n2 units would be drawn and so on, up to kth stratum. Now all these k
samples would be combined to get the ultimate sample. So, the ultimate size
of sample would be n n1 n 2 n 3 ... n k . This method of sampling is
known as Stratified random sampling because here stratification is done first
to make population homogeneous and then samples are drawn randomly by
simple random sampling from each stratum.
The principles of stratification are explained in Section 3.2. The properties of
stratified random sampling are described in Section 3.3, whereas Section 3.4
provides the derivation of the mean and variance of proportions in stratified
random sampling. The allocation of sample size with the help of different
techniques is described in Section 3.5. The comparative study between
stratified random sampling and simple random sampling is given in Section
3.6.
Objectives
After studying this unit, you would be able to
define the stratified random sampling;
explain the principles of stratification;
describe the properties of stratified random sampling;
derive the mean and variance of proportions in stratified random
sampling;
describe the allocation of sample size with the help of different
techniques; and
calculate the estimate of population mean and variance of sample mean.
1. The strata should not be overlapping and should together comprise the
whole population.
46
3.2.1 Notations and Terminology Stratified Random
Sampling
N = Population size
n = Sample size
k = Number of strata
Ni = Size of ith stratum
k
Then N N i
i 1
1 k Ni
X Population Mean X ij
N i1 j1
k k
1
i i
N i1
N X
i 1
Wi X i
Ni
where, Wi is called the weight of i th stratum
N
S i2 = Population mean square of the i th stratum
Ni
1
X , j 1, 2, ... , N
2
ij Xi i & i = 1, 2, . .., k
Ni 1 j1
ni
1
s i2
n i 1
x
j1
ij xi ,
2
i 1, 2, ..., k
where, x st is the weighted mean of the strata sample means, weights being
equal to strata sizes. These two will be identical if n i Ni
47
Statistical Techniques
3.3 PROPERTIES OF STRATIFIED RANDOM
SAMPLING
1 k Ni
= X ij X
N i1 j1
Hence proved
Theorem 2: Prove that
k k
1 Si 1 1 2 2
Var x st N N n n n Wi Si
N i
i i i
N2 i1 i i1 i
Proof: We have
k
Var x st Var Wi x i
i 1
k
W i
2
Var x i
i1
The covariance term vanish since the samples from different strata are
independent and the sample units in each stratum are the simple random
sample without replacement, we have
48
1 1 2 Stratified Random
Var x S Sampling
n N
1 1 2
or Var ( x i ) = Si
n i Ni
Therefore,
k
2
Var x st Wi2 1 1 Si
i 1 n N i i
1 k
Si2
N2
N i N i n i
i 1 ni
From the above result the variance depends on Si2 the heterogeneity within the
strata. Thus, if Si2 are small i.e. strata are homogeneous then stratified
sampling schemes provides estimates with greater precision.
Theorem 3: If Si2 is not known then prove that estimate of the variance of the
sample mean of the stratified random sample is given by
k
1 1
E Var x st Wi2Si2
i 1 n i Ni
Proof: In general Si2are not known. A simple random sample is drawn from
each stratum. If we assume a individual stratum as a population then the
sample, drawn from it, would be a simple random sample. If the sample is
drawn from ith stratum, the sample mean square si2would be an estimate of
population mean square Si2
i. e. E s i2 Si2 i 1, 2, ..., k … (1)
Accordingly an unbiased estimate of the variance is given by
k
1 1 2 2
Var x st Wi s i
i 1 n i N i
Therefore,
n 1 1
EVar x st E Wi2 s i2
i1 n i N i
k
1 1
Wi2 E s i2
i 1 n i Ni
Substituting from equation (1), we get
k
1 1
EVar x st Wi2 Si2
i1 n i Ni
49
Statistical Techniques
3.4 MEAN AND VARIANCE FOR PROPORTIONS
As in simple random sampling, we can divide a population into two classes
with respect to a attribute. Hence the units in the population are classified in
these two classes accordingly as it possesses or does not possess the given
attribute. After taking a sample of size n, we may be interested in estimating
the population proportion of the defined attribute.
If a unit possesses the attribute, it receives the code value 1 and if an unit does
not possesses the attribute, it receives the value 0. Let the number of units
belonging to A in the ith stratum of size Ni be Mi and if the sample of size ni
taken from ith stratum, the number of units belonging to A be mi. Denoting
the proportion of units belonging to A in the population, in the ithstratum and
sample from the ith stratum by , i and pi respectively, various formula for
mean and variance are as follows:
Mi m
πi and pi i
Ni ni
k k
N
and π i π i Wi π i
i 1 N i 1
for i =1, 2,…, k
The estimated proportion pst under stratified sampling for the units belonging
to A is
k
p st Wi p i
i 1
k k
Mean p st E Wi p i Wi E p i
i1 i 1
since we draw SRS from each stratum so by Theorem 10 of Section 2.4 of
Unit 2 we have
E p i π i
50
N i n i π i 1 πi Stratified Random
Var pi . Sampling
Ni 1 ni
N i N i n i π i 1 π i
k 2
1
Var p st
N2 i 1 N i 1 n i
If Ni is large enough, consider 1/ Ni as negligible and Ni-1~Ni, formula for
Var (pst ) reduces to
1 k 1 i
Var pst 2
N i N i n i i
N i 1 ni
where, qi = 1− p i
51
Statistical Techniques
n
ni for all i =1, 2, …, k
k
n k W S2
Var x st PROP 1 i i
N i 1 n
… (3)
1 1 k
Wi Si2
n N i 1
52
WiS i N i Si Stratified Random
ni n k
n k
Sampling
WS
i 1
i i N
i 1
i Si … (5)
In any stratum the cost of survey per sampling unit cannot be the same. That
is, in one stratum the cost of transportation may be different from the other.
Hence, it would not be wrong to allocate the cost of the survey in each stratum
differently.
Let ci be the cost per unit of survey in the ithstratum from which a sample of
size ni is stipulated. Also suppose c0 as the over head fixed cost of the survey.
In this way the total cost C of the survey comes out to be
k
C c 0 ci n i
i 1
… (7)
c0 and ci are beyond our control. Hence we will determine the optimum value
of ni which minimizes the variance of stratified sample mean.
To determine the optimum value of ni, we consider the function
Var x st C
k k
1 1
Wi2 Si2 λ c0 ci n i
n
i 1 i N i i 1 ... (8)
where is constant and known as Lagrange’s multiplier.
53
Statistical Techniques
Wi2S2i
λ ci 0 … (9)
n i2
or
Wi Si
ni … (10)
ci
1 k Wi Si
or n
i1 ci
1 k Wi Si
n i1 ci
ni n
W S c n N S c
i i i i i i
… (11)
k k
W S c N S c
i 1
i i i
i 1
i i i
Thus, the relation (11) leads to the following important conclusions that we
have to take a larger sample in a given stratum if
1. The stratum size Ni is larger;
2. The stratum has larger variability (Si); and
3. The cost per unit is lower in the stratum.
54
Ni Stratified Random
1
where, Si2
N i 1
Xij Xi 2 Sampling
j1
1 1
and Var x SRSWOR S2
n N … (13)
k Ni
1
where, S2 Xij X 2
N 1 i1 j1
In order to comparing (12) and (13) we shall first express S2 in terms of S2i
we have
1 k Ni
S2 X ij X i X i X
2
N 1 i 1 j 1
k Ni k Ni
N 1S2 Xij Xi 2 Xi X 2
i 1 j1 i 1 j 1
k Ni
2 Xi X Xij Xi
i 1 j1
k k
N 1S2 N i 1 Si2 N i Xi X 2
i 1 i 1
Ni
X
j1
ij Xi 0
being the sum of square of deviation from the stratum mean. If we assume that
Ni and consequently N are sufficiently large so that we can put Ni-1= Ni and
N-1 = N, then we get
k k
N S 2 N i Si2 N i X i X
2
i 1 i 1
k k
S2 Wi S2i Wi Xi X
2
… (14)
i 1 i 1
1 1 k 1 1 k
Var x Wi Si2 Wi Xi X
2
SRSWOR
n N i 1 n N i 1
1 1 k
Var x SRSWOR
Var x st PROP Wi Xi X 2
n N i1
Var x
SRSWOR
Var x st PROP … (15)
and
2
1 k 1 k
Var x st NEY Wi Si Wi S2i
n i1 N i1 … (17)
By subtracting equation (17) from equation (16) we get
1 1 k
Var x st PROP Var x st NEY Wi Si
2
n N i1
2
1 k 1 k
2
Wi Si Wi Si
n i1 N i1
2
1 k k
Wi S2i WiSi
n i 1 i 1
k
1
Wi (Si S) 2
n i 1 … (18)
k
1 k
where, S Wi Si Ni Si is the weighted mean of the stratum sizes Ni
i 1 N i 1
Hence from equation (18) we can say
From the relationship between the proportional allocation and simple random
sampling and the relation between proportional and Neyman allocation we
have
1 1 k
Var x SRSWOR Var x st PROP Wi Xi X
2
n N i 1 … (19)
k
1
W S 2
and Var x st PROP Var x st NEY i i S ... (20)
n i 1
1. More Representative
Stratified random sampling ensures any desired representation in the
sample of the various strata in the population. It overruled the probability
of any essential group of the population being completely excluded in the
sample.
2. Greater Accuracy
Stratified random sampling provides estimate of parameters with
increased precision in comparison to simple random sampling. Stratified
random sampling also enables us to obtain the results of known precision
for each of the stratum.
3. Administrative Convenience
The stratified random samples would be more concentrated geographically
in comparison to simple random samples. Therefore, this method needs
less time and money involved in interviewing the supervision of the field
work can be done with greater case and convenience.
Demerits
However, stratified random sampling has some demerits too, which are:
2. Lower Efficiency
If the sizes of samples from different stratum are not properly determined
then stratified random sampling may yield a larger variance that means
lower efficiency.
Example 1: A sample of 60 persons is to be drawn from a population
consisting of 600 belonging to two villages A and B. The means and standard
deviations of their marks are given below:
57
Statistical Techniques
Solution: If we regard the villages A and B as representing two different
strata then the problem is to draw a stratified random sample of size 30 using
technique of proportional allocation. In proportional allocation, we have
n
ni Ni
N
Therefore,
60
n1 400 40
600
60
n2 200 20
600
Thus, the required sample sizes for the villages A and B are 40 and 20
respectively.
Collage A 300 25 25
E2) Obtain the sample mean and estimate of the population mean for the
given information in Example 1 discussed above.
3.7 SUMMARY
In this unit, we have discussed:
1. The definition and procedure of stratified random sampling;
2. The principles of stratification;
3. The properties of stratified random sampling;
4. The mean and variance of proportions in stratified random sampling;
5. The allocation of sample size with the help of different techniques; and
6. Calculation of the estimate of population mean and variance of sample
mean.
E1) If we regard the collages A and B representing two different strata then
the problem is to draw as stratified sample of 100 employees using
technique of proportional allocation and Neyman’s allocation.
58
In proportional allocation we have Stratified Random
Sampling
n 100
ni Ni Ni
N 500
100
n1 300 60
500
100
n2 200 40
500
In Neyman’s allocation, we have
N iSi
ni n k
NS
i 1
i i
NS
i 1
i i = 300 5 + 200 10
200 5
n 2 100 57 .14 57
3500
Therefore, the samples regarding the colleges A and B for both
allocations are obtained as:
Proportional Neyman
Collage A 60 43
Collage B 40 57
Total 100 100
2 2
Village Ni Xi σi
S 2i
N 2
σi N i Si2 Ni σ i2 Xi N i Xi
N 1
1 k
X Ni Xi
N i1
59
Statistical Techniques
1
400 60 200 120
600
1 48000
24000 24000 80
600 600
k
2
1 1
N S i i
Var x PROP
i 1
n N N
= 36.1675
1 1
Var x SRSWOR S2
n N
where,
k ni
1
S2
N 1 i 1 j1
Xij X
2
1 k k 2
N 1 i1
N i 2
i
i 1
N i X i
X
1 k 2
k
N N i X i2 NX 2
N 1 i1 i i
i1
1
1440000 4320000 600 80 80
599
1
5760000 3840000 1920000
599 599
600 60 1920000
Var x SRSWOR
600 60 599
Then the conclusion is
Var x st PROP 36.1675
Var x SRSWOR 48.08
60