Introduction To Business Statistics
Introduction To Business Statistics
STRUCTURE:
        1.1    Introduction
        1.2    Meaning and Definitions of Statistics
        1.3    Types of Data and Data Sources
        1.4    Types of Statistics
        1.5    Scope of Statistics
        1.6    Importance of Statistics in Business
        1.7    Limitations of statistics
        1.8    Summary
        1.9    Self-Test Questions
        1.10   Suggested Readings
1.1 INTRODUCTION
reference, coverage, and scope. At the macro level, these are data on gross national
Domestic Product).
At the micro level, individual firms, howsoever small or large, produce extensive
statistics on their operations. The annual reports of companies contain variety of data
These data are often field data, collected by employing scientific survey techniques.
Unless regularly updated, such data are the product of a one-time effort and have
limited use beyond the situation that may have called for their collection. A student
chemistry, physics, and others. It is a discipline, which scientifically deals with data,
and is often described as the science of data. In dealing with statistics as data,
                                                       -1 -
Caritas Business College             BUSINESS STATISTICS – OAD 221
In the beginning, it may be noted that the word ‘statistics’ is used rather curiously in
two senses plural and singular. In the plural sense, it refers to a set of figures or data.
In the singular sense, statistics refers to the whole body of tools that are used
to collect data, organise and interpret them and, finally, to draw conclusions from
them. It should be noted that both the aspects of statistics are important if the
and consists of poor methodology, we could not know the right procedure to extract
from the data the information they contain. Similarly, if our data are defective or that
they are inadequate or inaccurate, we could not reach the right conclusions even
A.L. Bowley has defined statistics as: (i) statistics is the science of counting, (ii)
Statistics may rightly be called the science of averages, and (iii) statistics is
probabilities. Further, W.I. King has defined Statistics in a wider context, the science
of Statistics is the method of judging collective, natural or social phenomena from the
Seligman explored that statistics is a science that deals with the methods of
collected to throw some light on any sphere of enquiry. Spiegal defines statistics
statistics is concerned with scientific method for collecting, organising, summa rising,
presenting and analyzing data as well as drawing valid conclusions and making
From the above definitions, we can highlight the major characteristics of statistics as
follows:
(i) Statistics are the aggregates of facts. It means a single figure is not statistics.
For example, national income of a country for a single year is not statistics but
(ii) Statistics are affected by a number of factors. For example, sale of a product
(iii) Statistics must be reasonably accurate. Wrong figures, if analysed, will lead to
haphazard manner, they will not be reliable and will lead to misleading
conclusions.
(vi) Lastly, Statistics should be placed in relation to each other. If one collects
data unrelated to each other, then such data will be confusing and will not
lead to any logical conclusions. Data should be comparable over time and
over space.
Statistical data are the basic raw material of statistics. Data may relate to an activity
variable is one that shows a degree of variability when successive measurements are
recorded. In statistics, data are classified into two broad categories: quantitative data
                                                      -3 -
Caritas Business College              BUSINESS STATISTICS – OAD 221
and qualitative data. This classification is based on the kind of characteristics that are
measured.
Quantitative data are those that can be quantified in definite units of measurement.
continuous variable is the one that can assume any value between any two
values are quite precise and close to each other, yet distinguishably different.
the data recorded on these and similar other characteristics are called
(ii) Discrete data are the values assumed by a discrete variable. A discrete
variable is the one whose outcomes are measured in fixed numbers. Such
data are essentially count data. These are derived from a process of
characteristic is qualitative in nature when its observations are defined and noted in
                                                       -4 -
Caritas Business College             BUSINESS STATISTICS – OAD 221
(i) Nominal data are the outcome of classification into two or more categories of
undergraduates, and post-graduates), all result into nominal data. Given any
particular class and make a summation of items belonging to each class. The
(ii) Rank data, on the other hand, are the result of assigning ranks to specify
order in terms of the integers 1,2,3, ..., n. Ranks may be assigned according
characteristic.
Data sources could be seen as of two types, viz., secondary and primary. The two
(i) Secondary data: They already exist in some form: published or unpublished -
(ii) Primary data: Those data which do not already exist in any form, and thus
have to be collected for the first time from the primary source(s). By their very
nature, these data require fresh and first-time collection covering the whole
There are two major divisions of statistics such as descriptive statistics and inferential
                                                      -5 -
Caritas Business College             BUSINESS STATISTICS – OAD 221
and simplifying data, which are otherwise quite unwieldy and voluminous. It seeks to
achieve this in a manner that meaningful conclusions can be readily drawn from the
data. Descriptive statistics may thus be seen as comprising methods of bringing out
not only facilitates an understanding of the data and systematic reporting thereof in a
manner; and also makes them amenable to further discussion, analysis, and
interpretations.
The first step in any scientific inquiry is to collect data relevant to the problem in
hand. When the inquiry relates to physical and/or biological sciences, data
collection is normally an integral part of the experiment itself. In fact, the very manner
and/or generate. The problem of identifying the nature and the kind of the relevant
is possible in the case of physical sciences. In the case of social sciences, where the
required data are often collected through a questionnaire from a number of carefully
selected respondents, the problem is not that simply resolved. For one thing,
designing the questionnaire itself is a critical initial problem. For another, the number
of respondents to be accessed for data collection and the criteria for selecting
them has their own implications and importance for the quality of results obtained.
Further, the data have been collected, these are assembled, organized, and
needed, figures, diagrams, charts, and graphs are also used for better presentation
of the data. A useful tabular and graphic presentation of data will require that the raw
data be properly classified in accordance with the objectives of investigation and the
                                                      -6 -
Caritas Business College            BUSINESS STATISTICS – OAD 221
A well thought-out and sharp data classification facilitates easy description of the
constitute the essential scope of descriptive statistics. These form a large part of
the subject matter of any basic textbook on the subject, and thus they are being
presenting the related data. Instead, it consists of methods that are used for drawing
basis of knowledge about a part of that totality. The totality of observations about
or a universe. The part of totality, which is observed for data collection and analysis
The desired information about a given population of our interest; may also be
collected even by observing all the units comprising the population. This total
coverage is called census. Getting the desired value for the population through
census is not always feasible and practical for various reasons. Apart from time and
individual unit of the population with reference to any data characteristic may at times
involve even destructive testing. In such cases, obviously, the only recourse available
is to employ the partial or incomplete information gathered through a sample for the
purpose. This is precisely what inferential statistics does. Thus, obtaining a particular
value from the sample information and using it for drawing an inference about the
                                                       -7 -
Caritas Business College              BUSINESS STATISTICS – OAD 221
situation in which one is required to know the average body weight of all the college
students in a given cosmopolitan city during a certain year. A quick and easy way to
do this is to record the weight of only 500 students, from out of a total strength of,
say, 10000, or an unknown total strength, take the average, and use this average
based on incomplete weight data to represent the average body weight of all the
college students. In a different situation, one may have to repeat this exercise for
some future year and use the quick estimate of average body weight for a
comparison. This may be needed, for example, to decide whether the weight of the
college students has undergone a significant change over the years compared.
example, an inspection of a sample of five battery cells drawn from a given lot may
reveal that all the five cells are in perfectly good condition. This information may be
used to conclude that the entire lot is good enough to buy or not.
cells, it is equally likely that all the cells in the lot are not in order. It is also possible
that all the items that may be included in the sample are unsatisfactory. This may be
used to conclude that the entire lot is of unsatisfactory quality, whereas the fact may
limited sample. The rescue in such situations lies in evaluating such risks. For this,
probabilistic term the chances of decisions taken on the basis of sample information
being incorrect. This requires an understanding of the what, why, and how of
                                                        -8 -
Caritas Business College               BUSINESS STATISTICS – OAD 221
Apart from the methods comprising the scope of descriptive and inferential branches
of statistics, statistics also consists of methods of dealing with a few other issues of
specific nature. Since these methods are essentially descriptive in nature, they have
been discussed here as part of the descriptive statistics. These are mainly concerned
(i) It often becomes necessary to examine how two paired data sets are related.
For example, we may have data on the sales of a product and the
Given that sales and advertisement expenditure are related to each other, it is
useful to examine the nature of relationship between the two and quantify the
methods, these falls under the purview of what we call regression and
correlation analysis.
(ii) Situations occur quite often when we require averaging (or totalling) of data
example, price of cloth may be quoted per meter of length and that of wheat
not apply to such price/quantity data, special techniques needed for the
activity with a view to determining its future behaviour. For example, when
and analysis of relevant sales data over time. The more complex the
activity, the
                                                     -9 -
Caritas Business College            BUSINESS STATISTICS – OAD 221
more varied the data requirements. For profit maximising and future sales
planning, forecast of likely sales growth rate is crucial. This needs careful
collection and analysis of past sales data. All such concerns are taken care of
(iv) Obtaining the most likely future estimates on any aspect(s) relating to a
business or economic activity has indeed been engaging the minds of all
planning. The regression, correlation, and time series analyses together help
develop the basic methodology to do the needful. Thus, the study of methods
Keeping in view the importance of inferential statistics, the scope of statistics may
making under conditions of uncertainty. While the term statistical methods is often
which statistical data are analysed, interpreted, and the inferences drawn for
decision- making.
Though generic in nature and versatile in their applications, statistical methods have
economics. These are also being increasingly used in biology, medicine, agriculture,
psychology, and education. The scope of application of these methods has started
political scientist finds them of increasing relevance for examining the political
behaviour and it is, of course, no surprise to find even historians statistical data, for
                                                     -10
                                                     -
Caritas Business College             BUSINESS STATISTICS – OAD 221
There are three major functions in any business enterprise in which the statistical
(i) The planning of operations: This may relate to either special projects or to
(ii) The setting up of standards: This may relate to the size of employment,
achieved against the norm or target set earlier. In case the production
has fallen short of the target, it gives remedial measures so that such a
setting standards, and control-are separate, but in practice they are very much
interrelated.
instance, Croxton and Cowden give numerous uses of Statistics in business such as
project planning, budgetary planning and control, inventory planning and control,
quality control, marketing, production and personnel administration. Within these also
they have specified certain areas where Statistics is very relevant. Another author,
specifies a number of areas where statistics is extremely useful. These are: customer
                                                       -11
                                                       -
Caritas Business College            BUSINESS STATISTICS – OAD 221
production, inspection, packaging and shipping, sales and complaints, inventory and
As such, one may do no more than highlight some of the more important ones to
Statistical quality control methods are used to ensure the production of quality goods.
Identifying and rejecting defective or substandard goods achieve this. The sale
targets can be fixed on the basis of sale forecasts, which are done by using varying
methods of forecasting. Analysis of sales affected against the targets set earlier
causes: (i) targets were too high and unrealistic (ii) salesmen's performance has
been poor (iii) emergence of increase in competition (iv) poor quality of company's
management. Here, one is concerned with the fixation of wage rates, incentive norms
Statistical methods could also be used to ascertain the efficacy of a certain product,
                                                     -12
                                                     -
Caritas Business College               BUSINESS STATISTICS – OAD 221
patients. One group is given this new medicine for a specified period and the
other one is treated with the usual medicines. Records are maintained for the two
groups for the specified period. This record is then analysed to ascertain if there is
any significant difference in the recovery of the two groups. If the difference is really
(i) There are certain phenomena or concepts where statistics cannot be used.
quantified. Statistics has no place in all such cases where quantification is not
possible.
(ii) Statistics reveal the average behaviour, the normal or the general trend. An
For example, one may be misguided when told that the average depth of
a river from one bank to the other is four feet, when there may be some points
in between where its depth is far more than four feet. On this understanding,
one may enter those points having greater depth, which may be hazardous.
(iii) Since statistics are collected for a particular purpose, such data may not be
data (i.e., data originally collected by someone else) may not be useful for the
other person.
(iv) Statistics are not 100 per cent precise as is Mathematics or Accountancy.
                                                     -13
                                                     -
Caritas Business College             BUSINESS STATISTICS – OAD 221
possible to cover all the units or elements comprising the universe. The
different surveys based on the same size of sample but different sample units
in statistics, but such a relationship does not indicate cause and effect'
the two variables. In such cases, it is the user who has to interpret the results
(vii) A major limitation of statistics is that it does not reveal all pertaining to a
does not cover. Similarly, there are some other aspects related to the problem
on hand, which are also not covered. The user of Statistics has to be well
informed and should interpret Statistics keeping in mind all other aspects
Apart from the limitations of statistics mentioned above, there are misuses of it. Many
people, knowingly or unknowingly, use statistical data in wrong manner. Let us see
what the main misuses of statistics are so that the same could be avoided when one
has to use statistical data. The misuse of Statistics may take several forms some of
(i) Sources of data not given: At times, the source of data is not given. In the
absence of the source, the reader does not know how far the data are
so.
                                                      -14
                                                      -
Caritas Business College              BUSINESS STATISTICS – OAD 221
(ii) Defective data: Another misuse is that sometimes one gives defective data.
unem- ployed persons, the definition may include even those who are
One may choose a sample just on the basis of convenience. He may collect
the desired information from either his friends or nearby respondents in his
representative sample.
sample. For example, in a city we may find that there are 1, 00,000
sample of merely 100 households comprising only 0.1 per cent of the
universe. A survey based on such a small sample may not yield right
information.
comparisons from the data collected. For instance, one may construct an
index of production choosing the base year where the production was much
less. Then he may compare the subsequent year's production from this
low base.
                                                     -15
                                                     -
Caritas Business College            BUSINESS STATISTICS – OAD 221
attempted. Such a comparison is wrong. Likewise, when data are not properly
are not taken into consideration, comparisons of such data would be unfair as
next five years, one may assume a lower rate of growth though the past two
years indicate otherwise. Sometimes one may not be sure about the changes
in business environment in the near future. In such a case, one may use an
conclusion may be the use of wrong average. Suppose in a series there are
extreme values, one is too high while the other is too low, such as 800
and 50. The use of an arithmetic average in such a case may give a wrong
has to examine the relationship between two variables. A close relationship between
that one
                                                     -16
                                                     -
Caritas Business College            BUSINESS STATISTICS – OAD 221
variable is the cause and the other is the effect. It should be taken as something that
measures degree of association rather than try to find out causal relationship..
1.8 SUMMARY
coverage, and scope. At the macro level, these are data on gross national product
Product). At the micro level, individual firms, howsoever small or large, produce
other activities. These data are often field data, collected by employing scientific
survey techniques. Unless regularly updated, such data are the product of a one-time
effort and have limited use beyond the situation that may have called for their
scientifically deals with data, and is often described as the science of data. In dealing
presenting, summarizing, and analysing data, and thus consists of a body of these
methods.
and business.
4. What are the major limitations of Statistics? Explain with suitable examples.
                                                     -17
                                                     -
Caritas Business College           BUSINESS STATISTICS – OAD 221
2. Hooda, R. P.: Statistics for Business and Economics, Macmillan, New Delhi.
NJ.
NY.
                                                   -18
                                                   -
Caritas Business College             BUSINESS STATISTICS – OAD 221
STRUCTURE:
        2.1    Introduction
        2.2    Arithmetic Mean
        2.3    Median
        2.4    Mode
        2.5    Relationships of the Mean, Median and Mode
        2.6    The Best Measure of Central Tendency
        2.7    Geometric Mean
        2.8    Harmonic Mean
        2.9    Quadratic Mean
        2.10   Summary
        2.11   Self-Test Questions
        2.12   Suggested Readings
2.1 INTRODUCTION
The description of statistical data may be quite elaborate or quite brief depending on
two factors: the nature of data and the purpose for which the same data have been
collected. While describing data statistically or verbally, one must ensure that the
description is neither too brief nor too lengthy. The measures of central tendency
enable us to compare two or more distributions pertaining to the same time period or
within the same distribution over time. For example, the average consumption of
tea in two different territories for the same period or in a territory for two years, say,
                                                      -19
                                                      -
Caritas Business College                BUSINESS STATISTICS – OAD 221
Adding all the observations and dividing the sum by the number of
observations:
These are seven observations. Symbolically, the arithmetic mean, also called simply
mean is
                                   10  15  30  7  42  79  83
                               =                 7
                                    266
                               =          = 38
                                    7
It may be noted that the Greek letter  is used to denote the mean of the
population and n to denote the total number of observations in a population. Thus the
population mean  = x/n. The formula given above is the basic formula that forms
the definition of arithmetic mean and is used in case of ungrouped data where
In case of ungrouped data where weights are involved, our approach for calculating
Example 2.1: Suppose a student has secured the following marks in three
Laboratory 25
Final 20
                                                      -20
                                                      -
Caritas Business College             BUSINESS STATISTICS – OAD 221
However, this will be wrong if the three tests carry different weights on the basis of
their relative importance. Assuming that the weights assigned to the three tests are:
Laboratory 3 points
Final 5 points
Solution: On the basis of this information, we can now calculate a weighted mean
as shown below:
Laboratory 3 25 75
Final 5 20 100
                     wx w1 x1  w2 x2  w3 x3
               x       
                    w     w1  w2  w3
                           60  75  100
                       =                    23.5 marks
                             235
It will be seen that weighted mean gives a more realistic picture than the simple or
unweighted mean.
falling prices in the stock exchange, a stock is sold at Rs 120 per share on one day,
Rs 105 on the next and Rs 90 on the third day. The investor has purchased 50
shares on the first day, 80 shares on the second day and 100 shares on the third'
day. What average price per share did the investor pay?
                                                       -21
                                                       -
Caritas Business College                  BUSINESS STATISTICS – OAD 221
Solution:
2 105 80 8400
3 90 100 9000
                                                  w1 x1  w2 x2      wx
               Weighted average           =
                                                     w3 x3 w1  w2   w
                                                      w3
It will be seen that if merely prices of the shares for the three days (regardless of the
number of shares purchased) were taken into consideration, then the average price
would be
                     120  105  90
               Rs.         3         105
also a weighted average where weight in each case is the same, that is, only 1.
When we use the term average alone, we always mean that it is an unweighted or
simple average.
For grouped data, arithmetic mean may be calculated by applying any of the
following methods:
                                                         -22
                                                         -
Caritas Business College               BUSINESS STATISTICS – OAD 221
In the case of direct method, the formula x = fm/n is used. Here m is mid-point of
various classes, f is the frequency of each class and n is the total number of
frequencies. The calculation of arithmetic mean by the direct method is shown below.
Example 2.3: The following table gives the marks of 58 students in Statistics.
Solution:
                                                      No. of
                                Mark    Mid-point            Stude      fm
                                  s        m                 nts f
                                0-10        5                4           20
                                10-        15                8          120
                                 20
                                20-        25                11         275
                                 30
                                30-        35                15         525
                                 40
                                40-        45                12         540
                                 50
                                50-        55                6          330
                                 60
                                60-        65                2          130
                                 70
                                                                      fm =
                                                                      1940
               Where,
                                                       -23
                                                       -
Caritas Business College                BUSINESS STATISTICS – OAD 221
n 58
It may be noted that the mid-point of each class is taken as a good approximation of
the true mean of the class. This is based on the assumption that the values are
distributed fairly evenly throughout the interval. When large numbers of frequency
The formula for calculation of the arithmetic mean by the short-cut method is given
below:
                             fd
               xA
                             n
f = frequency
When the values are extremely large and/or in fractions, the use of the direct method
would be very cumbersome. In such cases, the short-cut method is preferable. This
particularly for calculation of the product of values and their respective frequencies.
However, when calculations are not made manually but by a machine calculator, it
may not be necessary to resort to the short-cut method, as the use of the direct
As can be seen from the formula used in the short-cut method, an arbitrary or
assumed mean is used. The second term in the formula (fd  n) is the correction
factor for the difference between the actual mean and the assumed mean. If the
assumed mean turns out to be equal to the actual mean, (fd  n) will be zero. The
use of the short-cut method is based on the principle that the total of deviations taken
from an actual mean is equal to zero. As such, the deviations taken from any other
figure will depend on how the assumed mean is related to the actual mean. While
one may choose any value as assumed mean, it would be proper to avoid extreme
                                                     -24
                                                     -
Caritas Business College             BUSINESS STATISTICS – OAD 221
values, that is, too small or too high to simplify calculations. A value apparently close
Example 2.4:
                                                Mid-
                                   Marks           point   f     d           fd
                                                   m
                                    0-10           5       4 -30            -120
                                   10-20           1       8 -20            -160
                                                   5
                                   20-30           2       11 -10           -110
                                                   5
                                   30-40           3       15 0              0
                                                   5
                                   40-50           4       12 10            120
                                                   5
                                   50-60           5       6 20             120
                                                   5
                                   60-70           6       2 30             60
                                                   5
                                                                      fd = -90
It may be noted that we have taken arbitrary mean as 35 and deviations from
midpoints. In other words, the arbitrary mean has been subtracted from each value of
                                 fd
                       xA
                               n
                            
                        35     90 
                             58 
Now we take up the calculation of arithmetic mean for the same set of data using
                           40        5
                           40-       4    12     10   1          12
                           50        5
                           50-       5    6      20   2          12
                           60        5
                           60-       6    2      30   3          6
                           70        5
                                                            fd’ =-
                                                              9
                                               -26
                                               -
Caritas Business College               BUSINESS STATISTICS – OAD 221
                                   fd '
                       xA                
                       C
                                   n           = 33.45 or 33 marks approximately.
                               9
                        35
                       10 
                                          
                              58          
It will be seen that the answer in each of the three cases is the same. The step-
also be noted that if we select a different arbitrary mean and recalculate deviations
Now that we have learnt how the arithmetic mean can be calculated by using
Example 2.6: The mean of the following frequency distribution was found to be 1.46.
Solution:
Here we are given the total number of frequencies and the arithmetic mean. We have
to determine the two frequencies that are missing. Let us assume that the frequency
200
                                                      -27
                                                      -
Caritas Business College               BUSINESS STATISTICS – OAD 221
                        x  2y  140
               1.46 =
                            200
x + 2y = 152
x + y = 200 - 86
x + y = 114
                        x + 2y =       152
                        x+y =          114
                         - -            -
                        y      =       38
Therefore, x = 114 - 38 = 76
Against accident 1 : 76
Against accident 2 : 38
1. The sum of the deviations of the individual items from the arithmetic mean is
is the arithmetic mean. Since the sum of the deviations in the positive
direction
is equal to the sum of the deviations in the negative direction, the arithmetic
2. The sum of the squared deviations of the individual items from the arithmetic
mean is always minimum. In other words, the sum of the squared deviations
taken from any value other than the arithmetic mean will be higher.
                                                     -28
                                                     -
Caritas Business College            BUSINESS STATISTICS – OAD 221
3. As the arithmetic mean is based on all the items in a series, a change in the
value of any item will lead to a change in the value of the arithmetic mean.
4. In the case of highly skewed distribution, the arithmetic mean may get
2.3 MEDIAN
Median is defined as the value of the middle item (or the mean of the values of
the two middle items) when the data are arranged in an ascending or descending
value if n is odd. When n is even, the median is the mean of the two middle values.
We have to first arrange it in either ascending or descending order. These figures are
5,7,10,15,18,19,21,25,33
Now as the series consists of odd number of items, to find out the value of the
               Where                   n
                                        1
                                                                              n
               Where n is the number of items. In this case, n is 9, as        1 = 5, that is, the size
               such
                                                                               2
Suppose the series consists of one more items 23. We may, therefore, have to
include 23 in the above series at an appropriate place, that is, between 21 and 25.
Thus, the series is now 5, 7, 10, 15, 18, 19, and 21,23,25,33. Applying the above
                                                     -29
                                                     -
Caritas Business College      BUSINESS STATISTICS – OAD 221
formula, the
                                          -30
                                          -
Caritas Business College               BUSINESS STATISTICS – OAD 221
median is the size of 5.5th item. Here, we have to take the average of the values of
5th and 6th item. This means an average of 18 and 19, which gives the median as
18.5.
merely indicates the position of the median, namely, the number of items we have to
count until we arrive at the item whose value is the median. In the case of the even
number of items in the series, we identify the two items whose values have to be
averaged to obtain the median. In the case of a grouped series, the median is
items
c = the cumulative frequency of the class preceding the one in which the median lies
Example 2.7:
Total 143
                                                      -31
                                                      -
Caritas Business College            BUSINESS STATISTICS – OAD 221
frequency to the table. Thus, the table with the cumulative frequency is written as:
                                                     -32
                                                     -
Caritas Business College            BUSINESS STATISTICS – OAD 221
                                                             Cumulative
                                   Monthly    Frequen        Frequency
                                    Wages         cy
                                  800 -1,000     18              18
                                 1,000 -1,200    25              43
                                 1,200 -1,400    30              73
                                 1,400 -1,600    34             107
                                 1,600 -1,800    26             133
                                 1.800 -2,000    10             143
                                                    l2  l1
                                             M = l1         (m 
                                                      c) f
                                                 n  1 143  1
                                           M=                  = 72
                                                    2  2
                               200
                       1200      (29)
                             30
= Rs 1393.3
At this stage, let us introduce two other concepts viz. quartile and decile. To
understand these, we should first know that the median belongs to a general class of
statistical descriptions called fractiles. A fractile is a value below that lays a given
fraction of a set of data. In the case of the median, this fraction is one-half (1/2).
Likewise, a quartile has a fraction one-fourth (1/4). The three quartiles Q 1, Q2 and Q3
are such that 25 percent of the data fall below Q1, 25 percent fall between Q1 and Q2,
25 percent fall between Q2 and Q3 and 25 percent fall above Q3 It will be seen that Q 2
is the median. We can use the above formula for the calculation of quartiles as well.
The only difference will be in the value of m. Let us calculate both Q 1 and Q3 in
                                                     -33
                                                     -
Caritas Business College                BUSINESS STATISTICS – OAD 221
                                                              n   143  1
                      Here, m will be             =            1 =         = 36
                                                                     4
                                                       1200  1000
                                        Q  1000                     (36  18)
                                          1
                                                               25
                                                         200
                                               1000          (18)
                                                         25
= Rs. 1,144
                                                      n1       3144
               In the case of Q3, m will be 3 =          4            = 108
                                                       =          4
                                1800  1600
               Q  1600                       (108  107)
                 1
                                   26
                           200
                1600           (1)
                           26
In the same manner, we can calculate deciles (where the series is divided into
10 parts) and percentiles (where the series is divided into 100 parts). It may be noted
that unlike arithmetic mean, median is not affected at all by extreme values, as it is a
happens to be skewed. Another point that goes in favour of median is that it can be
computed when a distribution has open-end classes. Yet, another merit of median is
that when a distribution contains qualitative data, it is the only average that can be
                                                          -34
                                                          -
Caritas Business College                 BUSINESS STATISTICS – OAD 221
Example 2.8:Calculate the most suitable average for the following data:
Size of the Item Below 50 50-100 100-150 150-200 200 and above
Frequency 15 20 36 40 10
Solution: Since the data have two open-end classes-one in the beginning (below 50) and
the other at the end (200 and above), median should be the right choice as a measure of
central tendency.
               Below 50                            15                             15
               50-100                    20                                35
               100-150                   36                                71
               150-200                             40                             111
               200 and above                       10                             121
                   121  1
               =           = 61st
                     item 2
                                         l2  l1
              Median =         11 = l1             (m  c)
                                            f
                         150  100
               = 100 +               (61  35)
                              36
Example 2.9: The following data give the savings bank accounts balances of nine
(a) Find the mean and the median for these data; (b) Do these data contain an outlier? If
so, exclude this value and recalculate the mean and median. Which of these summary
measures
                                                             -35
                                                             -
Caritas Business College                      BUSINESS STATISTICS – OAD 221
has a greater change when an outlier is dropped?; (c) Which of these two summary
Solution:
                                      Rs
                          =
                          83,600                        = Rs 9,289
                                              9
                                                      n  1
               Median =               Size                      th item
                                                        2
               of
                                      9       1
                          =
                                                 = 5th item
                                          2
Arranging the data in an ascending order, we find that the median is Rs 1,800.
exclude this figure and recalculate both the mean and the median.
               Mean =                                 83,600 
                                                       68,000
               Rs.
                                                            8
                          =
                          15,600 Rs                     = Rs. 1,950
                                                  8
                                           n  1
               Median = Size of                         th item
                                              2
                   8 
               =
               1                         item.
                     2        4.5th
                         1,500 
               = Rs.
               1,800                          = Rs. 1,650
                                 2
It will be seen that the mean shows a far greater change than the median when
(c) As far as these data are concerned, the median will be a more appropriate
                                                   -37
                                                   -
Caritas Business College             BUSINESS STATISTICS – OAD 221
               Frequency        6      12      22     37       17      8     5
               We are asked to draw both types of ogive from these data and to determine
the median.
Solution:
First of all, we transform the given data into two cumulative frequency distributions,
                                                             Table A
                                                                            Frequenc
                                                                            y
                                      Less than 10                          6
                                      Less than 20                          18
                                      Less than 30                          40
                                      Less than 40                          77
                                      Less than 50                          94
                                      Less than 60                          102
                                      Less than 70                          107
Table B
                                                                       Frequency
                                      More than 0                           107
                                      More than 10                          101
                                      More than 20                          89
                                      More than 30                          67
                                      More than 40                          30
                                      More than 50                          13
                                      More than 60                          5
                                                     -38
                                                     -
Caritas Business College              BUSINESS STATISTICS – OAD 221
meet the X-axis at M. Thus, from the point of origin to the point at M gives the value
applying the formula, then the answer comes to 33.8, or 34, approximately. It may be
pointed out that even a single ogive can be used to determine the median. As we
have determined the median graphically, so also we can find the values of quartiles,
{3(n + 1)} /4 = 81st item. From this point on the Y-axis, we can draw a perpendicular
to meet the 'less than' ogive from which another straight line is to be drawn to meet
the X-axis. This point will give us the value of the upper quartile. In the same manner,
1. Unlike the arithmetic mean, the median can be computed from open-ended
2. The median can also be determined graphically whereas the arithmetic mean
4. In case of the qualitative data where the items are not counted or measured
tendency.
2.4 MODE
The mode is another measure of central tendency. It is the value at the point around
which the items are most heavily concentrated. As an example, consider the
                                                      -39
                                                      -
Caritas Business College              BUSINESS STATISTICS – OAD 221
There are ten observations in the series wherein the figure 15 occurs maximum
number of times three. The mode is therefore 15. The series given above is a
discrete series; as such, the variable cannot be in fraction. If the series were
continuous, we could say that the mode is approximately 15, without further
computation.
               Mode= l1
                                    f1  f 0
                                                    i
                           ( f 1  f 0 )  ( f1  f 2 )
Where, l1 = the lower value of the class in which the mode lies
While applying the above formula, we should ensure that the class-intervals are
uniform throughout. If the class-intervals are not uniform, then they should be made
uniform on the assumption that the frequencies are evenly distributed throughout the
class. In the case of inequal class-intervals, the application of the above formula will
Solution: We can see from Column (2) of the table that the maximum frequency
of 12 lies in the class-interval of 60-70. This suggests that the mode lies in this class-
                                                          -40
                                                          -
Caritas Business College              BUSINESS STATISTICS – OAD 221
               Mode = 60 +              12 - 8
                                                         10
                              12 - 8 (12 - 8)  (12 -
                                        9)
                         4
               = 60 +       10
                        43
= 65.7 approx.
In several cases, just by inspection one can identify the class-interval in which the
mode lies. One should see which the highest frequency is and then identify to which
class-interval this frequency belongs. Having done this, the formula given for
At times, it is not possible to identify by inspection the class where the mode lies. In
such cases, it becomes necessary to use the method of grouping. This method
(i) Preparation of a grouping table: A grouping table has six columns, the first
frequencies grouped in two's, starting from the top. Leaving the first
the frequencies of the first three items, then second to fourth item and so on.
Column 5 leaves the first frequency and groups the remaining items in
three's. Column 6 leaves the first two frequencies and then groups the
remaining in three's. Now, the maximum total in each column is marked and
analysis table is prepared. On the left-hand side, provide the first column for
column numbers and on the right-hand side the different possible values of
mode. The highest values marked in the grouping table are shown here by
values
                                                        -41
                                                        -
Caritas Business College               BUSINESS STATISTICS – OAD 221
they represent. The last row of this table will show the number of times a
particular value has occurred in the grouping table. The highest value in the
analysis table will indicate the class-interval in which the mode lies. The
procedure of preparing both the grouping and analysis tables to locate the
                               10-20                          10
                               20-30                          18
                               30-40                          25
                               40-50                          26
                               50-60                          17
                               60-70                          4
               Solution:
                                              Grouping Table
               Size of item    1       2      3     4     5             6
                10-20          10
                                       28
               20-30           18                      53
                                              43
                30-40          25                            69
                                       51
                40-50          26                                  68
                                              43
               50-60           17                      47
                                       21
                60-70          4
Analysis table
                                                 Size of item
           Col. No.            10-20          20-30          30-40          40-50        50-60
               1                                                            1
               2                                              1             1
               3                              1               1             1            1
               4               1              1               1
               5                              1               1             1
                                                       -42
                                                       -
Caritas Business College              BUSINESS STATISTICS – OAD 221
6 1 1 1
Total 1 3 5 5 2
This is a bi-modal series as is evident from the analysis table, which shows that the
two classes 30-40 and 40-50 have occurred five times each in the grouping. In such
formula:
Median = Size of (n + l)/2th item, that is, 101/2 = 50.5th item. This lies in the class 30-
40. Applying the formula for the median, as given earlier, we get
                                     40 - 30
                       =      30 +             (50.5  28)
                                       25
= 30 + 9 = 39
                                    fd '
               Mean =         A+            i
                                      n
                                     34
                       =      35 +         10
                                     100
= 38.4
= (3 x 39) - (2 x 38.4)
= 117 -76.8
                                                        -43
                                                        -
Caritas Business College              BUSINESS STATISTICS – OAD 221
= 40.2
This formula, Mode = 3 Median-2 Mean, is an empirical formula only. And it can
give only approximate results. As such, its frequent use should be avoided. However,
when mode is ill defined or the series is bimodal (as is the case in the
Having discussed mean, median and mode, we now turn to the relationship amongst
(i) When a distribution is symmetrical, the mean, median and mode are the
In case, a distribution is
bution is skewed to the right where a large number of families have relatively
low income and a small number of families have extremely high income. In
such a case, the mean is pulled up by the extreme high incomes and the
relation among these three measures is as shown in Fig. 6.3. Here, we find
This is
                                                      -44
                                                      -
Caritas Business College             BUSINESS STATISTICS – OAD 221
(iii) Given the mean and median of a unimodal distribution, we can determine
is skewed to the left. It may be noted that the median is always in the middle
At this stage, one may ask as to which of these three measures of central tendency
the best is. There is no simple answer to this question. It is because these three
measures are based upon different concepts. The arithmetic mean is the sum of the
values divided by the total number of observations in the series. The median is the
value of the middle observation that divides the series into two equal parts. Mode is
the value around which the observations tend to concentrate. As such, the use of a
particular measure will largely depend on the purpose of the study and the nature
of the data; For example, when we are interested in knowing the consumers
preferences for different brands of television sets or different kinds of advertising, the
choice should go in favour of mode. The use of mean and median would not be
proper. However, the median can sometimes be used in the case of qualitative data
when such data can be arranged in an ascending or descending order. Let us take
company. A large number of candidates apply for that post. We are now interested to
know as to which age or age group has the largest concentration of applicants.
Here, obviously the mode will be the most appropriate choice. The arithmetic mean
                                                      -45
                                                      -
Caritas Business College             BUSINESS STATISTICS – OAD 221
be influenced by some extreme values. However, the mean happens to be the most
commonly used measure of central tendency as will be evident from the discussion in
Apart from the three measures of central tendency as discussed above, there are two
other means that are used sometimes in business and economics. These are the
geometric mean and the harmonic mean. The geometric mean is more important
than the harmonic mean. We discuss below both these means. First, we take up the
geometric mean. Geometric mean is defined at the nth root of the product of n
observations of a distribution.
Symbolically, GM = n x1....x2 .....xn ... If we have only two observations, say, 4 and
               16 then GM =
                               4 16                    Similarly, if there are three observations, then we
                                          8. 64
have to calculate the cube root of the product of these three observations; and so on.
When the number of items is large, it becomes extremely difficult to multiply the
numbers and to calculate the root. To simplify calculations, logarithms are used.
Example 2.13: If we have to find out the geometric mean of 2, 4 and 8, then we find
Log GM = log xi n
                                                  Log 2  Log 4 
                                      =                    Log8 3
                                                      0.3010  0.6021 
                                      =                    0.9031
                                                                 3
                                      =
                                                  1.8062
                                                             0.60206
                                                      3
GM = Antilog 0.60206
=4
                                                           -46
                                                           -
Caritas Business College               BUSINESS STATISTICS – OAD 221
When the data are given in the form of a frequency distribution, then the
                                       f .log x
                                = f1  f 2 ..........fn
Then, GM = Antilog n
3. Discounting, capitalization.
Example 2.14: A person has invested Rs 5,000 in the stock market. At the end of the
first year the amount has grown to Rs 6,250; he has had a 25 percent profit. If at the
end of the second year his principal has grown to Rs 8,750, the rate of increase is 40
percent for the year. What is the average rate of increase of his investment during
Solution:
               GM =
                       1.25 1.40               = 1.323
                                     1.75.
0.323, which if multiplied by 100, gives the rate of increase as 32.3 percent.
Example 2.15: We can also derive a compound interest formula from the above set
Solution: Now, 1.25 x 1.40 = 1.75. This can be written as 1.75 = (1 + 0.323)2.
Let P2 = 1.75, P0 = 1, and r = 0.323, then the above equation can be written as P2 = (1
+ r)2 or P2 = P0 (1 + r)2.
                                                           -47
                                                           -
Caritas Business College             BUSINESS STATISTICS – OAD 221
Where P2 is the value of investment at the end of the second year, P0 is the initial
investment and r is the rate of increase in the two years. This, in fact, is the familiar
r)n. In our case Po is Rs 5,000 and the rate of increase in investment is 32.3 percent.
Let us apply this formula to ascertain the value of Pn, that is, investment at the end of
Pn = 5,000 (1 + 0.323)2
= 5,000 x 1.75
= Rs 8,750
It may be noted that in the above example, if the arithmetic mean is used, the resultant
                                                                                         25  40
               figure will be wrong. In this case, the average rate for the two years              percent
               is                                                                           2
                                                                                165
               per year, which comes to 32.5. Applying this rate, we get Pn =         x 5,000
                                                                                100
= Rs 8,250
Example 2.16: An economy has grown at 5 percent in the first year, 6 percent in the
second year, 4.5 percent in the third year, 3 percent in the fourth year and 7.5
percent in the fifth year. What is the average rate of growth of the economy during
the five
years?
Solution:
                                                     -48
                                                     -
Caritas Business College             BUSINESS STATISTICS – OAD 221
 log x 
                                
               GM = Antilog                
                                    n       
                               
                            10.10987 
               = Antilog
                                       
                               5       
= Antilog 2.021974
= 105.19
Hence, the average rate of growth during the five-year period is 105.19 - 100 = 5.19
percent per annum. In case of a simple arithmetic average, the corresponding rate of
2.7.1 DISCOUNTING
If the future income is Pn rupees and the present rate of interest is 100 r percent, then
the present value of P n rupees will be P0 rupees. For example, if we have a machine
that has a life of 20 years and is expected to yield a net income of Rs 50,000 per
year, and at the end of 20 years it will be obsolete and cannot be used, then the
This process of ascertaining the present value of future income by using the interest
In conclusion, it may be said that when there are extreme values in a series,
geometric mean should be used as it is much less affected by such values. The
                                                     -49
                                                     -
Caritas Business College            BUSINESS STATISTICS – OAD 221
                                                    -50
                                                    -
Caritas Business College              BUSINESS STATISTICS – OAD 221
Before we close our discussion on the geometric mean, we should be aware of its
2.7.2 ADVANTAGES OF G. M.
1. Geometric mean is based on each and every observation in the data set.
2. It is rigidly defined.
4. As compared to the arithmetic mean, it gives more weight to small values and
mean, it is generally less than the arithmetic mean. At times it may be equal
understand.
2. Both computation of the geometric mean and its interpretation are rather
difficult.
In view of the limitations mentioned above, the geometric mean is not frequently
used.
                                                     -51
                                                     -
Caritas Business College             BUSINESS STATISTICS – OAD 221
The harmonic mean is defined as the reciprocal of the arithmetic mean of the
The calculation of harmonic mean becomes very tedious when a distribution has a
large number of observations. In the case of grouped data, the harmonic mean is
or
n n 1 
                                         f  
                                        i 1 i i 
                                                x
               Where n is the total number of observations.
frequency (f).
The main advantage of the harmonic mean is that it is based on all observations in a
greater weight to smaller observations and less weight to the larger observations,
then the use of harmonic mean will be more suitable. As against these
advantages, there are certain limitations of the harmonic mean. First, it is difficult to
the observations is zero or negative. Third, it is only a summary figure, which may not
It is worth noting that the harmonic mean is always lower than the geometric mean,
which is lower than the arithmetic mean. This is because the harmonic mean
assigns
                                                       -52
                                                       -
Caritas Business College                   BUSINESS STATISTICS – OAD 221
reciprocals, it becomes clear that as reciprocals of higher values are lower than those
of lower values, it is a lower average than the arithmetic mean as well as the
geometric mean.
Example 2.17: Suppose we have three observations 4, 8 and 16. We are required to
                                                                                                           1
               calculate the harmonic mean. Reciprocals of 4,8 and 16                      1       1           respectively
                                                                                               ,       ,
               are:
                                                                                           4       8 16
Since HM = n
                                1/ x1  1/ x 2  1/ x
                         =      3
                                            3
                         =
                                     1/ 4  1/ 8 
                                         1/ 16
                                        0.25  0.125 
                                            0.0625
= 6.857 approx.
Frequency 20 40 30 10
Solution:
                           n  1
                          f i  
                                 x
                         i 1   i 
               =              n
                                                                -53
                                                                -
Caritas Business College            BUSINESS STATISTICS – OAD 221
                           100
               =                 = 4.984 approx.
                      20.0641
                                                   -54
                                                   -
Caritas Business College           BUSINESS STATISTICS – OAD 221
Example 2.19: In a small company, two typists are employed. Typist A types one
page in ten minutes while typist B takes twenty minutes for the same. (i) Both are
asked to type 10 pages. What is the average time taken for typing one page? (ii)
Both are asked to type for one hour. What is the average time taken by them for
                                      10  2(
                                      pages)
= 15 minutes
                                         60  (min utes)
                      HM     =
                                        60 / 10  60 / 20(
                                              pages)
                                        120         40
                             =                           13 min utes and 20 seconds.
                                     120 
                                                    3
                             60
20
Example 2.20: It takes ship A 10 days to cross the Pacific Ocean; ship B takes
15 days and ship C takes 20 days. (i) What is the average number of days taken by a
ship to cross the Pacific Ocean? (ii) What is the average number of days taken by a
cargo to cross the Pacific Ocean when the ships are hired for 60 days?
Solution: Here again Q-(i) pertains to simple arithmetic mean while Q-(ii) is
               (i)    M      =       10  15  20
                                          3       = 15 days
(ii) HM = 60  3(days) _
                                      60 / 10  60 / 15  60 /
                             =                   20
                                                        -55
                                                        -
Caritas Business College   BUSINESS STATISTICS – OAD 221
     1
     8
     0
     3
     6
     0
     2
     4
     0
     1
     8
     0
     6
     0
                                       -56
                                       -
Caritas Business College             BUSINESS STATISTICS – OAD 221
We have seen earlier that the geometric mean is the antilogarithm of the arithmetic
mean of the logarithms, and the harmonic mean is the reciprocal of the arithmetic
mean of the reciprocals. Likewise, the quadratic mean (Q) is the square root of the
               Q=      x 2  x 12n
                               2  2
                               n
               Instead of using original values, the quadratic mean can be used while averaging
Q> x >G>H provided that all the individual observations in a series are positive and
cases, we should use the same method of averaging that was employed in
calculating the original averages. Thus, we should calculate the arithmetic mean of
several values of x, the geometric mean of several values of GM, and the harmonic
mean of several values of HM. It will be wrong if we use some other average in
averaging of means.
2.10 SUMMARY
It is the most important objective of statistical analysis is to get one single value that
describes the characteristics of the entire mass of cumbersome data. Such a value is
                                                      -57
                                                      -
Caritas Business College            BUSINESS STATISTICS – OAD 221
mean, the median and the mode in the light of these desiderata? Why
2. "Every average has its own peculiar characteristics. It is difficult to say which
3. What do you understand .by 'Central Tendency'? Under what conditions is the
4. The average monthly salary paid to all employees in a company was Rs 8,000.
The average monthly salaries paid to male and female employees of the
Frequency 2 4 9 11 12 6 4 2
6. Calculate the mean, median and mode from the following data:
Persons
                                     62-63                                  2
                                     63-64                                  6
                                     64-65                                  14
                                     65-66                                  16
                                     66-67                                  8
                                     67-68                                  3
                                     68-69                                  1
                                     Total                                  50
After drying for two weeks, the same articles have again been weighed and
similarly classified. It is known that the median weight in the first weighing
                                                     -58
                                                     -
Caritas Business College            BUSINESS STATISTICS – OAD 221
was 20.83 gm while in the second weighing it was 17.35 gm. Some
frequencies a and b in the first weighing and x and y in the second are
missing. It is known that a = 1/3x and b = 1/2 y. Find out the values of the
missing frequencies.
Class Frequencies
0- 5 a z
5-10 b y
10-15 11 40
15-20 52 50
20-25 75 30
25-30 22 28
8 Cities A, Band C are equidistant from each other. A motorist travels from A to
Frequency 20 40 30 10
While coming down it runs 12 km per litre. Find its average consumption for
to and fro travel between two places situated at the two ends of 25 Ian long
gradient.
                                                     -59
                                                     -
Caritas Business College           BUSINESS STATISTICS – OAD 221
3. Hooda, R. P.: Statistics for Business and Economics, Macmillan, New Delhi.
NJ.
                                                   -60
                                                   -
Caritas Business College            BUSINESS STATISTICS – OAD 221
 Module 3:      VARIATION
 Topic :        Standard Deviation and Variance
        OBJECTIVE: The objective of the present lesson is to impart the knowledge of measures
                              of dispersion and skewness and to enable the students to distinguish
                              between average, dispersion, skewness, moments and kurtosis.
        STRUCTURE:
        3.1    Introduction
        3.2    Meaning and Definition of Dispersion
        3.3    Significance and Properties of Measuring Variation
        3.4    Measures of Dispersion
        3.5    Range
        3.6    Interquartile Range or Quartile Deviation
        3.7    Mean Deviation
        3.8    Standard Deviation
        3.9    Lorenz Curve
        3.10   Skewness: Meaning and Definitions
        3.11   Tests of Skewness
        3.12   Measures of Skewness
        3.13   Moments
        3.14   Kurtosis
        3.15   Summary
        3.16   Self-Test Questions
        3.17   Suggested Readings
3.1 INTRODUCTION
may be noted that these measures do not indicate the extent of dispersion or
increasing our understanding of the pattern of the data. Further, a high degree of
is a high degree of variability in the raw material, then it could not find mass
production economical.
                                                     -61
                                                     -
Caritas Business College             BUSINESS STATISTICS – OAD 221
Suppose an investor is looking for a suitable equity share for investment. While
examining the movement of share prices, he should avoid those shares that are
highly fluctuating-having sometimes very high prices and at other times going
very low. Such extreme fluctuations mean that there is a high risk in the investment
in shares. The investor should, therefore, prefer those shares where risk is not so
high.
The various measures of central value give us one single figure that represents the
entire data. But the average alone cannot adequately describe a set of
observations, unless all the observations are the same. It is necessary to describe
central value may be the same but still there can be wide disparities in the
formation of distribution.
distribution.
It is clear from above that dispersion (also known as scatter, spread or variation)
measures the extent to which the items vary from some central value. Since
an average, they are also called averages of the second order. An average is
more meaningful when it is examined in the light of dispersion. For example, if the
                                                       -62
                                                       -
Caritas Business College                  BUSINESS STATISTICS – OAD 221
workers of factory A is Rs. 3885 and that of factory B Rs. 3900, we cannot
necessarily conclude that the workers of factory B are better off because in factory B
there may be much greater dispersion in the distribution of wages. The study of
following example:
100 102 2
100 103 3
100 90 5
Since arithmetic mean is the same in all three series, one is likely to conclude that
perfectly represented by
from the
                                                       -63
                                                       -
Caritas Business College           BUSINESS STATISTICS – OAD 221
arithmetic mean and hence there is no dispersion. In series B, only one item is
perfectly represented by the arithmetic mean and the other items vary but the
represented by the arithmetic mean and the items vary widely from one another. In
two groups of labourers with the same mean salary and yet their distributions may
differ widely. The mean salary may not be so important a characteristic as the
variation of the items from the mean. To the student of social affairs the mean
income is not so vitally important as to know how this income is distributed. Are a
large number receiving the mean income or are there a few with enormous incomes
and millions with incomes far below the mean? The three figures given in Box 3.1
distractions with the same mean X , but with different dispersions. The two curves in
(b) represent two distributions with the same dispersion but with unequal means X l
and X 2, (c) represents two distributions with unequal dispersion. The measures of
measures the extent to which there are differences between individual observation
the amount of the variation or its degree but not in the direction. For example, a
measure of 6 inches below the mean has just as much dispersion as a measure of
                                                   -64
                                                   -
Caritas Business College            BUSINESS STATISTICS – OAD 221
tendency gives us an idea of the concentration of the observations about the central
part of the distribution. If we know the average alone, we cannot form a complete
idea about the distribution. But with the help of dispersion, we have an idea about
VARIATION
of the mass. When dispersion is small, the average is a typical value in the
sense that it closely represents the individual value and it is reliable in the
On the other hand, when dispersion is large, the average is not so typical, and
unless the sample is very large, the average may be quite unreliable.
in body temperature, pulse beat and blood pressure are the basic guides to
the causes of which are sought through inspection is basic to the control of
series with regard to their variability. The study of variation may also be
looked
                                                     -65
                                                     -
Caritas Business College              BUSINESS STATISTICS – OAD 221
Deviation, Mean deviation, Standard Deviation, and Lorenz curve. Among them, the
first four are mathematical methods and the last one is the graphical method.
3.5 RANGE
The simplest measure of dispersion is the range, which is the difference between the
Example 3.1: Find the range for the following three sets of data:
Set 1: 05 15 15 05 15 05 15 15 15 15
Set 2: 8 7 15 11 12 5 13 11 15 9
                                                        -66
                                                        -
Caritas Business College             BUSINESS STATISTICS – OAD 221
Set 3: 5 5 5 5 5 5 5 5 5 5
Solution: In each of these three sets, the highest number is 15 and the lowest
number is 5. Since the range is the difference between the maximum value and the
minimum value of the data, it is 10 in each case. But the range fails to give any idea
about the dispersal or spread of the series between the highest and the lowest value.
upper limit of the highest class and the lower limit of the lowest class.
Example 3.2: Find the range for the following frequency distribution:
                                      Size of                    Frequen
                                        Item                        cy
                                       20- 40                       7
                                       40- 60                       11
                                       60- 80                       30
                                       80-100                       17
                                      100-120                       5
                                        Total                       70
Solution: Here, the upper limit of the highest class is 120 and the lower limit of the
lowest class is 20. Hence, the range is 120 - 20 = 100. Note that the range is not
S, where L is the largest value and S is the smallest value in a distribution. The
coefficient of range is calculated by the formula: (L-S)/ (L+S). This is the relative
measure. The coefficient of the range in respect of the earlier example having three
sets of data is: 0.5.The coefficient of range is more appropriate for purposes of
Example 3.3: Calculate the coefficient of range separately for the two sets of data
given below:
               Set 1          8       10      20      9      15      10      13      28
               Set 2          30      35      42      50     32      49      39      33
                                                      -67
                                                      -
Caritas Business College             BUSINESS STATISTICS – OAD 221
Solution: It can be seen that the range in both the sets of data is the
same: Set 1 28 - 8 = 20
Set 2 50 - 30 = 20
               28 – 8 = 0.55
               28+8
               Coefficient of range in set 2 is:
               50 – 30
                          = 0.25
               50 +30
1. It is based only on two items and does not cover all the items in a distribution.
same population.
3. It fails to give any idea about the pattern of distribution. This was evident from
Despite these limitations of the range, it is mainly used in situations where one wants
to quickly have some idea of the variability or' a set of data. When the sample size is
very small, the range is considered quite adequate measure of the variability. Thus,
it is widely used in quality control where a continuous check on the variability of raw
weather forecast. The meteorological department uses the range by giving the
maximum and the minimum temperatures. This information is quite useful to the
common man, as he can know the extent of possible variation in the temperature on
a particular day.
                                                      -68
                                                      -
Caritas Business College            BUSINESS STATISTICS – OAD 221
distribution than the range. Here, avoiding the 25 percent of the distribution at both
the ends uses the middle 50 percent of the distribution. In other words, the
interquartile range denotes the difference between the third quartile and the first
quartile.
Many times the interquartile range is reduced in the form of semi-interquartile range
When quartile deviation is small, it means that there is a small deviation in the central
50 percent items. In contrast, if the quartile deviation is high, it shows that the
central
distribution, the two quartiles, that is, Q3 and QI are equidistant from the median.
Symbolically,
M-QI = Q3-M
However, this is seldom the case as most of the business and economic data are
              Coefficient of QD =     Q3 –Q1
                                      Q3 +Q1
upper and lower quartiles. As the computation of the two quartiles has already been
                                                       -69
                                                       -
Caritas Business College               BUSINESS STATISTICS – OAD 221
The mean deviation is also known as the average deviation. As the name implies, it
is the average of absolute amounts by which the individual items deviate from the
mean. Since the positive deviations from the mean are equal to the negative
deviations, while computing the mean deviation, we ignore positive and negative
signs.
Symbolically,
from the mean ignoring positive and negative signs, n = the total number of
observations.
                                                         -70
                                                         -
Caritas Business College             BUSINESS STATISTICS – OAD 221
Example 3.4:
Solution:
                            2-4                   3                    20    60   -2.6   52
                            4-6                   5                    40   200   -0.6    24
                            6-8                   7                    30   210    1.4    42
                           8-10                   9                    10    90    3.4    34
                                                 Total                100   560          152
                                   fm           560
                      x       =      n                 5.6
                                                 100
                                   f |d |             152
                                                             1.52
                      MD ( x ) =         n             100
result, a change in the value of any item will have its effect on the magnitude
of mean deviation.
3. The values of extreme items have less effect on the value of the mean
deviation.
                                                               -71
                                                               -
Caritas Business College            BUSINESS STATISTICS – OAD 221
2. At times it may fail to give accurate results. The mean deviation gives best
results when deviations are taken from the median instead of from the mean.
But in a series, which has wide variations in the items, median is not a
satisfactory measure.
algebraic signs when the deviations are taken from the mean.
The standard deviation is similar to the mean deviation in that here too the deviations
are measured from the mean. At the same time, the standard deviation is preferred
to the mean deviation or the quartile deviation or the range because it has desirable
mathematical properties.
Before defining the concept of the standard deviation, we introduce another concept
viz. variance.
Example 3.5:
                                     X               X-           (X-)2
                                     20           20-18=12            4
                                     15           15-18= -3           9
                                     19           19-18 = 1           1
                                     24           24-18 = 6          36
                                     16           16-18 = -2          4
                                     14           14-18 = -4         16
                                    108             Total            70
               Solution:
                                         108
                                Mean =         = 18
                                          6
                                                       -72
                                                       -
Caritas Business College                BUSINESS STATISTICS – OAD 221
The second column shows the deviations from the mean. The third or the last column
shows the squared deviations, the sum of which is 70. The arithmetic mean of the
                                      = 70/6=11.67 approx.
                             N
This mean of the squared deviations is known as the variance. It may be noted
that this variance is described by different terms that are used interchangeably: the
variance of the distribution X; the variance of X; the variance of the distribution; and
               It is also written as  2        x i   2
               
                                                   N
(points). If a distribution relates to income of families then the variance is (Rs) 2 and
not rupees. Similarly, if another distribution pertains to marks of students, then the
variance is taken, which yields a better measure of dispersion known as the standard
deviation. Taking our earlier example of individual observations, we take the square
In applied Statistics, the standard deviation is more frequently used than the
                                                           -73
                                                           -
Caritas Business College                     BUSINESS STATISTICS – OAD 221
                                                                      
                                                                       x
                                                                                2
                                                             x 2           i
                                                               i    
                                                       =               N
                                                                    N
We use this formula to calculate the standard deviation from the individual
Example 7.6:
                                                       X                  X
                                                                          2
                                                       20                400
                                                       15                225
                                                       19                361
                                                       24                576
                                                       16                256
                                                       14                196
                                                      108               2014
               Solution:
                                                      xi
                      x             2014
                            2
                                                                       N=6
                                i                            108
                       = 70                           Or,     = 11.67
                            6
 = 3.42
Example 3.7:
                                              Marks                Number of
                                                                    Students
                                              0- 10                     1
                                              10- 20                    3
                                              20- 30                    6
                                              30- 40                   10
                                              40- 50                   12
                                                              -74
                                                              -
Caritas Business College   BUSINESS STATISTICS – OAD 221
50- 60 11
                                       -75
                                       -
Caritas Business College             BUSINESS STATISTICS – OAD 221
                                      60- 70                         6
                                      70- 80                         3
                                      80- 90                         2
                                      90-100                         1
               Solution:
= K
                                                     fim
                                                         i
                                                               i1
                                                                       2
                                                         N
               Where mi is the mid-point of the class intervals  is the mean of the distribution, fi is
the frequency of each class; N is the total number of frequency and K is the number
of classes. This formula requires that the mean  be calculated and that deviations
(mi -
) be obtained for each class. To avoid this inconvenience, the above formula can be
modified as:
                                                                     2             
                                                                K         K
                                                                      N
               Where C is the class interval: fi is the frequency of the ith class and di is the
deviation of the of item from an assumed origin; and N is the total number of
observations.
                                                     45 2
                                       = 10 231        
                                             55 55 
                                                         -76
                                                         -
Caritas Business College            BUSINESS STATISTICS – OAD 221
=104.2  0.669421
=18.8 marks
        When it becomes clear that the actual mean would turn out to be in fraction, calculating
               deviations from the mean would be too cumbersome. In such cases, an assumed
               mean is used and the deviations from it are calculated. While mid- point of any class
               can be taken as an assumed mean, it is advisable to choose the mid-point of that
               class that would make calculations least cumbersome. Guided by this consideration,
               in Example 3.7 we have decided to choose 55 as the mid-point and, accordingly,
               deviations have been taken from it. It will be seen from the calculations that they are
               considerably simplified.
               3.8.1   USES OF THE STANDARD DEVIATION
determine as to how far individual items in a distribution deviate from its mean. In a
(i) About 68 percent of the values in the population fall within: + 1 standard
(ii) About 95 percent of the values will fall within +2 standard deviations from the
mean.
(iii) About 99 percent of the values will fall within + 3 standard deviations from
the mean.
in the same units as the original data. As such, it cannot be a suitable measure while
comparing two or more distributions. For this purpose, we should use a relative
variation, which relates the standard deviation and the mean such that the standard
deviation is expressed as a percentage of mean. Thus, the specific unit in which the
standard deviation is measured is done away with and the new unit becomes
percent.
                                                     -77
                                                     -
Caritas Business College              BUSINESS STATISTICS – OAD 221
                                                                
                Symbolically, CV (coefficient of variation) =        x 100
                                                                    
Example 3.8: In a small business firm, two typists are employed-typist A and typist
B. Typist A types out, on an average, 30 pages per day with a standard deviation of 6.
                                                                Or A  6
                                                                                   x 100
                                                                              30
Or 20% and
                                                                     
                               Coefficient of variation for B           x 100
                                                                     
                                                                         10
                                                                B            x 100
                                                                         45
or 22.2 %
These calculations clearly indicate that although typist B types out more pages,
there is a greater variation in his output as compared to that of typist A. We can say
this in a different way: Though typist A's daily output is much less, he is more
clear in comparing two groups of data having different means, as has been the case
The variable Z = (x - x )/s or (x - )/, which measures the deviation from the
mean in units of the standard deviation, is called a standardised variable. Since both
the numerator and the denominator are in the same units, a standardised variable is
independent of units used. If deviations from the mean are given in units of the
scores.
                                       -79
                                       -
Caritas Business College              BUSINESS STATISTICS – OAD 221
compositions differ.
Example 3.9: A student has scored 68 marks in Statistics for which the
average marks were 60 and the standard deviation was 10. In the paper on
Marketing, he scored 74 marks for which the average marks were 68 and the
15. In which paper, Statistics or Marketing, was his relative standing higher?
from the mean x in terms of standard deviation s. For Statistics, Z = (68 - 60)  10 =
Example 3.10: Convert the set of numbers 6, 7, 5, 10 and 12 into standard scores:
Solution:
                                          X                             X2
                                          6                          36
                                          7                          49
                                          5                          25
                                          10                        100
                                          12                        144
                                                                       2
                                        X = 40                    X =
                                                                        354
x   x  N  40  5  8
                                       X   2                     402
                             
                             NN
                               x2                             354 
                                                                    5
                       =                         or,     =
                                                                    5
                                                         -80
                                                         -
Caritas Business College                      BUSINESS STATISTICS – OAD 221
                                xx          68
                           Z=                      = -0.77 (Standard score)
                                            2.61
                           78
                    (i)                = -0.38
                           2.61
                    (ii)
                            5  8 = -1.15
                    (iii) 2.61
                                       = 0.77
                           10 
                           8
(iv)                       (
                           i       = 1.53
                           v  2.61
                           )
                             12 
                             8
2.61
Thus the standard scores for 6,7,5,10 and 12 are -0.77, -0.38, -1.15, 0.77 and 1.53,
respectively.
This measure of dispersion is graphical. It is known as the Lorenz curve named after
Dr. Max Lorenz. It is generally used to show the extent of concentration of income
and wealth. The steps involved in plotting the Lorenz curve are:
2. Calculate percentage for each item taking the total equal to 100.
3. Choose a suitable scale and plot the cumulative percentages of the persons
4. Show the line of equal distribution, which will join 0 of X-axis with 100 of Y-
axis.
5. The curve obtained in (3) above can now be compared with the straight line of
equal distribution obtained in (4) above. If the Lorenz curve is close to the line
                           of equal distribution, then it implies that the dispersion is much less. If, on the
                                                              -81
                                                              -
Caritas Business College             BUSINESS STATISTICS – OAD 221
contrary, the Lorenz curve is farther away from the line of equal distribution,
The Lorenz curve is a simple graphical device to show the disparities of distribution
Figure 3.1 shows two Lorenz curves by way of illustration. The straight line AB is a
line of equal distribution, whereas AEB shows complete inequality. Curve ACB and
A F
As curve ACB is nearer to the line of equal distribution, it has more equitable
distribution of income than curve ADB. Assuming that these two curves are for the
same company, this may be interpreted in a different manner. Prior to taxation, the
curve ADB showed greater inequality in the income of its employees. After the
taxation, the company’s data resulted into ACB curve, which is closer to the line of
equal distribution. In other words, as a result of taxation, the inequality has reduced.
may be repeated here that frequency distributions differ in three ways: Average
value, Variability or dispersion, and Shape. Since the first two, that is, average
value and
                                                      -82
                                                      -
Caritas Business College                BUSINESS STATISTICS – OAD 221
variability or dispersion have already been discussed in previous chapters, here our
main spotlight will be on the shape of frequency distribution. Generally, there are two
a distribution. Two distributions may have the same mean and standard deviation but
may differ widely in their overall appearance as can be seen from the following:
nature.
distributions.
symmetrical distribution the mean, median and mode are identical. The more
the mean moves away from the mode, the larger the asymmetry or
skewness."
4. "A distribution is said to be 'skewed' when the mean and the median fall at
different points in the distribution, and the balance (or centre of gravity) is
                                                          -83
                                                          -
Caritas Business College             BUSINESS STATISTICS – OAD 221
The above definitions show that the term 'skewness' refers to lack of symmetry" i.e.,
distribution.
The concept of skewness will be clear from the following three diagrams showing a
distribution.
metrical distribution the values of mean, median and mode coincide. The
2. Asymmetrical Distribution. A
the mean is maximum and that of mode least-the median lies in between the
maximum and that of mean least-the median lies in between the two. In the
greater
                                                      -84
                                                      -
Caritas Business College              BUSINESS STATISTICS – OAD 221
range of values on the high-value end of the curve (the right-hand side) than
they are on the low-value end. In the negatively skewed distribution the
position is reversed, i.e. the excess tail is on the left-hand side. It should be
mean and the median is approximately one-third of the interval between the
          In order to ascertain whether a distribution is skewed or not the following tests may be
        applied. Skewness is present if:
                   1.      The values of mean, median and mode do not coincide.
2. When the data are plotted on a graph they do not give the normal bell-
shaped form i.e. when cut along a vertical line through the centre the two
3. The sum of the positive deviations from the median is not equal to the
the mode.
3. Sum of the positive deviations from the median is equal to the sum of
                                                       -85
                                                       -
Caritas Business College            BUSINESS STATISTICS – OAD 221
the mode.
There are four measures of skewness, each divided into absolute and relative
measures. The relative measure is known as the coefficient of skewness and is more
frequently used than the absolute measure of skewness. Further, when a comparison
which is used. The measures of skewness are: (i) Karl Pearson's measure, (ii)
Bowley’s measure, (iii) Kelly’s measure, and (iv) Moment’s measure. These
               Coefficient of skewness =
                                                Mean – Mode
                                              Standard Deviation
               In case the mode is indeterminate, the coefficient of skewness is:
greater than the mode or less than the mode. If it is greater than the mode, then
               skewness is
                                                       -86
                                                       -
Caritas Business College               BUSINESS STATISTICS – OAD 221
positive. But when the mean is less than the mode, it is negative. The difference
between the mean and mode indicates the extent of departure from symmetry. It is
the unit of measurement. It may be recalled that this observation was made in the
skewness lies between +1. If the mean is greater than the mode, then the coefficient
Example 3.11: Given the following data, calculate the Karl Pearson's coefficient of
Solution:
                       SkP =
                               Mean - Mode
                            Standard deviation
                             X 452
               Mean ( x               45.2
               )=
                               N 10 2                   2
               SD               x           x
                      x2
                  N             x2
                                                
                               
                                       N    
                                  N
                                 
                                                N 
                           2          2427  (45.2)    19.59
               24270      452
                          
                                                      2
                        
                       10  10 
               Applying the values of mean, mode and standard deviation in the above
               formula, Sk = 45.2 – 43.7
                          p
                        19.59
               =0.08
This shows that there is a positive skewness though the extent of skewness is
marginal.
Example 3.12: From the following data, calculate the measure of skewness using
                       X       10 -      20 -    30 - 40     40 -    50-   60 -   70 -
                                20        30                  50     60     70     80
                       f        18        30       40         55     38     20     16
                                                       -87
                                                       -
Caritas Business College                   BUSINESS STATISTICS – OAD 221
        Solution:
                                                                                  2
                              x        MVx         dx     f            fdx     fdX       cf
                            10 -        15         -3    18            -54     162       18
                             20
                            20 -         25        -2    30            -60     120       48
                             30
                            30 -         35        -1    40            -40      40       88
                             40
                           40-50         45=       0     55             0        0      143
                                           a
                           50 -           55       1     38            38       38      181
                            60
                           60 -          65        2     20            40       80      201
                            70
                           70 -          75        3     16            48      144      217
                            80
                                      Tota 217        -28     584
                                         l
               a = Assumed mean = 45, cf = Cumulative frequency, dx = Deviation from assumed
mean, and i = 10
                          fdx
               xa              i
                           N
               45  28
                           10  43.71
                      21
                      7
                               l2  l1
               Median= l1               (m  c)
                                 f1
                                     50  40
               Median  40                     (109  88)
                                      55
                      10
                40      21
                     55
= 43.82
               SD                =
                                                fd 2fd
                                                   x      x   
                                                                             584  28 2
                                                                                  
                                                f   f          10 
                                                                            217 217    10
                                                             -88
                                                             -
Caritas Business College       BUSINESS STATISTICS – OAD 221
                           =                   10  16.4
                                2.69 - 0.016
               Skewness    =   3 (Mean - Median)
= 3 (43.71 - 43.82)
= 3 x -0.011
                                               -89
                                               -
Caritas Business College               BUSINESS STATISTICS – OAD 221
= -0.33
Coefficient of skewness
                Skewness           or
                  SD
               =              -0.33
                              16.4
               =               -0.02
The result shows that the distribution is negatively skewed, but the extent of
               Skewness      Q3  Q1 
               =               2M Q3 
                               Q1
Where Q3 and Q1 are upper and lower quartiles and M is the median. The value of
this skewness varies between +1. In the case of open-ended distribution as well as
where extreme values are found in the series, this measure is particularly useful. In a
symmetrical distribution, skewness is zero. This means that Q3 and Q1 are positioned
when the distribution is skewed, then Q3 - Q2 will be different from Q2 – Q1' When Q3
written as:
while comparing two distributions where the units of measurement are different. In
below:
                                                      -90
                                                      -
Caritas Business College                BUSINESS STATISTICS – OAD 221
Solution:
                                                                      Q3  Q1  2M
               Bowley's coefficient of skewness is:           SkB =
                                                                        Q3  Q1
                           Q3  16.4 - (2 x
               SkB =           24.2)
Q3  16.4
                             Q3  16.4 -
               0.56           48.4
Q3  16.4
- 1.56 Q3 = - 41.184
                                          Q3   =       41.184
                                                        1.56    26.4
Now, we have the values of both the upper and the lower quartiles.
                                                                       Q     Q1 Q3  Q1
               Coefficient of quartile deviation =               =
                                                                       3
                                                      -91
                                                      -
Caritas Business College          BUSINESS STATISTICS – OAD 221
   26.4  16.4
   26.4  16.4                                               42.
                                                              8
                                               10                    0.234 Approx.
following data:
                                                 -92
                                                 -
Caritas Business College                    BUSINESS STATISTICS – OAD 221
Value in Rs Frequency
Less than 50 40
50 - 100 80
150 – 200 60
Solution: It should be noted that the series given in the question is an open-ended
be the most appropriate measure of skewness in this case. In order to calculate the
quartiles and the median, we have to use the cumulative frequency. The table is
50 - 100 80 120
                           l2  l1
               Q1 = l1              (m 
                             c) f1
                                                  341
                                n1                      = 85.25, which lies in 50 - 100 class
               Now     m=(               ) item     4
               =
                                 4
                            100  50
               Q1 = 50 +                  (85.25  40)  78.28
                                80
                     n1                                                                         4
               M=(            ) item
               =
                                                                -93
                                                                -
Caritas Business College              BUSINESS STATISTICS – OAD 221
341
      = 170.25, which lies in 100 -
150 class
 4
                                                  -94
                                                  -
Caritas Business College                BUSINESS STATISTICS – OAD 221
                            150  100
               M= 100 +                   (170.5  120)  119.4
                               130
                           l2  l1
               Q3 = l1            (m 
                             c) f1
m = 3(341)  4 = 255.75
                             200  150
               Q3 = 150 +                  (255.75  250)  154.79
                                 60
= - 0.075 approx.
This shows that there is a negative skewness, which has a very negligible magnitude.
                                                  D1  D9  2M
                                                    D9  D1
Where P and D stand for percentile and decile respectively. In order to calculate the
50th and 90th percentiles. Somehow, this measure of skewness is seldom used. All
Class Intervals f cf
                                10 - 20                 18        18
                                20 - 30                 30        48
                                                        -95
                                                        -
Caritas Business College                   BUSINESS STATISTICS – OAD 221
                                     30- 40                 40     88
                                     40- 50                 55     143
                                     50 - 60                38     181
                                     60 – 70                20     201
                                     70 - 80.               16     217
                            l2  l1
               PIO = l1               (m  c) , where m = (n + 1)/10th item
                                f1
               217  1
                            
                                       item
               21.8th
                 10
Kelley's skewness
                        88.87 - 87.64
               =            46.63
                                                            -96
                                                            -
Caritas Business College      BUSINESS STATISTICS – OAD 221
= 0.027
                                          -97
                                          -
Caritas Business College             BUSINESS STATISTICS – OAD 221
        This shows that the series is positively skewed though the extent of skewness is extremely
               negligible. It may be recalled that if there is a perfectly symmetrical distribution, then
               the skewness will be zero. One can see that the above answer is very close to zero.
        3.13   MOMENTS
In mechanics, the term moment is used to denote the rotating effect of a force. In
moments lies in the sense that they indicate different aspects of a given distribution.
dispersion or variability, skewness and the peakedness of the curve. The moments
about the actual arithmetic mean are denoted by . The first four moments
about mean or
               Third moment                    3      =
                                                                                   3
                                                                            x
                                                               x
                                                               N         1
               Fourth moment                   3      =
                                                                                   4
                                                               1            x
                                                                         1
                                                               x
                                                               N
                                                               1
                                                                         1
                                                               x
                                                               N
     1
           fix N                               2
                                          x
     1
           fix N
                                          x
                                                3      1
                                       -99
                                       -
Caritas Business College               BUSINESS STATISTICS – OAD 221
               Fourth moment                   3      =     1                  4
                                                                          x
                                                                       1
                                                              fix N
It may be noted that the first central moment is zero, that is, =
The third central moment 3 is used to measure skewness. The fourth central
Karl Pearson suggested another measure of skewness, which is based on the third
                      2
                1   3
                      3
                     2
Example 3.16: Find the (a) first, (b) second, (c) third and (d) fourth moments for the
Solution:
               (a)          x   2  3  4  5  6 20
                       x                           4
                            N           5         5
               (b)
                               x       22  32  42  52 
                       x
                                                62
                                 2
                                                5
                              N
                           4  9  16  25  36
                                  5             18
               (c)
                               x
                       x              23  33  43  53 
                                                63
                                 3
                                                5
                              N
                           8  27  64  125  216
                                    5              88
               (d)
                                                       -
                                                       100
Caritas Business College             BUSINESS STATISTICS – OAD 221
           x                         4
                                          34  44  54  64
    x                         4
                                                 5
         N         2
                                                       -
                                                       101
Caritas Business College                      BUSINESS STATISTICS – OAD 221
Example 3.17: Using the same set of five figures as given in Example 3.7, find the
(a) first, (b) second, (c) third and (d) fourth moments about the mean.
Solution:
                                          (x  x)   (2  4)  (3  4)  (4  4)  (5  4)  (6  4)
             m             (x  x)               
                   1
                                              N                             5
                               - 2 -1  0  1  2
                          =             5         =0
        m2  (x  x)2
                               (x  x)
                                        2
        
                                            (2  4)2  (3  4)2  (4  4)2  (5  4)2  (6  4)2
                                  N                                5
                       (-2)2  (_1) 2  02  12  22
               =
                                     5
                       41014
               =                              = 2. It may be noted that m2 is the
                               variance 5
                                 (x  x)
                                          3
                                              (2  4)3  (3  4)3  (4  4)3  (5  4)3  (6  4)3
        m3=  (x  x)3              N                                5
        
                       (-2)3  (_1)3  03  13 
                                   23                      - 8 -1  0  1  8
               =                                       =           5          0
                                    5
                                          4                  4
                                 (x  x)          (2  4)        (3  4)4  (4  4)4  (5  4)4  (6
                                    N                                        4)4
        m4=  (x  x)4                        
                                                                            5
                               (-2)4  (_1) 4  04  14  24
                           =
                                              5
                               16  1  0  1  016
                           =           5             6.8
Example 3.18: Calculate the first four central moments from the following data:
                                                                   -
                                                                   102
Caritas Business College            BUSINESS STATISTICS – OAD 221
                                                 -
                                                 103
Caritas Business College                      BUSINESS STATISTICS – OAD 221
                         fd  i          3 10
                 '                               0.6
                 1        N                  50
                     fd  i
                            2
                               63                   12.6
                2'     N    
                             10
                                50
                     fd  i
                        3
                                                    0.8
                        N
                2'            4
                             
                             10
                
                                 50
                      fd  i4
               2'           195                  19
                        N    
                             10
                                 50
               Moments about Mean
               1=1’ - 1’= -0.6-(-0.6) = 0
               2=2’ - 1’2=10-( -0.6)2= 10-3.6=6.4
               3=3’ - 32’’1+21’3=-0.8-3(12.6)(-0.6)+2(-0.6)3
               = -0.8 + 22.68 + 0.432 = 22.312
                                     ’2   ’4
               4=4’ - 43’’1+621 -31
               = 19 + 4(-0.8)(-0.6) + 6(10)(-0.6)2- 3(-0.6)4
               = 19 + 1.92 + 21.60 - 0.3888
               = 42.1312
        3.14   KURTOSIS
which means bulginess. While skewness signifies the extent of asymmetry, kurtosis
classified curves into three types on the basis of the shape of their peaks. These are
mesokurtic, leptokurtic and platykurtic. These three types of curves are shown in
figure below:
                                                             -
                                                             104
Caritas Business College              BUSINESS STATISTICS – OAD 221
curve of a normal
distribution. Leptokurtic
curve is a more peaked than the normal curve. In contrast, platykurtic is a relatively
flat curve. The coefficient of kurtosis as given by Karl Pearson is  2=4/22. In case
of a normal distribution, that is, mesokurtic curve, the value of 2=3. If 2 turn out to
be
> 3, the curve is called a leptokurtic curve and is more peaked than the normal curve.
Again, when 2 < 3, the curve is called a platykurtic curve and is less peaked than
the normal curve. The measure of kurtosis is very helpful in the selection of an
appropriate average. For example, for normal distribution, mean is most appropriate;
Example 3.19: From the data given in Example 3.18, calculate the kurtosis.
Solution: For this, we have to calculate 2 This can be done by using the formula
Another measure of kurtosis is based on both quartiles and percentiles and is given
                        Q
               K 
                     P90 
                     P10
                                                        -
                                                        105
Caritas Business College                     BUSINESS STATISTICS – OAD 221
percentile and P10 is the 10th percentile. This is also known as the percentile
Example 3.20: From the data given below, calculate the percentile coefficient of
kurtosis.
50- 60 10 10
60-70 14 24
70-80 18 42
80 - 90 24 66
90-100 16 82
100 -110 12 94
                       110 - 120                               6                              10
                                                                                              0
                       Total                                    100
Solution: It may be noted that the question involved first two columns and in order to
                                     l2  l1
               Q1      =        l1                (m  c) , where m = (n + 1)/4th item, which is = 25.25th item
                                        f1
                                         80  70
                       =         70 +                  (25.25       = 70.69
                                         24)
                                           18
                                        l2  l1
               Q3      =         l1 +               (m  c) , where m = 75.75
                                             f1
                                         100 
                       =
                       90        90 +                   (75.75 - 66) = 96.09
                                               16
                                                               -
                                                               106
Caritas Business College                   BUSINESS STATISTICS – OAD 221
                                          l2  l1
               PI0      =          l1 +             (m  c) , where m = 10.1
                                            f1
                                             70 
                        =          60 +       60      (10.01 -10) = 60.07
14
                                          l2  l1
               P90      =          l1 +             (m  c) , where m = 90.9
                                            f1
                         110 
               = 100 +
               100                        (90.9 - 82) = 107.41
                              12
                         Q
               K 
                      P90 
                      P10
                   1/ 2(Q3  Q1 )
               =      P90  P10
                   ½ (96.09 - 70.69)
               =    107.41 - 60.07
= 0.268
It will be seen that the above distribution is very close to normal distribution as the
        3.15   SUMMARY
        The average value cannot adequately describe a set of observations, unless all the
               observations are the same. It is necessary to describe the variability or dispersion of
               the observations. In two or more distributions the central value may be the same but
               still there can be wide disparities in the formation of distribution. Therefore, we have
               to use the measures of dispersion.
               Further, two distributions may have the same mean and standard deviation but may
                                                              -
                                                              107
Caritas Business College             BUSINESS STATISTICS – OAD 221
skewness.
2. “Variability is not an important factor because even though the outcome is more
certain, you still have an equal chance of falling either above or below the median.
Therefore, on an average, the outcome will be the same.” Do you agree with this
3. Why is the standard deviation the most widely used measure of dispersion? Explain.
6. What are the different measures of skewness? Which one is repeatedly used?
(i) Compute the moment coefficients of skewness and kurtosis. (ii) Is the distribution
10. The first four moments of a distribution about the value 4 are 1,4, 10 and 45. Obtain
                                                     -
                                                     108
Caritas Business College               BUSINESS STATISTICS – OAD 221
11. Define kurtosis. If β1=1 and β2 =4 and variance = 9, find the values of β3 and β4 and
12. Calculate the first four moments about the mean from the following data. Also
No. of students5 12 18 40 15 7 3
New Delhi.
Hall, NJ.
                                                      -
                                                      109
Caritas Business College              BUSINESS STATISTICS – OAD 221
        Structure
        4.1    Introduction
        4.2    What is Correlation?
        4.3    Correlation Analysis
               4.3.1   Scatter Diagram
               4.3.2   Correlation Graph
               4.3.3   Pearson’s Coefficient of Correlation
               4.3.4   Spearman’s Rank Correlation
               4.3.5   Concurrent Deviation Method
        4.4    Limitations of Correlation Analysis
        4.5    Self-Assessment Questions
        4.6    Suggested Readings
 4.1    INTRODUCTION
        Statistical methods of measures of central tendency, dispersion, skewness and kurtosis are
helpful for the purpose of comparison and analysis of distributions involving only one
variable i.e. univariate distributions. However, describing the relationship between two or
In many business research situations, the key to decision making lies in understanding the
relationships between two or more variables. For example, in an effort to predict the
behavior of the bond market, a broker might find it useful to know whether the interest rate of
bonds is related to the prime interest rate. While studying the effect of advertising on sales,
an account executive may find it useful to know whether there is a strong relationship
The statistical methods of Correlation (discussed in the present lesson) and Regression (to
        be discussed in the next lesson) are helpful in knowing the relationship between two or more
Caritas Business College              BUSINESS STATISTICS – OAD 221
variables which may be related in same way, like interest rate of bonds and prime
interest rate; advertising expenditure and sales; income and consumption; crop-yield and
In all these cases involving two or more variables, we may be interested in seeing:
 if so, what form the relationship between the two variables takes;
 how we can make use of that relationship for predictive purposes, that is,
forecasting; and
Since these issues are inter related, correlation and regression analysis, as two sides of
a single process, consists of methods of examining the relationship between two or more
variables. If two (or more) variables are correlated, we can use information about one (or
more) variable(s) to predict the value of the other variable(s), and can measure the
Correlation is a measure of association between two or more variables. When two or more
exploratory research when the objective is to locate variables that might be related in some
Correlation can be classified in several ways. The important ways of classifying correlation
are:
If both the variables move in the same direction, we say that there is a positive correlation,
i.e., if one variable increases, the other variable also increases on an average or if one
On the other hand, if the variables are varying in opposite direction, we say that it is a case
                X       :   10   20   30     40    50
                Y :         25   50   75     100   125
The ratio of change in the above example is the same. It is, thus, a case of linear
correlation. If we plot these variables on graph paper, all the points will fall on the same
straight line.
On the other hand, if the amount of change in one variable does not follow a constant ratio
couple of figures in either series X or series Y are changed, it would give a non-linear
correlation.
The distinction amongst these three types of correlation depends upon the number of
variables involved in a study. If only two variables are involved in a study, then the
correlation is said to be simple correlation. When three or more variables are involved in a
study, then it is a problem of either partial or multiple correlation. In multiple correlation, three
or more variables are studied simultaneously. But in partial correlation we consider only two
variables influencing each other while the effect of other variable(s) is held constant.
        hours studied, Y is I.Q. and Z is the number of marks obtained in the examination. In a
Caritas Business College              BUSINESS STATISTICS – OAD 221
multiple correlation, we will study the relationship between the marks obtained (Z) and the
two variables, number of hours studied (X) and I.Q. (Y). In contrast, when we study the
The correlation analysis, in discovering the nature and degree of relationship between
variables, does not necessarily imply any cause and effect relationship between the
variables. Two variables may be related to each other but this does not mean that one
variable causes the other. For example, we may find that logical reasoning and creativity are
correlated, but that does not mean if we could increase peoples’ logical reasoning ability, we
reasoning ability does influence their creativity, then the two variables must be correlated
with each other. In other words, causation always implies correlation, however
1. The correlation may be due to chance particularly when the data pertain to a
small sample. A small sample bivariate series may show the relationship but such
2. It is possible that both the variables are influenced by one or more other variables.
households show a positive relationship because both have increased over time.
But, this is due to rise in family incomes over the same period. In other words, the
incomes.
3. There may be another situation where both the variables may be influencing each
other so that we cannot say which is the cause and which is the effect. For
                   example, take the case of price and demand. The rise in price of a commodity
Caritas Business College                BUSINESS STATISTICS – OAD 221
may lead to a decline in the demand for it. Here, price is the cause and the
demand is the effect. In yet another situation, an increase in demand may lead to
a rise in price. Here, the demand is the cause while price is the effect, which is
just the reverse of the earlier situation. In such situations, it is difficult to identify
The foregoing discussion clearly shows that correlation does not indicate any causation or
this has nothing to do with cause and effect relation. It only reveals co-variation between
two variables. Even when there is no cause-and-effect relationship in bivariate series and
one interprets the relationship as causal, such a correlation is called spurious or non-sense
correlation. Obviously, this will be misleading. As such, one has to be very careful in
correlation exercises and look into other relevant factors before concluding a cause-and-
effect relationship.
Correlation Analysis is a statistical technique used to indicate the nature and degree of
relationship existing between one variable and the other(s). It is also used along with
regression analysis to measure how well the regression line explains the variations of the
        The commonly used methods for studying linear relationship between two variables involve
        both graphic and algebraic methods. Some of the widely used methods include:
           1.      Scatter Diagram
2. Correlation Graph
This method is also known as Dotogram or Dot diagram. Scatter diagram is one of the
obtained is called "Scatter Diagram". By studying diagram, we can have rough idea about
the nature and degree of relationship between two variables. The term scatter refers to the
spreading of dots on the graph. We should keep the following points in mind while
interpreting correlation:
 if the plotted points are very close to each other, it indicates high degree of
correlation. If the plotted points are away from each other, it indicates low degree of
                correlation.
Caritas Business College             BUSINESS STATISTICS – OAD 221
 if the points on the diagram reveal any trend (either upward or downward), the
variables are said to be correlated and if no trend is revealed, the variables are
uncorrelated.
 if there is an upward trend rising from lower left hand corner and going upward to the
upper right hand corner, the correlation is positive since this reveals that the values
of the two variables move in the same direction. If, on the other hand, the points
depict a downward trend from the upper left hand corner to the lower right hand
corner, the correlation is negative since in this case the values of the two variables
 in particular, if all the points lie on a straight line starting from the left bottom and
going up towards the right top, the correlation is perfect and positive, and if all the
points like on a straight line starting from left top and coming down to right bottom,
The various diagrams of the scattered data in Figure 4-1 depict different forms of correlation.
Example 4-1
                                                     102
Caritas Business College                          BUSINESS STATISTICS – OAD 221
Given the following data on sales (in thousand units) and expenses (in thousand rupees) of a
        Month :           J          F             M      A     M            J   J    A         S    O
        Sales:            50       50              55    60     62       65      68   60        60   50
        Expenses:         11       13              14    16     16       15      15   14        13   13
                    a) Make a Scatter Diagram
Solution:(a) The Scatter Diagram of the given data is shown in Figure 4-2
                                         20
                                Expens
15
10
                                         0
                                              0          20           40         60        80
                                                                     Sales
(b) Figure 4-2 shows that the plotted points are close to each other and reveal an upward
trend. So there is a high degree of positive correlation between sales and expenses of the
firm.
This method, also known as Correlogram is very simple. The data pertaining to two series
are plotted on a graph sheet. We can find out the correlation by examining the direction and
closeness of two curves. If both the curves drawn on the graph are moving in the same
direction, it is a case of positive correlation. On the other hand, if both the curves are moving
in opposite direction, correlation is said to be negative. If the graph does not show
any definite pattern on account of erratic fluctuations in the curves, then it shows an absence
        of correlation.
                                                                103
Caritas Business College               BUSINESS STATISTICS – OAD 221
        Example 4-2
        Find out graphically, if there is any correlation between price yield per plot (qtls); denoted by
        Plot No.:       1         2       3     4      5       6       7      8       9      10
            Y:          3.5      4.3      5.2   5.8    6.4     7.3    7.2     7.5    7.8     8.3
X: 6 8 9 12 10 15 17 20 18 24
                            30
                            25
                            20
                            15
                            10
                              X and
                            5
                            0
                            12345678910
                            Plot Number
Figure 4-3 shows that the two curves move in the same direction and, moreover, they are
very close to each other, suggesting a close relationship between price yield per plot (qtls)
Remark: Both the Graphic methods - scatter diagram and correlation graph provide a
‘feel for’ of the data – by providing visual representation of the association between the
variables. These are readily comprehensible and enable us to form a fairly good,
though rough idea of the nature and degree of the relationship between the two variables.
However, these methods are unable to quantify the relationship between them. To quantify
the extent of correlation, we make use of algebraic methods - which calculate correlation
coefficient.
A mathematical method for measuring the intensity or the magnitude of linear relationship
                                                       104
Caritas Business College                    BUSINESS STATISTICS – OAD 221
between two variables was suggested by Karl Pearson (1867-1936), a great British
Biometrician and Statistician and, it is by far the most widely used method in practice.
Karl Pearson’s measure, known as Pearsonian correlation coefficient between two variables
relationship between them and is defined as the ratio of the covariance between X and Y,
Symbolically
                             Cov( X ,Y
                )r                                                …………(4.1)
                               Sx .S y
                     x
Y in a bivariate distribution,
                                       ( X  X )(Y 
                Cov(
                Y ) X ,Y )                                        …………(4.2a)
                                             N
               Sx            ( X  X )2                          …………(4.2b)
                                  N
        and     Sy                                                 …………(4.2c)
                
                             (Y  Y )2
                                  N
        Thus by substituting Eqs. (4.2) in Eq. (4.1), we can write the Pearsonian
        correlation coefficient as
                                  1
               rxy                     ( X  X )(Y  Y )
                                 N
                              1  ( X  X )21 (Y  Y )2
                              N              N
               rxy                                                 …………(4.3)
                             ( X  X )(Y  Y )
                           ( X  X )2 (Y  Y )2
                                                            105
Caritas Business College                                     BUSINESS STATISTICS – OAD 221
        If we denote, dNx  X                               an
        X                                                    d  dy YY
        We have
                       1
        Cov( X ,Y )       ( X  X )(Y  Y )
                     N               
                      1
                          XY 
                     XY N            
                      1           X Y
                         XY 
                      N           N     N
                                1                                                             …………(4.4)
                                    N  XY   X Y 
        and                 1
                S2                 ( X  X )
                                               2
                 x
                            N
                             1
                                        X 2 ( X ) 2
                            N        
                            1                              2
                                         2
                                X                     
                        
                            N
                                                         
                                                      N 
                                1                                    2
                                     N  X 2   X                                        …………(4.5a)
                        
                                N
        Similarly, we have
                        1                    2                                                …………(4.5b)
                S2        N  Y 2   Y 
                
                    y            2
                            N
                                                                              106
Caritas Business College        BUSINESS STATISTICS – OAD 221
        or     rxy N   XY   X Y                     …………(4.6)
                    N  X 2   X  N  Y 2   Y 
        
                                    2               2
                                             107
Caritas Business College                BUSINESS STATISTICS – OAD 221
Remark: Eq. (4.3) or Eq. (4.3a) is quite convenient to apply if the means X and
        Y come out to be integers. If X or/and Y is (are) fractional then the Eq. (4.3) or Eq. (4.3a) is
                                                                                    2              2
        quite cumbersome to apply, since the computations of              ( X  X ) ,    (Y  Y ) and
        ( X  X )(Y  Y
                         are quite time consuming and tedious. In such a case Eq. (4.6) may be
        )
used provided the values of X or/ and Y are small. But if X and Y assume large values, the
Thus if (i) X and Y are fractional and (ii) X and Y assume large values, the Eq. (4.3) and Eq.
(4.6) are not generally used for numerical problems. In such cases, the step deviation
method where we take the deviations of the variables X and Y from any arbitrary points is
-1 ≤ r ≤1
Remarks: (i) This property provides us a check on our calculations. If in any problem,
the obtained value of r lies outside the limits + 1, this implies that there is some mistake in
our calculations.
(ii) The sign of r indicate the nature of the correlation. Positive value of r indicates
absence of correlation.
various values of r:
                                                        108
Caritas Business College               BUSINESS STATISTICS – OAD 221
0 absence of correlation
                            XA
                       U=                   an             YB
                             h                       V 
                                            d               k
Where A, B, h and k are constants and h > 0, k > 0; then the correlation
i.e.,
Remark: This is one of the very important properties of the correlation coefficient and
is extremely helpful in numerical computation of r. We had already stated that Eq. (4.3) and
Eq.(4.6) become quite tedious to use in numerical problems if X and/or Y are in fractions or if
X and Y are large. In such cases we can conveniently change the origin and scale (if
possible) in X or/and Y to get new variables U and V and compute the correlation between U
           rxy  r
                   uv N  UV  U V                                         …………(4.7)
               
                       N  U 2   U  N  V 2  V 
                                                 2                2
                                                           109
Caritas Business College           BUSINESS STATISTICS – OAD 221
3. Two independent variables are uncorrelated but the converse is not true
                                                  110
Caritas Business College                  BUSINESS STATISTICS – OAD 221
rxy = 0
However, the converse of the theorem is not true i.e., uncorrelated variables need
distribution.
                       X        :          1       2    3     -3     -2     -1
                       Y        :          1       4    9      9      4      1
Hence in the above example the variable X and Y are uncorrelated. But if we
examine the data carefully we find that X and Y are not independent but are
connected by the relation Y = X2. The above example illustrates that uncorrelated
Remarks: One should not be confused with the words uncorrelation and
independence. rxy = 0 i.e., uncorrelation between the variables X and Y simply implies the
absence of any linear (straight line) relationship between them. They may, however, be
related in some other form other than straight line e.g., quadratic (as we have seen in the
coefficients, i.e.
The signs of both the regression coefficients are the same, and so the value of r will
This property will be dealt with in detail in the next lesson on Regression Analysis.
                                                        111
Caritas Business College              BUSINESS STATISTICS – OAD 221
determination.
better and useful measure for interpreting the value of r. This property will also
The correlation coefficient establishes the relationship of the two variables. After ascertaining
this level of relationship, we may be interested to find the extent upto which this coefficient is
dependable. Probable error of the correlation coefficient is such a measure of testing the
for the two variables under consideration, then the Probable Error, denoted by PE (r) is
expressed as
                        PE(r)  0.6745
                                         SE(r)
        or              PE(r)  0.6745 1  r 2
                                         N
PE(r), implying that if we take another random sample of the size N from the same
population, then the observed value of the correlation coefficient in the second
sample can be expected to lie within the limits given above, with 0.5 probability.
When sample size N is small, the concept or value of PE may lead to wrong
                                                       112
Caritas Business College                      BUSINESS STATISTICS – OAD 221
be fairly large.
correlation.
        Example 4-3
        Find the Pearsonian correlation coefficient between sales (in thousand units) and expenses (in
        Firm:             1         2          3          4    5        6          7    8      9         10
        Sales:            50        50         55         60   65       65         65   60     60        50
        Expenses:         11        13         14         16   16       15         15   14     13        13
                    Firm       X          Y                                   2           2        dx.dy
                                                                             dx          dy
                                                     dx         dy
                                                    XX       YY
                      1        50         11         -8         -3            64         9          24
                      2        50         13          -8           -1         64         1          8
                      3        55         14          -3           0          9          0          0
                      4        60         16          2            2          4          4          4
                      5        65         16          7            2          49         4          14
                      6        65         15          7            1          49         1          7
                      7        65         15          7            1          49         1          7
                      8        60         14          2            0          4          0          0
                      9        60         13          2            -1         4          1          -2
                     10        50         13          -8           -1         64         1          8
                               X        Y                                     2          2   dxdy
                                                                             d x       d y
                                                               113
Caritas Business College                      BUSINESS STATISTICS – OAD 221
                       X        580                                               Y        140
                  X=        =            = 58             and                 Y=        =          = 14
                        N           10                                             N         10
rxy   dx d y
                                    rxy          dd x
                                                          2
                                                              y
                                                                  2
                                                 70
                                    rxy        360x22
                                                70
                                       7920
                               rxy  0.78
        The value of rxy  0.78 , indicate a high degree of positive correlation between sales and expenses.
        Example 4-4
        The data on price and quantity purchased relating to a commodity for 5 months is
given below:
Find the Pearsonian correlation coefficient between prices and quantity and comment on
               Month             X                Y                      2                   2                 XY
                                                                        X                   Y
                  1             10                5                     100                 25                  50
                  2             10                6                     100                 36                  60
                  3             11                4                     121                 16                  44
                  4             12                3                     144                  9                  36
                  5             12                3                     144                  9                  36
                                                                      114
Caritas Business College                BUSINESS STATISTICS – OAD 221
                                                                  2             2
                            X =55        Y =21             X        609   Y  95         XY  226
                         rxy N   XY   X Y
                               N  X 2   X  N  Y 2   Y 
                                                         2                     2
                         rxy 
                               5x226  55x21
                               (5x609  55x55)(5x95  21x21)
                         rxy 
                               1130 1155
                              20x34
                         rxy 
                                25
                               680
rxy  0.98
The negative sign of r indicate negative correlation and its large magnitude indicate a very
high degree of correlation. So there is a high degree of negative correlation between prices
        Example 4-5
        Find the Pearsonian correlation coefficient from the following series of marks obtained by 10
        X:     45        70        65       30     90         40         50   75        85   60
        Y:     35        90        70       40     95         40         60   80        80   50
        Solution:
                                    Calculations for Coefficient of Correlation
                                                   {Using Eq. (4.7)}
                    X         Y         U           V                   2           2             UV
                                                                       U           V
                    45        35        -3          -6                  9          36             18
                    70        90        2           5                   4          25             10
                    65        70        1           1                   1          1              1
                                                              115
Caritas Business College                   BUSINESS STATISTICS – OAD 221
                   30            40        -6        -5         36                25        30
                   90            95        6         6          36                36        36
                   40            40        -4        -5         16                25        20
                   50            60        -2        -1          4                1             2
                   75            80        3         3           9                9             9
                   85            80        5         3          25                9         15
                   60            50        0         -3          0            9                 0
                                          U     V        U 
                                                                2
                                                                         V
                                                                              2
                                                                                   176   UV       
                                                                                                    141
                                           2       2         140
                       X
                U
                60                         and                 Y  65
                                                          V      5
                        5
                                      10x141  2x(2)
                             =
                                      10x140  2x2 10x176  (2)x(2)
                                 1410  4
                             =
                                 1400  4 1760  4
                             1414
                         =
                             2451376
= 0.9
and in Statistics.
PE(r)  0.6745 1  r 2
                                                          116
Caritas Business College               BUSINESS STATISTICS – OAD 221
                                 1  0.9 2
                PE(r)  0.6745         10
PE(r)  0.0405
Sometimes we come across statistical series in which the variables under consideration
are not capable of quantitative measurement but can be arranged in serial order. This
happens when we are dealing with qualitative characteristics (attributes) such as honesty,
beauty, character, morality, etc., which cannot be measured quantitatively but can be
arranged serially. In such situations Karl Pearson’s coefficient of correlation cannot be used
which consists in obtaining the correlation coefficient between the ranks of N individuals in
Suppose we want to find if two characteristics A, say, intelligence and B, say, beauty are
related or not. Both the characteristics are incapable of quantitative measurements but we
can arrange a group of N individuals in order of merit (ranks) w.r.t. proficiency in the two
characteristics. Let the random variables X and Y denote the ranks of the individuals in the
individuals get the same rank in a characteristic then, obviously, X and Y assume numerical
The Pearsonian correlation coefficient between the ranks X and Y is called the rank
correlation coefficient between the characteristics A and B for the group of individuals.
Spearman’s rank correlation coefficient, usually denoted by ρ(Rho) is given by the equation
6 d 2
                       ρ =1                                         …………(4.8)
                                N (N 2 
                                1)
                                                      117
Caritas Business College                          BUSINESS STATISTICS – OAD 221
Where d is the difference between the pair of ranks of the same individual in the two
        Example 4-6
        Ten entries are submitted for a competition. Three judges study each entry and list the ten
        Entry:                    A         B      C        D          E       F        G        H      I       J
        Judge J1:                9          3      7        5          1       6        2        4      10      8
        Judge J2:                9          1      10       4          3       8        5        2      7       6
        Judge J3:                6          3      8        7          2       4        1        5      9       10
Calculate the appropriate rank correlation to help you answer the following questions:
           Entry                 Rank by
                                                                           Difference in Ranks
                                 Judges
                            J1        J2    J3   d(J1&J2)          2         d(J1&J3)        2       d(J2&J3)         2
                                                                  d                         d                        d
             A               9        9     6       0             0            +3           9          +3            9
             B               3        1     3      +2             4             0           0          -2            4
             C               7        10    8      -3             9            -1           1          +2            4
             D               5        4     7      +1             1            -2           4          -3            9
             E               1        3     2      -2             4            -1           1          +1            1
             F               6        8     4      -2             4            +2           4          +4            16
             G               2        5     1      -3             9            +1           1          +4            16
             H               4        2     5      +2             4            -1           1          -3            9
                 I          10        7     9      +3             9            +1           1          -2            4
             J               8        6    10      +2             4            -2           4          -4            16
                                                  6 d 2
                             (J1 & J2) = 1
                                                N (N 2  1)
                                                                       118
Caritas Business College             BUSINESS STATISTICS – OAD 221
=1  6 x 48
                                     10(102
                                      1)
                                    288
                              =1  990
                              =1 – 0.29
                              = +0.71
                                     6 d 2
                
                 (J1 & J3)   =1
                                     N (N 2 
                                        1)
                              =1 
                                     6 x 26
                                     10(102
                                      1)
                                   156
                              =1  990
                              =1 – 0.1575
                              = +0.8425
6 d 2
               
                (J2 & J3)    =1
                                     N (N 2 
                                        1)
                              =1 
                                     6 x 88
                                     10(102
                                      1)
                                   528
                              =1  990
                              =1 – 0.53
                              = +0.47
Spearman’s rank correlation Eq.(4.8) can also be used even if we are dealing with variables,
which are measured quantitatively, i.e. when the actual data but not the ranks relating to two
variables are given. In such a case we shall have to convert the data into ranks. The
                                                     119
Caritas Business College            BUSINESS STATISTICS – OAD 221
highest (or the smallest) observation is given the rank 1. The next highest (or the next
lowest) observation is given rank 2 and so on. It is immaterial in which way (descending or
ascending) the ranks are assigned. However, the same approach should be followed for all
                                                   120
Caritas Business College                 BUSINESS STATISTICS – OAD 221
        Example 4-7
        Calculate the rank coefficient of correlation from the following data:
               X:      75        88        95      70         60      80       81      50
               Y:      120       134       150     115        110     140      142     100
Solution:
                             75             5            120           5                 0          0
                             88             2            134           4                 -2         4
                             95             1            150           1                 0          0
                             70             6            115           6                 0          0
                             60             7            110           7                 0          0
                             80             4            140           3                +1          1
                             81             3            142           2                +1          1
                             50             8            100           8                0           0
                                                                                                  d2 = 6
                               6 d 2
                      =1
                             N (N 2  1)
                                 6x6
                       =1
                       
                             8(82 1)
                       =1        36
                       
                             504
                       = 1 – 0.07
                       = + 0.93
        Hence, there is a high degree of positive correlation between X and Y
Repeated Ranks
In case of attributes if there is a tie i.e., if any two or more individuals are placed together in
any classification w.r.t. an attribute or if in case of variable data there is more than one item
with the same value in either or both the series then Spearman’s Eq.(4.8) for calculating the
rank correlation coefficient breaks down, since in this case the variables X [the ranks
of
                                                              121
Caritas Business College                  BUSINESS STATISTICS – OAD 221
In this case common ranks are assigned to the repeated items. These common ranks are
the arithmetic mean of the ranks, which these items would have got if they were different
from each other and the next item will get the rank next to the rank used in computing the
common rank. For example, suppose an item is repeated at rank 4. Then the common rank
to be assigned to each item is (4+5)/2, i.e., 4.5 which is the average of 4 and 5, the ranks
which these observations would have assumed if they were different. The next item will be
assigned the rank 6. If an item is repeated thrice at rank 7, then the common rank to be
assigned to each value will be (7+8+9)/3, i.e., 8 which is the arithmetic mean of 7,8 and 9
viz., the ranks these observations would have got if they were different from each other. The
If only a small proportion of the ranks are tied, this technique may be applied together with
                      m(m 2
                       1)                              …………(4.8a)
12
                 2
        to  d       ; where m is the number of times an item is repeated. This correction factor is to be
        Example 4-8
        For a certain joint stock company, the prices of preference shares (X) and debentures (Y)
        are given below:
                 X:         73.2   85.8   78.9   75.8   77.2    81.2   83.8
                 Y:         97.8   99.2   98.8   98.3   98.3    96.7   97.1
                                                        122
Caritas Business College                   BUSINESS STATISTICS – OAD 221
        Use the method of rank correlation to determine the relationship between preference
        prices and debentures prices.
        Solution:
      Calculations for Coefficient of Rank Correlation
                                                 {Using Eq. (4.8) and (4.8a)}
In this case, due to repeated values of Y, we have to apply ranking as average of 2 ranks,
which could have been allotted, if they were different values. Thus ranks 3 and 4 have been
allotted as 3.5 to both the values of Y = 98.3. Now we also have to apply correction
factor
          m(m 2
                              2
           1)       to  d       , where m in the number of times the value is repeated, here m = 2.
            12
                                           2     mm 2  1
                                   6 d                  
                                                2
                                                          
                        =
                                       N (N 2  1)
                                           24  1
                                   6 48.5 
                                            12 
                                    
                     =
                                      7(72  1)
                                        6 x 49
                         =         1-
                                        7 x 48
                         =         0.125
        Hence, there is a very low degree of positive correlation, probably no correlation,
                                                             123
Caritas Business College           BUSINESS STATISTICS – OAD 221
                                                  124
Caritas Business College            BUSINESS STATISTICS – OAD 221
correlation coefficient, r, between the ranks, it can be interpreted in the same way
3. Karl Pearson’s correlation coefficient assumes that the parent population from
made about the from of the population from which sample observations are
drawn.
Pearson’s formula. The values obtained by the two formulae, viz Pearsonian r
and Spearman’s  are generally different. The difference arises due to the fact
that when ranking is used instead of full set of observations, there is always some
loss of information. Unless many ties exist, the coefficient of rank correlation
measured quantitatively but can be arranged serially. It can also be used where
6. Spearman’s formula has its limitations also. It is not practicable in the case of
bivariate frequency distribution. For N >30, this formula should not be used
                                                    125
Caritas Business College                   BUSINESS STATISTICS – OAD 221
This is a casual method of determining the correlation between two series when we are not
very serious about its precision. This is based on the signs of the deviations (i.e. the
direction of the change) of the values of the variable from its preceding value and does not
take into account the exact magnitude of the values of the variables. Thus we put a plus (+)
sign, minus (-) sign or equality (=) sign for the deviation if the value of the variable is greater
than, less than or equal to the preceding value respectively. The deviations in the values of
two variables are said to be concurrent if they have the same sign (either both deviations are
positive or both are negative or both are equal). The formula used for computing correlation
                 rc          2c  N 
                                                                    …………(4.9)
                               N 
Where c is the number of pairs of concurrent deviations and N is the number of pairs of
deviations. If (2c-N) is positive, we take positive sign in and outside the square root in Eq.
(4.9) and if (2c-N) is negative, we take negative sign in and outside the square root in Eq.
(4.9).
Remarks: (i) It should be clearly noted that here N is not the number of pairs of
observations but it is the number of pairs of deviations and as such it is one less than the
          “If the short time fluctuations of the time series are positively correlated or in other words,
        if their deviations are concurrent, their curves would move in the same direction and would
        indicate positive correlation between them”
        Example 4-9
                                                        126
Caritas Business College                  BUSINESS STATISTICS – OAD 221
        Supply:         112         125    126   118     118      121       125    125      131       135
        Price:          106         102    102   104     98       96        97     97       95        90
        Solution:
      Calculations for Coefficient of Concurrent Deviations
                                                 {Using Eq. (4.9)}
        We have
                 Number of pairs of deviations, N =10 – 1 = 9
                 c = Number of concurrent deviations
                  = Number of deviations having like signs
                  =2
        Coefficient of correlation by the method of concurrent deviations is given by:
                        rc 
                               2c  N 
                                       
                                N 
                               2x2  9 
                        rc            
                                9 
rc   0.5556
Since 2c – N = -5 (negative), we take negative sign inside and outside the square root
                                                         127
Caritas Business College             BUSINESS STATISTICS – OAD 221
                              0.5556
                       rc  
                       rc  
                            0.5556
                       rc  0.7
Hence there is a fairly good degree of negative correlation between supply and price.
As mentioned earlier, correlation analysis is a statistical tool, which should be properly used
reasonably sure that one variable is the cause while the other is the effect. Let us
take an example. .
and their earnings after, say, three years of their graduation. We may find that these
two variables are highly and positively related. At the same time, we must not forget
that both the variables might have been influenced by some other factors such as
interviewing process and so forth. If the data on these factors are available, then it is
that correlation explains 70 percent of the total variation in Y. The error can be seen
                                                     128
Caritas Business College              BUSINESS STATISTICS – OAD 221
determination r2 will be 0.49. This means that only 49 percent of the total variation in
Y is explained.
indicate causal relationship, that is, the percentage of the change in one variable is
3. Another mistake in the interpretation of the coefficient of correlation occurs when one
concludes a positive or negative relationship even though the two variables are
actually unrelated. For example, the age of students and their score in the
examination have no relation with each other. The two variables may show similar
movements but there does not seem to be a common link between them.
To sum up, one has to be extremely careful while interpreting coefficient of correlation. Be-
fore one concludes a causal relationship, one has to consider other relevant factors that
might have any influence on the dependent variable or on both the variables. Such an
approach will avoid many of the pitfalls in the interpretation of the coefficient of correlation. It
has been rightly said that the coefficient of correlation is not only one of the most
widely used, but also one of the widely abused statistical measures.
2. Explain the meaning and significance of the concept of correlation. Does correlation
always signify casual relationships between two variables? Explain with illustration
(a) Over a period of time there has been an increased financial aid to under
developed countries and also an increase in comedy act television shows. The
                                                        129
Caritas Business College              BUSINESS STATISTICS – OAD 221
(b) The correlation between salaries of school teachers and amount of liquor
4. What is a scatter diagram? How does it help in studying correlation between two
6. Draw a scatter diagram from the data given below and interpret it.
               X:     10      20       30     40     50         60    70   80
               Y:     32      20       24     36     40         28    38   44
advertising (X) and sales (Y) from the data given below:
               X:     39      65      62      90     82         75    25   98      36    78
               Y:     47      53      58      86     62         68    60   91      51    84
week (X) and the number of items purchased (Y) in that week.
               X:      5      10       4      0      2          7     3    6
               Y:      10     12       5      2      1          3     4    8
                                                      130
Caritas Business College                BUSINESS STATISTICS – OAD 221
9. Calculate coefficient of correlation between X and Y series from the following data
               X:       78        89     96    69       59    79     68     61
               Y:       125       137    156   112      107   136    123    108
10. In two set of variables X and Y, with 50 observations each, the following data
are observed:
                X         =       10,    SD of X = 3
                                                              rxy  0.3
                Y         =       6,     SD of Y = 2
               However, on subsequent verification, it was found that one value of X (=10) and one
value of Y (= 6) were inaccurate and hence weeded out with the remaining 49 pairs
of
11. Calculate coefficient of correlation r between the marks in statistics (X) and
               X:       52        74    93     55       41    23     92     64     40   71
               Y:       45        80    63     60       35    40     70     58     43   64
12. The coefficient of correlation between two variables X and Y is 0.48. The covariance
13. Twelve entries in painting competition were ranked by two judges as shown below:
               Entry:         A   B      C     D        E     F      G      H      I    J
               Judge I:       5   2      3     4        1     6      8      7      10   9
               Judge II: 4        5      2     1        6     7      10     9      3    8
14. Calculate Spearman’s rank correlation coefficient between advertisement cost (X)
                                                        131
Caritas Business College                       BUSINESS STATISTICS – OAD 221
                       X:       39    65        62    90       82     75      25     98      36     78
                       Y:       47    53        58    86       62     68      60     91      51     84
15. An examination of eight applicants for a clerical post was taken by a firm. From the
marks obtained by the applicants in the Accountancy (X) and Statistics (Y)
paper,
                       Applicant:      A        B     C        D       E       F     G       H
                       X:             15       20     28       12     40      60     20      80
                       Y:             40       30     50       30     20      10     30      60
16. Calculate the coefficient of concurrent deviation from the following data:
17. Obtain a suitable measure of correlation from the following data regarding changes in
                       Month:         A         M     J        J      A       S      O       N      D
                       A:             +4        +3    +2       -1     -3      +4     -5      +1     +2
                       B:             -2        +5    +3       -2     -1      -3     +4      -1     -3
18. The cross-classification table shows the marks obtained by 105 students in the
Marks in Statistics
                       70-79              16              9            20             -              45
                       80-89              -               -            -              -              -
                       Total          20              25              40             20             105
                                                               132
Caritas Business College             BUSINESS STATISTICS – OAD 221
                                                   133
Caritas Business College             BUSINESS STATISTICS – OAD 221
        Objectives:    The overall objective of this lesson is to give you an understanding of linear
                       regression, there by enabling you to understand the importance and also
                       the limitations of regression analysis.
        Structure
        5.1    Introduction
        5.2    What is Regression?
        5.3    Linear Regression
               5.3.1   Regression Line of Y on X
                       5.3.1.1 Scatter Diagram
                       5.3.1.2 Fitting a Straight Line
                       5.3.1.3 Predicting an Estimate and its Preciseness
                       5.3.1.4 Error of Estimate
               5.3.2   Regression Line of X on Y
        5.4    Properties of Regression Coefficients
        5.5    Regression Lines and Coefficient of Correlation
        5.6    Coefficient of Determination
        5.7    Correlation Analysis Versus Regression Analysis
        5.8    Solved Problems
        5.9    Self-Assessment Questions
        5.10   Suggested Readings
                                                         134
Caritas Business College             BUSINESS STATISTICS – OAD 221
 5.1    INTRODUCTION
        In business, several times it becomes necessary to have some forecast so that the
order to make a forecast, one has to ascertain some relationship between two or more
how far the demand for television sets will increase in the next five years, keeping in mind
the growth of population in a certain town. Here, it clearly assumes that the increase in
population will lead to an increased demand for television sets. Thus, to determine the
nature and extent of relationship between these two variables becomes important for the
company.
In the preceding lesson, we studied in some depth linear correlation between two variables.
Here we have a similar concern, the association between variables, except that we develop
it further in two respects. First, we learn how to build statistical models of relationships
between the variables to have a better understanding of their features. Second, we extend
For this purpose, we have to use the technique - regression analysis - which forms the
In 1889, Sir Francis Galton, a cousin of Charles Darwin published a paper on heredity,
“Natural Inheritance”. He reported his discovery that sizes of seeds of sweet pea plants
reported results of a study of the relationship between heights of fathers and heights of their
sons. A straight line was fit to the data pairs: height of father versus height of son. Here, too,
he found a “regression to mediocrity” The heights of the sons represented a movement away
from their
                                                       135
Caritas Business College            BUSINESS STATISTICS – OAD 221
fathers, towards the average height. We credit Sir Galton with the idea of statistical
regression.
While most applications of regression analysis may have little to do with the “regression to
the mean” discovered by Galton, the term “regression” remains. It now refers to the
general sense, regression analysis means the estimation or prediction of the unknown value
of one variable from the known value(s) of the other variable(s). It is one of the most
important and widely used statistical techniques in almost all sciences - natural, social or
physical.
In this lesson we will focus only on simple regression –linear regression involving only two
Simple regression involves only two variables; one variable is predicted by another variable.
The variable to be predicted is called the dependent variable. The predictor is called the
independent variable, or explanatory variable. For example, when we are trying to predict
the demand for television sets on the basis of population growth, we are using the demand
for television sets as the dependent variable and the population growth as the independent
or predictor variable.
The decision, as to which variable is which sometimes, causes problems. Often the choice is
obvious, as in case of demand for television sets and population growth because it would
make no sense to suggest that population growth could be dependent on TV demand! The
population growth has to be the independent variable and the TV demand the dependent
variable.
                                                    136
Caritas Business College              BUSINESS STATISTICS – OAD 221
 if we have control over one of the variables then that is the independent. For
example, a manufacturer can decide how much to spend on advertising and expect
 it there is any lapse of time between the two variables being measured, then the
latter must depend upon the former, it cannot be the other way round
 if we want to predict the values of one variable from your knowledge of the other
The task of bringing out linear relationship consists of developing methods of fitting a
straight line, or a regression line as is often called, to the data on two variables.
The line of Regression is the graphical or relationship representation of the best estimate of
one variable for any given value of the other variable. The nomenclature of the line depends
on the independent and dependent variables. If X and Y are two variables of which
relationship is to be indicated, a line that gives best estimate of Y for any value of X, it is
For purposes of illustration as to how a straight line relationship is obtained, consider the
sample paired data on sales of each of the N = 5 months of a year and the marketing
Table 5-1
                                                       137
Caritas Business College             BUSINESS STATISTICS – OAD 221
                                            Y                                X
                 April                      14                               10
                 May                        17                               12
                 June                       23                               15
                  July                      21                               20
                August                      25                               23
Let Y, the sales, be the dependent variable and X, the marketing expenditure, the
independent variable. We note that for each value of independent variable X, there is a
specific value of the dependent variable Y, so that each value of X and Y can be seen as
paired observations.
relationship between the two variables is linear, that is, the one which is best explained by a
straight line. A good way of doing this is to plot the data on X and Y on a graph so as to
yield a scatter diagram, as may be seen in Figure 5-1. A careful reading of the scatter
 the overall tendency of the points is to move upward, so the relationship is positive
 the general course of movement of the various points on the diagram can be
            there is a high degree of correlation between the variables, as the points are very
               close to each other
                                                      1
Caritas Business College                  BUSINESS STATISTICS – OAD 221
If the movement of various points on the scatter diagram is best described by a straight line,
the next step is to fit a straight line on the scatter diagram. It has to be so fitted that on the
whole it lies as close as possible to every point on the scatter diagram. The
necessary requirement for meeting this condition being that the sum of the squares of the
vertical deviations of the observed Y values from the straight line is minimum.
As shown in Figure 5-1, if dl, d2,..., dN are the vertical deviations' of observed Y values from
                d 2
                     d 2 ..................... d
                2                                      Nd 2
                  1     2                                    j
                        N                               j 1
is the minimum. The deviations dj have to be squared to avoid negative deviations canceling
out the positive deviations. Since a straight line so fitted best approximates all the points on
the scatter diagram, it is better known as the best approximating line or the line of best fit. A
Free hand drawing is the simplest method of fitting a straight line. After a careful inspection
of the movement and spread of various points on the scatter diagram, a straight line is
drawn through these points by using a transparent ruler such that on the
                                                                 135
Caritas Business College                 BUSINESS STATISTICS – OAD 221
whole it is closest to every point. A straight line so drawn is particularly useful when future
Whereas the use of free hand drawing may yield a line nearest to the line of best fit, the
major drawback is that the slope of the line so drawn varies from person to person because
of the influence of subjectivity. Consequently, the values of the dependent variable estimated
on the basis of such a line may not be as accurate and precise as those based on the line of
best fit.
The least square method of fitting a line of best fit requires minimizing the sum of the
squares of vertical deviations of each observed Y value from the fitted line. These
deviations, such as d1 and d3, are shown in Figure 5-1 and are given by Y - Yc, where Y is
the observed value and Yc the corresponding computed value given by the fitted line
                           Yc  a                                       …………(5.1)
                           bX i
The straight line relationship in Eq.(5.1), is stated in terms of two constants a and b
 The constant a is the Y-intercept; it indicates the height on the vertical axis
from where the straight line originates, representing the value of Y when X is zero.
 Constant b is a measure of the slope of the straight line; it shows the absolute
Since a straight line is completely defined by its intercept a and slope b, the task of fitting
the same reduces only to the computation of the values of these two constants. Once these
two values are known, the computed Yc values against each value of X can be easily
                                                          136
Caritas Business College                  BUSINESS STATISTICS – OAD 221
In the method of least squares the values of a and b are obtained by solving simultaneously
Y  aN  b X …………(5.2)
                 XY  a X  b X
                                              2                      …………(5.2)
observations and then can be substituted in the above equations to obtain the value of a and
b. Since simultaneous solving the two normal equations for a and b may quite often be
cumbersome and time consuming, the two values can be directly obtained as
                                                                     …………(5.3)
               a= Y b
               X
        and
                    N  XY   X Y
                b  N X 2   X 2                                   …………(5.4)
                             
        Note: Eq. (5.3) is obtained simply by dividing both sides of the first of Eqs. (5.2) by N and
        Eq.(5.4) is obtained by substituting (Y  b X ) in place of a in the second of Eqs. (5.2)
                a                                                   …………(5.5)
                                     2
                          N  X   X
                    2
                
and
               b=                                                    …………(5.6)
                          Y
                           aX
        Note: Eq. (5.5) is obtained by substituting N  XY   X Y 2        for b in Eq. (5.3) and Eq.
                                                     N  X 2   X 
      Table 5-2
      Computation of a and b
Y X XY X2 Y2
                                                      137
Caritas Business College                   BUSINESS STATISTICS – OAD 221
                            100x1398  80x1684
                     a                   2
                             5x1398  80
                            139800 134720
                             6990  6400
                           5080
                         = 590
                         = 8.6101695
        and
                         5x1684 
                      b  80x100 5x1398
                                 2
                           80
                           8420  8000
                          6990  6400
                           420
                         = 590
= 0.7118644
Yc = 8.61 + 0.71X..................................................................(5.1a)
                                                                138
Caritas Business College                        BUSINESS STATISTICS – OAD 221
Then, to fit the line of best fit on the scatter diagram, only two computed Yc values are
needed. These can be easily obtained by substituting any two values of X in Eq. (5.1a).
When these are plotted on the diagram against their corresponding values of X, we get
two points, by joining which (by means of a straight line) gives us the required line of best
We can have some important relationships for data analysis, involving other measures such as
Yc = ( Y  b X ) +bX
or Yc - Y = b(X- X )........................................................(5.7)
 XY   X  Y 
                                   N    N  N 
                                            2 
                         b               X 
                                   X
                                      2
                                                   
                                        N         N 
                                XY
                                           XY
                                   N
               or        b
                                         Sx 2
                                                                                    …………(5.8)
                               Cov( X ,Y )
               or        b
                                       Sx 2
                        Cov( X , Y )
                r xy
                           Sx Sy
                                                                 139
Caritas Business College                  BUSINESS STATISTICS – OAD 221
or Cov( X , Y )  rxy S x S y
                b  r Sx S y
                   xy 2
                       Sx
                     S
                br                                                                   …………(5.9)
                y
                     xy
                          Sx
                                   (X- X )...........................................................(5.10)
                Y-Y = r Sy
                 c     xy
                          Sx
The main objective of regression analysis is to know the nature of relationship between two
variables and to use it for predicting the most likely value of the dependent variable
corresponding to a given, known value of the independent variable. This can be done by
substituting in Eq.(5.1a) any known value of X corresponding to which the most likely
estimate of Y is to be found.
Yc = 8.61 + 0.71(15)
= 8.61 + 10.65
= 19.26
It may be appreciated that an estimate of Y derived from a regression equation will not be
exactly the same as the Y value which may actually be observed. The difference between
estimated Yc values and the corresponding observed Y values will depend on the extent of
                                                                 140
Caritas Business College                  BUSINESS STATISTICS – OAD 221
The closer the various paired sample points (Y, X) clustered around the line of best fit, the
smaller the difference between the estimated Yc and observed Y values, and vice-versa. On
the whole, the lesser the scatter of the various points around, and the lesser the vertical
distance by which these deviate from the line of best fit, the more likely it is that an estimated
The estimated Yc values will coincide the observed Y values only when all the points on the
scatter diagram fall in a straight line. If this were to be so, the sales for a given marketing
expenditure could have been estimated with l00 percent accuracy. But such a situation is too
rare to obtain. Since some of the points must lie above and some below the straight line,
perfect prediction is practically non-existent in the case of most business and economic
situations.
This means that the estimated values of one variable based on the known values of the
other variable are always bound to differ. The smaller the difference, the greater the
precision of the estimate, and vice-versa. Accordingly, the preciseness of an estimate can
be obtained only through a measure of the magnitude of error in the estimates, called the
error of estimate.
              Syx       Y  Y c
                                      2
                                                              …………(5.11)
                              N
Syx measures the average absolute amount by which observed Y values depart from the
large. In such cases Syx may be computed directly by using the equation:
                                                       141
Caritas Business College                BUSINESS STATISTICS – OAD 221
values of a and b
        We
        have
                                   2080  8.61x100  0.71x1684
                         Syx   =
                                                5
                                   2080  861 1195.64
                               =
                                            5
                                   23.36
                               =
                                     5
                               = 4.67
                               = 2.16
Interpretations of Syx
A careful observation of how the standard error of estimate is computed reveals the
following:
1. Syx is a concept statistically parallel to the standard deviation Sy . The only difference
between the two being that the standard deviation measures the dispersion around
the mean; the standard error of estimate measures the dispersion around the
regression line. Similar to the property of arithmetic mean, the sum of the deviations
That is
2. Syx tells us the amount by which the estimated Yc values will, on an average, deviate
from the observed Y values. Hence it is an estimate of the average amount of error in
the estimated Yc values. The actual error (the residual of Y and Yc) may, however, be
smaller or larger than the average error. Theoretically, these errors follow a normal
distribution. Thus, assuming that n ≥ 30, Yc ± 1.Syx means that 68.27% of the
estimates
                                                         142
Caritas Business College              BUSINESS STATISTICS – OAD 221
based on the regression equation will be within 1.Syx Similarly, Yc ± 2.Syx means that
thousand being Rs 19.26 lac, one may like to know how good this estimate is. Since
Syx is estimated to be Rs 2.16 lac, it means there are about 68 chances (68.27) out of
100 that this estimate is in error by not more than Rs 2.16 lac above or below
Rs
19.26 lac. That is, there are 68% chances that actual sales would fall between (19.26
3. Since Syx measures the closeness of the observed Y values and the estimated Yc
values, it also serves as a measure of the reliability of the estimate. Greater the
closeness between the observed and estimated values of Y, the lesser the error
4. Standard error of estimate Syx can also be seen as a measure of correlation insofar
regression line. The closer the observed Y values scattered around the regression
same units of measurement as the data on the dependent variable. This creates
correlation. It is mainly due to this limitation that the standard error of estimate is not
                                                       143
Caritas Business College                     BUSINESS STATISTICS – OAD 221
So far we have considered the regression of Y on X, in the sense that Y was in the role of
dependent and X in the role of an independent variable. In their reverse position, such that
X is now the dependent and Y the independent variable, we fit a line of regression of X
Xc = a’ + b’Y...........................................................................(5.13)
Where Xc denotes the computed values of X against the corresponding values of Y. a’ is the
                                                                                         …………(5.14)
                 X  a' N  b'Y
                 XY  a'Y  b'Y
                                   2
…………(5.14)
a’ = X - b’ Y...........................................................................(5.15)
        and
                     N  XY   X Y
                b'  N Y 2   Y 2                                                      …………(5.16)
                              
        or
                           2
                     X Y  Y  XY N
               a'                  2                                                    …………(5.17)
                        Y   Y 
                          2
and
                b'                                                                       …………(5.18)
                      X  a'
                         Y
                       Cov(Y , X
                b'                                                                      …………(5.19)
                )                 2
                              S
                              y
               b'  r
                          S
                                                                                         …………(5.20)
                          x
                       yx
                            Sy
Xc - X = b’ (Y- Y )................................................................(5.21)
                                    S
               Xc - X = r                   (Y - Y )........................................................(5.22)
                                    x
                                yx
                                S
                                        y
As before, once the values of a’ and b’ have been found, their substitution in Eq.(5.13) will
                          X  X 2
               Sxy =.......................................................................................(5.23)
                                c
                                N
               or
For example, if we want to estimate the marketing expenditure to achieve a sale target of
Xc = a’ + b’Y
So using Eqs. (5.17) and (5.16), and substituting the values of  X , Y 2 , Y and  XY
                       80x2080  100x1684
                a'                    2
                        5x2080  100
                 166400  168400
                 10400  10000
                     2000
               =     400
               =     -5.00
        and
                       5x1684  80x100
                b'                   2
                       5x2080  100
                   8420  8000
                 10400  10000
                                                                        145
Caritas Business College              BUSINESS STATISTICS – OAD 221
                  420
                = 400
= 1.05
Now given that a’= -5.00 and b’=1.05, Regression equation (5.13) takes the form
Xc = -5.00 +1.05Y
Xc = -5.00+1.05x40
= -5 + 42
= 37
marketing.
the effect on dependent variable if there is a unit change in the independent variable.
Since for a paired data on X and Y variables, there are two regression lines: regression line
The following are the important properties of regression coefficients that are helpful in data
analysis
1. The value of both the regression coefficients cannot be greater than 1. However,
value of both the coefficients can be below 1 or at least one of them must be
below 1, so that the square root of the product of two regression coefficients must
±1.
                                                      146
Caritas Business College                  BUSINESS STATISTICS – OAD 221
                             b. b'
                       r = ±............................................................................(5.25)
The signs of both the regression coefficients are the same, and so the value of r
3. The mean of both the regression coefficients is either equal to or greater than
                        b  b'
                          2 r
                              XA                                   YB
                        U=                   an           V 
                               h                                     k
                                             d
r2 = b.b’
                                                                    147
Caritas Business College                  BUSINESS STATISTICS – OAD 221
                      S
               b= r                                                   Sx
                                                 and         b’ =
               y                                                    r S
                                                                        y
                      Sx
               tan  = b 
                       b' 1 
                       bb'
                           S S
                          = x y r 2 1
                            S2 S2 r
                                      
                            x     y          
                                  
                                           r 2 1  
                             S
                            –1
                               Sx
                               2y    
               or  =          S  S 2  r                          …………(5.26)
               tan
                                 x y           
                                                         1
Caritas Business College               BUSINESS STATISTICS – OAD 221
correlation (r = -1),  = 0, so the two regression lines will coincide, i.e. we have only
The farther the two regression lines from each other, lesser will be the degree of
correlation and nearer the two regression lines, more will be the degree of
 If the variables are independent i.e. r = 0, the lines of regression will cut each other at
Note : Both the regression lines cut each other at mean value of X and mean value of Y i.e. at
X and Y .
accounted for by the independent variable. In other words, the coefficient of determination
gives the ratio of the explained variance to the total variance. The coefficient of
determination is given by the square of the correlation coefficient, i.e. r2. Thus,
Coefficient of determination
                 2 Explained Variance
                r = Total Variance
                r2 =  Y  Y                                          …………(5.27)
                            2
                     c   
                              2
                              
YY
                                                       149
Caritas Business College                       BUSINESS STATISTICS – OAD 221
                                  2
                         Y  Y 
                K2 =         2 c                                                             …………(5.28)
                          
                         YY
                K2 = 1- Explained Variance
                          Total Variance
                             ...............................................................................
                     = 1 - r2                                                                                (5.29)
The square root of the coefficient of non-determination, i.e. K gives the coefficient of
alienation
                                 1r2
                           K = ±...........................................................................(5.30)
A simple algebraic operation on Eq. (5.30) brings out some interesting points about the
        Y 
                      2                          and                                   2
        Yc
                      N                                               Y  Y            NS2
                     S2
                                  y                                                                y
                                                       2
                                       Y  Y 
                            K2                    c
                                                         2
                                      Y Y
                           K2 = N 2
                                  yx
                                      N S2
                                           y
                                      S2
                                   yx
                                    S
                                       2
                                       y
                                                                       150
Caritas Business College                BUSINESS STATISTICS – OAD 221
                               Syx2
        So        1 – r2   
                               Sy2
                      S yx
        or
                               1 r 2                          …………(5.31)
        =
                      Sy
                                                    151
Caritas Business College                      BUSINESS STATISTICS – OAD 221
r= r2
                                       Syx2
                         r2 = 1 
                                       Sy2
                                1  S yx
                         r .............................................................................(5.32)
                                    S y2
On carefully observing Eq. (5.32), it will be noticed that the ratio Syx/Sy will be large if the
coefficient of determination is small, and it will be small when the coefficient of determination
is large. Thus
Eq. (5.32) also implies that Syx is generally less than Sy. The two can at the most be equal,
Interpretations of r2:
1. Even though the coefficient of determination, whose under root measures the
dimensionless pure number, the unit in which Syx is measured becomes irrelevant.
This facilitates comparison between the two sets of data in terms of their coefficient
of determination r2 (or the coefficient of correlation r). This was not possible in
2. The value of r2 can range between 0 and 1. When r2 = 1, all the points on the scatter
diagram fall on the regression line and the entire variations are explained by the
straight line. On the other hand, when r2 = 0, none of the points on the scatter
diagram falls on the regression line, meaning thereby that there is no relationship
determination does
                                                                    152
Caritas Business College              BUSINESS STATISTICS – OAD 221
not tell us about the direction of the relationship (whether it is positive or negative)
3. When r2 = 0.7455 (or any other value), 74.55% of the total variations in sales are
explained by the marketing expenditure used. What remains is the coefficient of non-
unexplained, which are due to factors other than the changes in the marketing
expenditure.
4. r2 provides the necessary link between regression and correlation which are the two
between the variables under study, without making a distinction between the
dependent and independent ones. Nor does it, therefore, help in predicting the value
5. The coefficient of correlation overstates the degree of relationship and it’s meaning is
correlation between sales and marketing expenditure. Therefore, the coefficient of'
6. The sum of r and K never adds to one, unless one of the two is zero. That is, r + K
the relationship between the variables. If we have information on more than one variable, we
might be interested in seeing if there is any connection - any association - between them.
If
                                                       153
Caritas Business College              BUSINESS STATISTICS – OAD 221
we found such a association, we might again be interested in predicting the value of one
1. Correlation literally means the relationship between two or more variables that vary in
2. Correlation coefficient rxy between two variables X and Y is a measure of the direction
and degree of the linear relationship between two variables that is mutual. It is
symmetric, i.e., ryx = rxy and it is immaterial which of X and Y is dependent variable
Regression analysis aims at establishing the functional relationship between the two(
or more) variables under study and then using this relationship to predict or estimate
the value of the dependent variable for any given value of the independent
variable(s). It also reflects upon the nature of the variable, i.e., which is dependent
variable and which is independent variable. Regression coefficient are not symmetric
3. Correlation need not imply cause and effect relationship between the variable under
study. However, regression analysis clearly indicates the cause and effect
variable.
between ±1.
                                                     154
Caritas Business College             BUSINESS STATISTICS – OAD 221
On the other hand, the regression coefficients, byx and bxy are absolute measures
representing the change in the value of the variable Y (or X), for a unit change in the
value of the variable X (or Y). Once the functional form of regression curve is known;
by substituting the value of the independent variable we can obtain the value of the
dependent variable and this value will be in the units of measurement of the
dependent variable.
5. There may be non-sense correlation between two variables that is due to pure
chance and has no practical relevance, e.g., the correlation, between the size of
shoe and the intelligence of a group of individuals. There is no such thing like non-
sense regression.
of 5 years and the sale of motor tyres by a firm in that territory for the same period.
Solution: Here the dependent variable is number of tyres; dependent on motor registrations.
Hence we put motor registrations as X and sales of tyres as Y and we have to establish the
regression line of Y on X.
                                                      155
Caritas Business College                  BUSINESS STATISTICS – OAD 221
              X                   Y                                           2               dx dy
                                                                            dx
                                             dx = X- X   dy = Y- Y
              600                1,250          -100        -50            10,000            5,000
              630                1,100          -70          -200          4,900             14,000
              720                1,300          20             0            400                0
              750                1,350          50             50          2,500             2,500
              800                1,500         100           200        10,000               20,000
                                                                         2
          X = 3,500         Y = 6,500      dx =0      dy=0        d x = 27,800        dx d y =
                                                                                            41,500
                       X
                  X=         =    3,50 =700                                         6,500
                                                         and           Y     Y             = 1,300
                       N           0 =                                        N =     5
                                    5
Y- Y = byx (X- X )
Y = 1.4928 X + 255.04
When X = 850, the value of Y can be calculated from the above equation, by putting X =
= 1523.92
= 1,524 Tyres
        Example 5-2
        A panel of Judges A and B graded seven debators and independently awarded the
following marks:
                                                         156
Caritas Business College       BUSINESS STATISTICS – OAD 221
2 34 39
                                           157
Caritas Business College                     BUSINESS STATISTICS – OAD 221
                           3                        28                           26
                           4                        30                           30
                           5                        44                           38
                           6                        38                           34
                           7                        31                           28
An eighth debator was awarded 36 marks by judge A, while Judge B was not present.
If Judge B were also present, how many marks would you expect him to award to the eighth
debator, assuming that the same degree of relationship exists in their judgement?
Solution: Let us use marks from Judge A as X and those from Judge B as Y. Now we have
          N=7                                                   U
                                                                     2
                                                                          206        V
                                                                                           2
                                                                                                185          UV           121
                                   U  0         V  17
                  U              0                                              V                    17
        X= A            = 35 +        = 35         and              Y= A             = 30 +               = 32.43
                   N               7                                              N                    7
N UV  U V 
                byx = bvu =                           2
                                  N  U 2   U 
                                7x121 - 0x17
                            =    7x206 - 0  0.587
Y- Y = byx (X- X )
                                                              158
Caritas Business College                BUSINESS STATISTICS – OAD 221
or Y = 0.587X + 11.87
Y = 0.587 x 36 + 11.87
= 33
Thus if Judge B were present, he would have awarded 33 marks to the eighth debator.
        Example 5-3
        For some bivariate data, the following results were obtained.
X = 53.2 Y = 27.9
X, Y- Y = byx (X- X )
or Y = -1.5X + 107.7
Y = -1.5 x 60 + 107.7
= 17.7
r2 = byx bxy
                                                         159
Caritas Business College                   BUSINESS STATISTICS – OAD 221
= (-1.5) x (–0.2)
= 0.3
So r = ± 0.3 = ± 0.5477
Since both the regression coefficients are negative, we assign negative value to the
correlation coefficient
r = - 0.5477
        Example 5-4
        Write regression equations of X on Y and of Y on X for the following data
        X:         45        48    50          55        65         70     75           72      80       85
        Y:         25        30    35          30        40         50     45           55      60       65
Solution: We prepare the table for working out the values for the regression lines.
             X               Y         U = X-65          V = Y-45                  2             UV                   2
                                                                               U                                     V
             45             25           -20                  -20              400               400                400
             48             30           -17                  -15              289               255                225
             50             35           -15                  -10              225               150                100
             55             30           -10                  -15              100               150                225
             65             40            0                   -5                0                    0              25
             70             50            5                   5                 25               25                 25
             75             45            10                  0                100                   0               0
             72             55            7                   5                 49               35                 25
             80             60            15                  15               225               225                225
             85             65            20                  20               400              400                 400
We have,
                  X        645
        X=                                                                     435
                        =       = 64.5         and            Y     Y                 = 43.5
                  N         = 10                                     N =       10
                            N UV  U V 
                   byx =                        2
                             N  U 2   U 
                                                                    160
Caritas Business College                   BUSINESS STATISTICS – OAD 221
Regression equation of Y on X is
                            Y- Y               =    by (X- X )
                                                    x
or Y = 0.787X + 7.26
N UV  U V 
             bxy =                         2
                        N  V 2   V 
                             X-X               =    bx (Y- Y )
                                                    y
or X = 0.87Y + 26.65
        Example 5-5
        The lines of regression of a bivariate population
are 8X – 10Y + 66 = 0
8X – 10Y + 66 = 0
Since both the lines of regression pass through the mean values, the point ( X , Y ) will
as 8 X - 10 Y + 66 = 0
40 X - 18 Y - 214 = 0
X = 13 and Y = 17
(ii) For correlation coefficient between X and Y, we have to calculate the values of byx and
bxy
10Y = 8X + 66
                 2
                r = byx . bxy
So r = + 9 / 25
= + 0.6
Both the values of the regression coefficients being positive, we have to consider only
                                                      162
Caritas Business College                  BUSINESS STATISTICS – OAD 221
Sx = ± 3
We consider Sx = 3 as SD is always
Sy = 4/5 x 3/0.6
= 4
        Example 5-6
        The height of a child increases at a rate given in the table below. Fit the straight line using
the method of least-square and calculate the average increase and the standard error of
estimate.
        Month:             1        2       3          4    5            6       7      8          9    10
        Height:        52.5        58.7    65      70.2    75.4   81.1       87.2      95.5      102.2 108.4
                                                                     2
                       X =55              Y =796.2            X        385        XY  4887.5
Considering the regression line as Y = a + bX, we can obtain the values of a and b from
                                                           163
Caritas Business College                     BUSINESS STATISTICS – OAD 221
                                                        a
                                         2
                               Y  X    X  XY
                                                    2
                                   N  X 2   X 
= 45.73
                                N  XY   X Y N
                        b                    2
                                  X   X 
                                     2
                                10 x 4887.5 - 55 x 796.2
                           =       10 x 385 - 55 x 55
= 6.16
Y = 45.73 + 6.16X
For standard error of estimation, we note the calculated values of the variable against
                                                            164
Caritas Business College                        BUSINESS STATISTICS – OAD 221
                                                                                (Y        2
                                                                                       Yi )    10.421
                                            1
                          S yx      =
                                            N     (Y  Y )
                                                        i
                                                            2
                                            10.421
                                    =
                                              10
                                    = 1.02
        Example 5-7
        Given X = 4Y+5 and Y = kX + 4 are the lines of regression of X on Y and of Y on X
If k = 1/16, find the means of the two variables and coefficient of correlation between them.
So bxy = 4
We get byx = k
Now
                           2
                          r = bxy. byx
= 4k
Since 0  r 2  1, we obtain 0  4k  1,
                                     1
        Or                0k
                                    ,
                                    4
                      1
        Now for k =    ,
                      16
                                        1        1
                          r 2  4x           
                                     16          4
r=+½
                                                                165
Caritas Business College                   BUSINESS STATISTICS – OAD 221
        When k = 1
                           , the regression line of Y on X becomes
                      16
                                 1
                            Y=     X+4
                                 16
Or X – 16Y + 64 = 0
Since line of regression pass through the mean values of the variables, we obtain revised
equations as
X -4Y-5=0
X - 16 Y + 64 = 0
X = 28 and Y = 5.75
        Example 5-8
        A firm knows from its past experience that its monthly average expenses (X) on
advertisement are Rs 25,000 with standard deviation of Rs 25.25. Similarly, its average
monthly product sales (Y) have been Rs 45,000 with standard deviation of Rs 50.50. Given
this information and also the coefficient of correlation between sales and advertisement
Rs 50,000
(ii) the most appropriate advertisement expenditure for achieving a sales target
of Rs 80,000
X = Rs 25,000 Sx = Rs 25.25
Y = Rs 45,000 Sy = Rs 50.50
r = 0.75
                                                         166
Caritas Business College              BUSINESS STATISTICS – OAD 221
                                               S
        (i)    Using equation                  y    (X- X ), the most appropriate value of sales     for an
               Yc                    -Y = r         Yc
                                               S
                                               x
                     Yc – 45,000 =      50.50
                     0.75                       (50,000 – 25,000)
                                        25.25
Yc = 45,000 + 37,500
= Rs 82,500
                                                   Sx
        (ii)   Using equation Xc - X     =r             (Y - Y ), the most appropriate value of advertisement
                                                   Sy
               Xc – 25,000 =          25.25
               0.75                           (80,000 – 45,000)
                                      50.50
Xc = 13,125 + 25,000
= Rs 38,125
regression for each bivariate distribution? How the two regression lines are useful in
                                                     168
Caritas Business College                BUSINESS STATISTICS – OAD 221
statistical analysis.
                X         :      1        3       4        8       9      11     14
                Y         :      1        2       4        5       7      8      9
Hence obtain
d) X and Y
8. What are regression coefficients? Show that r2 = byx. bxy where the symbols have their
usual meanings. What can you say about the angle between the regression lines
when
9. Obtain the equations of the lines of regression of Y on X from the following data.
                X         :      12       18      24       30      36     42     48
                Y         :      5.27     5.68    6.25     7.21    8.02   8.71   8.42
10. The following table gives the ages and blood pressure of 9 women.
Age (X) : 56 42 36 47 49 42 60 72 63
Blood Pressure(Y) 147 125 118 128 145 140 155 160 149
                                                           169
Caritas Business College              BUSINESS STATISTICS – OAD 221
11. Given the following results for the height (X) and weight (Y) in appropriate units of
1,000 students:
Obtain the equations of the two lines of regression. Estimate the height of a student A
who weighs 200 units and also estimate the weight of the student B whose height
is 60 units.
12. From the following data, find out the probable yield when the rainfall is 29”.
                                              Rainfall                   Yield
               Mean                                25”           40 units per
                                                                 hectare
               Standard Deviation                   3”           6 units per hectare
13. A study of wheat prices at two cities yielded the following data:
City A City B
Coefficient of correlation r is 0.774. Estimate from the above data the most likely
price of wheat
14. Find out the regression equation showing the regression of capacity utilisation on
r = 0.62
                                                         170
Caritas Business College              BUSINESS STATISTICS – OAD 221
15. The following table shows the mean and standard deviation of the prices of two
share B.
16. Find out the regression coefficients of Y on X and of X on Y on the basis of following
data:
Variance of X = 4, Variance of Y = 9
17. Find the regression equation of X and Y and the coefficient of correlation from the
following data:
        18.    By using the following data, find out the two lines of regression and from them
               compute the Karl Pearson’s coefficient of correlation.
as 2X – 3Y = 0 and 4Y – 5X-8 = 0.
(i) Identify which of the two can be called regression line of Y on X and of X on Y.
Which of these is the lines of regression of X and Y. Find rxy and Sy when Sx = 3
                                                         171
Caritas Business College                  BUSINESS STATISTICS – OAD 221
21. The regression equation of profits (X) on sales (Y) of a certain firm is 3Y – 5X +108 =
0. The average sales of the firm were Rs 44,000 and the variance of profits is 9/16 th
of the variance of sales. Find the average profits and the coefficient of correlation
                                                         172
Caritas Business College            BUSINESS STATISTICS – OAD 221
 6.1    INTRODUCTION
        In business, managers and other decision makers may be concerned with the way in
which the values of variables change over time like prices paid for raw materials,
numbers of
                                                    173
Caritas Business College             BUSINESS STATISTICS – OAD 221
employees and customers, annual income and profits, and so on. Index numbers are one
If we turn to any journal devoted to economic and financial matters, we are very likely to
come across an index number of one or the other type. It may be an index number of share
production. The objective of these index numbers is to measure the changes that have
occurred in prices, cost of living, production, and so forth. For example, if a wholesale price
index number for the year 2000 with base year 1990 was 170; it shows that wholesale
the same index number moves to 180 in 2001, it shows that there has been 80 percent
With the help of various index numbers, economists and businessmen are able to
describe and appreciate business and economic situations quantitatively. Index numbers
were originally developed by economists for monitoring and comparing different groups of
index serieses, and to construct index series of your own. Having constructed your own
index, it can then be compared to a national one such as the RPI, a similar index for your
industry as a whole and also to indexes for your competitors. These comparisons are a very
                                                       174
Caritas Business College              BUSINESS STATISTICS – OAD 221
company's own sales and compare it to the index of the volume of sales for the general
supermarket industry. A graph of the two indexes will illustrate the company's performance
within the sector. It is immediately clear from Figure 6-1 that, after initially lagging behind the
general market, the supermarket company caught up and then overtook it. In the later
stages, the company was having better results than the general market but that, as with the
Our focus in this lesson will be on the discussion related to the methodology of index number
construction. The scope of the lesson is rather limited and as such, it does not discuss a
large number of index numbers that are presently compiled and published by different
the level of a certain phenomenon in two or more situations”. The phenomenon under
variable or a group of distinct but related variables. In Business and Economics, the
 the prices of a particular commodity like steel, gold, leather, etc. or a group of
commodities like consumer goods, cereals, milk and milk products, cosmetics, etc.
exports, stocks and shares, sales and profits of a business house and so on.
                                                       175
Caritas Business College              BUSINESS STATISTICS – OAD 221
 the national income of a country, wage structure of workers in various sectors, bank
profession, etc.
The utility of index numbers in facilitating comparison may be seen when, for example we
are interested in studying the general change in the price level of consumer goods, i.e.
say, low income group or middle income group or labour class and so on. Obviously these
changes are not directly measurable as the price quotations of the various commodities are
available in different units, e.g., cereals (wheat, rice, pulses, etc) are quoted in Rs per quintal
or kg; water in Rs per gallon; milk, petrol, kerosene, etc. in Rs per liter; cloth in Rs per miter
and so on.
Further, the prices of some of the commodities may be increasing while those of others may
be decreasing during the two periods and the rates of increase or decrease may be
different for different commodities. Index number is a statistical device, which enables us to
arrive at a single representative figure that gives the general level of the price of the
being made.”
                                                       176
Caritas Business College            BUSINESS STATISTICS – OAD 221
in practice.”
On the basis of above discussion, the following characteristics of index numbers are apparent:
measuring the central tendency of the data, representing a group of figures. Index
number has all these functions to perform. L R Connor states, "in its simplest form, it
measurement.
Examples of such magnitudes are 'price level', 'cost of living', 'business or economic
activity' etc. The statistical methods used in the construction of index numbers are
magnitude in such a manner that the changes in that magnitude may be measured in
4. Index Numbers are for comparison: The index numbers by their nature are
comparative. They compare changes taking place over time or between places or
                                                    177
Caritas Business College             BUSINESS STATISTICS – OAD 221
In brief, index number is a statistical technique used in measuring the composite change in
several similar economic variables over time. It measures only the composite change,
because some of the variables included may be showing an increase, while some others
may be showing a decrease. It synthesizes the changes taking place in different directions
and by varying extents into the one composite change. Thus, an index number is a device to
simplify comparison to show relative movements of the data concerned and to replace what
The first index number was constructed by an Italian, Mr G R Carli, in 1764 to compare the
changes in price for the year 1750 (current year) with the price level in 1500 (base year) in
order to study the effect of discovery of America on the price level in Italy. Though originally
money, today index numbers are extensively used for a variety of purposes in economics,
business, management, etc., and for quantitative data relating to production, consumption,
profits, personnel and financial matters etc., for comparing changes in the level of
phenomenon for two periods, places, etc. In fact there is hardly any field or quantitative
measurements where index numbers are not constructed. They are used in almost all
sciences – natural, social and physical. The main uses of index numbers can be
summarized as follows:
Index numbers are indispensable tools for the management personnel of any
and formulation of executive decisions. The indices of prices (wholesale & retail),
output (volume of trade, import and export, industrial and agricultural production) and
bank deposits, foreign exchange and reserves etc., throw light on the nature of, and
                                                      178
Caritas Business College            BUSINESS STATISTICS – OAD 221
variation in the general economic and business activity of the country. They are the
good appraisal of the general trade, economic development and business activity
                  “Index numbers are today one of the most widely used statistical devices.
                  They are used to take the pulse of the economy and they have come to be
                  used as indicators of inflationary or deflationary tendencies.”
               Like barometers, which are used in Physics and Chemistry to measure atmospheric
Since the index numbers study the relative change in the level of a phenomenon at
different periods of time, they are especially useful for the study of the general trend
for a group phenomenon in time series data. The indices of output (industrial and
agricultural production), volume of trade, import and export, etc., are extremely useful
for studying the changes in the level of phenomenon due to the various components
of a time series, viz. secular trend, seasonal and cyclical variations and irregular
components and reflect upon the general trend of production and business activity.
As a measure of average change in extensive group, the index numbers can be used
establishing a new undertaking, the study of the trend of changes in the prices,
general idea of the comparative courses, which the future holds for different
undertakings.
                                                     179
Caritas Business College            BUSINESS STATISTICS – OAD 221
Index numbers of the data relating to various business and economic variables
serve an important guide to the formulation of appropriate policy. For example, the
cost of living index numbers are used by the government and, the industrial and
business concerns for the regulation of dearness allowance (D.A.) or grant of bonus
to the workers so as to enable them to meet the increased cost of living from time to
according to the index numbers of the consumption of the commodity from time to
planning of their future production. Although index numbers are now widely used to
study the general economic and business conditions of the society, they are also
health and educational authorities etc., for formulating and revising their policies from
time to time.
Since the changes in prices and purchasing power of money are inversely related, an
increase in the general price index indicates that the purchasing power of money has
gone down.
                               Purchasing Power              1
                                                                        x100
                                                       General Price
                                                          Index
Accordingly, if the consumer price index for a given year is 150, the purchasing
power of a rupee is (1/150) 100 = 0.67. That is, the purchasing power of a rupee in
                                                     180
Caritas Business College            BUSINESS STATISTICS – OAD 221
With the increase in prices, the amount of goods and services which money wages
can buy (or the real wages) goes on decreasing. Index numbers tell us the change
in real
A real wage index equal to, say, 120 corresponding to money wage index of 160 will
indicate an increase in real wages by only 20 per cent as against 60 per cent
Index numbers also serve as the basis of determining the terms of exchange.
The terms of exchange are the parity rate at which one set of commodities is
exchanged for another set of commodities. It is determined by taking the ratio of the
price index for the two groups of commodities and expressing it in percentage.
For example, if A and B are the two groups of commodities with 120 and 150 as their
price index in a particular year, respectively, the ratio 120/150 multiplied by 100 is 80
per cent. It means that prices of A group of commodities in terms of those in group B
Consumer price indices or cost of living index numbers are used for deflation of net
obtaining real wages from the given nominal wages (as explained in use 4 above)
can be used to find real income from inflated money income, real sales from nominal
                                                     181
Caritas Business College              BUSINESS STATISTICS – OAD 221
the phenomenon they study. Although index numbers can be constructed for measuring
relative changes in any field of quantitative measurement, we shall primarily confine the
discussion to the data relating to economics and business i.e., data relating to prices,
production (output) and consumption. In this context index numbers may be broadly
1. Price Index Numbers: The price index numbers measure the general changes in the
(i) Wholesale Price Index Numbers: The wholesale price index numbers reflect
(ii) Retail Price Index Numbers: These indices reflect the general changes in
the retail prices of various commodities such as consumption goods, stocks and
(iii) Consumer Price Index: Commonly known as the Cost of living Index, CPI is
a specialized kind of retail price index and enables us to study the effect of changes
living of a particular class or section of the people like labour class, industrial or
2. Quantity Index Numbers: Quantity index numbers study the changes in the volume
agricultural production, industrial production, imports and exports, etc. They are
                                                          182
Caritas Business College                     BUSINESS STATISTICS – OAD 221
3. Value Index Numbers: These are intended to study the change in the total value
inventories. However, these indices are not as common as price and quantity indices.
                                                          Notations Used
          Since index numbers are computed for prices, quantities, and values, these are denoted by the lower
          case letters:
                   p, q, and v represent respectively the price, the quantity, and the value of an individual
                                     commodity.
          Subscripts 0, 1, 2,… i, ... are attached to these lower case letters to distinguish price, quantity, or value
          in any one period from those in the other. Thus,
pi denotes the price of a commodity in the ith period, where i = 1,2,3, ...
Similar meanings are assigned to q0, q1, ... qi, ... and v0, v1, … vi, …
          Capital letters P, Q and V are used to represent the price, quantity, and value index numbers,
          respectively. Subscripts attached to P, Q, and V indicates the years compared. Thus,
PI2 means the price index for period 2 relative to period 1, and so on.
          Similar meanings are assigned to quantity Q and value V indices. It may be noted that all indices are
          expressed in percent with 100 as the index for the base period, the period with which comparison is to be
          made.
Various indices can also be distinguished on the basis of the number of commodities that go
into the construction of an index. Indices constructed for individual commodities or variable
are termed as simple index numbers. Those constructed for a group of commodities or
                                                                  183
Caritas Business College                   BUSINESS STATISTICS – OAD 221
Here, in this lesson, we will develop methods of constructing simple as well as composite
indices.
construct a simple index, we first have to decide on the base period and then find ratio of the
value at any subsequent period to the value in that base period - the price/quantity relative.
                                                          Value in Period i
                        Index for any Period i                                 x100
                        
                                                  Value in Base
                                                  Year
        i.e. Simple Price Index for period i = 1,2,3 ... will be
                               P0i 
                                     pp x10                          …………(6-1)
                                        0
                                      0
        Example 6-1
        Given are the following price-quantity data of fish, with price quoted in Rs per kg and production in
        qtls.
                Year            :          1980       1981    1982    1983     1984   1985
                Price           :           15           17    16      18       22    20
                Production      :           500       550      480    610       650   600
        Construct:
                (a)     the price index for each year taking price of 1980 as base,
                (b)     the quantity index for each year taking quantity of 1980 as base.
        Solution:
                                       Simple Price and Quantity Indices of
                                                  Fish (Base Year = 1980)
                                                              184
Caritas Business College                        BUSINESS STATISTICS – OAD 221
percentages. Given such an index, it is easy to find the percent by which the price/quantity
may have changed in a given period as compared to the base period. For example,
observing the index computed in Example 6-1, one can firmly say that the output of fish was
It may also be noted that given the simple price/quantity for the base year and the index for
the period i = 1, 2, 3, …; the actual price/quantity for the period i = 1, 2, 3, … may easily be
        obtained
        as:             p                                         …………(6-3)
                                        P0i
                                       
                             i        0      
                                          100
                                           
                                       Q0i
        and            q q                                        …………(6-4)
                       
                         i
                                       100
                                        
                                      0
                                      
                                          122.00 
                       q 
                                  i               
                                 500 100
                                                  
                                 = 610
                                                             185
Caritas Business College                      BUSINESS STATISTICS – OAD 221
numbers are used. Depending upon the method used for constructing an index, composite
Irrespective of the units in which prices/quantities are quoted, this index for given
                 (i)      Find the aggregate of prices/quantities of all commodities for each period (or
                          place).
                 (ii)     Selecting one period as the base, divide the aggregate prices/quantities
The computation procedure contained in the above steps can be expressed as:
                                   pi
                            P               x10                    …………(6-5)
                                             0
                            0i
                                 p      0
                                   qi
        and                Q                x100                   …………(6-6)
                            0i
                                  q     0
        Example 6-2
        Given are the following price-quantity data, with price quoted in Rs per kg and
production in qtls.
                                               1980                              1985
              Item                Price               Production        Price           Production
                                                              186
Caritas Business College                   BUSINESS STATISTICS – OAD 221
        Find   (a)     Simple Aggregative Price Index with 1980 as the base.
               (b)     Simple Aggregative Quantity Index with 1980 as the base.
        Solution:
                                              Calculations for
                                   Simple Aggregative Price and Quantity Indices
                                               (Base Year = 1980)
                                P0i        67
                                               x100
                                           55
P0i  121.82
Although Simple Aggregative Index is simple to calculate, it has two important limitations:
First, equal weights get assigned to every item entering into the construction of this index
irrespective of relative importance of each individual item being different. For example,
                                                             187
Caritas Business College               BUSINESS STATISTICS – OAD 221
items like pencil and milk are assigned equal importance in the construction of this index.
Second, different units in which the prices are quoted also sometimes unduly affect
this index. Prices quoted in higher weights, such as price of wheat per bushel as compared
to a price per kg, will have unduly large influence on this index. Consequently, the prices of
only a few commodities may dominate the index. This problem no longer exists when the
units in which the prices of various commodities are quoted have a common base.
Even the condition of common base will provide no real solution because commodities with
relatively high prices such as gold, which is not as important as milk, will continue to
dominate this index excessively. For example, in the Example 6-2 given above chicken
prices are relatively higher than those of fish, and hence chicken prices tend to influence this
This index makes an improvement over the index of simple aggregative prices/quantities as
it is not affected by the difference in the units of measurement in which prices/quantities are
expressed. However, this also suffers from the problem of equal importance getting
Given the prices/quantities of a number of commodities that enter into the construction of this
(i) After selecting the base year, find the price relative/quantity relative of each
commodity for each year with respect to the base year price/quantity. As
period is the ratio of the price/quantity of that commodity in the given period
                                                         188
Caritas Business College                 BUSINESS STATISTICS – OAD 221
(ii) Multiply the result for each commodity by 100, to get simple price/quantity
(iii) Take the average of the simple price/quantity indices by using arithmetic
                                                      
        and                     Q0i  Average of q x100 
 0 
        Using arithmetic
        mean
                                                   
                                                x100
                                            pi
                                             p
                               P0i          0                      …………(6-7)
                                          
                                              N
                                                  
                                            qi x100
                                            q
        and                    Q0i          0                      …………(6-8)
                                          
                                              N
        and                                                         …………(6-10)
                              Anti log log
                         Q0i 
                         x100          N      q
                                           0            
        Example 6-3
        From the data in Example 6.2 find:
(a) Index of Average of Price Relatives (base year 1980); using mean, median and
geometric mean.
              (b) Index of Average of Quantity Relatives (base year 1980); using mean, median
                                                        189
Caritas Business College         BUSINESS STATISTICS – OAD 221
Solution:
                                             190
Caritas Business College               BUSINESS STATISTICS – OAD 221
                                               Calculations for
                             Index of Average of Price Relatives and Quantity Relatives
                                                 (Base Year = 1980)
                                                               
                                                         pi x100
                                                        p
        Using arithmetic mean                 P          0     
                                                 
                                              0i          N
                                                      370.19
                                                       3
                                                   123.39
        Using Median                          P  Size of  N  1 
                                              0i          
                                                                 th item
                                                          
                                                              2  
                                                           3  1
                                                 Size of
                                                                    th item
                                                                    
                                                               2
                                            Size of 2nd item
                                                   127.77
1  pi 
                                                      191
Caritas Business College                BUSINESS STATISTICS – OAD 221
                                                                
                                                          qi x100
                                                       
                                                          q
        Using arithmetic mean                  Q          0     
                                                  
                                               0i          N
                                                       339.58
                                                        3
                                                    113.19
        Using Median                           Q  Size of  N  1 
                                                0i         
                                                                  th item
                                                           
                                                                2 
                                                            3  1
                                                  Size of
                                                                    th item
                                                                    
                                                                2
                                             Size of 2nd item
                                                    111.11
1  qi 
Apart from the inherent drawback that this index accords equal importance to all items
entering into its construction, a simple arithmetic mean and median are not appropriate
injects an upward bias in the index. So geometric mean is considered a more appropriate
We have noted that the simple aggregative price/quantity indices do not take care of the
differences in the weights to be assigned to different commodities that enter into their
construction. It is primarily because of this limitation that the simple aggregative indices are
of very limited use. Weighted aggregative Indices make up this deficiency by assigning
                                                       192
Caritas Business College                  BUSINESS STATISTICS – OAD 221
Among several ways of assigning weights, two widely used ways are:
                                La         pi q0 x10                     …………(6-11)
                            P                    0
                                0i
                                           p q0   0
                                La         qi p0 x10                     …………(6-12)
                            Q                    0
                                 0i
                                           q p0   0
                                  Pa    pi
                                P          x10                           …………(6-13)
                                qi          0
                                0i
                                           p q0   i
                              Pa    qi pi x10                            …………(6-14)
                             Q            0
                                 0i
                                           q p0   i
        Example 6-4
        From the data in Example 6.2 find:
(a) Laspeyre’s Price Index for 1985, using 1980 as the base
(b) Laspeyre’s Quantity Index for 1985, using 1980 as the base
(c) Paasche’s Price Index for 1985, using 1980 as the base
                                                        193
Caritas Business College               BUSINESS STATISTICS – OAD 221
(d) Paasche’s Quantity Index for 1985, using 1980 as the base
Solution:
                                                  Calculations for
                                            Laspeyre’s and Paasche’s
                                            Indices
                                                   (Base Year = 1980)
                Item           p0 q0                    p1 q0       p0 q1     p1 q1
                Fish           7500                     10000       9000      12000
                Mutton         10620                    13570       11520     14720
               Chicken          9900                    10800       11000     12000
                Sum      →     28020                    34370       31520     38720
(a) Laspeyre’s Price Index for 1985, using 1980 as the base
                                           La         pi q0 x100
                                       P         
                                            0i
                                                      p q0   0
                                                   34370
                                                  28020 x100
 122.66
(b) Laspeyre’s Quantity Index for 1985, using 1980 as the base
                                           La         qi p0 x100
                                       Q         
                                            0i
                                                      q 0p   0
                                                   31520
                                                  28020 x100
 112.49
(c) Paasche’s Price Index for 1985, using 1980 as the base
                                             Pa    pi
                                           P          x100
                                           qi
                                           0i
                                                      p q0   i
                                                   38720
                                                  31520 x100
                                                              194
Caritas Business College           BUSINESS STATISTICS – OAD 221
 122.84
(d) Paasche’s Quantity Index for 1985, using 1980 as the base
                                                   195
Caritas Business College               BUSINESS STATISTICS – OAD 221
                                            Pa         qi pi x100
                                        Q         
                                             0i
                                                       q 0p   i
                                                    38720
                                                   34370 x100
 112.66
precise interpretations:
1. It compares the cost of collection of a fixed basket of goods selected in the base
period with the cost of collecting the same basket of goods in the given (current)
period.
Accordingly, the cost of collection of 500 qtls of fish, 590 qtls of mutton and 450 qtls
of chicken has increased by 22.66 per cent in 1985 as compared to what it was in
1980. Viewed differently, it indicates that a fixed amount of goods sold at 1985
prices yield 22.66 per cent more revenue than what it did at 1980 prices.
2. It also implies that a fixed amount of goods when purchased at 1985 prices would
cost 22.66 per cent more than what it did at 1980 prices. In this interpretation, the
Laspeyre's Price Index serves as the basis of constructing the cost of living
index, for it tells how much more does it cost to maintain the base period standard
Laspeyre's Quantity Index, too, has precise interpretations. It reveals the percentage change
in total expenditure in the given (current) period as compared to the base period if varying
amounts of the same basket of goods are sold at the base period prices. When viewed in
this manner, we will be required to spend 12.49 per cent more in 1985 as compared to 1980
if the quantities of fish, mutton and chicken for 1965 are sold at the base period (1980)
prices.
                                                               196
Caritas Business College             BUSINESS STATISTICS – OAD 221
A careful examination of the Paasche's Price Index will show that this too is amenable to the
1. It compares the cost of collection of a fixed basket of goods selected in the given
period with the cost of collection of the same basket of goods in the base period.
Accordingly, the cost of collection of a fixed basket of goods containing 600 qtls of
fish, 640 qtls of mutton and 500 qtls of chicken in 1985 is about 22.84 per cent
more than the cost of collecting the same basket of goods in 1980. Viewed a little
differently, it indicates that a fixed basket of goods sold at 1985 prices yields 22.84
per cent more revenue than what it would have earned had it been sold at the base
2. It also tells that a fixed amount of goods purchased at 1985 prices will cost
22.84 per cent more than what it would have cost if this fixed amount of goods had
Analogously, Paasche's Quantity Index, too, has its own precise meaning. It tells the per
cent change in total expenditure in the given period as compared to the base period if
varying amounts of the same basket of goods are to be sold at given period prices. When so
viewed, we will be required to spend 12.66 per cent more in 1985 as compared to
1980 if the quantities of fish, mutton and chicken for 1980 are sold at the given period
(1985) prices.
In order to understand the relationship between Laspeyre’s and Paasche’s Indices, the
Laspeyre's index is based on the assumption that unless there is a change in tastes and
preferences, people continue to buy a fixed basket of goods irrespective of how high or low
                                                     197
Caritas Business College                  BUSINESS STATISTICS – OAD 221
the prices are likely to be in the future. Paasche's index, on the other hand, assumes that
people would have bought the same amount of a given basket of goods in the past
However, the basic contention implied in the assumptions on which the two indices are
based is not true. For, people do make shifts in their purchase pattern and preferences by
buying more of goods that tend to become cheaper and less of those that tend to become
costlier. In view of this, the following two situations that are likely to emerge need
consideration:
1. When the prices of goods that enter into the construction of these indices show a
general tendency to rise, those whose prices increase more than the average
increase in prices will have smaller quantities in the given period than the
corresponding quantities in the base period. That is, qi’s will be smaller than q0’s
when prices in general are rising. Consequently, Paasche's index will have relatively
smaller weights
               than those in the Laspeyre's index and, therefore, the former ( P Pa ) will be smaller
                                                                                        0
                                     La
               than the latter ( P        ). In other words, Paasche's index will show a relatively smaller
                                     0
2. On the contrary, when prices in general are falling, goods whose prices show a
relatively smaller fall than the average fall in prices, will have smaller quantities in
the given period than the corresponding quantities in the base period. This means
that qi’s will be smaller than q0’s when prices in general are falling. Consequently,
Paasche's index will have smaller weights than those in the Laspeyre's index
and,
               therefore, the former ( P Pa ) will be smaller than the latter ( P La ). In other words,
                                        0i                                       0i
Paasche's index will show a relatively greater fall when the prices in general tend
to fall.
An important inference based on the above discussion is that the Paasche's index has a
downward bias and the Laspeyre's index an upward bias. This directly follows from the fact
                                                          198
Caritas Business College                        BUSINESS STATISTICS – OAD 221
that the Paasche's index, relative to the Laspeyre's index, shows a smaller rise when the
prices in general are rising, and a greater fall when the prices in general are falling.
It may, however, be noted that when the quantity demanded increases because of change in
real income, tastes and preferences, advertising, etc., the prices remaining unchanged, the
Paasche's index will show a higher value than the Laspeyre's index. In such situations, the
Paasche's index will overstate, and the Laspeyre's will understate, the changes in prices.
The former now represents the upper limit, and the latter the lower limit, of the range of price
changes.
The relationship between the two indices can be derived more precisely by making use of
                                         fXY       fX   fY 
                                                  
                                          N        N            
                                                                   N            
                                                                                  
                                                                                             …………(6.15)
                           r        
                                    
                               xy
                                                        Sx S y
                                                                                                   pi
        in which X and Y denote the relative price                                                      ) and relative quantity
        movements(                                                                                 p0
                      qi
        movements(         ) respectively. Sx and Sy are the standard deviations of price and quantity
                     q0
movements, respectively. While rxy represents the coefficient of correlation between the
relative price and quantity movements; f represents the weights assigned, that is, p0 q0. N
is
Substituting the values of X, Y, f and N in Eq. (6-15), and then rearranging the expression, we
have
                       rxy Sx                  pi qi           pi q0            p0 qi
                                                                             x
                           Sy                  p q0    0        p q0    0        p q0    0
                                                                              199
Caritas Business College                    BUSINESS STATISTICS – OAD 221
              pi qi
        If             , is the index of value expanded between the base period and the ith period,
              p q0 0 V0i
                            rxy Sx
                                        pi
                              Sy 1                       p0 qi
                                   q0                 x
                              V0i                          pi qi
                                       p0
                                      q0
                            rxy Sx
                                                  1
                              S y  1  P La x
                                         0
                                               P0iP
                              V0i
                               La
                            P0i  1       rxy S x
                                                                             …………(6.16)
                            P P            Sy
                             0i
                                           V0i
                       La
                1      P            when either rxy , Sx and Sy is equal to zero. That is, the two indices will
                       Pa
                       0i       0i
give the same result either when there is no correlation between the price and
quantity movements, or when the price or quantity movements are in the same ratio
2. Since in actual practice rxy will have a negative value between 0 and -1, and as
neither Sx = 0 nor Sy = 0, the right hand side of Eq. (6-16) will be less than 1.
This
3. Given the overall movement in the index of value ( V0i ) expanded, the greater the
coefficient of correlation (rxy) between price and quantity movements and/or the
200
                                                                     P
Caritas Business College           BUSINESS STATISTICS – OAD 221
greater the degree of dispersion (Sx and Sy) in the price and quantity movements,
the
4. The longer the time interval between the two periods to be compared, the more the
chances for price and quantity movements leading to higher values of Sx and Sy.
breaking down
                                                    201
Caritas Business College                BUSINESS STATISTICS – OAD 221
over a longer period, people do find enough time to make shifts in their
consumption pattern, buying more of goods that may have become relatively
cheaper and less of those that may have become relatively dearer. All this will end
                 movement.
                                    0                      P0
                 Consequently P La will diverge from         Pa
                                                                  more in the long run than in the short run.
                 ,
                 So long as the periods to be compared are not much apart, P La will be quite close to
                                                                                     0
                   PPa
                   0i
are opposite to each other. Such an impression is not sound because both serve the same
purpose, although they may give different results when applied to the same data.
This raises an important question. Which one of them gives more accurate results and which
one should be preferred over the other? The answer to this question is rather difficult since
Despite a very useful and precise difference in interpretation, in actual practice the
Laspeyre's index is used more frequently than the Paasche's index for the simple reason
that the latter requires frequent revision to take into account the yearly changes in weights.
No such revision is required in the case of the Laspeyre's index where once the weights
have been determined, these do not require any change in any subsequent period. It is on
this count that the Laspeyre's index is preferred over the Paasche's index.
However, this does not render the Paasche's index altogether useless. In fact, it
supplements the practical utility of the Laspeyre's index. The fact that the Laspeyre's index
has an upward bias and the Paasche's index downward bias, the two provide the range
between which the index can vary between the base period and the given period.
Interestingly, thus, the former represents the upper limit, and the latter the lower limit.
                                                       202
Caritas Business College              BUSINESS STATISTICS – OAD 221
To overcome the difficulty of overstatement of changes in prices by the Laspeyre's index and
compromise and improve upon them. These are particularly useful when the given period
and the base period fall quite apart and result in a greater divergence between Laspeyre's
1. Marshall-Edgeworth Index
The Marshall-Edgeworth Index uses the average of the base period and given
                                       q0  qi 
                                  p
                                 i      2   
                        ME                    
                       P0i                                              …………(6-17)
                        q              x100
                                         q
                                      0    i
                               p 0         
                                       2    
                                       p0  pi 
                                  q
                                i       2   
                        ME                    
                       Q0i                                              …………(6-18)
                        p             x100
                                         p
                                      0    i
                                q 0        
                                       2    
The Dorbish and Bowley Index is defined as the arithmetic mean of the
                                     La    Pa
                                 DBP    P
                               P      0i 0i                              …………(6-19)
                                0i
                                         2
                                      La     Pa
                                 DB Q    i Q0i
                                         0                                …………(6-20)
                               Q     
                                 0i
                                         2
                                                    203
Caritas Business College                    BUSINESS STATISTICS – OAD 221
The Fisher’s Ideal Index is defined as the geometric mean of the Laspeyre’s and
Paasche’s indices.
                                      F
                                     P0 ..........................................................................(6-21)
                                            P0i0i
                                                La .PPa
                                    Q0 ..........................................................................(6-22)
                                       F
                                           Q0i0i
                                               La .QPa
An alternative system of assigning weights lies in using value weights. The value weight v for
any single commodity is the product of its price and quantity, that is, v = pq.
  pi 
                                v          x100
                                            p
                                            0
                        P0i                                                                      …………(6-23)
                                           v
(i) the product of the base period prices and the base period quantities denoted as
v0 that is, v0 = p0 q0 , or
(ii) the product of the base period prices and the given period quantities
  pi 
                                p0 q0 
                                                p x100
                                               0
                       0 P0i                                                                    …………(6-24)
                                            p q0      0
It may be seen that 0 P0i is the same as the Laspeyre’s aggregative price index.
expressed as
                                                                   204
Caritas Business College                  BUSINESS STATISTICS – OAD 221
  pi 
                                p0 qi 
                                              p x100
                                             0
                       i P0i                                               …………(6-25)
                                          p q0      i
It may be seen that i P0i is the same as the Paasche’s aggregative price
  qi 
                                  v    x100
                       Q0i            q 0
                                                                            …………(6-26)
                                         v
(i) the product of the base period quantities and the base period prices denoted as
v0 that is, v0 = q0 p0 , or
(ii) the product of the base period quantities and the given period prices
  qi 
                             q 0 p 0 
                                         q x100
                                        0                                   …………(6-27)
                        Q  
                       0 0i           q0 p0
It may be seen that 0 Q0i is the same as the Laspeyre’s aggregative quantity index.
                             q 0 pi     x100
                                      q 0                                   …………(6-28)
                        Q  
                       i 0i          q0 pi
                                                               205
Caritas Business College            BUSINESS STATISTICS – OAD 221
It may be seen that i Q0i is the same as the Paasche’s aggregative quantity index.
        Example 6-5
        From the data in Example 6.2 find the:
                                                    206
Caritas Business College               BUSINESS STATISTICS – OAD 221
Solution:
                                              Calculations for
                                   Index of Weighted Average of Price
                                               Relatives (Base Year = 1980)
                                                                        pi          p1
                                                                   p q          p q
               Item            v0 = p0 q0            v1 = p0 q1                     
                                                                   x100         x100
                                                                    0 0        0 1       
                                                                        0p         0p    
               Fish            7500                   9000            1000000       1200000
              Mutton           10620                 11520               1357000   1472000
              Chicken           9900                 11000               1080000   1200000
              Sum       →      28020                 31520               3437000   3872000
                                              p0 q0 
                                                         p x100
                                                      0      
                                       0 P0i 
                                                     p 0q 0
                                             3437000
                                             28020
 122.66
                                              p0 qi 
                                                             p x100 
                                                          207
Caritas Business College   BUSINESS STATISTICS – OAD 221
                                           0        
                           i P0i 
                                          p q0   i
                                          208
Caritas Business College               BUSINESS STATISTICS – OAD 221
                                             3872000
                                             31520
 122.84
                                              Calculations for
                                 Index of Weighted Average of Quantity Relatives
                                               (Base Year = 1980)
                                                                       q1             q1
                                                                 q p              q p
               Item            v0 = q0 p0          v1 = q0 p1                         
                                                                 x100             x100
                                                                  0 0            0 1       
                                                                       0q            0q    
               Fish            7500                  10000           900000           1200000
              Mutton           10620                 13570            1152000         1472000
              Chicken           9900                 10800            1100000         1200000
              Sum       →      28020                 34370            3152000         3872000
                                             q 0 p 0 
                                                         q x100
                                                       0    
                                        Q 
                                       0 0i           q0 p0
                                              3152000
                                              28020
 112.49
                                              q 0 pi 
                                                         q x100 
                                                       0    
                                        Q 
                                       i 0i           q0 pi
                                             3872000
                                             34370
 112.66
Although the indices of weighted average of price/quantity relatives yield the same results
                                                          209
Caritas Business College   BUSINESS STATISTICS – OAD 221
also in
                                       210
Caritas Business College                BUSINESS STATISTICS – OAD 221
situations when it is necessary and advantageous to do so. Some such situations are
as follows:
the group, the price relative of the latter is weighted by the group as a whole.
(iii) Price/quantity relatives serve a useful purpose in splicing two index series
formulae measures the price changes or quantity changes with perfection and has some
bias. The problem is to choose the most appropriate formula in a given situation. As a
measure of the formula error a number of mathematical tests, known as the tests of
consistency or tests of adequacy of index number formulae have been suggested. In this
section we will discuss these tests, which are also sometimes termed as the criteria for a
1. Unit Test: This test requires that the index number formula should be independent of
the units in which the prices or quantities of various commodities are quoted. All the
formulae discussed in the lesson except the index number based on Simple
2. Time Reversal Test: The time reversal test, proposed by Prof Irving Fisher requires
the index number formula to possess time consistency by working both forward and
                                                        211
Caritas Business College             BUSINESS STATISTICS – OAD 221
In other words, if the index numbers are computed for the same data relating to two
periods by the same formula but with the bases reversed, then the two index
should have
 1 or more generally
(vi) Weighted Geometric Mean of Price Relatives formula with fixed weights
Lespeyre’s and Pasche’s index numbers do not satisfy the time reversal test.
3. Factor Reversal Test: This is the second of the two important tests of consistency
“Just as our formula should permit the interchange of two times without
and quantities without giving inconsistent results – i.e., the two results
multiplied together should give the true value ratio, except for a constant
of proportionality.”
                                                     212
Caritas Business College           BUSINESS STATISTICS – OAD 221
This implies that if the price and quantity indices are obtained for the same data,
same base and current periods and using the same formula, then their product
(without the
                                                   213
Caritas Business College               BUSINESS STATISTICS – OAD 221
factor 100) should give the true value ratio. Symbolically, we should have (without
               factor 100).
                                                  p1 q1
                                                           
                                P xQ     V                                         …………(6-30)
                                01 01
                                              p0 q0             0
Fisher’s formula satisfies the factor reversal test. In fact fisher’s index is the only
index satisfying this test as none of the formulae discussed in the lesson satisfies this
test.
Remark: Since Fisher’s index is the only index that satisfies both the time reversal
reversal test for more than two periods and is based on the shift ability of the base
period. This requires the index to work in a circular manner and this property enables
us to find the index numbers from period to period without referring back to the
original base each time. For three periods a,b,c, the test requires :
               For Instance
                                                   p2 q1         p0 q2
                 La    La    La          x                  x                1
               P    xP    xP    
               p1q0
                01 12              0         0         1 1            2   2
                    21          p q
                                                 pq           p q
Hence Laspeyre’s index does not satisfy the circular test. In fact, circular test is not
satisfied by any of the weighted aggregative formulae with changing weights. This
                                                                214
Caritas Business College            BUSINESS STATISTICS – OAD 221
                                                       215
Caritas Business College                  BUSINESS STATISTICS – OAD 221
(i) when the base period of a given index number series is to be made
more recent, or
(ii) when two index number series with different base periods are to be
compared, or
(iii) when there is need for splicing two overlapping index number series.
        Example 6-6
        Reconstruct the following indices using 1997 as base:
        Solution:
       Shifting the Base Period
                                                         216
Caritas Business College              BUSINESS STATISTICS – OAD 221
base periods into a single series either at the base period of the old series (one with an old
base year), or at the base period of the new series (one with a recent base year). This
actually amounts to changing the weights of one series into the weights of the other series.
           1. Splicing the New Series to Make it Continuous with the Old Series
                Here we reduce the new series into the old series after the base year of the former.
As shown in Table 6.8.2(i), splicing here takes place at the base year (1980) of the
new series. To do this, a ratio of the index for 1980 in the old series (200) to the
index of 1980 in the new series (100) is computed and the index for each of the
                                               Table 6.8.2(i)
                                Splicing the New Series with the Old Series
2. Splicing the Old Series to Make it Continuous with the New Series
                                                       217
Caritas Business College              BUSINESS STATISTICS – OAD 221
This means reducing the old series into the new series before the base period of the
letter. As shown in Table 6.8.2(ii), splicing here takes place at the base period of the
new series. To do this, a ratio of the index of 1980 of the new series (100) to the
index of 1980 of the old series (200) is computed and the index for each of the
preceding years of the old series are then multiplied by this ratio.
                                               Table 6.8.2(ii)
                                Splicing the Old Series with the New Series
                 1976              100                    --                           50
                 1977              120                    --                           60
                 1978              146                    --                      73.50
                 1979              172                    --                           86
                 1980              200                   100                       100
                 1981               --                   110                       110
                 1982               --                   116                       116
                 1983               --                   125                       125
                 1984               --                   140                       140
year quantities/prices (or the given year quantities/prices) are used as weights. In a dynamic
situation where tastes, preferences, and habits are constantly changing, the weights should
be revised on a continuous basis so that new commodities are included and the old ones
This is all the more necessary in a developing society where new substitutes keep replacing
the old ones, and completely new commodities are entering the market. To take care of such
changes, the base year should be the most recent, that is, the year immediately preceding
the
                                                       218
Caritas Business College                BUSINESS STATISTICS – OAD 221
given year. This means that as we move forward, the base year should move along the
As shown in Table 6.8.3(i), to convert fixed-base index numbers into chain-base index
                                               Table 6.8.3(i)
                           Conversion of Fixed-base Index into Chain-base Index
As shown in Table 6.8.3(ii), to convert fixed-base index numbers into chain-base index
 The first year's index is taken what the chain base index is; but if it is to form the
                                                        219
Caritas Business College                BUSINESS STATISTICS – OAD 221
                                               Table 6.8.3(ii)
                            Conversion of Chain-base Index into Fixed-base Index
1. Choice of the Base Period: Choice of the base period is a critical decision because
of its importance in the construction of index numbers. A base period is the reference
period for describing and comparing the changes in prices or quantities in a given
period. The selection of a base year or period does not pose difficult theoretical
questions. To a large extent, the choice of the base year depends on the objective
of the index. A major consideration should be to ensure that the base year is not an
abnormal year. For example, a base period with very low price/quantity will unduly
inflate, while the one with a very high figure will unduly depress, the entire index
number series. An index number series constructed with any such period as the base
may give very misleading results. It is, therefore, necessary that the base period be
selected carefully.
Another important consideration is that the base year should not be too remote in the
past. A more recent year needs to be selected as the base year. The use of a
particular
                                                      220
Caritas Business College             BUSINESS STATISTICS – OAD 221
year for a prolonged period would distort the changes that it purports to measure.
That is why we find that the base year of major index numbers, such as consumer
2. Selection of Weights to be Used: It should be amply clear from the various indices
discussed in the lesson that the choice of the system of weights, which may be used,
is fairly large. Since any system of weights has its own merits and is capable of
It is also worthwhile to bear in, mind that the use of any system of weights should
represent the relative importance of individual commodities that enter into the
index number are also important in deciding the weights. The use of a system of
specific to simple average indices. Theoretically, one can use any of the several
averages that we have, such as mean, median, mode, harmonic mean, and
geometric mean. Besides being locational averages, median and mode are not the
appropriate averages to use especially where the number of years for which an index
While the use of harmonic mean and geometric mean has some definite merits over
availability of different types of indices giving different results when applied to the
same data. Out of the various indices discussed, the choice should be in favour of
one
                                                      221
Caritas Business College            BUSINESS STATISTICS – OAD 221
which is capable of giving more accurate and precise results, and which provides
While the Fisher's index may be considered ideal for its ability to satisfy the tests of
adequacy, this too suffers from two important drawbacks. First, it involves too
the Laspeyre's and Paasche's indices. The use of the term ideal does not, however,
mean that it is the best to use under all types of situations. Other indices are more
In this context, it is important to note that the selection of commodities must not be
based on random sampling. The reason being that in random sampling every
commodity, including those that are not important and relevant, have equal chance of
being selected, and consequently, the index may not be representative. The choice
of commodities has, therefore, to be deliberate and in keeping with the relevance and
im- portance of each individual commodity to the purpose for which the index is
constructed.
6. Data Collection: Collection of data through a sample is the most important issue
in the construction of index numbers. The data collected are the raw material of
an index. Data quality is the basic factor that determines the usefulness of an index.
as possible.
                                                     222
Caritas Business College             BUSINESS STATISTICS – OAD 221
The practical utility of an index also depends on how readily it can be constructed.
Therefore, data should be collected from where these can be easily available. While
the purpose of an index number will indicate what type of data are to be collected, it
also determines the source from where the data can be available.
related variables”. Discuss this statement and point out the important uses of index
numbers.
a. Base period
b. Price relatives
6. “Laspeyre’s index has an upward bias and the Paasche’s index downward bias”.
                                                     223
Caritas Business College                BUSINESS STATISTICS – OAD 221
8. State and explain the Fisher’s ideal formula for price index number. Show how it
satisfies the time-reversal and factor- reversal test? Why is it used little in practice?
a. Base-shifting
b. Splicing
c. Deflating
10. From the following data, construct the price index for each year with price of 1995
as base.
Price of Commodity: 40 50 45 55 65 70
11. From the following data, construct an index number for 2004 taking 2003 as base
year:
                 Articles:                    A     B        C        D        E
                 Prices (2003):          100      125    50          40        5
                 Prices (2004):          140      200    80          60        10
12. Find the index number for 1982 and 1983 taking 1981 as base year by the Simple
Average of Price Relatives Method, using (i) Mean, (ii) Median, and (iii) Geometric
Mean:
13. Construct index number of price and index number of quantity from the following
data using:
                                                             224
Caritas Business College                 BUSINESS STATISTICS – OAD 221
a. Laspeyre’s formula,
b. Paasche’s formula,
14. Calculate index number using Kelly’s Method of Standard Weights, from the
following data:
15. From the following data, construct price index by using Weighted Average of Price
Relatives Method:
                                                            225
Caritas Business College               BUSINESS STATISTICS – OAD 221
           16.   From the information given below, calculate the Cost of Living Index number for
                 1985, with 1984 as base year by
                    a.      Aggregative Expenditure Method, and
information:
What changes in the cost of living figure of 2002 have taken place as compared
to 2001?
19. Given below are two sets of indices one with 1975 as base and the other with 1979 as
                                                         226
Caritas Business College               BUSINESS STATISTICS – OAD 221
Second Set
Price : 25 30 45 60 90
21. Convert into Chain Base Index Number from Fixed Base Index Number
22. From the Chain Base Index numbers given below, construct Fixed
23. From the following data, prepare index number for real wages of workers:
24. During certain period, the Cost of Living Index number went up from 110 to 200 and
salary of a worker also raised from 325 to 500. State by how much the worker has
                                                        227
Caritas Business College             BUSINESS STATISTICS – OAD 221
                                                   228
Caritas Business College                 BUSINESS STATISTICS – OAD 221
        Objective:                 This lesson would enable you to understand the meaning, importance,
                                   models, and components of time series along with details of
                                   methods of measuring trends.
        Structure
        7.1.         Introduction
        7.2.         Objectives of time series analysis
        7.3.         Components of time series
        7.4.         Time series decomposition models
        7.5.         Measurement of secular trend
        7.6.         Seasonal variations
        7.7.         Measurement of cyclical variations
        7.8.         Measurement of irregular variations
        7.9.         Questions
        7.10.        Suggested readings
        7.1.         INTRODUCTION
        A series of observations, on a variable, recorded after successive intervals of time is called a
        time series. The successive intervals are usually equal time intervals, e.g., it can be 10
        years, a year, a quarter, a month, a week, a day, and an hour, etc. The data on the
        population of India is a time series data where time interval between two successive figures
        is 10 years. Similarly figures of national income, agricultural and industrial production, etc.,
        are available on yearly basis.
                                                           229
Caritas Business College             BUSINESS STATISTICS – OAD 221
        In the typical time-series there are three main components which seem to be independent
        of the and seems to be influencing time-series data.
        Trend- It is the broad long-term tendency of either upward or downward movement in the
        average (or mean) value of the forecast variable y over time. The rate of trend growth
        usually varies over time, as shown in fig 7.1(a) and (b).
        Cycles- An upward and downward oscillation of uncertain duration and magnitude about
        the trend line due to seasonal effect with fairly regular period or long period with irregular
        swings is called a cycle. A business cycle may vary in length, usually greater than one year
        but less than 5 to 7 years. The movement is through four phases: from peak (prosperity) to
        contradiction (recession) to trough (depression) to expansion (recovery or growth) as
        shown in Fig. 7.1 (b) and (c).
        Seasonal- It is a special case of a cycle component of time series in which the magnitude
        and duration of the cycle do not vary but happen at a regular interval each year. For
        example, average sales for a retail store may increase greatly during festival seasons.
        Irregular- An irregular or erratic (or residual) movements in a time series is caused by short-
        term unanticipated and non-recurring factors. These follow no specific pattern.
        The purpose of decomposition models is to break a time series into its components: Trend
        (T), Cyclical (C), Seasonality (S), and Irregularity (I). Decomposition of time series provides
        a basis for forecasting. There are many models by which a time series can be analysed;
        two models commonly used for decomposition of a time series are discussed below.
        7.4.1. Multiplicative Model
        This is a most widely used model which assumes that forecast (Y) is the product of the
        four components at a particular time period. That is, the effect of four components on the
        time series is interdependent.
                                    Y=T x C x S × I  Multiplicative model
        The multiplicative model is appropriate in situations where the effect of S, C, and I is
        measured in relative sense and is not in absolute sense. The geometric mean of S, C, and
        I is assumed to be less than one. For example, let the actual sales for period 20 be Y20 =
        423.36. Further let, this value be broken down into its components as: let trend component
        (mean sales) be 400; effect of current cycle (0.90) is to depress sales by 10 per cent;
        seasonality of the series (1.20) boosts sales by 20 per cent. Thus besides the random
        fluctuation, the expected value of sales for the period is 400 × 0.90 × 1.20 = 432. If the
        random factor depresses sales by 2 per cent in this period, then the actual sales value will
        be 432 × 0.98 = 423.36.
        7.4.2. Additive Model
        In this model, it is assumed that the effect of various components can be estimated by
        adding the various components of a time-series. It is stated as:
                                    Y=T + C + S + I          Additive model
        Here S, C, and I are absolute quantities and can have positive or negative values. It is
        assumed that these four components are independent of each other. However, in real-life
        time series data this assumption does not hold good.
                                                      230
Caritas Business College               BUSINESS STATISTICS – OAD 221
(i) The trend line should be smooth- a straight line or mix of long gradual curves.
           (ii)    The sum of the vertical deviations of the observations above the trend line should
                   equal the sum of the vertical deviations of the observations below the trend line.
           (iii)   The sum of squares of the vertical deviations of the observations from the trend
                   line should be as small as possible.
           (iv)    The trend line should bisect the cycles so that area above the trend line should
                   be equal to the area below the trend line, not only for the entire series but as
                   much as possible for each full cycle.
        Example 7.1: Fit a trend line to the following data by using the freehand method.
         Year             1991 1992 1993 1994 1995 1996 1997 1998
         Sales turnover : 80    90   92   83   94   99   92   104
         (Rs. in lakh)
                        80                                                                obtained
        by the trend                                                                      extending
                        1991 1992 1993 1994 1995 1996 1997 1998
                        Years                                                             line.
                                                       2
Caritas Business College              BUSINESS STATISTICS – OAD 221
           (i)     This method is highly subjective because the trend line depends on personal
                   judgement and therefore what happens to be a good-fit for one individual may not
                   be so for another.
           (ii)    The trend line drawn cannot have much value if it is used as a basis for
                   predictions.
                                                       221
Caritas Business College              BUSINESS STATISTICS – OAD 221
           (i)      As the size of n (the number of periods averaged) increases, it smoothens the
                    variations better, but it also makes the method less sensitive to real changes in
                    the data.
           (ii)     Moving averages cannot pick-up trends very well. Since these are averages, it
                    will always stay within past levels and will not predict a change to either a higher
                    or lower level.
                                                      222
Caritas Business College            BUSINESS STATISTICS – OAD 221
             1994              26                     78                26.00                 0
             1995              27                     79                26.33               0.67
             1996              26                      -                  -                   -
        When the chosen period of length n is an odd number, the moving average at
        year i is centred on i, the middle year in the consecutive sequence of n yearly
        values used to compute i. For instance with n =5, MA3(5) is centred on the third
        year, MA4(5) is centred on the fourth year…, and MA 9(5) is centred on the ninth
        year.
        No moving average can be obtained for the first (n-1)/2 years or the last (n- 1/2)
        year of the series. Thus for a 5-year moving average, we cannot make
        computations for the just two years or the last two years of the series.
        When the chosen period of length n is an even numbers, equal parts can easily
        be formed and an average of each part is obtained. For example, if n = 4, then
        the first moving average M 3 (placed at period 3) is an average of the first four
        data values, and the second moving average M 4 (placed at period 4) is the
        average of data values 2 through 5). The average of M3 and M4 is placed
        at period 3 because it is an average of data values for period 1 through 5.
        Example 7.3: Assume a four-yearly cycle and calculate the trend by the method of
        moving average from the following data relating to the production of tea in India.
               Year           Production (million             Year            Production (million
                                      lbs)                                             lbs)
               1987                   464                     1992                     540
                1988                   515                    1993                    557
                1989                   518                    1994                    571
                1990                   467                    1995                    586
                1991                   502                    1996                    612
        Solution: The first 4-year moving average is:
                                             464 + 515 + 518+ 467              1964
                                MA3(4) =                                    =         = 491.00
                                                      4                         4
        This moving average is centred on the middle value, that is, the third year of the
        series. Similarly,
                                             515 + 518 + 467+ 502              2002
                                                     223
Caritas Business College            BUSINESS STATISTICS – OAD 221
                             MA4(4) =                                     =        = 500.50
                                                       4                     4
        This moving average is centred on the fourth year of the series.
        Table 7.2. presents the data along with the computations of 4-year moving averages.
        Table 7.2: Calculation of Trend and Short-term Fluctuations
             Year            Production          4-yearly        4-Yearly     4-Yearly Moving
                              (mm lbs)       Moving Totals        Moving      Average Centred
                                                                  Average
             1987                464                 -                -               -
             1988                515                 -                -               -
                                                   1964           491.00
             1989                518                                               495.75
                                                   2002           500.50
             1990                467                                               503.62
                                                   2027           506.75
             1991                502                                               511.62
                                                   2066           516.50
             1992                540                                               529.50
                                                   2170           542.50
             1993                557                                               553.00
                                                   2254           563.50
             1994                571                                               572.00
                                                   2326            581.50             -
             1995                586                 -                -               -
             1996                612                 -                -               -
        Weighted Moving Averages
        In moving averages, each observation is given equal importance (weight). However,
        different values may be assigned to calculate a weighted average of the most recent n
        values. Choice of weights is somewhat arbitrary because there is no set formula to
        determine them. In most cases, the most recent observation receives the most weightage,
        and the weight decreases for older data values.
Solution: The results of 3-month weighted average are shown in Table 7.3.
                                                   224
Caritas Business College          BUSINESS STATISTICS – OAD 221
                                                  6                           6
              5                   19              1 [316)  (2 13)  112] 
                                                    141
                                                  6                           3
              6                   23              1 [319)
                                                    17
                                                            (2 16)  113] 
                                                  6
              7                   26              1 [3
                                                    
                                                        23)  (2 19)  116] 20
                                                                                1
                                                  6                             2
              8                   30              1 [3
                                                    
                                                        26)  (2  23)  119] 235
                                                  6                            6
              9                   28              1 [3 30)  (2  26)  1
                                                    23]                      271
                                                  6                            2
             10                   18              1 [3 28)  (2  30)  1
                                                    26]                      289
                                                  6                             3
             11                   16              1 [318)
                                                    
                                                            (2  28)  1 30] 23
                                                                                 1
                                                  6                            3
             12                   14              1 [316)  (2 18)  1 28] 
                                                    182
                                                     6                           3
        Example 7.5: A food processor uses a moving average to forecast next month’s demand.
        Past actual demand( in units) is shown below:
        Month                   :    43     44     45      46     47    48    49      50   51
        Actual demand           :    105    106    110 110        114   121   130     128  137
        (in units)
        (a) Compute a simple five-month moving average to forecast demand for month 52.
        (b) Compute a weighted three-month moving average where the weights are highest for
        the latest months and descend in order of 3, 2, 1.
        Solution: Calculation for five-month moving average are shown in Table 7.4.
                Month             Actual Demand          5-month Moving     5-month Moving
                                                              Total             Average
                   43                   105                     -                   -
                   44                   106                     -                   -
                   45                   110                   545                109.50
                   46                   110                   561                 112.2
                   47                   114                   585                 117.0
                   48                   121                   603                 120.6
                                                  225
Caritas Business College               BUSINESS STATISTICS – OAD 221
                                  51           3 × 137     =    141
                                  50           2 × 128     =    256
                                  49           1 × 130     =    130
                                               6                797
                  797
                   6
         MAWT =         = 133 units
        Semi-Average Method
        The semi-average method permits us to estimate the slope and intercept of the trend the
        quite easily if a linear function will adequately described the data. The procedure is simply
        to divide the data into two parts and compute their respective arithmetic means. These two
        points are plotted corresponding to their midpoint of the class interval covered by the
        respective part and then these points are joined by a straight line, which is the required
        trend line. The arithmetic mean of the first part is the intercept value, and the slope is
        determined by the ratio of the difference in the arithmetic mean of the number of years
        between them,
        that is, the change per unit time. The resultant is a time series of the form yˆ  a  bx . The
        :
         yˆ is the calculated trend value and a and b are the intercept and slope values respectively.
        The equation should always be stated completely with reference to the year where x =0 and
        a description of the units of x and y.
        The semi-average method of developing a trend equation is relatively easy to commute and
        may be satisfactory if the trend is linear. If the data deviate much from linearity, the forecast
        will be biased and less reliable.
        Example 7.6: Fit a trend line to the following data by the method of semi-average
        and forecast the sales for the year 2002.
                  Year                 Sales of Firm              Year        Sales of Firm
                                     (thousand                                         (thousand
                                     units)                                            units)
                 1993                       102                   1997                  108
                1994                      105                    1998                  116
                1995                      114                    1999                  112
                1996                      110
                                                         226
Caritas Business College             BUSINESS STATISTICS – OAD 221
        Solution: Since number of years are odd in number, therefore divide the data into equal
        parts (A and B) of 3 years ignoring the middle year (1996). The average of part A and B is
                                      102 + 105 + 114               321
                        yA     =                             =             = 107 units
                                           3              3
120
115
                                     110
                                   Sal
105
                                     100
                                       199        1994      1995       1996    1997   1998   1999
                                       3
                                                                       Years
                                                112 – 107               5
                                         =                         =     = 1.25
                                              1998 – 1994 4
                       Intercept = a = 107 units at 1994
        Thus, the trend line is :        yˆ = 107 +
        1.25x
        Since 2002 is 8 year distant from the origin (1994), therefore we have
                                       yˆ = 107 + 1.25(8) = 117
        Exponential Smoothing Methods
        Exponential smoothing is a type of moving-average forecasting technique which weighs
        past data in an exponential manner so that the most recent data carries more weight in the
        moving average. Simple exponential smoothing makes no explicit adjustment for trend
        effects whereas adjusted exponential smoothing does take trend effect into account (see
        next section for details).
        Simple Exponential Smoothing
                                                      227
Caritas Business College             BUSINESS STATISTICS – OAD 221
        With simple exponential smoothing, the forecast is made up of the last period forecast plus
        a portion of the difference between the last period’s actual demand and the last period’s
        forecast.
                        Ft = Ft-1 +  (Dt-1 – Ft-1) = (1-)Ft-1+ Dt-1           …(7.1)
        Where Ft = current period forecast
                        Ft-1 = last period forecast
                         = a weight called smoothing constant (0  
                        1) Dt-1 = last period actual demand
        From Eqn. (7.1), we may notice that each forecast is simply the previous forecast plus
        some correction for demand in the last period. If demand was above the last period
        forecast the correction will be positive, and if below it will negative.
        When smoothing constant  is low, more weight is given to past data, and when it is high,
        more weight is given to recent data. When  is equal to 0.9, then 99.99 per cent of the
        forecast value is determined by the four most recent demands. When  is as low as 0.1,
        only
        34.39 per cent of the average is due to these last 4 periods and the smoothing effect
        is equivalent to a 19-period arithmetic moving average.
        If  were assigned a value as high as 1, each forecast would reflect total adjustment to the
        recent demand and the forecast would simply be last period’s actual demand, that is, Ft =
        1.0Dt-1. Since demand fluctuations are typically random, the value of  is generally kept in
        the range of 0.005 to 0.30 in order to ‘smooth’ the forecast. The exact value depends upon
        the response to demand that is best for the individual firm.
        The following table helps illustrate this concept. For example, when  = 0.5, we can see
        that the new forecast is based on demand in the last three or four periods. When  = 0.1,
        the forecast places little weight on recent demand and takes a 19-period arithmetic moving
        average.
                                                          Weight Assigned
                                                                  to
        Smoothin              Most             nd
                                             2 Most            3rd Most      4th Most   5th Most
        g                    Recent           Recent            Recent        Recent     Recent
        Constant             Period           Period            Period        Period     Period
                               ()            (1-)            (1-)2      (1-)3    (1-)4
           = 0.1           0.1            0.09            0.081          0.073          0.066
                       0.5            0.25            0.125          0.063          0.031
                                                     228
Caritas Business College             BUSINESS STATISTICS – OAD 221
                                                      Forecast errors
                                        MAD =
                                                                 n
        where Standard deviation  = 1.25 MAD
        The exponential smoothing method also facilities continuous updating of the estimate
        of MAD. The current MADt is given by
        MADt = Actual values- Forecasted values+ (1-) MADt-1
        Higher values of smoothing constant  make the current MAD more responsive to
        current forecast errors.
        Example 7.7: A firm uses simple exponential smoothing with  =0.1 to forecast demand.
        The forecast for the week of February 1 was 500 units whereas actual demand turned out
        to be 450 units.
        (a) Forecast the demand for the week of February 8.
        (b) Assume the actual demand during the week of February 8 turned out to be 505 units.
        Forecast the demand for the week of February 15. Continue forecasting through March
        15, assuming that subsequent demands were actually 516, 488, 467, 554 and 510 units.
        Solution: Given Ft-1 = 500, D t-1 = 450, and  = 0.1
        (a) Ft = F t-1 – (Dt-1 - Ft-1) = 500 + 0.1(450-500) = 495 units
        (b) Forecast of demand for the week of February 15 is shown in Table 7.5
                                                       229
Caritas Business College             BUSINESS STATISTICS – OAD 221
Where the data can be coded so that x = 0, two terms in three equations drop
        Coding is easily done with time-series data. For coding the data, we choose
        the centre of the time period as x = 0 and have an equal number of plus and minus
        periods on each side of the trend line which sum to zero.
        Alternately, we can also find the values of constants a and b for any regression
        line as:
                                  xy 
                       b=
                       nxy                    and a = y  bx
                              x 2  n(x) 2
                                                      230
Caritas Business College             BUSINESS STATISTICS – OAD 221
        Example 7.9: Below are given the figures of production (in thousand quintals) of
        a sugar factory:
        Year                         :        1992       1993    1994          1995        1996   1997    1998
        Production                   :         80         90      92            83          94     99      92
                                  x         28             y       630
                           x                    4, y                   90
                                    n         7             n        7
a = y  bx  90  2(4)  82
yˆ  a  bx  82  2x
        The slope b = 2 indicates that over the past 7 years, the production of sugar had
        an average growth of about 2 thousand quintals per year.
                                                          231
Caritas Business College                       BUSINESS STATISTICS – OAD 221
100
95
                           Producti
                                      90
85
80
                                      75
                                       1992       1993   1994     1995   1996   1997   1998
                                                                 Years
        (b) Plotting points on the graph paper, we get an actual graph representing
        production of sugar over the past 7 years. Join the point a = 82 and b = 2
        (corresponds to 1993) on the graph we get a trend line as shown in Fig. 7.4.
yˆ = a + bx + cx2
y = na + bx + cx2
                                                                232
Caritas Business College              BUSINESS STATISTICS – OAD 221
        When the data can be coded so that x = 0 and x3 = 0, two term in the above
        expressions drop out and we have
                        y = na + cx2
xy = bx2
        To find the exact estimated value of the variable y, the values of constants a, b,
        and c need to be calculated. The values of these constants can be calculated
        by using the following shortest method:
             yc                  xy
                 2
               x             ;b                  n  x2 y   x2 
        a=                                 c
                 n            and                 y n  x4  ( x2
                                                    2
                               x2                )
        Example 7.10: The prices of a commodity during 1999-2004 are given below.
        Fit a parabola to these data. Estimate the price of the commodity for the year
        2005.
                Year                      Price                  Year              Price
                1999                      100                    2002              140
                2000                      107                    2003              181
                2001                      128                    2004              192
                                                         233
Caritas Business College           BUSINESS STATISTICS – OAD 221
        Solving eqns. (iv) and (v) for b and c we get b =18.04 and c = 1.78.
        Substituting values of b and c in eqn. (i), we get a = 126.68.
                                                   234
Caritas Business College           BUSINESS STATISTICS – OAD 221
                        600
                        500
                  Yea   400
                        300
                        200
                        100
                           0
                           1994   1995    1996    1997    1998     1999
                                           Price (Rs.)
Fig. 7.5
        The characteristics property of this law is that the rate of growth, that is, the rate
        of change of y with respect to x is proportional to the values of the function. The
        following function has this property.
y = abcx, a > 0
        To assume that the law of growth will continue is usually unwarranted, so only
        short range predictions can be made with any considerable degree or reliability.
                                                 235
Caritas Business College           BUSINESS STATISTICS – OAD 221
If we take logarithms (with base 10) of both sides of the above equation, we obtain
        For b =10, log b =1, but for b=e, log b =0.4343 (approx.). In either case, this
        equation is of the form y = c + dx
        Equation (7.2) represents a straight line. A method of fitting an exponential trend line
        to a set of observed values of y is to fit a straight trend line to the logarithms of the
        y-values.
        In order to find out the values of constants a and b in the exponential function,
        the two normal equations to be solved are
         log y = n log a + log bx
        Coding is easily done with time-series data by simply designating the center of
        the time period as x =0, and have equal number of plus and minus period on
        each side which sum to zero.
        Example 7.11: The sales (Rs. In million) of a company for the years 1995 to 1999
        are:
        Year :             1995         1996              1997       1998           1999
        Sales :             1.6          4.5              13.8       40.2           125.0
        Find the exponential trend for the given data and estimate the sales for
        2002.
                                                    236
Caritas Business College              BUSINESS STATISTICS – OAD 221
        Solution: The computational time can be reduced by coding the data. For this
        consider u = x-3. The necessary computations are shown in Table 7.8.
10 5.6983 4.7366
                                          1                  1
                                log a =        log y            (5.6983) = 1.1397
                                =                            5
                                          n
                                               u log   4.7366
                                    log b =      y    =        = 0.4737
                                               u2
                                                                   1
           (i)     Shift the origin, simply by adding or subtracting the desired number of
                   periods from independent variable x in the original forecasting equation.
                                                        237
Caritas Business College            BUSINESS STATISTICS – OAD 221
           (ii)    Change the time units from annual values to monthly values by
                   dividing independent variable x by 12.
           (iii)   Change the y        units from annual to monthly values, the entire right-
                   hand side of the equation must be divided by 12.
        Solution: (a) Shifting of origin can be done by adding the desired number of
        period 5(=1997-1992) to x in the given equation. That is
                      Linear trend :         a  b
                                        yˆ      x
                                            12 24
                                                   238
Caritas Business College                BUSINESS STATISTICS – OAD 221
                         Parabolic trend :            yˆ  x               x2
                                                         a                           1
                                                         1                           7
                                                         2                          2
                                                                      c              8
                                                      
                                                          144
        But if data are given as monthly averages per year, then value of ‘a’ remains
        unchanged ‘b’ is divided by 12 and ‘c’ by 144.
        If the time series data are in terms of annual figures, the seasonal variations
        are absent. These variations are likely to be present in data recorded on quarterly
        or monthly or weekly or daily or hourly basis. As discussed earlier, the seasonal
        variations are of periodic in nature with period less than or equal to one year.
        These       variations   reflect        the       annual      repetitive   pattern   of the economic or
        business activity of any society. The main objectives of measuring seasonal
        variations are:
                                                                    239
Caritas Business College            BUSINESS STATISTICS – OAD 221
        This method is used when the time series variable consists of only the seasonal
        and random components. The effect of taking average of data corresponding to the
        same period (say 1st quarter of each year) is to eliminate the effect of random
        component and thus, the resulting averages consist of only seasonal component.
        These averages are then converted into seasonal indices, as explained in the
        following examples.
Example 7.13.
        Assuming that trend and cyclical variations are absent compute the seasonal
        index for each month of the following data of sales (in Rs. ‘000) of a company.
        Year   Jan Feb        Mar    Apr   May Jun       Jul    Aug   Sep Oct     Nov     Dec
        1987    46     45     44     46    45     47     46     43    40    40    41      45
        1988    45     44     43     46    46     45     47     42    43    42    43      44
        1989    42     41     40     44    45     45     46     43    41    40    42      45
        Solution
                                           Calculation Table
        Yea    Jan Feb      Mar     Apr    May    Jun    Jul    Aug   Se    Oct   Nov     Dec
         r                                                             p
        198     46    45      44     46     45    47     46     43    40    40    41       45
         7
        198     45    44      43     46     46    45     47     42    43    42    43       44
         8
        198     42    41      40     44     45    45     46     43    41    40    42       45
         9
        Tot    133    130   127     136    136    137    139    128   124   122   126     134
         al
         At    44.3 43.3      42.   45.3   45.3   45.7   46.3   42.   41.   40.   42.     44.7
                               3                                 7     3     7     0
        S.l.   101. 99.1      96.   103.   103.   104.   105.   97.   94.   93.   96.     102.
                4              8     7      7      6      9      7     5     1     1       3
        In the above table, A denotes the average and S.I the seasonal index for a
        particular month of various years. To calculate the seasonal index, we
                                                  240
Caritas Business College            BUSINESS STATISTICS – OAD 221
        Remarks: The total equal to 1200, in case of monthly indices and 400, in
        case of quarterly indices, indicate that the ups and downs in the time series, due
        to seasons, neutralise themselves within that year. It is because of this that the
        annual data are free from seasonal component.
Example 7.14
        Compute the seasonal index from the following data by the method of simple
        averages.
         Year     Quarter     Y       Year     Quarter           Y          Year     Quarter         Y
         1980        I       106      1982        I              90         1984         I           80
                     II      124                  II            112                     II          104
                    III      104                 III            101                    III           95
                    IV        90                 IV              85                    IV            83
         1981        I        84       1983       I              76         1985         I          104
                     II      114                  II             94                     II          112
                    III      107                 III             91                    III          102
                    IV        88                 IV              76                    IV            84
Solution
                                                    241
Caritas Business College                      BUSINESS STATISTICS – OAD 221
              Ai
                                      93.67               114.49            104.07           87.77
              100
              G
                                         384.33
        We have G                                96.08 . Further, since the sum of terms in the last
                At                        4
                                4
        row of the table is 400, no adjustment is needed. These terms are the seasonal
        indices of respective quarters.
        This method is used when cyclical variations are absent from the data, i.e.
        the        time     series    variable      Y    consists   of   trend,   seasonal   and   random
        components.
              (i)         Obtain the trend values for each month or quarter, etc. by the method
                          of least squares.
              (ii)        Divide the original values by the corresponding trend values. This would
                          eliminate trend values from the data. To get figures in percentages, the
                          quotients are multiplied by 100.
                                                             242
Caritas Business College                 BUSINESS STATISTICS – OAD 221
Example 7.15
        Assuming that the trend is linear, calculate seasonal indices by the ratio to moving
        average method from the following data:
             Year                    I                   II              III                IV
             1982                   65                 58                56                 61
             1983                   68                 63                63                 67
             1984                   70                 59                56                 52
             1985                   60                 55                51                 58
Solution
        By adding the values of all the quarters                  of a year, we can obtain annual
        output for each of the four years. Fit a linear trend to the data and obtain
        trend values for each quarter.
                                                   962                       72
        From the above table, we get          a                 and b            3.6
                                                                            20
                                              240.5
                                                   4
Thus, the trend line is Y=240.5 – 3.6X, Origin: Ist January 1984, unit of X:6 months.
                                                         243
Caritas Business College               BUSINESS STATISTICS – OAD 221
                           1
        Y=60.13-0.45(X+
                               ) = 59.9-0.45X, origin I-quarter, unit of X=1 quarter.
                           2
             Year                 I                    II              III           IV
             1982              63.50                 63.05           62.50          62.15
             1983              61.70                 61.25           60.80          60.35
             1984              59.90                 59.45           59.00          58.55
             1985              58.10                 57.65           57.20          56.75
                                                        Y
        The table of Ratio to Trend Values, i.e.             100
                                                            T
             Year                 I                    II              III           IV
             1982              102.36                91.99           89.46          98.15
             1983              110.21               102.86          103.62         111.02
             1984              116.86                99.24           94.92          88.81
             1985              103.27                95.40           89.16         102.20
             Total             432.70               389.49          377.16         400.18
           Average             108.18                97.37           94.29         100.05
                S.I.           108.20                97.40           94.32         100.08
Example 7.16.
Find seasonal variations by the ratio to trend method, from the following
data:
                                                       244
Caritas Business College   BUSINESS STATISTICS – OAD 221
Solution
                                       245
Caritas Business College              BUSINESS STATISTICS – OAD 221
                    1120                    480
         Now a =            224 and b =           48
5 10
                                                     224
         The quarterly trend equation is Y=                 48 X=56+3X, origin: Ist July 1997,
                                                         4   16
        unit of X = 1 quarter.
        Y = 56 + 3 (X+
        1                      ) = 57.5 + 3X
             Year                     I                     II          III          IV
            1995                    27.5                   30.5       33.5          36.5
            1996                    39.5                   42.5       45.5          48.5
            1997                    51.5                   54.5       57.5          60.5
            1998                    63.5                   66.5       69.5          72.5
            1999                    75.5                   78.5       81.5          84.5
             Year                     I                     II          III          IV
            1995                    109.1                131.1        107.5         93.2
            1996                    86.1                 122.4        109.9         90.7
                                                         246
Caritas Business College    BUSINESS STATISTICS – OAD 221
                                        247
Caritas Business College              BUSINESS STATISTICS – OAD 221
                                                403.12
        Note that the Grand Average G=                    100.78. Also check that the sum of
                                                    4
        indices is 400.
        The ratio to moving average is the most commonly used method of measuring
        seasonal variations. This method assumes the presence of all the four
        components of a time series. Various steps in the computation of seasonal
        indices are as follows:
           (i)         Compute the moving averages with period equal to the period of
                       seasonal variations. This would eliminate the seasonal component and
                       minimise the effect of random component. The resulting moving
                       averages would consist of trend, cyclical and random components.
           (ii)        The original values, for each quarter (or month) are divided by the
                       respective moving average figures and the ratio is expressed as a
                                                    248
Caritas Business College           BUSINESS STATISTICS – OAD 221
Example 7.17
        Given the following quarterly sale figures, in thousand of rupees, for the year
        1996-1999, find the specific seasonal indices by the method of moving
        averages.
             Year                  I                 II            III          IV
             1996                  34                33            34           37
             1997                  37                35            37           39
             1998                  39                37            38           40
             1999                  42                41            42           44
Solution
                                                   249
Caritas Business College            BUSINESS STATISTICS – OAD 221
             Year                   I                    II                III                IV
             1996                   -                     -             97.4                104.2
             1997                 102.5                95.1             99.2                103.2
             1998                 102.4                96.4             97.7                100.5
             1999                 102.9                98.1                -                  -
             Total                307.8                289.6            294.3               307.9
              At                  102.6                96.5             98.1                102.6
             S.I.                 102.7                96.5             98.1                102.7
                                                399.8
        Note that the Grand Average G=                  =99.95. Also check that the sum of
                                                   4
        indices is 400.
        This method assumes that all the four components of a time series are present
        and, therefore, widely used for measuring seasonal                 variations. However, the
        seasonal     variations   are   not   completely      eliminated     if   the    cycles of these
        variations are not of regular nature. Further, some information is always lost at the
        ends of the time series.
        This method is based on the assumption that the trend is linear and cyclical
        variations are of uniform pattern. As discussed in earlier chapter, the link relatives
        are percentages of the current period (quarter             or   month)          as compared with
        previous period. With the computation of link relatives and their average, the effect
        of cyclical and random component is minimised. Further, the trend gets eliminated
        in the process of adjustment of chained relatives. The following steps are involved in
        the computation of seasonal indices by this method:
        (i) Compute the link relative (L.R.) of each period by dividing the figure of that
        period with the figure of previous period. For example, link relative of
                           figure of 3rd quarter
        3rd quarter =                                   100
                           figure of 2nd quarter
                                                    250
Caritas Business College                 BUSINESS STATISTICS – OAD 221
        (ii) Obtain the average of link relatives of a given quarter (or month) of various
        years. A.M. or Md can be used for this purpose. Theoretically, the later is
        preferable because the former gives undue importance to extreme items.
        (iii) These averages are converted into chained relatives by assuming the chained
        relative of the first quarter (or month) equal to 100. The chained relative (C.R.) for
        the current period (quarter or month)
        (iv) Compute the C.R. of first quarter (or month) on the basis of the last
        quarter (or month). This is given by
                     C.R. of the last quarter (or month) × L.R. of 1st quarter (or month)
                 =
                                                      100
        This value, in general, be different from 100 due to long term trend in the data.
        The chained relatives, obtained above, are to be adjusted for the effect of this
        trend. The adjustment factor is
        d=
        1        [New C.R. for Ist quarter-100] for quarterly data
        and d = 1
                            [New C.R. for Ist month –100] for monthly data.
                       12
On the assumption that the trend is linear, d, 2d, 3d, etc. is respectively
subtracted from the 2nd, 3rd, 4th, etc., quarter (or month).
        (vi) Make sure that the sum of these indices is 400 for quarterly data and 1200
        for monthly data.
Example 7.18
                                                       251
Caritas Business College         BUSINESS STATISTICS – OAD 221
Determine the seasonal indices from the following data by the method of link relatives:
         Solution
                                          Calculation Table
               Year              I              II                 III            IV
               2000              -             73.1               78.9           66.7
               2001           360.0            80.5               79.3           95.7
               2002           181.8            62.5               80.0           75.0
               2003           306.7            56.5               76.9           90.0
               2004           233.3            66.7               85.7           87.5
        The chained
                Total relative 1081.8
                               (C.R.) of the Ist quarter on the400.8
                                              339.3                             of the 4th quarter
                                                                 basis of C. R.414.0
          270 Mean
                 45.2
        =               122.3 270.5           67.9             80.2            83.0
                      100
                C.R.           100.0           67.9             54.5            45.2
                                        1
        The
         C.R.trend adjustment factor
               (adjusted)      100.0d = (122.3    100)  5.6 43.3
                                               62.3                             28.4
                                            4
                S.I.          170.9           106.5               74.0           48.6
        Thus, the adjusted C.R. of 1st quarter = 100
                                                252
Caritas Business College            BUSINESS STATISTICS – OAD 221
                                                  Adjusted C.R. ×
        The seasonal index of a quarter =
                                                                100 G
        Merits and Demerits
        This method is less complicated than the ratio to moving average and the ratio
        to trend methods. However, this method is based upon the assumption of a
        linear trend, which may not always hold true.
Deseasonalisation of Data
        It may be pointed out here that the deseasonalization of a data is done under
        the assumption that the pattern of seasonal variations, computed on the basis of
        past data, is similar to the pattern of seasonal variations in the year of
        deseasonalization.
        Example 7.19
        Deseasonalise the following data on the sales of a company during various
        months of 1990 by using their respective seasonal indices. Also interpret the
        deseasonalised values.
          Month          Sales             S.I.         Month         Sales            S.I.
                       (Rs. ‘000)                                   (Rs. ‘000)
        Jan           16.5           109          Jul              36.5          85
        Feb           21.3           105          Aug              44.4          88
        Mar           27.1           108          Sep              54.9          98
        Apr           31.0           102          Oct              62.0          102
        May           35.5           100          Nov              67.6          104
                                                  253
Caritas Business College           BUSINESS STATISTICS – OAD 221
Solution
        Let Y denote monthly sales and DS denote the deseasonalised sales. Then,
        we can write
                              DS  Y
                                       100
                                   S.I
        The deseasonalised figures of sales for each month represent the monthly sales
        that would have been in the absence of seasonal variations.
        As mentioned earlier that a typical time-series has four components: secular trend
        (T), seasonal variation (S), cyclical variation (C), and irregular variation (I). In a
        multiplicative time-series model, these components are written as:
Y=T×C×S×I
        The deseasonalization data can be adjusted for trend analysis these by the
        corresponding trend and seasonal variation values. Thus we are left with only
        cyclical (C) and irregular (I) variations in the data set as shown below:
                              Y           T×C×S×I
                                     =                        =C×I
                           T×S                  T×S
                                                    254
Caritas Business College           BUSINESS STATISTICS – OAD 221
        Alternately, trend (T), seasonal (S), and cyclical (C) components of the given time-
        series are estimated and then the residual is taken as the irregular variation. Thus,
        in the case of multiplicative time-series model, we have
                             Y             T×C×S×I
                                      =                       = I
                           T×C×S              T×C×S
7.9. QUESTIONS
                                                   255
Caritas Business College                BUSINESS STATISTICS – OAD 221
        2. What is measured by a moving average? Why are 4-quarter and 12- month
              moving averages used to develop a seasonal index?
        6. Apply the method of link relatives to the following data and calculate seasonal
              indexes.
                                                        256
Caritas Business College              BUSINESS STATISTICS – OAD 221
        12.      Explain briefly the additive and multiplicative models of time series. Which of these
                 models is more popular in practice and why?
        13.      A company that manufactures steel observed the production of steel (in
                 metric tonnes) represented by the time-series:
        Year                      :    1990    1991    1992     1993  1994    1995    1996
        Production in steel       :     60       72     75       65     80     85      95
        (a) Find the linear equation that describes the trend in the production of steel by
        the company.
        (b) Estimate the production of steel in 1997.
        14.      The sales (Rs. In lakh) of a company for the years 1990 to 1996 are
                 given below:
        Year                          :     1990     1991     1992     1993     1994     1995     1996
        Sales                         :      32       47       65       88       132     190       275
        Find trend values by using the equation Yc = abx and estimate the value for
        1997.
        15.      A company that specializes in the production of petrol filters has
                 recorded the following production (in 1000 units) over the last 7 years.
        Year                         :    1994   1995     1996     1997     1998    1999          2000
        Production                   :     42     49       62       75       92      122           158
        (a) Develop a second degree estimating equation that best describes these data.
        (b) Estimate the production in 2004.
3. R.P. Hooda: Statistic for Business and Economic, McMillan India Ltd.
                                                       257
Caritas Business College             BUSINESS STATISTICS – OAD 221
 8.1    INTRODUCTION
        Life is full of uncertainties. ‘Probably’, ‘likely’, ‘possibly’, ‘chance’ etc. is some of the most
commonly used terms in our day-to-day conversation. All these terms more or less
convey the same sense - “the situation under consideration is uncertain and
commenting on the
                                                       258
Caritas Business College             BUSINESS STATISTICS – OAD 221
formal and precise expressions for the uncertainties involved. For example, product
demand is uncertain but study of demand spelled out in a form amenable for analysis may
go a long to help analyze, and facilitate decisions on sales planning and inventory
management. Intuitively, we see that if there is a high chance of a high demand in the
coming year, we may decide to stock more. We may also take some decisions regarding the
price increase, reducing sales expenses etc. to manage the demand. However, in order to
make such decisions, we need to quantify the chances of different quantities of demand in
the coming year. Probability theory provides us with the ways and means to quantify the
event.
Since uncertainty is an integral part of human life, people have always been interested -
Having its origin associated with gamblers, the theory of probability today is an
indispensable tool in the analysis of situations involving uncertainty. It forms the basis for
inferential statistics as well as for other fields that require quantitative assessments of
chance occurrences, such as quality control, management decision analysis, and almost all
happening. In order that we are able to compute it, a proper understanding of certain basic
                                                     259
Caritas Business College             BUSINESS STATISTICS – OAD 221
       8.2.1   EXPERIMENT
               An experiment is a process that leads to one of several possible outcomes. An
The term experiment is used in probability theory in a much broader sense than in physics or
chemistry. Any action, whether it is the drawing a card out of a deck of 52 cards, or reading
launching of a new product in the market, constitute an experiment in the probability theory
terminology.
For example, the product we are measuring may turn out to be undersize or right size or
oversize, and we are not certain which way it will be when we measure it. Similarly,
launching a new product involves uncertain outcome of meeting with a success or failure in
the market.
So each outcome is visualized as a sample point in the sample space. The sample spaces
                                                    260
Caritas Business College              BUSINESS STATISTICS – OAD 221
       8.2.3   EVENT
        An event, in probability theory, constitutes one or more possible outcomes of an experiment.
that the event occurs if the experiment gives rise to a basic outcome
For the experiment of drawing a card, we may obtain different events A, B, and C like:
ace
In the first case, out of the 52 sample points that constitute the sample space, only one
sample point or outcome defines the event, whereas the number of outcomes used in the
mainly to cater to the three different types of situations under which probability measures are
normally required. We will study these approaches with the help of examples of distinct types
of experiments.
Consider the following situations marked by three distinct types of experiments. The events
that we are interested in, within these experiments, are also given.
        Situation I
        Experiment      :              Drawing a Card Out of a Deck of 52
        Cards Event A:                 On any draw, a king is there
        Situation II
        Experiment      :              Administering a Taste Test for a New
        Soup Event B :                 A consumer likes the taste
        Situation III
                                                      261
Caritas Business College              BUSINESS STATISTICS – OAD 221
The first situation is characterized by the fact that for a given experiment we have a sample
space with equally likely basic outcomes. When a card is drawn out of a well-shuffled deck,
every one of the cards (the basic outcomes) is as likely to occur as any other. This type of
situations, marked by the presence of "equally likely" outcomes, gave rise to the Classical
defined as the relative size of the event with respect to the size of the sample space. Since
there are 4 kings and there are 52 cards, the size of A is 4 and the size of the sample
space is
The rule we use in computing probabilities, assuming equal likelihood of all basic outcomes,
is as follows:
P(A) = n ( A ) …………(8-1)
                                        N(S
                                        )
If we try to apply the classical definition of probability in the second experiment, we find that
we cannot say that consumers will equally like the taste of the soup. Moreover, we do not
know as to how many persons have been tested. This implies that we should have the
past data on people who were administered the soup and the number that liked the taste. In
the absence of past data, we have to undertake an experiment, where we administer the
                                                       262
Caritas Business College               BUSINESS STATISTICS – OAD 221
The Relative Frequency Approach is used to compute probability in such cases. As per
this approach, the probability of occurrence of an event is given by the observed relative
frequency of an event in a very large number of trials. In other words, the probability of
occurrence of an event is the ratio of the number of times the event occurs to the total
It is appreciated in this approach that, in order to take such a measure, we should have the
soup tested for a large number of people. In other words, the total number of trials in the
The third situation seems apparently similar to the second one. We may be tempted here to
apply the Relative Frequency Approach. We may calculate the probability of the event that
the venture is a success as the ratio of number of successful ventures to the total number of
such ventures undertaken i.e. the relative frequency of successes will be a measure of the
probability.
In practice, a solar power plant being a relatively new development involving the latest
technology, past experiences are not available. Experimentation is also ruled out because of
high cost and time involved, unlike the taste testing situation. In such cases, the only way out
is the Subjective Approach to probability. In this approach, we try to assess the probability
from our own experiences. We may bring in any information to assess this. In the situation
                                                             263
Caritas Business College             BUSINESS STATISTICS – OAD 221
cited, we may, perhaps, look into the performance of the commissioning authority in other
Therefore the Subjective Approach involves personal judgment, information, intuition, and
recovery and an expert assessing the probability of success of a merger offer are both
making a personal judgment based upon what they know and feel about the situation. The
area of subjective probability - which is relatively new, having been first developed in the
1930s - is somewhat controversial. One person's subjective probability may very well be
different from another person's subjective probability of the same event. We may note here
that since the assessment is a purely subjective one, it will vary from person to person and,
situations. So these approaches are not contradictory to one another. In fact, these
complement each other in the sense that where one fails, the other becomes applicable.
These are identical inasmuch as probability is defined as a ratio or a weight assigned to the
approach, the first two approaches - Classical and Relative Frequency - provide an objective
We can bring out the commonality between the Classical Approach and the Relative
Frequency Approach with the help of an example. Let us assume that we are interested in
finding out the chances of getting a head in the toss of a coin. By now, you would have come
up with the answer by the Classical Approach, using the argument, that there are two
outcomes, heads and tails, which are equally likely. Hence, given that a head can occur only
once, the probability is ½ : Consider the following alternative line of argument, where the
                                                     264
Caritas Business College              BUSINESS STATISTICS – OAD 221
probability can be estimated using the Relative Frequency Approach. If we toss the coin for
a sufficiently large number of times and note down the number of times the head occurs, the
proportion of times that a head occurs will give us the required probability.
Thus, given our definition of the approaches, we find both the arguments to be valid. This
brings out, in a way, the commonality between the Relative Frequency and the Classical
Approach. The difference, however, is that the probability computed by using the Relative
apriori that the chances are ½ , based on our assumption of "equally likely" outcomes.
Example 8-1
A fair coin is tossed twice. Find the probabilities of the following events:
Solution: Being a Two-Trial Coin Tossing Experiment, it gives rise to the following On = 2n
HH HT TH TT
                                                       265
Caritas Business College             BUSINESS STATISTICS – OAD 221
We can use the Classical Approach to find out the required probabilities.
n(A) = 1 { HH }
P(A) = n ( A )
                                                 N(S)
                                             =
                                                 1
n(B) = 2 { HT, TH }
P(B) = n ( B )
                                                 N(S)
                                             =
                                                 2
                                              1
                                             =2
P(C) = n ( A )
                                                 N(S)
                                             =
                                                 4
=1
                                                        266
Caritas Business College   BUSINESS STATISTICS – OAD 221
n(D) = 0
                                       267
Caritas Business College              BUSINESS STATISTICS – OAD 221
P(D) = n ( D )
                                                  N(S)
                                             =
                                                  0
=0
Example 8-2
A newspaper boy wants to find out the chances that on any day he will be able to sell more
than 90 copies of The Times of India. From his dairy where he recorded the daily sales of
the last year, he finds out that out of 365 days, on 75 days he had sold 80 copies, on 144
days he had sold 85 copies, on 62 days he had sold 95 copies and on 84 days he had sold
100 copies of The Times of India. Find out the required probability for the newspaper boy.
Thus, the number of days when his sales were more than 90 = (62 + 84) days = 146 days
= 146
365
= 0.4
                                                            268
Caritas Business College              BUSINESS STATISTICS – OAD 221
               (b)     The probability of occurrence of one or the other of all possible events is
                       equal to one. As S denotes the sample space or the set of all possible
                       events, we write
                              P(S) = 1...........................................................(8-4)
If two or more events together define the total sample space, the events are said to be
collectively exhaustive.
Given the above axioms, we may now define probability as a function, which assigns
                                                             269
Caritas Business College              BUSINESS STATISTICS – OAD 221
We have also seen that 0 and 1, both values inclusive, sets the range of values that the
When an event cannot occur (impossible event), its probability is zero. The probability of the
empty set is zero: P(Φ) = 0. In a deck where half the cards are red and half are black, the
probability of drawing a green card is zero because the set corresponding to that event is the
Events that are certain to occur have probability 1.00. The probability of the entire sample
space S is equal to 1.00: P(S) = 1.00. If we draw a card out of a deck, 1 of the 52 cards in
the deck will certainly be drawn, and so the probability of the sample space, the set of all 52
Within the range of values 0 to 1, the greater the probability, the more confidence we have in
the occurrence of the event in question. A probability of 0.95 implies a very high confidence
in the occurrence of the event. A probability of 0.80 implies a high confidence. When the
probability is 0.5, the event is as likely to occur as it is not to occur. When the probability is
0.2, the event is not very likely to occur. When we assign a probability of 0.05, we believe
the event is unlikely to occur, and so on. Figure 8-2 is an informal aid in interpreting
probability.
                  267
Caritas Business College              BUSINESS STATISTICS – OAD 221
Note that probability is a measure that goes from 0 to 1. In everyday conversation we often
describe probability in less formal terms. For example, people sometimes talk about odds. If
        the odds are 1 to 1, the probability         1 i .e . ; if the odds are 1 to 2, the probability is
        is                                          1  1
                                                     1      2
          1
             i .e . ; and so on. Also, people sometimes say, "The probability is 80 percent."
         1  1
          2       3
were relatively simple, so that direct application of the definition of probability could be used
for computation. Quite often, we are interested in the probability of occurrence of more
complex events. Consider for example, that you want to find the probability that a king or a
club will occur in a draw from a deck of 52 cards. Similarly, on examining couples with two
children, if one child is known as a boy, you may be interested in the probability of the event
of both the children being boys. These two situations, we find, are not as simple as those
discussed in the earlier section. As a sequel to the theoretical development in the field of
probability, certain results are available which help us in computing probabilities in such
Theorem) allows us to write the probability of the union of two events in terms of the
Consider two events A and B defined over the sample space S, as shone in Figure 8-3
                                                          2
Caritas Business College               BUSINESS STATISTICS – OAD 221
        We may
        define
                                P(A  B)         n(A  B)
                                =
                                                   N(S)
                                                n(A)                   n(A  B
                                            =         n(B        
                                                       )
                                                N(S)   N(S              N(S)
                                                       )
The probability of the intersection of two events P ( A  B  is called their joint probability.
The meaning of this rule is very simple and intuitive: When we add the probabilities of A and
measuring the relative size of A within the sample space and once when doing this with B.
Since the relative size, or probability, of the intersection of the two sets is counted twice, we
subtract it once so that we are left with the true probability of the union of the two events.
The rule of unions is especially useful when we do not have the sample space for the union
Example 8-3
                                                       269
Caritas Business College             BUSINESS STATISTICS – OAD 221
A card is drawn from a well-shuffled pack of playing cards. Find the probability that the card
                                                     270
Caritas Business College              BUSINESS STATISTICS – OAD 221
Solution: Let A be the event that a club is drawn and B the event that a king is drawn. Then,
= 16/52
= 4/13
Example 8-4
Suppose your chance of being offered a certain job is 0.45, your probability of getting
another job is 0.55, and your probability of being offered both jobs is 0.30. What is the
probability that you will be offered at least one of the two jobs?
Solution: Let A be the event that the first job is offered and B the event that the second job is
offered. Then,
= 0.70
                                                       271
Caritas Business College              BUSINESS STATISTICS – OAD 221
For mutually exclusive events, the probability of the intersection of the events is zero. This is
so because the intersection of the events is the empty set, and we know that the probability
                        P(A  B 
                                                                     …………(8.6)
                        0
This fact gives us a special rule for unions of mutually exclusive events. Since the probability
        of the intersection of the two events is zero, there is no need to           P(A  B       when
        subtract                                                                     
This is not really a new rule since we can always use the rule of unions for the union of two
events: If the events happen to be mutually exclusive, we subtract zero as the probability of
the intersection.
Example 8-5
A card is drawn from a well-shuffled pack of playing cards. Find the probability that the card
Solution: Let A be the event that a king is drawn and B the event that a queen is drawn.
= 4/52 + 4/52
= 8/52
= 2/13
We can extend the Rule of Unions to three (or more) events. Let A, B, and C be the
three events defined over the sample space S, as shown in Figure 8-5
P(A  B  C)=
When the three events are mutually exclusive (see Figure 8-6), the Rule of Unions is
Example 8-6
A card is drawn from a well-shuffled pack of playing cards. Find the probability that the card
drawn is
Solution: (a) Let A be the event that a heart is drawn, B the event that an honour is drawn
                                                          273
Caritas Business College      BUSINESS STATISTICS – OAD 221
        and    n(A     B  C 1
               
                                          274
Caritas Business College               BUSINESS STATISTICS – OAD 221
= 28/52
= 7/13
(b) Let A be the event that an ace is drawn, B the event that a king is drawn and C
Since A, B and C are mutually exclusive events, the required probability (using Eq. (8.9) is
= 12/52
= 3/13
of the probability of the original event. Consider event A defined over the sample space S.
The
complement of set A, denoted by A , is a subset, which contains all outcomes, which do not
so P(A + A ) = P(S)
or P(A) + P( A ) = 1
or P( A ) = 1 - P(A)..............................................(8.10)
                                                            275
Caritas Business College                BUSINESS STATISTICS – OAD 221
Eq. (8.10) is our Rule of Complements. As a simple example, if the probability of rain
tomorrow is 0.3, then the probability of no rain tomorrow must be 1 - 0.3 = 0.7. If the
probability of drawing a king is 4/52, then the probability of the drawn card's not being a
Example 8-7
Find the probability of the event of getting a total of less than 12 in the experiment of
The event of getting a total of less than 12 is the complement of A, so the required
probability is
P( A ) = 1 - P(A)
P( A ) = 1 – 1/36
P( A ) = 35/36
where the probability of an event A is influenced by the information that another event B has
occurred. Thus, the probability we would give the event "Xerox stock price will go up
tomorrow" depends on what we know about the company and its performance; the
probability is conditional upon our information set. If we know much about the company, we
may assign a different probability to the event than if we know little about the company. We
may define the probability of event A conditional upon the occurrence of event B. In this
example, event A may be the event that the stock will go up tomorrow, and event B may be
                                                       276
Caritas Business College              BUSINESS STATISTICS – OAD 221
Consider two events A and B defined over the sample space S, as shown in Figure 8-8
                        P(A/ B)
                                     n(A  B)
n(B)
                                       n(A B)
                        P(A / B)                      N
                                          n(B)
                                                  N
Therefore, the probability of event A given the occurrence of event B is defined as the
Example 8-8
(a) of the event of getting a total of 9, given that the die has shown up points
(b) of the event of getting points between 4 and 6 (both inclusive), given that a total
Solution: Let getting a total 9 be the event A and the die showing points between 4 and
                                                          277
Caritas Business College                 BUSINESS STATISTICS – OAD 221
        and P ( A
                       B ) = {(4,5) (5,4)}
            
        (a)                     P(A  B)
                 P(A/ B)
                 
                                    P(B)
                P(A/ B)         2 / 36
                
                                9 / 36
                P(A/ B)         2
                
                                9
                                P(B  A
        (b)     P(B / A)           )
                
                                    P(A)
                P(B / A)
                                2 / 36
                
                                4 / 36
                P(B / A)
                               1
                P(A/ B)
                               P(A  B)
P(B)
                                                          278
Caritas Business College             BUSINESS STATISTICS – OAD 221
               P(A  B)
                               P ( A / B ). P ( B )
                     
                                                      279
Caritas Business College              BUSINESS STATISTICS – OAD 221
The Product Rule states that the probability that both A and B will occur simultaneously is
equal to the probability that B (or A) will occur multiplied by the conditional probability that A
(or B) will occur, when it is known that B (or A) is certain to occur or has already
occurred.
Example 8-9
A box contains 10 balls out of which 2 are green, 5 are red and 3 are black. If two balls are
drawn at random, one after the other without replacement, from the box. Find the
probabilities that:
(d) the first ball is red and the second one is black
(e) the first ball is green and the second one is red
        Solution:     (a)         P (G 1  G 2 ) P (G      / G 1 ). P (G 1 )
                                                2
                                            1    2
                                           9 x 10
                                            1
                                           45
                         P ( B1
        (b)                       B 2 )  P ( B 2 / B1 ). P ( B1 )
                                            2    3
                                           9 x 10
                                             1
                                           15
        (c)              P ( R1    R2 )    P ( R / R1 ). P ( R1 )
                                         2
                                            4    5
                                           9 x 10
                                            2
                                           9
        (d)              P ( R1           B ) 
                                            2
                                                        280
Caritas Business College          BUSINESS STATISTICS – OAD 221
P(B2
               / R1 ). P ( R1 )
                                              281
Caritas Business College                 BUSINESS STATISTICS – OAD 221
                                              3    5
                                             9 x 10
                                              1
                                             6
        (e)             P (G 1       R2)      P ( R / G 1 ). P (G 1 )
                                            2
                                              5    2
                                             9 x 10
                                              1
                                             9
Example 8-10
A consulting firm is bidding for two jobs, one with each of two large multinational
corporations. The company executives estimate that the probability of obtaining the
consulting job with firm A, event A, is 0.45. The executives also feel that if the company
should get the job with firm A, then there is a 0.90 probability that firm B will also give the
company the consulting job. What are the company's chances of getting both jobs?
Solution: We are given P(A) = 0.45. We also know that P(B / A) = 0.90, and we are looking
        for P ( A
                      B ) , which is the probability that both A and B will occur.
            
        So      P(A       B)       P ( B / A ). P ( A )
                         
                P(A  B)
                                   0 .90 x 0 .45
                                  0 .405
                               
                                                   Independent Events
        Two events are said to be independent of each other if the occurrence or non-occurrence
of one event in any trial does not affect the occurrence of the other event in any trial.
Events A and B are independent of each other if and only if the following three conditions
                                                           282
Caritas Business College          BUSINESS STATISTICS – OAD 221
                P(B / A)
                            P(B                          …………(8.13b)
                
                            )
                                              283
Caritas Business College               BUSINESS STATISTICS – OAD 221
The first two equations have a clear, intuitive appeal. The top equation says that when A and
B are independent of each other, then the probability of A stays the same even when we
know that B has occurred - it is a simple way of saying that knowledge of B tells us nothing
about A when the two events are independent. Similarly, when A and B are independent,
then knowledge that A has occurred gives us absolutely no information about B and its
likelihood of occurring.
The third equation, however, is the most useful in applications. It tells us that when A and B
are independent (and only when they are independent), we can obtain the probability of the
joint occurrence of A and B (i.e. the probability of their intersection) simply by multiplying the
two separate probabilities. This rule is thus called the Product Rule for Independent
Events.
As an example of independent events, consider the following: Suppose I roll a single die.
What is the probability that the number 5 will turn up? The answer is 1/6. Now suppose that I
told you that I just tossed a coin and it turned up heads. What is now the probability that the
die will show the number 5? The answer is unchanged, 1/6, because events of the die and
the
        coin are independent of each other. We see              P (6 / H )   P (6 ) , which is the first rule
        that                                                    
above.
The rules for union and intersection of two independent events can be extended to
Intersection Rule
The probability of the intersection of several independent events A1, A2, ……is just the
        P ( A1    A2       A3 )    P ( A1 ). P ( A2 ). P ( A3                   …………(8.15)
                                ).........
Union Rule
                                                        284
Caritas Business College                   BUSINESS STATISTICS – OAD 221
The probability of the union of several independent events A1, A2, ……is given by the
following equation
        P ( A1        A2     A3 ..........)   1                                           …………(8.16)
                                                  P ( A1 ). P ( A2 ). P ( A3
                                                     ).........
The union of several events is the event that at least one of the events happens.
Example 8-11
A problem in mathematics is given to five students A, B,C, D and E. Their chances of solving
it are 1/2, 1/3, 1/3, 1/4 and 1/5 respectively. Find the probability that the problem will
(b) be solved
Solution: (a) The problem will not be solved when none of the students solve it. So the
(b) The problem will be solved when at least one of the students solve it. So the
                    P(A  B  C  D  E) 1                                      
                                              P ( A ). P ( B ). P (C ). P ( D ). P ( E )
                                        1  2 / 15
                                        13 / 15
situations. Quite often, whether it is in our personal life or our work life, decision-making is an
ongoing process. Consider for example, a seller of winter garments, who is interested in
the demand of the product. In deciding on the amount he should stock for this winter, he
                                                             285
Caritas Business College   BUSINESS STATISTICS – OAD 221
has
                                       286
Caritas Business College              BUSINESS STATISTICS – OAD 221
computed the probability of selling different quantities and has noted that the chance
of selling a large quantity is very high. Accordingly, he has taken the decision to stock a
large quantity of the product. Suppose, when finally the winter comes and the season ends,
he discovers that he is left with a large quantity of stock. Assuming that he is in this
business, he feels that the earlier probability calculation should be updated given the new
experience to help him decide on the stock for the next winter.
Similar to the situation of the seller of winter garment, situations exist where we are
interested in an event on an ongoing basis. Every time some new information is available,
we do revise our odds mentally. This revision of probability with added information is
formalised in probability theory with the help of famous Bayes' Theorem. The theorem,
discovered in 1761 by the English clergyman Thomas Bayes, has had a profound impact on
the development of statistics and is responsible for the emergence of a new philosophy of
science. Bayes himself is said to have been unsure of his extraordinary result, which was
presented to the Royal Society by a friend in 1763 - after Bayes' death. We will first
understand The Law of Total Probability, which is helpful for derivation of Bayes' Theorem.
Consider two events A and B. Whatever may be the relation between the two events, we can
always say that the probability of A is equal to the probability of the intersection of A and B,
        or      P(A)       P ( A / B ). P ( B )                     …………(8.17)
                                              P ( A / B ). P ( B
                                                )
The sets B and B form a partition of the sample space. A partition of a space is the
division of the sample space into a set of events that are mutually exclusive (disjoint sets)
and cover
the whole space. Whatever event B may be, either B or B must occur, but not both. Figure
                                                        287
Caritas Business College             BUSINESS STATISTICS – OAD 221
                                                      288
Caritas Business College               BUSINESS STATISTICS – OAD 221
The law of total probability may be extended to more complex situations, where the sample
space X is partitioned into more than two events. Say, we have partition of the space into a
collection of n sets B1, B2,………Bn .The law of total probability in this situation is:
               P(A) 
                             P(A
                             n
                                        Bi )
                            
                           i
                           
                           1
        or      P(A)
                            P ( A / B i ). P ( B i
                             n                                        …………(8.18)
                            )
                           i
                           
                           1
Figure 8-10 shows the partition of a sample space into five events B 1, B2, B3, B4 and B5 ; and
We can demonstrate the rule with a more specific example. Let us define A as the event that
an honour card is drawn out of a deck of 52 cards (the honour cards are the aces, kings,
queens, jacks and 10). Letting H, C, D, and S denote the events that the card drawn is a
heart, club, diamond, or spade, respectively, we find that the probability of an honour card is:
                                                       289
Caritas Business College                    BUSINESS STATISTICS – OAD 221
= 20/52
= 5/13
which is what we know the probability of an honour card to be just by counting 20 honour
cards out of a total of 52 cards in the deck. The situation is shown in Figure 8-11.
As can be seen from the figure, the event A is the set addition of the intersections of A
Example 8-12
A market analyst believes that the stock market has a 0.70 probability of going up in the next
year if the economy should do well, and a 0.20 probability of going up if the economy should
not do well during the year. The analyst believes that there is a 0.80 probability that the
economy will do well in the coming year. What is the probability that stock market will go up
next year?
Solution: Let U be the event that the stock market will go and W is the event that the
Then
            P (U )
                    P (U / W ). P (W ) P (U
                                                / W ). P (W )
                  ( 0 .70 )( 0 .80 )  ( 0 .20 )( 0 .20 )
                  0 .56  0 .04
        .         0 .60
BAYES’ THEOREM
We will now develop the Bayes’ theorem. Bayes' theorem is easily derived from the law of
                                                         290
Caritas Business College                BUSINESS STATISTICS – OAD 221
                 P(B / A)       P(B  A
                                  )                                   …………(8.19)
P(A)
                 P(B 
                           A) ) P ( A     B)     P ( A / B ). P ( B         …………(8.20)
                                              )
                 P(B / A)       P ( A / B ). P ( B
                                       )                                   …………(8.21)
P(A)
                 P ( A )  P ( A / B ). P ( B )               
                                               P ( A / B ). P ( B )
Substituting this expression for P(A) in the denominator of Eq.(8.21), we have the Bayes’
theorem
        P(B / A)                   P ( A / B ). P ( B
                                                                           …………(8.22)
                                   )
                        P ( A / B ). P ( B )  P ( A / B ). P ( B )
Thus the theorem allows us to reverse the conditionality of events: we can obtain the
As we see from the theorem, the probability of B given A is obtained from the probabilities
The probabilities P(B) and P( B ) are called prior probabilities of the events B and B ; the
Bayes'
theorem in terms of B and A, thus giving the posterior probability of B , P( B /A). Bayes'
theorem may be viewed as a means of transforming our prior probability of an event B into a
The Bayes' theorem can be extended to a partition of more than two sets. This is done by
using the law of total probability involving a partition in sets B 1, B2, ……… Bn .The resulting
                                                     292
Caritas Business College                BUSINESS STATISTICS – OAD 221
        P(                  P ( A / B i ). P ( B i
        Bi     / A)                                                          …………(8.23)
                           )
                           n
                          P ( A / B i ). P ( B i )
                         i
                         
                         1
The theorem gives the probability of one of the sets in the partition Bi, given the occurrence
of event A.
Example 8-13
An Economist believes that during periods of high economic growth, the Indian Rupee
appreciates with probability 0.40; and during periods of low economic growth, the Rupee
appreciates with probability 0.20.During any period of time the probability of high economic
growth is 0.30; the probability of moderate economic growth is 0.50 and the probability of
low economic growth is 0.20. Suppose the Rupee value has been appreciating during the
present period. What is the probability that we are experiencing the period of (a) high, (b)
Solution: Our partition consists of three events: high economic growth (event H), moderate
economic growth (event M) and low economic growth (event L). The prior probabilities of
Let A be the event that the rupee appreciates. We have the conditional probabilities
By using the Bayes’ theorem we can find out the required probabilities
                                                           293
Caritas Business College                  BUSINESS STATISTICS – OAD 221
                P(H                                           P ( A / H ). P ( H )
                       / A)       P ( A / H ). P ( H )                                      P ( A / L ). P ( L )
                                                             P ( A / M ). P ( M )
                                                             
                                                     ( 0 .70 )( 0 .30 )
                              ( 0 .70 )( 0 .30 )  ( 0 .40 )( 0 .50 )  ( 0 .20 )( 0 .20 )
                               0 .467
                                                              P ( A / M ). P ( M )
                P(M / A)           P ( A / H ). P ( H )                                     P ( A / L ). P ( L )
                                                              P ( A / M ). P ( M )
                                   
                                                               
                                                      ( 0 .40 )( 0 .50 )
                               ( 0 .70 )( 0 .30 )  ( 0 .40 )( 0 .50 )  ( 0 .20 )( 0 .20 )
                                0 .444
                P(L / A)                                     P ( A / L ). P ( L )
                                P ( A / H ). P ( H )        P ( A / M ). P ( M )         P ( A / L ). P ( L )
                                                            
                                                    ( 0 .20 )( 0 .20 )
                             ( 0 .70 )( 0 .30 )  ( 0 .40 )( 0 .50 )  ( 0 .20 )( 0 .20 )
                              0 .089
Suppose that a bank has two branches, each branch has two departments, and each
department has four employees. Then there are (2)(2)(4) choices of employees, and the
We may view the choice as done sequentially: First a branch is randomly chosen, then a
department within the branch, and then the employee within the department. This is
                                                                286
Caritas Business College                   BUSINESS STATISTICS – OAD 221
We denote n factorial by n!. The number n! is the number of ways in which n objects can be
ordered. By definition, 0! = 1.
For example, 5! is the number of possible arrangements of five objects. We have 5! = (5) (4)
(3) (2) (1) = 120. Suppose that five applications arrive at a center on the same day, all
written at different times. What is the probability that they will be read in the order in which
they were written? Since there are 120 ways to order five applications, the probability of a
particular order (the order in which the applications were written) is 1/120.
Permutations are the possible ordered selections of r objects out of a total of n objects. The
                         n            n!
                             P                                                  …………(8.26)
                             r
                                   (n  r
                             )!
Suppose that 4 people are to be randomly chosen out of 10 people who agreed to be
interviewed in a market survey. The four people are to be assigned to four interviewers. How
many possibilities are there? The first interviewer has 10 choices, the second 9 choices, the
third 8, and the fourth 7. Thus, there are (10)(9)(8)(7) = 5,040 selections. We can see that
this
( n  r )!
                                             288
Caritas Business College                 BUSINESS STATISTICS – OAD 221
Combinations are the possible selections of r items from a group of n items regardless of the
                                                                         n
        order of selection. The number of combinations is denoted by         C r and is read n choose r.
                        n               n!
                            Cr                                    …………(8.27)
                                   r! ( n  r )!
Suppose that 3 out of the 10 members of the board of directors of a large corporation are
selections
        are there? Using Eq. (8.27), we find that the number of combinations is   n
                                                                                                 n!
                                                                                      C r  r! ( n  r )! =
10!/(3!7!) = 120.
If the committee is chosen in a truly random fashion, what is the probability that the three-
committee members chosen will be the three senior board members? This is 1 combination
2. What are different approaches to the definition of probability? Are these approaches
contradictory to one another? Which of these approaches you will apply for
(c) Mr. Bhupinder S. Hooda will win the assembly election from Kiloi.
                                                     289
Caritas Business College            BUSINESS STATISTICS – OAD 221
page will be less than 10. From his past experience he finds that out of 3600 pages
he has proofed, 200 pages contained no errors, 1200 pages contained 5 errors, and
2200 pages contained 11 or more errors. Can you help him in finding the required
probability?
5. Explain the concept of conditional probability with the help of a suitable example.
7. State the Bayes’ Theoram of probability. Using an appropriate example, develop the
(a) In how many ways we can select three players out of 12 players of the
9. What is the probability that a non leap year, selected at random, will contain
                                                     290
Caritas Business College            BUSINESS STATISTICS – OAD 221
10. A card is drawn at random from well shuffled deck of 52 cards, find the probability
that
11. From a well-shuffled deck of 52 cards, two cards are drawn at random.
(a) If the cards are drawn simultaneously, find the probability that these
consists of (i) both clubs, (ii) a king and a queen, (iii) a face card and a 8.
(b) If the cards are drawn one after the other with replacement. Find the
probability that these consists of (i) both clubs, (ii) a king and a queen, (iii) a
12. A problem in mathematics is given to four students A, B,C, and D their chances of
solving it are 1/2 , 1/3, 1/4 and 1/5 respectively. Find the probability that the problem
will
                      (a)     be solved
                      (b)     not be solved
13. The odds that A speaks the truth are 3:2 and the odds that B does so are 7:3. In what
14. Among the sales staff engaged by a company 60% are males. In terms of
their professional qualifications, 70% of males and 50% of females have a degree in
marketing. Find the probability that a sales person selected at random will be
                                                      291
Caritas Business College                BUSINESS STATISTICS – OAD 221
15. A and B play for a prize of Rs. 10,000. A is to throw a die first and is to win if he
throw again and to win if he throws 3, 2 or 1: and so on. Find their respective
expectations.
16. A factory has three units A, B, and C. Unit A produces 50% of its products, and units
B and C each produces 25% of the products. The percentage of defective items
selected at random from the total production of the factory is found defective, what is
                                                         292
Caritas Business College             BUSINESS STATISTICS – OAD 221
        Objectives: The overall objective of this lesson is to discuss the concept of random
                       variable and discrete probability distributions. After              successful
                       completion of the lesson the students will be able to appreciate the
                       usefulness of probability distributions in decision-making and also
                       identify situations where Binomial and Poisson probability distributions
                       can be applied.
        Structure
        9.1    Introduction
        9.2    Discrete Probability Distribution
        9.3    Bernoulli Random Variable
        9.4    The Binomial Distribution
        9.5    The Poisson Distribution
        9.6    Self-Assessment Questions
        9.7    Suggested Readings
 9.1    INTRODUCTION
        In many situations, our interest does not lie in the outcomes of an experiment as such;
we may find it more useful to describe a particular property or attribute of the outcomes of an
experiment in numerical terms. For example, out of three births; our interest may be in
the
                                                     293
Caritas Business College              BUSINESS STATISTICS – OAD 221
matter of the probabilities of the number of boys. Consider the sample space of 8 equally
Now look at the variable “the number of boys out of three births”. This number varies
among sample points in the sample space and can take values 0,1,2,3, and it is random –
given to chance.
A random variable has a probability law - a rule that assigns probabilities to different values
of the random variable. This probability law - the probability assignment is called the
probability distribution of the random variable. We usually denote the random variable by
the introduction of the lesson, is a discrete random variable; so it will have a discrete
sample space. We can see the correspondence of sample points with the values of the
                                                      294
Caritas Business College              BUSINESS STATISTICS – OAD 221
The correspondence between sample points and the value of the random variable allows us to
The above probability statement constitute the probability distribution of the random variable
X = number of boys in three births. We may appreciate how this probability law is obtained
simply by associating values of X with sets in the sample space. (For example, the set GGB,
GBG, BGG leads to X = 1). We may write down the probability distribution of X in table
format (see Table 9-1) or we may plot it graphically by means of probability Histogram (see
Table 9-1 Probability Distribution of the Number of Boys out of Three Births
                                                        295
Caritas Business College                          BUSINESS STATISTICS – OAD 221
                                     (1, 0.375)
                P(
                                                                                  P(
                                                  (2, 0.375)
X X
Figure 9-1 Probability Distribution of the Number of Boys out of Three Births
The probability distribution of a discrete random variable X must satisfy the following two
conditions:
               2.                  PX  x  1
                                 all x
These conditions must hold because the P(X = x) values are probabilities. First condition
specifies that all probabilities must be greater than or equal to zero, as we know from Lesson
8.
For the second condition, we note that for each value x, P(x) = P(X = x) is the probability of
the event that the random variable equals x. Since by definition all x means all the values the
random variable X may take, and since X may take on only one value at a time, the
occurrences of these values are mutually exclusive events, and one of them must take
place. Therefore, the sum of all the probabilities P(X = x) must be 1.00.
The probability distribution of a discrete random variable lists the probabilities of occurrence
of the random variable. That is, we may be interested in the probability that the value of
the
                                                                            296
Caritas Business College              BUSINESS STATISTICS – OAD 221
random variable is at most some value x. This is the sum of all the probabilities of the values i
The cumulative distribution function (also called cumulative probability function) F(X =
               F(X = x) = P(X  x) =
                                          Pi
                                        all i  x
For example, to find the probability of at most two boys out of three births, we have
               F(X = 2) = P(X  2) =
                                          Pi
                                        all i  2
= 7/8
The expected value of a discrete random variable X is equal to the sum of all values of
                = E(X) =  x.Px
                          all x
In the same way we can calculate the other summary measures viz. skewness, kurtosis
and moments.
X : 0 1 2 3
                                                       297
Caritas Business College              BUSINESS STATISTICS – OAD 221
Now imagine this experiment is repeated 200 times, we may expect ‘no head’ and ‘three
heads’ will each occur 25 times; ‘one head’ and ‘two heads’ each will occur 75 times. Since
these results are what we expect on the basis of theory, the resultant distribution is called a
However, when the experiment is actually performed 200 times, the results, which we may
actually obtain, will normally differ from the theoretically expected results. It is quite possible
that in actual experiment ‘no head’ and ‘three heads’ may occur 20 and 28 times respectively
and ‘one head’ and ‘two heads’ may occur 66 and 86 times respectively. The distribution so
In practice, however, assessing the probability of every possible value of a random variable
through actual experiment can be difficult, even impossible, especially when the probabilities
are very small. But we may be able to find out what type of random variable the one at
hand is by examining the causes that make it random. Knowing the type, we can often
approximate the random variable to a standard one for which convenient formulae are
available.
The proper identification of experiments with certain known processes in Probability theory
can help us in writing down the probability distribution function. Two such processes are the
Bernoulli Process and the Poisson Process. The standard discrete probability
distributions that are consequent to these processes are the Binomial and the Poisson
distribution. We will now look into the conditions that characterize these processes, and
examine the standard distributions associated with the processes. This will enable us to
Let us first study the Bernoulli random variable, named so in honor of the mathematician
Jakob Bernoulli (1654-1705). It is the building block for other random variables and the
                                                        298
Caritas Business College              BUSINESS STATISTICS – OAD 221
that it does not always produce a good pin. Rather, it has a probability p of producing a good
pin and (1 - p) of producing a defective one. Let us denote a good pin as “success” and a
Just after the operator produces one pin, it is inspected; let X denote the "number of good
Now analyzing the trial- “inspecting a pin” and our random variable X-“number of
 The trial-“inspecting a pin” has only two possible outcomes, which are mutually
exclusive. Such a trial, whose outcome can only be either a success or a failure, is a
S = {success, failure}
 The random variable, X, that measures number of successes in one Bernoulli trial, is a
X : 0 1
P(X) : p 1-p
X ~ BER (p)
Where ~ is read as “is distributed as” and BER stands for Bernoulli.
A Bernoulli random variable is too simple to be of immediate practical use. But it forms the
building block of the Binomial random variable, which is quite useful in practice. The
binomial random variable in turn is the basis for many other useful cases, such as Poisson
random variable.
                                                       299
Caritas Business College              BUSINESS STATISTICS – OAD 221
X1, X2 ………, Xn. Here, identically means that they all have the same p, and independently
means that the value of one X does not in any way affect the value of another. For example,
the value of X2 does not affect the value of X3 or X8 and so on. Such a sequence of
Suppose an operator produces n pins, one by one, on a lathe that has probability p of
making a good pin at each trial, the sequence of numbers (1 or 0) denoting the good and
defective pins produced in each of the n trials is a Bernoulli process. For example, in the
001011001
the third, fifth, sixth and ninth are good pins, or successes. The rest are failures.
In practice, we are usually interested in the total number of good pins rather than the
sequence of 1's and 0's. In the example above, four out of nine are good. In the general
case, let X denote the total number of good pins produced in n trials. We then have
X = X1 + X2 +………+ Xn
Variable.
We may appreciate that the condition to be satisfied for a binomial random variable is that
                                                       300
Caritas Business College              BUSINESS STATISTICS – OAD 221
Any uncertain situation or experiment that is marked by the following three properties is
 There are only two mutually exclusive and collectively exhaustive outcomes in the
Distribution
Now we will develop the distribution of our Binomial random variable. To describe the
X ~ B (n, p)
Let us analyze the probability that the number of successes X in the n trials is exactly x
(obviously number of failures are n-x) i.e. X = x and x = 0,1,2,…………. n; as n trials are
Now we know that there are nCx ways of getting x successes out of n trials. We also observe
that each of these nCx possibilities has px(1-p)n-x probability of occurrence corresponding to x
This equation is the Binomial probability formula. If we denote the probability of failure as q
                                                        301
Caritas Business College                BUSINESS STATISTICS – OAD 221
We may write down the Binomial probability distribution in table format (see Table 9-2)
                                            XX=x                 P(X = x)
                                                                n     0 n
                                                 0               C0 p q
                                                                n     1 n-1
                                             1                    C1 p q
                                             …                        …
                                                                n       x n-x
                                             x                    Cx p q
                                             …                       …
                                             …                       …
                                                                 n
                                             n                     Cn pn q0
Each of the term for x = 0,1,2,………, n correspond to the Binomial expansion of (p + q)n
                E (X) =  = n x.P(x)
                            x0
=np
2. Variance
                                          2
                               =  (x
                                   n   ) .P(x)
                                 x0
        The rth moment about the origin denoted by m0 , of a Binomial distribution is computed as:
                                                           r
                           0     r
                        mr = n x .P(x)
                                 x0
                                                         302
Caritas Business College               BUSINESS STATISTICS – OAD 221
                           0
                        m1 = n x.P(x)
                                x0
= np
=
                           0     2
                        m2 = n x .P(x)
                                x0
= n(n-1)p2 + np
        The rth moment about the mean denoted by m  , of a binomial distribution is computed as:
                                                      r
                           
                        mr = n (x   ) .P(x)
                                        r
x0
                                      1
                        m1 = 
                             n (x   ) .P(x)
                                x0
=0
= npq
= 2
= npq(q-p)
                                                     303
Caritas Business College                     BUSINESS STATISTICS – OAD 221
                                         n
                            
                           =  (x   ) 4 .P(x)
                         m4 x0
= 3(npq)2 + npq(1-6pq)
5. Skewness
To bring out the skewness of a Binomial distribution we can calculate, moment coefficient of
skewness, 1
1 = 1
                                (m )2
                        =       3
                                (m
                                  2
                                    )3
                                 
                             m
                        =         3      3
                            m  
                                     2
                            npq(q  p)
                        =            3
                               npq 
                             q p
                        =
                              npq
        Evaluating 1
                            q p
        =                        we note:
                             npq
 the Binomial distribution is skewed to the right i.e. has positive skewness when 1 >
 the Binomial distribution is skewed to the left i.e. has negative skewness when 1
so when p = q
Thus, n being the same, the degree of skewness in a Binomial distribution tends to vanish as
p approaches ½ i.e. as p ½
                                                         304
Caritas Business College                  BUSINESS STATISTICS – OAD 221
 for a given value of p, as n increases the Binomial distribution moves to the right,
5. Kurtosis
kurtosis 2
2 = 2 – 3
                                
                              m4
                        =         3
                               
                              2
                            1  6 pq
                       =       npq
                            16
        Evaluating 2 =                we note
        pq
                             npq
 the Binomial distribution is leptokurtic when 2 > 0, which is so when 6pq <1.
 the Binomial distribution is platykurtic when 2 < 0, which is so when 6pq >1.
If n is large and if neither of p or q is too close to zero, the Binomial distribution can be
                        Z = X  np
                                   npq
                                                         305
Caritas Business College               BUSINESS STATISTICS – OAD 221
n  ∝ and p  0
Example 9-1
Assuming the probability of male birth as ½, find the probability distribution of number of
(b) Out of 960 families with 5 children each find the expected number of families with (i)
Solution: Let the random variable X measures the number of boys out of 5 births. Clearly
X ~ B (5, ½)
X=x : 0 1 2 3 4 5
= 1- 1/32
= 31/32
= 26/32
                                                        306
Caritas Business College                BUSINESS STATISTICS – OAD 221
(b) Out of 960 families with 5 children, the expected number of families with
= 960 * 31/32
= 930
= 960 * 26/32
= 720
(1781- 1840). If a random variable X is said to follow a Poisson Distribution, then its
probability
distribution is given by
                             
                            e            x = 0,1,2,………
                               x
                 P(X = x) = 
                              x!
 the constant probability of success p, for each trial is infinitely small i.e. p  0
(obviously q  1)
 np =  is finite
We can develop the Poisson probability rule from the Binomial probability rule under the
above conditions.
                                                           307
Caritas Business College                  BUSINESS STATISTICS – OAD 221
Let us consider a Bernoulli process with n trials and probability of success in any trial
               
        p =        , where   0. Then, we know that the probability of x successes in n trials is given
               n
        by
                                   x         n x
               P (X = x)                     
                                = nCx   1  
                                      n  n
                                     n!     x
                                                          n x
                                =               1 
                                                        
                                    x!(n  x)! n   n 
tending to 1
                      
        and 1          e  if n  ∝
                     n
                    
                   n
        Thus we
        have
                                       
                                      e
                        P(X = x) =              x = 0, 1, 2,………
                                         x
                                       
                                        x!
                                                      308
Caritas Business College              BUSINESS STATISTICS – OAD 221
Thus, we have seen that to describe the distribution of Poisson random variable we need
If X ~ POI ()
                                                       309
Caritas Business College              BUSINESS STATISTICS – OAD 221
                                
               Then P(X = x) = e               x = 0, 1, 2,………
                                  x
                                
                                 x!
We may write down the Poisson probability distribution in table format (see Table 9-3)
Poisson distribution may be expected in situations where the chance of occurrence of any
event is small, and we are interested in the occurrence of the event and not in its non-
occurrence. For example, number of road accidents, number of defective items, number of
deaths in flood or because of snakebite or because of a rare disease etc. In these situations,
we know about the occurrence of an event although its probability is very small, but we do
not know how many times it does not occur. For instance, we can say that two road
accidents took place today, but it is almost impossible to say as to how many times, accident
fails to take place. The reason is that the number of trials is very large here and the nature of
event is of rare type. The Poisson random variable X, counts the number of times a rare
               E (X) =  =  x.P(x)
                           all x
                                                      310
Caritas Business College                  BUSINESS STATISTICS – OAD 221
2. Variance
                               =  (x   ) 2 .P(x)
                                 all x
2 = 
        The rth moments about the origin denoted by m0 , of a Poisson distribution is computed as:
                                                          r
                           0
                        mr =  x .P(x)
                                r
all x
                           0
                        m1 =  x.P(x)
                                  all x
=
                           0
                        m2 =  x .P(x)
                                2
all x
= + 2
        The rth moments about the mean denoted by m  , of a Poisson distribution is computed as:
                                                         r
                           
                        mr =  (x   ) .P(x)
                                       r
all x
                                                      311
Caritas Business College                    BUSINESS STATISTICS – OAD 221
                           
                       m1 =  (x   ) .P(x)
                                      1
all x
=0
                                     2
                       m2 =  (x   ) .P(x)
                                 all x
=2
=
                           
                       m3 =  (x   ) .P(x)
                                      3
all x
=
                                     4
                       m4 =  (x   ) .P(x)
                                 all x
= 32+
5. Skewness
1 = 1
                               (m )2
                      =        3
                               (m
                                 2
                                   )3
                                
                           m     3
                       =                3
                           m  
                                    2
                           1
                      =
                           
                                                        312
Caritas Business College                   BUSINESS STATISTICS – OAD 221
        Evaluating 1           1
        =                           we note that Poisson distribution is always skewed to the right i.e. has
                            
6. Kurtosis
kurtosis 2
2 = 2 – 3
                                  
                                m4
                        =           3
                                 
                                2
                                1
                        =
                            
        Evaluating 2       1
                                    we note that the Poisson distribution is leptokurtic.
        =
                            
n  ∝ and p  0
Example 9-2
At a parking place the average number of car-arrivals during a specified period of 15 minutes
is 2. If the arrival process is well described by a Poisson process, find the probability that
                                                            313
Caritas Business College              BUSINESS STATISTICS – OAD 221
Solution: Let X denote the number of cars arrivals during the specified period of 15
                                                                 2
        (a)    P(no car will arrive) = P(X = 0)              = e
                                                                 0
                                                                2
0!
= 0.1353
                                                        2   2 1
                                                = 1-[ e    +e    2 ]
                                                         0    1!
                                                       2
                                                        0!
= 1-[0.1353 + 0.2707]
= 1 – 0.4060
= 0.5940
                                                             2
                                                    3    e
                                                 =    x
                                                x 0 2
x!
= 0.8571
                                                = P(X  3) - P(X = 0)
                                                         x 0
                                                                                       2     x
                                                     3                             e      2
                                                  =                                    x!
                                                             314
Caritas Business College   BUSINESS STATISTICS – OAD 221
       2 0                            -        0!
   e     2
= 0.8571 –0.1353
= 0.7218
                                       315
Caritas Business College             BUSINESS STATISTICS – OAD 221
Explain this statement. Also develop and generalize Binomial probability rule with
4. Under what condition can the Poisson distribution approximate Binomial distribution?
Develop the Poisson probability rule from the Binomial probability rule under these
conditions.
5. List some of the important areas where Poisson distribution is used. Also state
Out of 200 samples of 4 items, find the expected number of samples with (a), (b), and
(c) above
service on transplantation. Find the mean, standard deviation and moment coefficient
seedlings.
                                                      316
Caritas Business College                BUSINESS STATISTICS – OAD 221
8. If the sum of mean and variance of a binomial distribution of 5 trials is 9/5, find the
binomial distribution.
9. The mean and variance of a binomial distribution are 2 and 1.5 respectively. Find
the probability of
10. 150 random samples of 4 units each are inspected for number of defective item.
Number of Samples : 28 62 46 10 4
11. The probability that a particular injection will have reaction to an individual is
0.002. Find the probability that out of 1000 individuals (a) no, (b) 1, (c) at least 1, and
12. In a razor blades manufacturing factory, there is small chance of 1/500 for any blade
to be defective. The blades are supplied in packets of 10. Find the approximate
number of packets containing (a) no, (b) 1, and (c) 2 defective blades in a
13. If P(x = 1) = P(x = 2), for a distribution of Poisson random variable X. Find the
                                                      317
Caritas Business College             BUSINESS STATISTICS – OAD 221
           1. Statistics (Theory & Practice) by Dr. B.N. Gupta. Sahitya Bhawan Publishers and
               Distributors (P) Ltd., Agra.
           2. Statistics for Management by G.C. Beri. Tata McGraw Hills Publishing Company
               Ltd., New Delhi.
           3. Business Statistics by Amir D. Aczel and J. Sounderpandian. Tata McGraw Hill
               Publishing Company Ltd., New Delhi.
           4. Statistics for Business and Economics by R.P. Hooda. MacMillan India Ltd., New
               Delhi.
           5. Business Statistics by S.P. Gupta and M.P. Gupta. Sultan Chand and Sons.,
               New Delhi.
           6. Statistical Method by S.P. Gupta. Sultan Chand and Sons., New Delhi.
           7. Statistics for Management by Richard I. Levin and David S. Rubin. Prentice Hall of
               India Pvt. Ltd., New Delhi.
           8. Statistics for Business and Economics by Kohlar Heinz. Harper Collins., New York.
                                                  318
Caritas Business College             BUSINESS STATISTICS – OAD 221
        Objectives: The overall objective of the present lesson is to overview the concept of
                       continuous random variable and Normal distribution. After successful
                       completion of the lesson the students will be able to appreciate the
                       usefulness of normal distribution in decision-making and also identify
                       situations where normal probability distribution can be applied.
        Structure
        10.1   Introduction
        10.2   Continuous Probability Distribution
        10.3   The Normal Distribution
        10.4   The Standard Normal Distribution
        10.5   The Transformation of Normal Random Variables
        10.6   Self-Assessment Questions
        10.7   Suggested Readings
 10.1   INTRODUCTION
        We have learnt that a probability distribution is basically a convenient representation of the
different values a random variable may take, together with their respective probabilities of
occurrence. In the last lesson, we have examined situations involving discrete random
                                                      319
Caritas Business College             BUSINESS STATISTICS – OAD 221
variables and the resulting discrete probability distributions. Consider the following random
In the first case, Binomial random variable X1 could take only finite number of integer values;
0,1,2…n; whereas in the second case, Poisson random variable X2 could take an infinite
the
sense that they could be listed in a sequence, finite or infinite. In contrast to these, let us
consider a situation, where the variable of interest may take any value within a given range.
Suppose we are planning for measuring the variability of an automatic bottling process that
fills ½-liter (500 cm3) bottles with cola. The variable, say X, indicating the deviation of the
actual volume from the normal (average) volume can take any real value - positive or
negative; integer or decimal. This type of random variable, which can take an infinite number
of values in a given range, is called a continuous random variable, and the probability
and assumption inherent in the treatment of such distributions are quite different from those
used in the context of a discrete distribution. In the present lesson, after understanding the
½-liter (500cm3) bottles with cola. The random variable X indicates ‘the deviation of the
actual volume from the normal (average) volume.’ Let us, for some time, measure our
                                                      317
Caritas Business College              BUSINESS STATISTICS – OAD 221
      F
      Figure 10-1 Histograms of the Distribution of X as Measurements is refined to Smaller
        and Smaller Intervals of Volume, and the Limiting Density Function f(x)
probability of each value of X is the area of the rectangle over the value. Since the rectangle
will have the same base, the height of each rectangle is proportional to the probability. The
Volume is a continuous random variable; it can take on any value measured on an interval of
numbers. Now let us imagine the process of refining the measurement scale of X to
the nearest 1/2 cm3, the nearest 1/10 cm3… and so on. Obviously, as the process of refining
the measurement scale continues, the number of rectangles in the histogram increases and
the width of each rectangle decreases. The probability of each value is still measured by the
area of the rectangle above it, and the total area of all rectangles remains 1.00. As
continuous probability distribution. The step like surface formed by the tops of the rectangles
in the histogram tends to a smooth function. This function is denoted by f(x) and is called the
probability density function of the continuous random variable X. The density function is
the
                                                       318
Caritas Business College               BUSINESS STATISTICS – OAD 221
limit of the histograms as the number of rectangles approaches infinity and the width of each
rectangle approaches zero. The density function of the limiting continuous variable X
is shown in Figure 10-1 i.e. the values X can assume between the intervals –2.00 to –3.00
approaches infinity. The probability that X assumes a particular value (Say X = 1.5)
approaches zero. Probabilities are still measured as areas under the curve. The
probability that deviation will be between –1.50 and –1.00 is the area under f(x) between the
points x = -
The probabilities associated with a continuous random variable X are determined by the
probability density function of the random variable. The function, denoted by f(x), has the
following properties:
2. The probability that X will be between two numbers a and b is equal to the
                                            b
                       P(a < X < b) =  f ( x ).dx
                                       a
3. The total area under the entire curve of f(x) is equal to 1.00.
                                                
                       P (                       f (x          1.00
                       X                )        ).dx
                                       
                                                -
                                                
When the sample space is continuous, the probability of any single given value is zero. For a
continuous random variable, therefore, the probability of occurrence of any given value is
zero. We see this from property 2, noting that the area under a curve between a point
and itself is the area of a line, which is zero. For a continuous random variable, non-zero
                                                           319
Caritas Business College               BUSINESS STATISTICS – OAD 221
We define the cumulative distribution function F(x) for a continuous random variable similarly
to the way we defined it for a discrete random variable: F(x) is the probability that X is less
                F(x) = P(X = x) = area under f(x) between the smallest possible value of X (often -∝)
                                     and point x
                                      x
                                 =
                                      f ( x).dx
                                      -
The expected value of a continuous random variable X, denoted by E(X), and its
variance, denoted by V(X), require the use of calculus for their computation. Thus
                         E(X )  
                                      x. f ( x ).dx
                                     -
                         V(X)                      2
                                       x  E ( x)  . f ( x).dx
                                     -
It is being widely used in all data-based research in the field of agriculture, trade, business
A large number of random variables occurring in practice can be approximated to the normal
distribution.
                                                        320
Caritas Business College               BUSINESS STATISTICS – OAD 221
The lengths of pins made by an automatic machine; the times taken by an assembly worker
to complete the assigned task repeatedly; the weights of baseballs; the tensile strengths
of a batch of bolts; and the volumes of cola in a particular brand of canned cola - are good
examples of normally distributed random variables. All of these are affected by several
independent causes where the effect of each cause is small. This knowledge helps us in
calculating the probabilities of different events in varied situations, which in turn is useful for
decision-making.
In many real life situations, we face the problem of making statistical inferences about
processes based on limited data. Limited data is basically a sample from the full body of
data on the process. Irrespective of how the full body of data is distributed, it has been found
that the Normal Distribution can be used to characterize the sampling distribution of many of
the sample statistics. (we will see it in next few lessons). This helps considerably in
Statistical Inferences.
X ~ N (μ, σ2)
                                                       321
Caritas Business College              BUSINESS STATISTICS – OAD 221
In the equation e is the base of natural logarithm, equal to 2.71828.. .By substituting desired
values for and  , we can get any desired density function. For example, a
        distribution with mean 100 and standard deviation 5 will have the density function.
                                                       2
                                1        1  x 100 
                          f (x)              
                                                      -∝ < x < +∝
                                      2 5   2 5
                                           e    
This function when plotted (see Figure 10-2) will give the famous bell-shaped mesokurtic
normal curve.
Many mathematicians have worked on the mathematics behind the normal distribution and
have made many independent discoveries. In the initial stages, the normal distribution was
                                  Figure
        developed by Abraham De Moivre    10-1 Normal
                                       (1667-1754).      Curve
                                                    His work was later taken up by Pierre S
Laplace (1949-1827). But the discovery of equation for the normal density function is
attributed to Carl Friedrich Gauss (1777-1855),who did much work with the formula. In
1. The normal curve is not a single curve representing only one continuous distribution.
Obviously, it represents a family of normal curves; since for each different value of
                                                      322
Caritas Business College              BUSINESS STATISTICS – OAD 221
and  , there is a specific normal curve different in its positioning on the X-axis and
the extent of spread around the mean. Figure 10-3 shows three different normal
2. The normal curve is bell-shaped and perfectly symmetric about its mean. As a result
50% of the area lies to the right of mean and balance 50% to the left of mean. Perfect
symmetry, obviously, implies that mean, median and mode coincide in case of a
normal distribution. The normal curve gradually tapers off in height as it moves in
either direction away from the mean, and gets closer to the X-axis.
3. The normal curve has a (relative) kurtosis of 0, which means it has average
4. Theoretically, the normal curve never touches the horizontal axis and extends to
5. If several independent random variables are normally distributed, then their sum will
also be normally distributed. The mean of the sum will be the sum of all the
individual means, and by virtue of the independence, the variance of the sum will be
If X1, X2,…............Xn are independent normal variables, the their sum S will also be a
                                                      323
Caritas Business College                 BUSINESS STATISTICS – OAD 221
a and b are constants and a  0; the resultant Y variable will also be normally
If X1, X2,…............Xn are independent random variables that are normally distributed, then the
Let us see the application of this result with the help of an example.
Example 10-1
A cost accountant needs to forecast the unit cost of a product for the next year. He notes
that each unit of the product requires 10 labor hours and 5 kg of raw material. In addition,
each unit of the product is assigned an overhead cost of Rs 200. He estimates that the cost
of a labor hour next year will be normally distributed with an expected value of Rs 45 and a
standard deviation of Rs 2; the cost of raw material will be normally distributed with an
expected value of Rs 60 and a standard deviation of Rs 3. Find the distribution of the unit
Solution: Since the cost of labor L may not influence the cost of raw material M, we can
assume that the two are independent. This makes the unit cost of the product Q a random
variable. So if
                                                           324
Caritas Business College              BUSINESS STATISTICS – OAD 221
= 950
= 100(4) + 25(9)
= 625
So Q ~ N (950, 252)
+ 2.58σ is 0.99
for different values of μ and σ2. So the range probability P(a < X < b) will be different for
different normal curves. We can make use of integral calculus to compute the required
range probability
                                      b
                    P(a < X < b) =  f ( x ).dx
                                    a
It may be appreciated that we can simplify this process of computing range probabilities to a
great extent by tabulating the range probabilities. Since it is not practicable and indeed
impossible to have separate probability tables for each of the infinitely many possible normal
curves, we select one normal curve to serve as a standard. Probabilities associated with the
range of values of this standard normal random variable are tabulated. A special
                                                       325
Caritas Business College             BUSINESS STATISTICS – OAD 221
transformation then allows us to apply the tabulated probabilities to any normal random
variable. The standard normal random variable is denoted by a special name, Z (rather
We say
Z ~ N (0,12)
The probabilities associated with standard normal distribution are tabulated in two ways –
say Type I and Type II tables, as shown in Figure 10-4. Type I Tables give the area between
0 and any other z value, as shown by vertical hatched area in Figure 10-4a. The hatched
Type II Tables give the area towards the tail–end of the standard normal curve beyond the
ordinate at any particular z value. The hatched area shown in Figure 10-4b is P (Z >
z).
As the normal curve is perfectly symmetrical, the areas given by Type 1 Tables
when subtracted from 0.5 will provide the same areas as given by Type II Tables and vice-
versa.
                                                      326
Caritas Business College                 BUSINESS STATISTICS – OAD 221
We will now illustrate the use of standard normal area tables for calculating the range
probabilities. Probability of intervals is areas under the density curve (z) over the intervals
in question.
Example 10-2
Find the probability that the value of the standard normal random variable will be…
That is, we want P(0 < Z < 1.74). In Figure 10-4a, substitute 1.74 for the point z on the
graph. We are looking for the table area in the row labeled 1.7 and the column labeled 0.04.
That is, we want P(Z < -1.47). By the symmetry of the normal curve, the area to the left of -
= 0.5000 - 0.4292
= 0.0808
That is, we want P(1.3 < Z < 2). The required probability is the area under the curve between
the two points 1.3 and 2. The table gives us the area under the curve between 0 and 1.3,
and the area under the curve between 0 and 2. Areas are additive; therefore,
                                                          327
Caritas Business College               BUSINESS STATISTICS – OAD 221
= 0.4772 - 0.4032
= 0.0740
That is, we want P(-1< Z < 2). The required probability is the area under the curve between
the two points -1 and 2. The table gives us the area under the curve between 0 and 1, and
the area under the curve between 0 and 2. Areas are additive; therefore,
                              = 0.3413 + 0.4772
                              = 0.8185
In cases, where we need probabilities based on values with greater than second-decimal
accuracy, we may use a linear interpolation between two probabilities obtained from the
table.
Example 10-3
Solution: P(0 ≤ Z ≤ 1.645) is found as the midpoint between the two probabilities P(0 ≤ Z ≤
= ½[0.4495 + 0.4505]
= 0.45
In many situations, instead of finding the probability that a standard normal random variable
will be within a given interval; we may be interested in the reverse: finding an interval with a
Example 10-4
                                                        328
Caritas Business College              BUSINESS STATISTICS – OAD 221
Find a value z of the standard normal random variable such that the probability that
Solution: We look inside the table for the value closest to 0.40. The closest value we find to
0.40 is the table area 0.3997. This value corresponds to 1.28 (row 1.2 and column .08).
Example 10-5
Find the value of the standard normal random variable that cuts off an area of 0.90 to its left.
Solution: Since the area to the left of the given point z is greater than 0.50, z must be on the
right side of 0. Furthermore, the area to the left of 0 all the way to -∝ is equal to 0.50.
Therefore, TA = 0.90 - 0.50 = 0.40. We need to find the point z such that TA = 0.40.
Thus z =1.28 cuts off an area of 0.90 to the left of standard normal curve.
Example 10-6
Find a 0.99 probability interval, symmetric about 0, for the standard normal random variable.
Solution: The required area between the two z values that are equidistant from 0 on
either side is 0.99. Therefore, the area under the curve between 0 and the positive z value
is TA = 0.99/2 = 0.495. We now look in our normal probability table for the area closest to
0.495. The area 0.495 lies exactly between the two areas 0.4949 and 0.4951,
random variable may be transformed to the standard normal random variable. If we want
to
                                                        329
Caritas Business College             BUSINESS STATISTICS – OAD 221
transform X, where X ~ N (μ, σ 2), into the standard normal random variable Z ~ N (0, 12), we
                                             X 
                                        Z    
We move the distribution from its center of μ to a center of 0. This is done by subtracting μ
from all the values of X. Thus, we shift the distribution μ units back so that its new center is
0. To make the standard deviation of the distribution equal to 1, we divide the random
variable by its standard deviation σ. The area under the curve adjusts so that the total
remains the same. All probabilities (areas under the curve) adjust accordingly. Thus, the
transformation from X to Z is achieved by first subtracting μ from X and then dividing the
result by σ.
Example 10-7
If X ~ N (50, 10 2), find the probability that the value of the random variable X will be greater
than 60
        Solution
        :                               X         60  
                       P(X > 60) = P(          >
                                                   )
                                                    10
                                         
                                  = P( Z > 60  50 )
                                             10
= P( Z >1)
= 0.5000 - 0.3413
= 0.1587
Example 10-8
The weekly wage of 2000 workmen is normally distribution with mean wage of Rs 70 and
wage standard deviation of Rs 5. Estimate the number of workers whose weekly wages are
                                                         330
Caritas Business College            BUSINESS STATISTICS – OAD 221
X ~ N (70, 5 2)
                                                70         X 
         So            P(70 < X < 71) = P(               <             71  
                                                                  <            )
                                                                      
                                               70  70
                                        = P(             <Z<
                                                  5              71  70
                                                                    5    )
= 0.0793
= 2000 x 0.0793
= 159
                                                69         X 
         So            P(69 < X < 73) = P(               <             73  
                                                                  <            )
                                                                      
                                               69  70
                                        = P(             <Z<
                                                  5              73  70
                                                                    5    )
= 0.0793 + 0.2257
= 0.3050
                                                         331
Caritas Business College   BUSINESS STATISTICS – OAD 221
= 2000 x 0.3050
= 610
                                        332
Caritas Business College             BUSINESS STATISTICS – OAD 221
                                                  X
         So                      P(X > 72) = P(               72  
                                                        >
                                                               )
                                                              
                                       = P(Z > 72  70 )
                                                  5
= 0.5 – 0.1554
= 0.3446
= 2000 x 0.3446
= 689
                                        X
        So            P( X < 65) = P(                 65  
                                                 <
                                                       )
                                                      
                                       = P( Z < 65  70 )
                                                   5
= P(Z >1.0)
= 0.5 - 0.3413
= 0.1567
= 2000 x 0.1567
= 313
                                                        333
Caritas Business College              BUSINESS STATISTICS – OAD 221
        The transformation Z  X           takes us from a random variable X with mean μ, and standard
                           
                               
deviation σ to the standard normal random variable. We also have an opposite, or inverse,
transformation, which takes us from the standard normal random variable Z to the random
variable X with mean μ and standard deviation σ. The inverse transformation is given as
X    Z
We use the inverse transformation when we want to get from a given probability, the value
Example 10-9
The amount of fuel consumed by the engines of a jetliner on a flight between two cities is a
normally distributed random variable X with mean = 5.7 tons and standard derivation  =
0.5 tons. Carrying too much fuel is inefficient as it slows the plans. If, however, too little fuel
is loaded on the plane, an emergency landing may be necessary. What should be the
amount of fuel to load so that there is 0.99 probability that the plane will arrive at its
                                     X
        or
                               P(           < z) = 0.99
                                     
= 0.5 + 0.49
So x  z
                                                           334
Caritas Business College                  BUSINESS STATISTICS – OAD 221
x  5.7  2.33x0.5
x  6.865
Therefore, the plane should be loaded with 6.865 tons of fuel to give 0.99 probability that the
Example 10-10
Monthly sale of beer at a bar is believed to be approximately normally distributed with mean
2450 units and standard 400 units. To determine the level of orders and stock, the
management wants to find two values symmetrically on either side of mean, such that the
probability that sales of beer during the month will be between the two values is
                             x1         X         x2  
        
        or              P(            <          <            ) =0.95
                                                    
                        x1                                           x2     z2
                                                              and
                               z1                                      
                                                              335
Caritas Business College                   BUSINESS STATISTICS – OAD 221
Therefore, the management may be 95% sure that sales in any given month will be
                              x1         X         x2  
        
        or               P(            <          <            ) =0.99
                                                     
So z1 = -2.58 and z2 =
                        x1                                  and       x2     z2
                               z1                                       
Therefore, the management may be 99% sure that sales in any given month will be
We can summarize the procedure of obtaining values of a normal random variable, given
a probability, as:
 draw a picture of the normal distribution in question and the standard normal
distribution
 use the table to find the z value (or values) that gives the required probability
 use the transformation from Z to X to get the appropriate value (or values) of
                                                               336
Caritas Business College               BUSINESS STATISTICS – OAD 221
2. (a) Define normal random variable. State the probability density function of a
4. What do you mean by standard normal variable? Bring out the need for having a
5. Find the probability that a standard normal variable will have a value
provides are normally distributed with mean 0 and variance 1.00. Find the probability
7. The deviation of a magnetic needle from the magnetic pole in a certain area
standard deviation 1.00. What is the probability that the absolute value of the
deviation from the north pole at a given moment will be more than 2.4?
8. Find two values of the standard normal random variable, z and -z, such that
(a) the two corresponding "tail areas" of the distribution add to 0.01.
deviation σ = 3. Find
(a) P(10 < X< 18) (b) P(16 < X< 18) (c) P(X > 14)
                                                       337
Caritas Business College               BUSINESS STATISTICS – OAD 221
10. For a normally distributed random variable with mean -44 and standard deviation 16,
find the probability that the value of the random variable will be
11. A normal random variable has mean 0 and standard deviation 4. Find the probability
12. The time it takes an international telephone operator to place an overseas phone call
(a) What is the probability that my call will go through in less than 1 minute?
(b) What is the probability that my call will get through in less than 40 seconds?
(c) What is the probability that I will have to wait more than 70 seconds for my
call to go through?
approximately normally distributed with mean 8,000 and standard deviation 1,000.
The proposition needs at least 9,322 votes in order to pass. What is the
probability that the proposition will pass? (Assume numbers are on a continuous
scale.)
from a foreign supplier. From past experience, the materials manager notes that the
company’s demand for glue during the uncertain lead-time is normally distributed
with a mean of 187.6 gallons and a standard deviation of 12.4 gallons. The company
follows a policy of placing the order when the glue stock falls to a predetermined
value, called “re-order point”. It the demand during lead-time exceeds the reorder
level, the glue would go ‘stock-out’ and production process would have to stop.
(a) If the re-order point is kept at 187.6 gallons, what is the probability that a
                                                         338
Caritas Business College             BUSINESS STATISTICS – OAD 221
(b) If the reorder point is kept at 200 gallons, what is the probability that a stock-
(c) If the company wants to be 95% confident that the stock-out condition will
not occur, what should be the reorder point? The reorder point minus the
mean demand during lead-time is known as the "safety stock." What is the
(d) If the company wants to be 99% confident that the stock-out condition will not
occur, what should be the reorder point? What is the safety stock in this
case?
15. If X is a normally distributed random variable with mean 125 and standard deviation
44, find a value x such that the probability that X will be less than x is 0.66.
16. For a normal random variable with mean 10.5 and standard deviation 0.4, find a point
of the distribution such that there is a 0.95 probability that the value of the random
17. For a normal random variable with mean 29,500 and standard deviation 410, find a
point of the distribution such that the probability that the random variable will exceed
this value is
18. Find two values of the normal random variable with mean 80 and standard deviation
5 lying symmetrically on either side of the mean and covering an area of 0.98
between them.
19. For X~ N(32, 72), find two values x1 and x2, symmetrically lying on each side of the
mean, with
(a) P(x1 < X< x2) = 0.99 (b) P(x1 < X < x2) = 0.95
                                                      339
Caritas Business College                  BUSINESS STATISTICS – OAD 221
A candidate gets failed if he/she obtains less than 40% marks, while one must
obtain at least 75% marks to pass with distinction. Determine the mean and standard
21. The demand for gasoline at a service station is normally distributed with mean
27,009 gallons per day and standard deviation 4,530. Find two values that will give a
symmetric 0.95 probability interval for the amount of gasoline demanded daily.
22. The percentage of protein in a certain brand of dog food is a normally distributed
random variable with mean 11.2 % and standard deviation 0.6 %. The manufacturer
would like to state on the package that the product has a protein content of at least
x1
% and no more than x %. He wants the statement to be true for 99% of the packages
               1. Statistics (Theory & Practice) by Dr. B.N. Gupta. Sahitya Bhawan Publishers and
                       Distributors (P) Ltd., Agra.
               2. Statistics for Management by G.C. Beri. Tata McGraw Hills Publishing Company
                       Ltd., New Delhi.
               3. Business Statistics by Amir D. Aczel and J. Sounderpandian. Tata McGraw Hill
                       Publishing Company Ltd., New Delhi.
               4. Statistics for Business and Economics by R.P. Hooda. MacMillan India Ltd., New
                       Delhi.
               5. Business Statistics by S.P. Gupta and M.P. Gupta. Sultan Chand and Sons.,
                       New Delhi.
               6. Statistical Method by S.P. Gupta. Sultan Chand and Sons., New Delhi.
               7. Statistics for Management by Richard I. Levin and David S. Rubin. Prentice Hall
                       of India Pvt. Ltd., New Delhi.
               8. Statistics for Business and Economics by Kohlar Heinz. Harper Collins., New
                       York.
                                                        340
Caritas Business College            BUSINESS STATISTICS – OAD 221
        Objective: After going through this chapter, you will be able to understand: various terms
                      associated with sampling; various methods of probability and non-probability
                      sampling and how to determine sample size.
        Structure
        every unit of the population is considered and the respective data on the various
               characteristics are compiled,
        the analysis made on the basis of census data is very accurate and reliable,
               and
                                                    341
Caritas Business College             BUSINESS STATISTICS – OAD 221
        1.1. the population is very large, i.e., infinite and it would be impossible to
               conduct census surveys;
        2.1. when quick results are required it would be appropriate to conduct sample
               surveys rather than census surveys;
        5.1. some times accuracy may be lost because of the large size of the
               population. Sampling involves a small portion of the population and
               therefore, would involve very few people for conducting surveys and for
               data collection and compilation. This would not be               so   in   the census
               method and the chances of committing errors would increase.
        As the sampling involves less time and money, it would be possible to give attention to
        different characteristics of the elementary units. A sample using same money and time can
        produce a detailed study of lesser number of units. The process of sampling involves
        selecting a sample, collecting all relevant information, and finally drawing conclusions about
        the population from which the sample has been drawn.
                                                     342
Caritas Business College             BUSINESS STATISTICS – OAD 221
        11.2. Definitions
        The surveys are concerned with the attributes of certain entities, such as business
        enterprises, human beings, etc. The attributes that are the object of the study are known as
        characteristics and the units possessing them are called the elementary units.
        The aggregate of elementary units to which the conclusions of the study apply is termed
        as population/universe, and the units that form the basis of the sampling process are
        called sampling units. The sampling unit may be an elementary unit.
        The sample is defined as an aggregate of sampling units actually chosen in obtaining a
        representative subset from which inferences about the population are drawn. The frame— a
        list or directory, defines all the sampling units in the universe to be covered. This frame is
        either constructed for the purpose of a particular survey or may consist of previously
        available description of the population; the latter is the commonly used method. For
        example, telephone directory can be used as a frame for conducting opinion surveys in a
        city or locality.
        In order that, sampling results reflect the characteristics of the population, it is necessary
        that the sample selected for study should be
        1.1. Truly representative, i.e., the selected sample truly represent the universe
               so that the results can be generalised;
        2.1. Adequate, i.e., the size of the sample or the sample size should be
               adequate enough to represent the various characteristics of the universe;
        4.1.   Homogeneous, i.e., there should not be any basic difference between the
               characteristics of the units in the sample and that of the population.
               This means that if two or more samples are drawn from the same
               population, the results should be more or less identical.
        11.3. Probability samples vs. non-probability samples
        A probability sample is one for which the inclusion or exclusion of any individual element of
        the population depends upon the application of probability methods and not on a personal
        judgement. It is so designed and drawn that the probability of inclusion of an element is
        known. The essential feature of drawing such a sample is the randomness. As against the
        probability sample, we have a variety of other samples, termed as judgement samples,
        purposive samples, quota samples, etc. These samples have one common distinguishing
        feature: personal judgement rather than the random procedure to determine the composition
        of what is to be taken as a representative sample. The judgement affects the choice of the
        individual elements. All such samples are non-random, and no objective measure of
        precision may be attached to the results arrived at.
                                                     343
Caritas Business College              BUSINESS STATISTICS – OAD 221
        In a probability sampling, it is possible to estimate the error in the estimates and they can be
        minimized also. It is also possible to evaluate the relative efficiency of the various probability
        sampling designs. Probability sampling does not depend upon the detailed information about
        population for its effectiveness. However, probability sampling requires a high level of skill
        and experience for its use. It also requires sufficient time and money to execute.
        Non-probability sampling is a procedure of selecting a sample without the use of probability
        or randomisation. It is based on convenience, judgement, etc. The major difference between
        the two approaches is that it is possible to estimate the sampling variability in the case of
        probability sampling while it is not possible to estimate the same in the non-probability
        sampling. The classification of various probability and non-probability methods are shown in
        Fig. 11.1.
                                                 POPULATION
                                                  SAMPLING
                                                       344
Caritas Business College             BUSINESS STATISTICS – OAD 221
        members randomly selected. The lottery method of selecting these five members from a
        group of 200 would be first to prepare 200 slips of identical shape and size and write the
        name of each student on a slip. Fold these 200 slips identically and mix them well in a
        container. Then select five folded slips, from the container at random. The five students
        so selected would constitute a welfare committee of the hostel.
        There are, however, some difficulties in these procedures. For, if N is large, the task
        becomes physically difficult. So it is desirable to use better methods for ensuring
        randomness. One such method is the use of random number tables.
       Use of random number tables
        If the N elements of a total population are numbered serially from 1 to N, a random sample
        may be most readily and reliably drawn by using a table of random numbers. Such tables
        enable us to select n numbers at random from the full list of serial numbers from 1 to N. In a
        random number table, digits in each column are in random order and so are the digits in
        each row. As the arrangement is random in all directions, it makes no difference where we
        begin in our selection of random numbers from such a table. However, the column
        arrangement is generally found more convenient for references.
        Several random number tables are available for use. These numbers have been
        adequately tested for randomness. Among them, the most popular ones are:
        2.1.   Fisher and Yates (1938) table of random numbers with 1,500 sets of ten-
               digited random numbers; and
        Tippett’s numbers have been subjected to numerous tests and used in many
        investigations and their randomness has been well established for all practical purposes.
        An example to illustrate how Tippett’s table of random numbers may be used is given
        below.
        Suppose ten numbers from out of 0 and 80 are required. We start anywhere in the table
        and write down the numbers in pairs. The table can be read horizontally, vertically,
        diagonally or in any methodical way. Starting with the first and reading horizontally first we
        obtain 29, 52, 66, 41, 39, 92, 97, 92, 79, 69, 59, 11, 31, 70, 56, 24, 41, 67 and so on.
        Ignoring the numbers
        greater than 80, we obtain for one purpose ten random numbers, namely 29, 52, 66, 41, 39,
        79, 69, 59, 11 and 31.
                                                      345
Caritas Business College              BUSINESS STATISTICS – OAD 221
        The sampling procedure described above is quite satisfactory for a small population. With a
        large population, the process of identification of numbers to each elementary sampling unit
        becomes very prohibitive with respect to both time and money. Moreover, the population is
        often geographically spread out or composed of clearly identified strata possessing unique
        characteristics. Whenever any of the above situations arise, alternative sampling schemes
        that are sophisticated combinations of simple random sampling provide significantly better
        results for the same expenditure and time. As a result, the simple random sampling method
        is not very frequently used in practice. However, the simple random sampling scheme is the
        basis of any other probabilistic sampling schemes.
       (b)      Stratified random sampling method
        In simple random sampling, the population to be sampled is treated as homogeneous and
        the individual elements are drawn at random from the whole universe. However, it is often
        possible and desirable to classify the population into distinctive classes or strata and then
        obtain a sample by drawing at random the specified number of sampling units from each of
        the classes thus constructed. This may be desirable because of our interest in the distinct
        classes of the universe as a whole.
        In stratified random sampling, the population is sub-divided into strata before the sample is
        drawn. Strata are so designed that they should not overlap. A sample of specified size is
        drawn at random from the sampling units that make up each stratum. If a given stratum is of
        our interest, the corresponding sub-sample provides the basis for estimates concerning the
        attributes of the population stratum, or sub-universe from which it is drawn. The total of sub-
        samples constitutes the aggregate sample on which estimates of attributes of the entire
        population are based.
        Stratified samples may be either proportional or non-proportional. In a proportional stratified
        sampling, the number of elements to be drawn from each stratum is proportional to the size
        of that stratum compared with the population. For example, if a sample size of 500
        elementary units have to be drawn from a population with 10,000 units divided in four strata
        in the following way:
                                               Population size                      Sample size
                       Stratum I =                         2000                 500 × 0.2 = 100
                       Stratum II =                        3000                 500 × 0.3 = 150
                      Stratum III =                        4000                 500 × 0.4 = 200
                      Stratum IV =                         1000                   500 × 0.1 = 50
                                                    Total 10000                              500
        Thus, the elements to be drawn from each stratum would be 100, 150, 200 and 50
        respectively. Proportional stratification yields a sample that represents the population with
        respect to the proportion in each stratum in the population. Proportional stratified sampling
        yields satisfactory results if the dispersion in the various strata is of proportionately the same
        magnitude. If there is a significant difference in dispersion from stratum to stratum, sample
        estimates will be much more efficient if non-proportional stratified random sampling is used.
        Here, equal numbers of elements are selected from each stratum regardless of how the
        stratum is represented in the population.
        Thus, in the earlier example, an equal number, i.e., 125, of elementary units will be drawn to
        constitute the sample.
        A sample drawn by stratified random sampling scheme ensures a representative sample as
        the population is first divided into various strata and then a sample is drawn from each
        stratum.
        Stratified random sampling also ensures greater accuracy and it is maximum if each
        stratum is formed in such a way that it consists of uniform or homogeneous items.
        Compared with a simple random sample, a stratified sample can be more concentrated
        geographically, i.e., the elementary units from different strata may be selected in such a
        way that all of them are
                                                       346
Caritas Business College              BUSINESS STATISTICS – OAD 221
        located in one geographical area. This would also reduce both time and cost involved in
        data collection. However, care should be exercised in dividing the population into various
        strata. Each stratum must contain, as far as possible, homogeneous units, as otherwise the
        reliability of the results would be lost.
        In conclusion, stratification is an effective sampling device to the extent that it creates
        classes that are more homogeneous than the total. When this can be done, the classes are
        distinguished that differ among themselves in respect of a stated characteristic. Stratification
        may be futile if classes do not differ among themselves. Thus, there should be homogeneity
        within classes and heterogeneity between classes.
       (c) Cluster sampling or multistage sampling
        Under this method, the random selection is made of primary, intermediate and final (or the
        ultimate) units from a given population or stratum. There are several stages in which the
        sampling process is carried out. At first, the first stage units are sampled by some suitable
        method, such as simple random sampling. Then, a sample of second stage unit is selected
        from each of the selected first stage units, again by some suitable method which may be
        same as or different from the method employed for the first stage units. Further stages may
        be added as required. The procedure may be illustrated as follows:
        Suppose we want to take a sample of 5,000 households from the State of Haryana. At the
        first stage, the state may be divided into a number of districts and a few districts are selected
        at random. At the second stage, each district may be sub-divided into a number of villages
        and a sample of villages may be taken at random. At the third stage, a number of
        households may be selected from each of the villages selected at second stage. To take
        another example supposes in a particular survey, we wish to take a sample of 10,000
        students from a University. We may take colleges at the first stage, then draw departments
        at the second stage, and choose students as the third and last stage.
        Merits: Multi-stage sampling introduces flexibility in the sampling method which is lacking
        in the other methods. It enables existing divisions and sub-divisions of the population to be
        used as units at various stages, and permits the field work to be concentrated and yet
        large area to be covered.
        Another advantage of this method is that sub-division into second stage units need be
        carried out for only those first stage units which are included in the sample. It is, therefore,
        particularly valuable in surveys of under-developed areas where no frame is generally
        sufficiently detailed and accurate for subdivision of the material into reasonably small
        sampling units.
        Limitations: However, a multi-stage sample is in general less accurate than a sample
        containing the same number of final stage units which have been selected by some
        suitable single stage process.
       (d) Systematic sampling
        Another sampling form, simple in design and execution, may be employed when the
        members of population to be sampled are arranged in order, the order corresponding to
        consecutive numbers. The arrangements of names in a telephone directory or income-tax
        returns in the income tax department are the illustrations of such orderings. A sample of
        suitable size is obtained by taking every unit say, seventh unit of the population, one of the
        first seven units in this ordered arrangement is chosen at random and the sample is
        completely by selecting every seventh unit from the rest of the list. If the first unit selected is
        the fifth, the researcher will include in his sample 12th, 19th, 26th, 33rd, etc. We can generalize
        the approach as follows: if the requirements of the survey call for the inclusion of one unit
        out of every m units in the population, a unit is chosen at random from the first m units,
        thereafter, every mth unit in the population when arranged in order, is included in the
        sample.
                                                        347
Caritas Business College             BUSINESS STATISTICS – OAD 221
        2.1. It should be efficient: Efficiency is with respect to sample size and it means
               that the sample estimates should be clustered as closely possible to the
               population parameter being estimated for a given sample size. For
               example, when the population is normally distributed, both the sample
               mean and the median are unbiased estimators of the population mean.
               However, for any given sample size, the sample means cluster more
               closely around the population mean than do the sample medians. Thus,
               both mean and the median are the unbiased estimators of the population
               mean. However, the sample mean is the unbiased efficient estimator of
               the population mean.
        In stratified random sampling, where stratification is meaningful, a stratified random sample
        will be more efficient than a simple random sample of the same size. A sampling design is
        considered efficient with respect to cost if the sample estimates cluster more closely
        around the population parameter being estimated than they would for any alternative
        sampling scheme involving equivalent rupee expenditure.
                                                      348
Caritas Business College             BUSINESS STATISTICS – OAD 221
       1. Convenience sampling
        In this scheme, a sample is obtained by selecting ‘convenient’ population elements. For
        example, a sample selected from the readily available sources or lists such as telephone
        directory or a register of the small scale industrial units, etc. will give us a convenient
        sample. In these cases, even if a random approach is used for identifying the units, the
        scheme will not be considered as simple random sampling. For example, if one studies the
        wage structure in a close by textile industry by interviewing a few selected workers, then the
        scheme adopted here is convenient sampling. The results obtained by convenience
        sampling method can hardly be said to be representative of the population parameters.
        Therefore, the results obtained are generally biased and unsatisfactory. However,
        convenient sampling approach is generally used for making pilot studies, particularly for
        testing a questionnaire and to obtain preliminary information about the population.
       2. Quota sampling
        In this method of sampling, the basic parameters which describe the population are
        identified first. Then the sample is selected which conform to these parameters. Thus, in a
        quota sample, quotas are fixed according to these parameters, and each field investigator is
        assigned with quotas of the number of units to be interviewed. Within the preassigned
        quotas, the selection of the sample elements depends on the personal judgement. For
        example, if one is studying the consumer preferences for ice creams among children and
        college going students and supposes it is fixed to interview 250 individuals from each
        category. If the city has five colleges, one decides to fix up a quota of 50 students to be
        interviewed from each college. It entirely depends upon the interviewer who will constitute
        this sub-sample of 50 students in a college— they may be the first 50 students who visit the
        ice cream parlour or may be the 50 students who visit the parlour between 4 p.m. and 6
        p.m., etc.
        Quota sampling method has the advantage that the sample will conform to the selected
        parameters of the population. The cost and time involved in getting information from the
        sample will be relatively less for a quota sample but there are many weaknesses too. Some
        of these are:
        3.1. Even when the sample does conform to the characteristics used in the
               quotas, the sample may be distorted on other factors of importance in the
               study. For example, interviewing first 50 students or the last 50 students
               visiting the ice cream parlour can make a lot of difference particularly
               about their purchasing capacity, tastes, etc. This may completely distort
               the results.
        Quota sampling method is generally used in public opinion studies, election forecast polls,
        as there is not sufficient time to adopt a probability sampling scheme.
                                                     349
Caritas Business College             BUSINESS STATISTICS – OAD 221
       3. Judgement sampling
        Judgement sampling method can also be called as sampling by opinion. In this method,
        someone who is well acquainted with the population decides which members (elementary
        units) in his or her judgement would constitute a proper cross-section representing the
        parameters of relevance to the study. This method of sampling is generally used in studies
        involving performance of personnel. For example, if one is studying the performance of sales
        staff in a marketing organisation, the people here are classified into top grade, medium
        grade and low grade performers. Having specified qualities that are important in the study,
        the expert (possibly here the Vice-President-sales) indicates the people who, in his or her
        knowledge, would be representative of each of the three categories mentioned earlier. This,
        of course, is not a scientific method, but in the absence of better evidence, such a judgement
        method may have to be used.
        11.6. Determination of sample size
        We prefer samples to complete enumeration because of convenience and reduced cost of
        data collection. However, in sampling, there is a likelihood of missing some useful
        information about the population. For a high level of precision, we need to take a larger
        sample. How large should be the sample and what should be the level of precision? In
        specifying a sample size, care should be taken such that (i) neither so few are selected so
        as to render the risk of sampling error intolerably large, nor (ii) too many units are included,
        which would raise the cost of the study to make it inefficient. It is, therefore, necessary to
        make a trade-off between
        (i) increasing sample size, which would reduce the sampling error but increase the cost, and
        (ii) decreasing the sample size, which might increase the sampling error while decreasing
        the cost. Therefore, one has to make a compromise between obtaining data with greater
        precision and with that of lower cost of data collection. Several factors need to be
        considered before determining the sample size.
        The first and the foremost is the size of the error that would be tolerable for the purposes of
        decision-making. The second consideration would be the degree of confidence with the
        results of the study, i.e., if one wants to be 100 per cent confident of the results, the entire
        population must be studied. However, this is generally too impractical and costly. Therefore,
        one must accept something less than 100 per cent confidence. In practice, the confidence
        limits most often used are 99 per cent, 95 per cent and 90 per cent. Most commonly used
        confidence limit is 95 per cent. This means that there is a 5 per cent risk that the true
        population statistic is outside the range of possible error specified by the confidence interval.
        This 5 per cent risk appears to be acceptable in most of the decisions. Thus, for 95 per cent
        level of confidence, Z value is 1.96. The Z value can be obtained from normal probability
        distribution for a specified level of confidence. For determining the sample size, we make
        use of the following relationship:
                                                  
          = standard error of the estimate =
           x                                   n
          x can be calculated if we know the upper and lower confidence limits. Let these limits be
        Y, then
        Zx =Y
        Where Z is the value of the normal variate for a given confidence level. The procedure
        has been explained using the illustration given below:
        Illustration 11.1. A state cooperative department is performing a survey to determine the
        annual salary earned by managers numbering 3000 in the cooperative sector within the
        state. How large a sample size it should take in order to estimate the mean annual earnings
        within
                                                      350
Caritas Business College                BUSINESS STATISTICS – OAD 221
        plus and minus 1,000 and at 95 per cent confidence level? The standard deviation of
        annual earnings of the entire population is known to be Rs. 3,000.
        Solution. As the desired upper and lower limit is Rs. 1,000, i.e., we want to estimate
        the annual earnings within plus and minus Rs. 1,000.
                      z  = 1,000
        x
        As the level of confidence is 95 per cent, the Z value is 1.96
                        1.96     = 1,000
        x
                                       1‚000
                                =           = 510.20
                                x      1.96
        The standard error  is given by / n where  is the population standard deviation
        x
                              
                              n =       510.20
                             3000
        i.e.,                     =      510.20
                             n
                                         3000
        i.e.,                     n =           = 5.88
                                         510
        This gives n = 34.57
        Therefore, the desired sample size is about 35.
        11.6.1. Sample size for stratified sampling
        Once the strata have been established, we are interested in the size of the stratified
        random sample. The size will depend upon whether the proportional or disproportional
        (optimal) sample is being taken.
        A proportional stratified sample is one in which the sample units in a given stratum are
        allocated in proportion to the relative size of the stratum. The following formula is used for
        calculation of the proportional sample for each stratum
             Ni
        ni =    ×n
                N
        Where ni = number of sample units from stratum i, N = the total number of units in the
        population, Ni = the total number of units in the stratum i, n = sample size desired.
        The standard error of mean is
                      k
         x =            2 2
                      wi i /ni
                     i=1
        where wi = the weight of stratum i = Ni/N, i = the standard deviation of the ith stratum, k =
        the total number of strata. In case of disproportionate stratified sampling, the proportion of
        units in the sample stratum is not equal to the proportion of the population. The formula for
        sample allocation in this case is
              wiin
        n i= k
             wii
             1
        Thus, the disproportional stratified sample is more desirable if standard deviation (i) of
        each stratum is known. The standard error of the mean of a disproportionate stratified
        sample is
                                                         351
Caritas Business College               BUSINESS STATISTICS – OAD 221
                      k
                        (wii)2
        x               1
         =                 ni
        It may be observed that the standard error for stratified sample is smaller than for simple
        random sample, i.e., much smaller samples may be utilized when the population has
        been stratified.
        Illustration 11.2. In a market area, shops are divided into two categories, viz., those that
        have daily turnover of more than Rs. 2000 and those that have daily turnover of less than
        Rs. 2000 for the study of estimating the total sales in the area. The total number of shops in
        the first stratum are 420 and in the second stratum 180. A sample of 50 was selected, the
        standard deviation has been found to be 70 for first stratum and 95 for second stratum. What
        size of stratified random sample should be taken under proportional and disproportional
        stratified sampling?
        Solution. Under the proportional stratified sampling, the sample size is given
                         Ni
                 by ni =    ×n
                     N
                                      420
               and, therefore n1 =            × 50 = 35
                                       6
                                180
               and       n2 =         × 50 = 15
                                6
                                                               2i2
               The standard error ( x ) =
                                                              wi ni
                                2     (70)2 (0.3)2 × (95)2
               =         (0.7)
                                ×        +        15
                                   35
               =        122.75 = 11.079
       For disproportionate sampling, the sample size is given by:
                                  wiin
                           ni     w ii
                                    0.7 × 70 × 50         2450
                        n1 = 0.7 × 70 + 0.3 × 95= 77.5 = 32.0
                                    0.3 × 95 × 50         1425
                      and n2 = 0.7 × 70 + 0.3 × 95= 77.5 = 18.0
       The standard error is given by
                                           k
                                          (wii)2
                         x =                            (0.7 × 70 + 0.3 × 95)2
                                           1
                                              ni     =            50
                             =     120.125 = 10.96
       11.6.2.        Cost as a factor in the determination of the sample size
        Another consideration in determining the sample size is the cost. Management may
        reduce the level of confidence in an attempt to reduce the cost of sampling. An illustration
        will clarify how cost of sampling can be reduced by reducing the sample size.
        Illustration 11.3. In a market area there are 600 shops. A researcher wishes to estimate
        number of customers visiting these shops per day. The researcher wants to estimate the
        sampling error in the number of customers visiting is no larger than ± 10 with probability of
        0.95. The previous studies indicated that the standard deviation is 85 customers. If the cost
                                                          352
Caritas Business College              BUSINESS STATISTICS – OAD 221
        per interview is Rs. 20 (this includes field work, supervision of interviewers, coding, editing
        and tabulation of results and report writing, etc.), calculate the total cost involved.
        Researcher is willing to sacrifice some accuracy in order to reduce cost. If he settles for an
        estimate with
        0.90 probability, how much reduction in cost can be achieved?
        Solution. For 95 per cent confidence levels,
                          Z x = Y
                 i.e., 1.96     = 10.0
                 x
                                    10
                          x=
                                   1.96
        Now,  is given by / n and therefore, the sample size will be determined by the equation
        x
                                     10
                             n = 1.96
        Since  = 85, we have
                             8       10
                             5 = 1.96
                              n
                            n = 277.6
        Thus, if the sample is taken as 278, the total cost involved will be 278 × 20 = Rs. 5560. As
        this cost is considered to be on the higher side by the researcher and in order to reduce the
        cost, the researcher has now settled to 90 per cent confidence level. At 90 per cent
        confidence level, the sample size can be calculated as follows:
                            Z x =     10
                               
                        1.65 x =       10
                               
                        1.65 x =       10
                                       10
                           or  x =    1.65
                                       10
                            n = 1.65
                               85
                         i.e.,          10
                                 n  =  1.65
                                  n=   196.7
        The cost of survey for this sample size will be 197 × 20 = Rs. 3940. Thus, we have
        observed that by reducing the confidence level from 95 per cent to 90 per cent, the
        researcher would reduce the cost from Rs. 5560 to Rs. 3940. The researcher may not like
        to reduce the confidence level further and so further cost reduction may not be desirable.
        11.7. Self-Test Questions
        3.     Specify and explain the factors that make sampling preferable to a complete census
               in a statistical investigation.
                                                     353
Caritas Business College          BUSINESS STATISTICS – OAD 221
        4.     How would you determine the sample size for stratified sampling? Explain
               with the help of a suitable example.
                                                   354
Caritas Business College               BUSINESS STATISTICS – OAD 221
        12.1        Introduction
        12.2        Sampling Distribution of the Mean
        12.3        Central Limit Theorem
        12.4        Sampling Distribution of the Proportion
        12.5        Sampling Distribution of the Difference of Sample Means
        12.6        Sampling Distribution of the Difference of Sample Proportions
        12.7        Small Sampling Distributions
        12.8        Sampling Distribution of the Variance
        12.9        F Distribution
        12.10       t-Distribution
        12.11       Self-Assessment Questions
        12.12       Suggested Readings
 12.1   INTRODUCTION
        Having discussed the various methods available for picking up a sample from a
generalizations
                                                        355
Caritas Business College                 BUSINESS STATISTICS – OAD 221
about the population on the basis of a sample drawn from it. The generalizations to be made
of sample data
These generalizations, together with the measurement of their reliability, are made in terms
of the relationship between the values of any sample statistic and those of the
estimated) for the entire population viz. population mean, population median, population
proportion, population variance and so on. Population parameter is unknown but fixed,
whose value is to be estimated from the sample statistic that is known but random. Sample
Statistic is any numbers computed from our sample data viz. sample mean, sample
It may be appreciated that no single value of the sample statistic is likely to be equal to the
corresponding population parameter. This owes to the fact that the sample statistic being
random, assumes different values in different samples of the same size drawn from the
same population.
Referring to our earlier discussion on the concept of a random variable in the lessons on
Probability Distributions, it is not difficult to see that any sample statistics is a random
variable and, therefore, has a probability distribution better known as the Sampling
all possible values the statistic may take when computed from random
                                                      3
Caritas Business College             BUSINESS STATISTICS – OAD 221
In reality, of course we do not have all possible samples and all possible values of
the statistic. We have only one sample and one value of the statistic. This value is
interpreted with respect to all other outcomes that might have happened, as represented by
the sampling distribution of the statistic. In this lesson, we will refer to the sampling
distributions of only the commonly used sample statistics like sample mean, sample
proportion, sample variance etc., which have a role in making inferences about the
population.
Sample statistics form the basis of all inferences drawn about populations. Thus, sampling
distributions are of great value in inferential statistics. The sampling distribution of a sample
statistic possess well-defined properties which help lay down rules for making
generalizations about a population on the basis of a single sample drawn from it. The
variations in the value of sample statistic not only determine the shape of its sampling
distribution, but also account for the element of error in statistical inference. If we know the
probability distribution of the sample statistic, then we can calculate risks (error due to
                                                      356
Caritas Business College              BUSINESS STATISTICS – OAD 221
chance) involved in making generalizations about the population. With the help of the
properties of sampling distribution of a sample statistic, we can calculate the probability that
the sample statistic assumes a particular value (if it is a discrete random variable) or has a
value in a given interval. This ability to calculate the probability that the sample statistic lies
in a particular interval is the most important factor in all statistical inferences. We will
Suppose we know that 40% of the population of all users of hair oil prefers our brand to the
next competing brand. A "new improved" version of our brand has been developed and
given to a random sample of 100 users for use. If 55 of these prefer our "new improved"
version to the next competing brand, what should we conclude? For an answer, we would
like to know the probability that the sample proportion in a sample of size 100 is as large as
55% or higher when the true population proportion is only 40%, i.e. assuming that the new
version is no better than the old. If this probability is quite large, say 0.5, we might conclude
that the high sample proportion viz. 55% is perhaps because of sampling errors and the new
version is not really superior to the old. On the other hand, if this probability works out to a
very small figure, say 0.001, then rather than concluding that we have observed a rare event
we might conclude that the true population proportion is higher than 40%, i.e. the new
version is actually superior to the old one as perceived by members of the population. To
calculate this probability, we need to know the probability distribution of sample proportion
and
        denote the observation        x1 , x2 ,.        respectively. The sample mean for this sample is
        as                            ...............
                                      xn
defined as:
                                                           357
Caritas Business College               BUSINESS STATISTICS – OAD 221
                                x1  x2 . . xn
                          X           n
        If we pick up another sample of size n from the same population, we might end up with a
totally different set of sample values and so a different sample mean. Therefore, there are
many (perhaps infinite) possible values of the sample mean and the particular value that we
obtain, if we pick up only one sample, is determined only by chance. In other words, the
sample mean is a random variable. The possible values of this random variable depends
on the possible values of the elements in the random sample from which sample mean is to
be computed. The random sample, in turn, depends on the distribution of the population
from
To observe the distribution of X empirically, we have to take many samples of size n and
determine the value of X for each sample. Then, looking at the various observed values
of
X , it might be possible to get an idea of the nature of the distribution. We will derive the
Let us assume we have a population, with mean and variance  2, which is infinitely large.
                                                        358
Caritas Business College                      BUSINESS STATISTICS – OAD 221
                                               x1  x2 .  xn
                 Sample Mean X 
                                                    n
       here
              x1 representing the first observed values in the sample, is a random variable since it may
take any of the population values. Similarly x2 , representing the second observed value in
sample is also a random variable since it may take any of the population values. In other
variable.
Now when the population is infinitely large, whatever is the value of x1 , the distribution of
x2 is not affected by it. This is true for any other pair of random variables as well. In other
        words; x , x ,.            are independent random variables and all are picked up from the same
                 1 2
               ...............
               xn
population.
        Finally, we have
         = E X
                                 x1  x2 ... xn 
                           =E
                                                  
          x
                                         n        
                                x1     x2 
                           =E xn   E                             [as E(A + B) = E(A) + E(B)]
                           E
                                               
                              n      n            n 
                                      
                             1        1                1
                                   Ex   ...... 
                             Ex                                     [as E(nA) = n E(A)]
                                      
                           =   E x1       2                       2
                             n       n                      n
                             1     1          1
                           =     +     +.…..+
                             n     n          n
                           =μ
        and
          2                 x1  x2 . . xn 
         = Var( X ) = Var
                                                       
          x
                                              n        
                                                                359
Caritas Business College               BUSINESS STATISTICS – OAD 221
                               x1             x2                       xn 
                      = Var             Var                      Var
                                                           
                           n            n                  n 
                                                                          [as Var(A + B) = Var (A) + Var (B)]
                                                            360
Caritas Business College                                BUSINESS STATISTICS – OAD 221
                                       1                   1                              1
                               =           Var(x )               Var(x )  ......           Var(x
                                                                                                           [as Var(nA) = n2 Var(A)]
                               )
                                       2            1         2         2                 2       n
                                   n                      n                           n
                                   1                1
                               =           2            2  ......    1 2
                                                                         2
                                                                       n
                                       2           2
                                   n             n
                                           2
                               = 
                                  n
                                                
               So,  = SD( X ) =
                          x                     n
        12.2.2                 Sampling With Replacement from Finite Populations
The above results have been obtained under the assumption that the random variables
large. It is also valid when the sampling is done with replacement, so that the population is
back to the same form before the next sample member is picked up. Hence, if the sampling
            = E X  = μ                                                            2                                      
                                                                                                 or       = SD( X ) =
         x                                                x
                                                              2
                                                                  = Var( X ) =
            and                                                                   n                          x            n
When sampling without replacement from a finite population, the probability distribution of
the second random variable depends on what has been the outcome of the first pick and so
on. In other words, the n random variables representing the n sample members do not
remain
        independent, the expression for the variance of X changes. The results in this case will be:
         = E X  = μ
           x
                                             Nn    2
                                                                                                             Nn
        and           2
                            = Var( X ) =  .                                 or            = S.D( X ) =      .
                          x
                                         n N1                                                x             n    N1
        By comparing these expressions with the ones derived above we find that the variance of
                                                                              361
Caritas Business College              BUSINESS STATISTICS – OAD 221
        factor                                                Nn
                                                                      . This factor is, therefore, known as
                                                               N 1
                                                        362
Caritas Business College             BUSINESS STATISTICS – OAD 221
In practice, almost all the samples are picked up without replacement. Also, most
populations are finite although they may be very large and so the variance of the mean
should theoretically be found by using the expression given above. However, if the
population size
(N) is large and consequently the sampling ratio (n/N) small, then the finite population
multiplier is close to 1 and is not used, thus treating large finite populations as if they were
infinitely large. For example, if N = 100,000 and n = 100, the finite population multiplier will
be 0.9995, which is very close to 1 and the variance of the mean would, for all practical
purposes, be the same whether the population is treated as finite or infinite. As a rule of that,
the finite population multiplier may not be used if the sampling ratio (n/N) is smaller than
0.05.
Above discussion on the sampling distribution of mean, presents two very important
results, which we shall be using very often in statistical estimation and hypotheses testing.
We have seen that the expected value of the sample mean is the same as the population
mean.
Similarly, that the variance of the sample mean is the variance of the population divided
by the sample size (and multiplied by the correction factor when appropriate).
The fact that the sampling distribution of X has mean μ is very important. It means that, on
the average, the sample mean is equal to the population mean. The distribution of the
statistic
is centered on the parameter to be estimated, and this makes the statistic X a good
estimator of μ. This fact will become clearer in the next lesson, where we will discuss
        estimators and their properties. The fact that the standard deviation of X is            means
                                                                            n
size increases, the standard deviation of X decreases, making X more likely to be close to
μ. This is another desirable property of a good estimator, to be discussed in the next lesson.
                                                      363
Caritas Business College           BUSINESS STATISTICS – OAD 221
If we take a large number of samples of size n, then the average value of the sample means
tends to be close to the true population mean. On the other hand, if the sample size
is
                                                   364
Caritas Business College                    BUSINESS STATISTICS – OAD 221
increased then the variance of X gets reduced and by selecting an appropriately large
The standard deviation of X is also called the standard error of the mean. It indicates the
extent to which the observed value of sample mean can be away from the true value, due to
sampling errors. For example, if the standard error of the mean is small, we may be
reasonably confident that whatever sample mean value we have observed cannot be very
Before discussing the shape of the sampling distribution of mean, let us verify the above
Consider a discrete uniform population consisting of the values 1, 2, and 3. If the random
                      Xi          6
               μ=              =       =2
                       N           3
        and variance is                 2
                           X                                         2
                             i             (1  2)2  (2  2)2  (3  2)        2
               σ =2
                                        =                                    =
                              N                           3                      3
If random samples of size n = 2 are drawn with replacement from this population, we will
have Nn = 32 = 9 possible samples. These are shown in Box 12-1 along with the
X : 1 1.5 2 2.5 3
Box 12-1
                                                              365
Caritas Business College                     BUSINESS STATISTICS – OAD 221
Now we can find out the mean and variance of the sampling distribution, the necessary
                                                                                                   2
                                      PX  1           X .PX  2          PX .[ X  EX ]  1/
                                                                                            3
                         2
                                  = Var( X )
                              x
                                                             366
Caritas Business College                   BUSINESS STATISTICS – OAD 221
If random samples of size n = 2 are drawn without replacement from this population, we will
        have   Pn = 3P2 = 6 possible samples. These are shown in Box 12-2 along with the
               N
Box 12-2
X : 1.5 2 2.5
Now we can find out the mean and variance of the sampling distribution, the necessary
                         2x
                               = Var( X )
                                                           367
Caritas Business College                         BUSINESS STATISTICS – OAD 221
Now if we compare the shapes of the parent population and the resulting sampling
distribution of mean, we find that although our parent population is uniformly distributed, the
If we increase the sample size n we observe an interesting and important fact. As n increases
 the possible values X can assume increases, so the number of rectangles increases
 the probability that X assumes a particular value decreases i.e. the width of
                   rectangles decreases
              Probabili
In the limiting case when the sample size n increases infinitely, the particular values X can
assume approaches infinity and the probability that X assumes a particular value
        Thus as n → ∝                 X ~ N (μ,           n )2
                                                      
                                        f( X )
                                          Aa
                                          Aaaa
                                          Aaaaaaaa
                                          aa aa
                                           365
f (X)
Total Area
The result we just stated - the limiting distribution of X is the normal distribution - is one of
the most important results in statistics. It is popularly known as the central limit theorem.
                size n increases.
                                                             2
         For "Large Enough" n:            X ~ N (μ,         n )
                                                        
         The central limit theorem is remarkable because it states that the distribution of the sample
mean X tends to a normal distribution regardless of the distribution of the population from
which the random sample is drawn. The theorem allows us to make probability statements
about the possible range of values the sample mean may take. It allows us to
compute
probabilities of how far away X may be from the population mean it estimates. We will
extensively use the central limit theorem in the next two lessons about statistical estimation
                                                        3
Caritas Business College               BUSINESS STATISTICS – OAD 221
The central limit theorem says that, in the limit, as n goes to infinity (n → ∝), the distribution
rate at which the distribution approaches a normal distribution does depend, however, on the
 if the population itself is normally distributed, the distribution of X is normal for any
sample size n
relatively large sample size is required to achieve a good normal approximation for
the distribution of X
Figure 12-4 shows several parent population distributions and the resulting sampling
Since we often do not know the shape of the population distribution, it would be useful to
have some general rule of thumb telling us when a sample is “Large Enough” that we may
                                                         367
Caritas Business College            BUSINESS STATISTICS – OAD 221
We emphasize that this is a general, and somewhat arbitrary, rule. A larger minimum sample
size may be required for a good normal approximation when the population distribution is
very different from a normal distribution. By the same reason, a smaller minimum sample
size may suffice for a good normal approximation when the population distribution is close to
a normal distribution.
Figure 12-5 should help clarify the distinction between the population distribution and the
sampling distribution of X . The figure emphasizes the three aspects of the central limit
theorem:
The last statement is the key to the important fact that as the sample size increases,
the variation of X about its mean μ decreases. Stated another way, as we buy more
information
(take a larger sample), our uncertainty (measured by the standard deviation) about the
                                                      368
Caritas Business College             BUSINESS STATISTICS – OAD 221
What we call the central limit theorem actually comprises several theorems developed over
the years. The first such theorem was the discovery of the normal curve by Abraham De
Moivre in 1733, when he discovered the normal distribution as the limit of the binomial
distribution. The fact that the normal distribution appears as a limit of the binomial distribu-
tion as n increases is a form of the central limit theorem. Around the turn of the twentieth
century, Liapunov gave a more general form of the central limit theorem, and in 1922
Lindeberg gave the final form we use in applied statistics. In 1935, W Feller gave the proof
Let us now look at an example of the use of the central limit theorem.
Example 12-1
ABC Tool Company makes Laser XR; a special engine used in speedboats. The company’s
engineers believe that the engine delivers an average power of 220 horsepower, and that
sample 100 engines (each engine to be run a single time). What is the probability that
the sample
Here our random variable X is normal (or at least approximately so, by the central limit
                                                      369
Caritas Business College                BUSINESS STATISTICS – OAD 221
                                                     2
                                  X ~ N (μ,         n )
                                                
                                                           2
                                                   15 2100 )
                         or       X ~ N (220,
                                                               X 
        So we can use the standard normal variable Z =              to find the required probability,
                                                               n
= 0.0228
So there is a small probability that the potential buyer’s tests will result in a sample mean
possesses a particular attribute that is of interest to us. This also implies that a proportion q
(=1-p) of the population does not possess the attribute of interest. If we pick up a sample
of size n with
replacement and found x successes in the sample, the sample proportion of success ( p ) is
        given by
                     x
                p=
                     n
x is a binomial random variable, the possible value of this random variable depends on the
P( x) = nCx p x qn-x
                     x
        Since p =        and n is fixed (determined before the sampling) the distribution of the number
                     n
The expected value and the variance of x i.e. number of successes in a sample of size n is
known to be:
E(x) = n p
Var (x) = n p q
                           p                 
                                             n
                                            1
                                           = E(x) = 1 .n p =
                                            n       pn
                           2                        x
        and              = Var                = Var n
                                                      
                           p
                               p                        
                                               1                 1            pq
                                           =        . Var(x) =        .npq=
                                                2                n2           n
                                               n
                        = SD p               pq
                       =                       n
                               p
When sampling is without replacement, we can use the finite population correction factor, so
        Mean                        =p
                                       p
        Variance                   
                                   =
                                       2   pq  N  n 
                                             .     
                                       p   n  N1
                                                                 371
Caritas Business College               BUSINESS STATISTICS – OAD 221
As the sample size n increases, the central limit theorem applies here as well. The rate at
which the distribution approaches a normal distribution does depend, however, on the shape
relatively large sample size is required to achieve a good normal approximation for
the distribution of p
In order to use the normal approximation for the sampling distribution of p , the sample size
needs to be large. A commonly used rule of thumb says that the normal approximation to the
We now state the central limit theorem when sampling for the population proportion p .
increases.
                                                            2
        For "Large Enough" n:             p ~ N (p,     pq n )
The estimated standard deviation of p is also called its standard error. We demonstrate
Example 12-2
produced are defective. A random sample of 400 screws is examined for the proportion
                                                        372
Caritas Business College   BUSINESS STATISTICS – OAD 221
of defective
                                       373
Caritas Business College             BUSINESS STATISTICS – OAD 221
screws. Find the probability that the proportion of the defective screws ( p ) in the sample
So q = 0.08 (= 1-0.02)
        Since the population is infinite and also the sample size is large, the central limit theorem
                                                2
        applies. So             p ~ N (p,   pq n )
                                                                2
                                p ~ N (0.02,    (0.02)(0.08) 400 )
                                                                                  p p
                                                                                   
        We can find the required probability using standard normal variable Z =       
                                                                                pq / n 
                                                                                             
                                                                                  
                                                          Z                      
                                         0.01  0.02               0.03  0.02    
               P(0.01 < p < 0.03) = P                      
                                         (0.02)(0.08)
                                        400                        (0.02)(0.08)   
                                                                    400            
                                                                                  
                                                           0.01
                                                           
                                         0.01
                                  =P            Z  0.007
                                                           
                                                      
and 0.03.
                                                         374
Caritas Business College                 BUSINESS STATISTICS – OAD 221
Let us consider independent random sampling from the populations so that the sample
distribution of X 1 - X 2 .
                                            = μ1 - μ2
                          = VarX - X 2                  VarX 
        and                                = VarX
                                                                    2
                   2                                    1
                  X1
                   X
                   2                 1
                                                 2    2
                                            =  1  2 ; when sampling is with replacement
                                              n1 n2
                                                1 2  N  n   22  N  n 
                                            =       . 1 1        . 2 2  ; when sampling is without
                                                n1  N1  1 n2 N 2  1 
                                                              
replacement
                                                            375
Caritas Business College             BUSINESS STATISTICS – OAD 221
As the sample sizes n1 and n2 increases, the central limit theorem applies here as well. So we
        state the central limit theorem when sampling for the difference of population      X1 -X2
        means
distribution of the
                                                     376
Caritas Business College              BUSINESS STATISTICS – OAD 221
                                                                2        2
              with mean μ1 - μ2 and standard deviation            1        2   as the sample sizes n1
                                                                n1n2
and n2 increases.
Example 12-3
The makers of Duracell batteries claims that the size AA battery lasts on an average of
45 minutes longer than Duracell’s main competitor, the Energizer. Two independent
random
 2  67 minutes, find the probability that the difference in the average lives of Duracell and
μ1 - μ2 = 45
σ1 = 84 and σ2 = 67
        Let X 1 and        denote the two sample average lives of Duracell and Energizer batteries
            X2
        respectively. Since the population is infinite and also the sample sizes are large, the
        central limit theorem applies.
                                                            2
        i.e             X1 -                     2    2
                                 ~ N (μ1 - μ2,  1  2 )
                        X2                    n1n2
                                          842  672 2
                                 ~ N (45, 100100     )
                      X1 -X2
                                                        377
Caritas Business College             BUSINESS STATISTICS – OAD 221
Let us consider independent random sampling from the populations so that the sample
distribution of p1 - p2 .
                                                                378
Caritas Business College          BUSINESS STATISTICS – OAD 221
samples of size n1 and n2 are taken from two specified binomial populations.
                                                 379
Caritas Business College               BUSINESS STATISTICS – OAD 221
                                              = p1 - p2
                        = Varp - p2
        and                                  = Varp  Varp 
                   2                                             1              2
                  p1
                  
                  p2
                                1
                                                      p1q1 p2 q2
                                                  =              ; when sampling is with replacement
                                                           n2
                                                      n1
                                              p1q1  N1  n1        p2 q2  N 2  n2 
                                              =            .           .        ; when sampling is
                                                        n1  N  1  n2     N 1 
                                                                1
                                                              
                                                                                     
                                                                                            2
without replacement
As the sample sizes n1 and n2 increases, the central limit theorem applies here as well. So
we state the central limit theorem when sampling for the difference of population
proportions
p1 - p2
proportions
               deviation
                            p1q1  p2 q2 as the sample sizes n and n increases.
                                                              1     2
                             n1     n2
                                                                                                2
        For "Large Enough" n1 and n2:                 p1 -                           p1q1  p2 q2 )
                                                                     ~ N (p1 - p2,
                                                      p2                              n1     n2
                                                                      380
Caritas Business College           BUSINESS STATISTICS – OAD 221
        The estimated standard deviation    p1 - p2 is also called its standard error. We demonstrate
        of
Example 12-4
It has been experienced that proportions of defaulters (in tax payments) belonging to
business class and professional class are 0.20 and 0.15 respectively. The results of a
                                                   381
Caritas Business College             BUSINESS STATISTICS – OAD 221
               Proportion of defaulters:
                                                 p1                       p2  0.14
                                                 0.21
Find the probability of drawing two samples with a difference in the two sample proportions
p1 = 0.20 p2 = 0.15
n1 = 400 n2 = 420
p1  0.21 p2  0.14
        Since the population is infinite and also the sample sizes are large, the central limit
        theorem applies. i.e.
                                                              2
                      p1 - p2 ~ N (p1 - p2,   p1q1  p2 q2 )
                                               n1     n2
                                                                     2
                      p1 - p2
                              ~ N (0.05, (0.20)(0.80)  (0.15)(0.85)   )
                                                400               420
        So we can find the required probability using standard normal variable
                             p  p   p  p 
                       Z      1     2       1     2
                                    p1 q1  p 2 q 2
                                     n1      n2
                                                        382
Caritas Business College                BUSINESS STATISTICS – OAD 221
sampling distributions can be well approximated by a normal distribution for “Large Enough”
sample sizes. In other words, the Z-statistic is used in statistical inference when sample size
is large. It may, however, be appreciated that the sample size may be prohibited from
being large either due to physical limitations or due to practical difficulties of sampling costs
being too high. Consequently, for our statistical inferences, we may often have to contend
ourselves with a small sample size and limited information. The consequences of the sample
Thus, the basic difference which the sample size makes is that while the sampling
distributions based on large samples are approximately normal and sample variance S2 is an
unbiased estimator of 2 , the same does not occur when the sample is small.
It may be appreciated that the small sampling distributions are also known as exact sampling
distributions, as the statistical inferences based on them are not subject to approximation.
However, the assumption of population being normal is the basic qualification underlying the
In the category of small sampling distributions, the Binomial and Poisson distributions were
already discussed in lesson 9. Now we will discuss three more important small sampling
distributions – the chi-square, the F and the student t-distribution. The purpose of discussing
these distributions at this stage is limited only to understanding the variables, which define
them and their essential properties. The application of these distributions will be highlighted
                                                       383
Caritas Business College                 BUSINESS STATISTICS – OAD 221
The small sampling distributions are defined in terms of the concept of degrees of freedom.
The concept of degrees of freedom (df) is important for many statistical calculations and
number of observations contained in a set of sample data which can be freely chosen.
It refer to the number of independent variables which vary freely without being
                                                                                    1n
        Let x1 , x2          be n observations comprising a sample whose
                             mean                                                x   xi is a value
            ..............                                                           ni1
            xn
known to us. Obviously, we are free to assign any value to n-1 observation out of n
observations. Once the value are freely assigned to n-1observations, freedom to do the
same for the nth observation is lost and its value is automatically determined as
                                         n1
                                  = n x  xi
                                         i1
        As the value of nth observation must satisfy the restriction
                                   n
                                  xi  nx
                                 i1
We say that one degree of freedom, df is lost and the sum n x of n observations has n-1 df
For example, if the sum of four observations is 10, we are free to assign any value to three
                  x4  4 xi  (x1  x2  x3 )
                      i1
                                                        384
Caritas Business College              BUSINESS STATISTICS – OAD 221
x4  10  (2  1  4)
x4  3
Sampling essentially consists of defining various sample statistics and to make use them in
particular value of its estimator S2; the sample variance. The number of observations in the
sample being n, df = n-m = n-1 because 2 is the only parameter (i.e. m =1) to
variance and then present the chi-square distribution, which helps us in working out
By now it is implicitly clear that we use the sample mean to estimate the population
mean and sample proportion to estimate the population proportion, when those parameters
are unknown. Similarly, we use a sample statistic called the sample variance to estimate the
population variance.
As will see in the next lesson on Statistical Estimation a sample statistic is an unbiased
estimator of the population parameter when the expected value of sample statistic is equal to
                                                        385
Caritas Business College                       BUSINESS STATISTICS – OAD 221
Then E(S2) = 2
        However, it can be shown empirically that while calculating S2 if we divide the sum of square
                                                              x x
        of deviations from mean (SSD) i.e.                n         by n, it will not be an unbiased estimator of 2
                                                               2
                                                      
                                                      i
                                                      
                                                      1
                  n         2                                                         
                   x x                              n                         2  2
        and    E i1                          =         2  1           =
                                                            n                            n
                       n     
                             
                             
                      2                                                                                              2
                                                                                                                 
             x  x
        i.e.                       will underestimate the population variance 2 by the factor                           .
                                                                                                                             To
                     n             n
                                                                                                                      2
                                                                                                             x
                                                                                                              n
                                                                                                                  x
        compensate for this downward bias we                            x       by n-1, so          S2    i1        is
        divide                                                         i n 2       that                        n1
                                                                        x
                                                                       1
        an unbiased estimator of population variance 2 and we have:
                                  n        2 
                                 
                                   i 1x      2
                               E                =
                                  n1 
                                            
                                            
        In other words to get the unbiased estimator of population variance 2, we divide the
                               2
                 n x    x       by the degree of freedom n-1
        sum
        
               i1
X = X 1 , X 2.....X N 
        We may draw a random sample of size n                                  x1 , x2...xn values from this population.
        comprising
                                                                    386
Caritas Business College          BUSINESS STATISTICS – OAD 221
independent normal random variable with mean and variance 2. In other words
                                                 387
Caritas Business College             BUSINESS STATISTICS – OAD 221
Thus each of these n normally distribution random variable may be standardized so that
                                     xi 
                               Z 
                                                ~ N (0, 12)     where i = 1, 2, ………n
                                i
                                         
                               U  Z 2  Z 2 ......... Z 2
                                    1     2               n
                                     n
                                             2
                                U  i
                                     i1
                                                      2
                                     x
                                     n
                               U   i  
                                   i1                  
Which will take different values in repeated random sampling. Obviously, U is a random
variable. It is called chi-square variable, denoted by χ2. Thus the chi-square random
variables.
where e is the base of natural logarithm, n denotes the sample size (or the number of
area under the χ2 distribution is unity. χ2 values are determined in terms of degrees of
freedom, df
=n
Properties of χ2 Distribution
                                                           388
Caritas Business College                BUSINESS STATISTICS – OAD 221
4. The mean of a χ2 distribution is equal to the degrees of freedom df. The variance
χ2 distribution looks more and more like a normal. Thus for large df
                         χ2 ~ N (n,         2
                                       2n )
In general, for n ≥ 30, the probability of χ2 taking a value greater than or less than a
               freedo
               m           n
                           2
                             , n , n ,.........n . Then their sum  2   2   2 ......  also possesses
                            1 2 3               k                        1   2   3      k
               a χ2 distribution with df = n
                                                     n  n ......... nk .
                                                    1
                                                        2    3
We can write
                                                                 389
Caritas Business College                      BUSINESS STATISTICS – OAD 221
                                                                         2
                    n2 x  
                                     n   1                           
                        i
                                2  (xi  x)  (x  )
                 i1           i1
                                         n
                                     21   (x                                      x)(x  )
                                     i1       x)2  (x  ) 2 
                                           i  2(x
                                                                                  i
                                      1
                                     2 i1n (x  x)2 
                                                                     n (x    ) 2                   (xi
                                              i 1
                                                              2                             n                  x)
                                                             i                              (x  )
                                                           2                             2      i1
                                                           1                         
                                                                  2
                                    (n  1)S 2  x    
                                          2  
                                                / n 
                                                             
              n                     n                       n
         since   ( x  x)         2 ; ( x   )  n(x  ) and ( x  x) 0
                        2  (n  1)S
               i                                              i         
                                  i1                      i1        
i1
Now, we know that the LHS of the above equation is a random variable which has chi-square
distribution, with df = n
                                                         2
                          2         x ~ N (μ,           n )
             x                      
        Then     will have a chi-square distribution with df = 1
                 
               n
                                                                               n  1S 2
        Since the two terms on the RHS are independent,                                          will also has a chi-square
                                                                                      
        2
distribution with df = n-1. One degree of freedom is lost because all the deviations are
                                                                                                                n  1S 2
        In practice, therefore, we work with the distribution of                            1
                                                                                                                      2
                                                                                                                    
        of S2 directly.
                                                              390
Caritas Business College           BUSINESS STATISTICS – OAD 221
                                                    391
Caritas Business College                 BUSINESS STATISTICS – OAD 221
                         n  1S 2 
        So             E          n1
                       2      
                        
                           n1
                                    E(S 2 )  n  1
                               2
                           
E(S 2 )   2
                  (n  1)S 2 
        Also Var      2     2(n  1)
                      
                                                n                        2
                                                1S 2                      
        or                                 E                   (n         2(n  1)
                                           2                  1)
                                                                       
                    n                                        2
                      2              (n  1)2  2(n  n2 1S  2(n  1)
                   1 S 4           1)
        or        E                                    
                                                              
                         4                                  2 
                                  n  ES
                                                          
        or                              2  4
                                                   2S 2 2  2(n  1)
                                     1         4
                                                       2
                                       4              
                                     
                                                       2
                                        n  1
        or                                                 E(S 2   2 ) 2  2(n  1)
                                                   4
                                             
                                                                             2(n  1) 4
        or                                                 E(S 2   2 ) 2          
                                                                            (n  1)2
                                                                            2 4
        So                                                     Var(S 2 )
                                                                     
                                                                            n1
It may be noted that the conditions necessary for the central limit theorem to be operative in
the case of sample variance S2 are quite restrictive. For the sampling distribution of S2 to be
approximately normal requires not only that the parent population is normal, but also that the
                                                                  392
Caritas Business College             BUSINESS STATISTICS – OAD 221
Example 12-5
                                                 393
Caritas Business College              BUSINESS STATISTICS – OAD 221
In an automated process, a machine fills cans of coffee. The variance of the filling process is
known to be 30. In order to keep the process in control, from time to time regular checks of
the variance of the filling process are made. This is done by randomly sampling filled cans,
measuring their amounts and computing the sample variance. A random sample of 101
cans is selected for the purpose. What is the probability that the sample variance is between
Solution: We have
Population variance 2 = 30
n = 101
                              n 
                       χ2 =   1S 2
                                 2
                               
        So
        =      P(21.28 < S2 < 38.72)      (101  1)21.28   2  (101  1)38.72 
                                        P                  
                                                     394
Caritas Business College             BUSINESS STATISTICS – OAD 221
                                                                             
                                                 30                     30   
≈ 0.990 – 0.025
= 0.965
Since our population is normal and also sample size is quite large, we can also estimate
                                        2 4 2
                                2     2
        We have                S ~ ( , n  1 )
                                                                        
                                                                        
        So     P(21.28 <     < 38.72) =  21.28        Z    38.72   2 
                                        2                               
        S2                                                   2 4
                                       P                                
                                        2 4                 n1        
                                        n  1
                                                       395
Caritas Business College                       BUSINESS STATISTICS – OAD 221
                                                                                     
                                                                                     
                                                        21.28  30         38.72  30 
                                               =     P2x30x30  Z  2x30x30
                                                                                         
                                                      101  1        101  1             
                                                      8.72           8.72 
                                               =P               Z
                                                                        
                                                         4.36      4.36
                                               = P 2  Z  2
                                               = 2P0  Z  2
                                               = 2x0.4772
= 0.9544
sample of size n1 drawn from the first population, we have the chi-square variable
                               n  1S 2
                         
                          2           1        1
                         1                 2
                                          1
Similarly, for a random sample of size n2 drawn from the second population, we have the chi-
        square variable
                                    2
                         n  1S
                         2  2     2
                        2      2
                              2
                              21
                        F
                                      v1
                                 2
                                  2
                                      v2
        is a random variable known as F statistic, named in honor of the English statistician
                                                                 396
Caritas Business College       BUSINESS STATISTICS – OAD 221
                                           397
Caritas Business College            BUSINESS STATISTICS – OAD 221
Properties of F- Distribution
freedom of the numerator always listed as the first item in the parentheses and the
degrees of freedom of the denominator always listed as the second item in the
parentheses. So there are a large number of F distributions for each pair of v1 and v2.
2. As a ratio of two squared quantities, the F random variable cannot be negative and
                                                     398
Caritas Business College            BUSINESS STATISTICS – OAD 221
distribution looks more and more like a normal. In general, for v2 ≥ 30, the probability
                                                     399
Caritas Business College                   BUSINESS STATISTICS – OAD 221
of F taking a value greater than or less than a particular value can be approximated by
         5.    The F distributions defined                           F(v , and         F(v , are reciprocal of each other.
               as                                                    v )   as          v)
                                                                        1 2              2 1
                       i.e.
                                                         1
                                             F(v ,v ) =
                                                1     (v2F ,v1 )
                                  xi 
                       Z 
                                          ~ N (0, 12)                                            where i = 1, 2, ………n
                        i
                                   
                                       2                         n               2
                              n
                                  x
                                                 xi  x
        and            U            i
                                               i1
                                                                                 ~ χ2 (n-1 df)       where i = 1, 2, ………n
                                                                                 2
                                  i1                                   
                                                      xi  
                                   T                    
                                                        x x
                                                             n
                                                                     i
                                                                             2
                                             1
                                            i1
n1 2
                                                 xi  
                                   T 
                                            x  x 
                                             n
                                                  i                      2
                                                       i1
                                                      n1
                                           xi  
                                   T        S
This statistic - the ratio of the standard normal variable Z to the square root of the χ2
variable divided by its degree of freedom - is known as ‘t’ statistic or student ‘t’ statistic,
                                                                             400
Caritas Business College         BUSINESS STATISTICS – OAD 221
named after the pen name of Sir W S Gosset, who discovered the distribution of the
quantity.
                                               401
Caritas Business College                BUSINESS STATISTICS – OAD 221
                                 xi 
        The random                     follows t-distribution with n-1 degrees of freedom.
        variable
                                  S
                                  xi  
                                           ~ t (n-1 df)                 where i = 1, 2, ………n
                                    S
                                                                               2
        We know                            X ~ N (μ,                          n )
                                                                    
                                        X           ~ N (0, 12 )
        So                                 
                                              n
                                                                   xi  
                                 xi 
        Putting            for                                                             , we get
                                     in T 
                  X                                                                   2
                                  
                                                                                   
                                                                    n
                   n                                       xi  x
                                                        1 i1
                                                       n1    2
                                                               X 
                                                               
                                                                  n
                                           T 
                                                                    x x
                                                                    n
                                                                          i
                                                                                    2
                                                      1
                                                     i1
n1 2
                                                     X   
                                                            
        or                                 T 
                                                           x  x 
                                                           n
                                                               i                2
                                                                    i1
                                                 1
                                                            n(n  1)
                                                 
                                                           X 
        or                                 T 
                                                     1i1
                                                         n xi  x            2
nn  1
                                                 X 
        or                                 T
                                                                     402
Caritas Business College           BUSINESS STATISTICS – OAD 221
                                          S
                                              n
When defined as above, T again follows t-distribution with n-1 degrees of freedom.
                                                   403
Caritas Business College             BUSINESS STATISTICS – OAD 221
                               XS  
                                   n ~ t (n-1 df)     where i = 1, 2, ………n
Properties of t- Distribution
1. The t-distribution like Z distribution, is unimodal, symmetric about mean 0, and the t-
with the distribution are the df associated with the sample standard deviation.
3. The t-distribution has no mean for n = 2 i.e. for v = 1 and no variance for n ≤ 3 i.e. for v
≤ 2. However, for v >1, the mean and for v > 2, the variance is given as
                                                                  v
                        E(T) = 0                      Var(T) =
                                                                 v
                                                                  2
variable as against Z distribution which has variance 1. This follows from the fact what
while Z values vary from sample to sample owing to the change in the X alone, the
general, for n ≥ 30, the variance of t-distribution is approximately the same as that of Z
                                                      404
Caritas Business College              BUSINESS STATISTICS – OAD 221
2. How does the size of population and the kind of random sampling determine
(b) Suppose the population mean is μ = 125 and the population standard deviation
is 20. What are the expected value and the standard deviation of X ?
4. What is the most significant aspect of the central limit theorem? Discuss the
5. Under what conditions is the central limit theorem most useful in sampling for
6. If the population mean is 1,247, the population variance is 10,000, and the sample
size is 100, what is the probability that X will be less than 1,230?
7. When sampling is from a population with standard deviation σ = 55, using a sample of
size n = 150, what is the probability that X will be at least 8 units away from the
population mean μ?
8. The Colosseum, once the most popular monument in Rome, dates from about AD 70.
Since then, earthquakes have caused considerable damage to the huge structure,
and engineers are currently trying to make sure the building will survive future
shocks. The Colosseum can be divided into several thousand small sections.
Suppose that the average section can withstand a quake measuring 3.4 on the
Richter scale with a standard deviation of 1.5. A random sample of 100 sections is
selected and tested for the maximum earthquake force they can withstand. What is
                                                        405
Caritas Business College            BUSINESS STATISTICS – OAD 221
average section in the sample can withstand an earthquake measuring at least 3.6
9. On June 10, 1997, the average price per share on the Big Board Composite Index in
New York rose 15 cents. Assume the population standard deviation that day was 5
cents. If a random sample of 50 stocks is selected that day, what is the probability
that the average price change in this sample was a rise between 14 and 16 cents?
10. An economist wishes to estimate the average family income in a certain population.
The population standard deviation is known to be Rs 4,000, and the economist uses
a random sample of size n = 225. What is the probability that the sample mean will
11. When sampling is done from a population with population proportion p = 0.2, using a
12. When sampling is done for the proportion of defective items in a large shipment,
where the population proportion is 0.18 and the sample size is 200, what is the
13. A study of the investment industry claims that 55% of all mutual funds outperformed
the stock market as a whole last year. An analyst wants to test this claim and obtains
a random sample of 280 mutual funds. The analyst finds that only 128 of the funds
outperformed the market during the year. Determine the probability that another
random sample would lead to a sample proportion as low as or lower than the one
obtained by the analyst, assuming the proportion of all mutual funds that out-
14. In recent years, convertible sport coupes have become very popular in Japan.
                                                      406
Caritas Business College               BUSINESS STATISTICS – OAD 221
ships them back to Japan. Suppose that 25% of all Japanese in a given income and
prob- ability that at least 20% of those in the sample will express an interest in a
16. What do you understand by small sampling distributions? Why are the small
20. Define the t statistic. What are important properties of t-distribution? How does t
407