Statistics For Management II
Statistics For Management II
Address
https://gagecollege.net/
Email: - adtsm@gagecollege.net
                                            JANUARY, 2024
Contents
CHAPTER ONE ......................................................................................................................... 1
SAMPLING AND SAMPLING DISTRIBUTION ...................................................................... 1
   1.1 Sampling Theory ............................................................................................................... 1
   1.1.1 Basic Definitions ............................................................................................................ 1
   1.1.2. The need for samples ..................................................................................................... 2
   1.1.3. Designing and conducting a sampling study ................................................................... 2
   1.1.4. Bias and errors in sampling, non-sampling errors ........................................................... 3
   1.1.5. Types of sampling random and non-random sampling.................................................... 7
   1.2 Sampling Distribution ..................................................................................................... 10
   1.2.1 Definition ......................................................................... Error! Bookmark not defined.
   1.2.2         Sampling Distribution of the mean and proportion ................................................. 11
   1.2.3         Sampling Distribution of the difference between two means and two proportions ... 19
CHAPTER TWO ...................................................................................................................... 23
STATISTICAL ESTIMATIONS .............................................................................................. 23
   2.1. Basic concepts ................................................................................................................ 23
   2.2. Point estimators of the mean and proportion ................................................................... 23
   2.3. Interval estimators of the mean and proportion ............................................................... 23
   2.4 Interval estimation of the difference between two independent means .............................. 24
   2.5 Student’s t-distribution .................................................................................................... 26
   2.6. Determining the sample size ........................................................................................... 28
CHAPTER THREE .................................................................................................................. 31
HYPOTHESIS TESTING ......................................................................................................... 31
   3.1 Basic Concepts ................................................................................................................ 31
   3.2 Steps to Hypothesis Testing ............................................................................................ 32
   3.3 Type I and Type II errors ................................................................................................. 35
   3.4 One tailed/Is two tailed hypothesis tests ........................................................................... 36
   3.5. Hypothesis testing .......................................................................................................... 39
   3.5.1 Hypothesis testing of the population mean .................................................................... 43
CHAPTER FOUR..................................................................................................................... 52
                                                                    i
   4.1 Areas of application......................................................................................................... 52
                                                                     ii
                                    CHAPTER ONE
Examples
Examples:
1|Page
   6. Statistic: Characteristic or measure obtained from a sample.
   7. Sampling: The process or method of sample selection from the population.
   8. Sampling unit: the ultimate unit to be sampled or elements of the population to be
      sampled.
      Examples:
           If somebody studies Scio-economic status of the households, households is
               the sampling unit.
           If one studies performance of freshman students in some college, the student
               is the sampling unit
Because of the above consideration, in practice we take sample and make conclusion
about the population values such as population mean and population variance, known as
parameters of the population.
     Universality
     Qualitativeness
     Detailedness
     Non-representativeness
 Exercises:
2|Page
 4. An insurance company has insured 300,000 cars over the last six years. The company
    would like to know the number of cars involved in one or more accidents over this
    period. The manger selected 1000 cars from the files and made a record of cars that
    were involved in one or more accidents.
    a) What is the population?
    b) What is the sample?
    c) What is the variable of interest to the insurance company?
      First, there is a chance that some unusual or variable units exist in a population and
       are picked randomly, as this has a probability of occurring. Researchers can prevent
       this mistake by expanding the sample size, which reduces the likelihood of selecting
       rare or variant units.
      The second cause of sampling mistakes is sampling bias, which is the inclination
       to pick units with specific features. The most obvious type of sampling bias is
       known as selection bias, resulting from an improper sampling design. When a
       subset of possible units is excluded from the sample, this is known as selection bias.
When data is obtained unfairly, some individuals in the target population will have a lower
or greater sampling probability than others. The data collected from the whole population
may not accurately reflect the information collected in the systematic study. It often occurs
unintentionally and is often missed by the researcher.
      Self-Selection bias: People with particular qualities are more likely than others to
       consent to participate in research.
3|Page
      Coverage bias: People who decline to participate or drop out of a study differ
       systemically from those who participate.
      Under-coverage bias: Some population members are insufficiently represented in
       the sample.
      Advertising or pre-screening bias: A sample may be biased based on how
       volunteers are pre-screened or where research is publicized.
      Using a well-thought-out study design and method of sampling can assist you in
       preventing sample bias.
      Define a population of interest and a sampling frame
      Match the sample frame as closely as feasible to the target population to minimize
       the possibility of sampling bias.
      Make user-friendly questionnaires.
      Follow up with non-respondents.
In contrast, the discrepancy between a survey’s outcome and the population value is known
as sampling error. The assessment of sampling error can be anticipated, evaluated, and
accounted for in contrast to bias. Sampling error is quantified using P-values, standard
errors, confidence intervals, and coefficients of variation. These metrics are employed to
compute sample size before sampling and assess our confidence level in the results
following analysis.
Choosing the appropriate sampling procedure and sample size helps reduce sampling error.
Although sample bias cannot be precisely quantified, sampling error can be precisely
estimated. Preventing and reducing sample bias must be a top priority. Sampling error
happens when the confidence intervals are too broad because of the limited sample size.
 If researchers are not certain who to interview, they make a mistake in defining the
population.
      Sampling frame error: Sampling frame mistakes occur when researchers incorrectly
       target a subpopulation when choosing samples.
      Selection error: A selection error happens when respondents choose to engage in
       research on their own. Only those who are interested react. By requesting replies
       from the complete sample, it is possible to minimize selection mistakes. The
       response rate will be increased by pre-survey planning, follow-ups, and a well-
       organized survey design. Also, attempt CATI surveys and in-person interviews to
       increase response rates.
4|Page
      Sampling mistakes: Sampling errors are caused by a mismatch in the respondents’
       representativeness. It typically occurs when the researcher does not thoroughly
       arrange his sample. To minimize these errors, it is necessary to use a rigorous
       sample design. This sample size represents the total population, an online sample,
       or survey respondents.
      Increase sample size: A bigger sample yields more accurate results since the
       research becomes more representative of the total population.
      Instead of using a random sample, test groups are proportional to their size in the
       population.
      Know your audience: Examine your population and determine its demographic
       composition.
Non-sampling error refers to all sources of error that are unrelated to sampling. Non-
sampling errors are present in all types of survey, including censuses and administrative
data. They arise for a number of reasons: the frame may be incomplete, some respondents
may not accurately report data, data may be missing for some respondents, etc.
Non-sampling errors can be classified into two groups: random errors and systematic
errors.
      Random errors are errors whose effects approximately cancel out if a large enough
       sample is used, leading to increased variability.
      Systematic errors are errors that tend to go in the same direction, and thus
       accumulate over the entire sample leading to a bias in the final results. Unlike
       random errors, this bias is not reduced by increasing the sample size. Systematic
       errors are the principal cause of concern in terms of a survey’s data quality.
       Unfortunately, non-sampling errors are often extremely difficult, if not impossible,
       to measure.
Non-sampling error can occur in all aspects of the survey process, and can be classified
into the following categories: coverage error, measurement error, nonresponse error and
processing error.
Coverage error
5|Page
case of a census, it may be the main source of error. Coverage error can have both spatial
and temporal dimensions, and may cause bias in the estimates. The effect can vary for
different subgroups of the population. This error tends to be systematic and is usually due
to under coverage, which is why it’s important to reduce it as much as possible.
Measurement error
Measurement error, also called response error, is the difference between measured values
and true values. It consists of bias and variance, and it results when data are incorrectly
requested, provided, received or recorded. These errors may occur because of inefficiencies
with the questionnaire, the interviewer, the respondent or the survey process.
Non-response error
Estimates obtained after nonresponse has been observed and imputation has been used to
deal with this nonresponse are usually not equivalent to the estimates that would have been
obtained had all the desired values been observed without error. The difference between
these two types of estimates is called the nonresponse error. There are two types of non-
response errors: total and partial.
      Total nonresponse error occurs when all or almost all data for a sampling unit are
       missing. This can happen if the respondent is unavailable or temporarily absent, the
       respondent is unable to participate or refuses to participate in the survey, or if the
       dwelling is vacant. If a significant number of sampled units do not respond to a
6|Page
       survey, then the results may be biased since the characteristics of the non-
       respondents may differ from those who have participated.
      Partial nonresponse error occurs when respondents provide incomplete
       information. For certain people, some questions may be difficult to understand; they
       may refuse or forget to answer a question. Poorly designed questionnaire or poor
       interviewing techniques can also be reasons which result partial nonresponse error.
       To reduce this form of error, care should be taken in designing and testing
       questionnaires. Adequate interviewer training and appropriate edits and imputation
       strategies will also help minimize this error.
Processing error
Processing error occurs during data processing. It includes all data processing activities
after collection and prior to estimation, such as errors in data capture, coding, editing and
tabulation of the data as well as in the assignment of survey weights.
      Coding errors occur when different coders code the same answer differently,
       which can be caused by poor training, incomplete instructions, variance in coder
       performance (i.e. tiredness, illness), data entry errors, or machine malfunction
       (some processing errors are caused by errors in the computer programs).
      Data capture errors result when data are not entered into the computer exactly as
       they appear on the questionnaire. This can be caused by the complexity of
       alphanumeric data and by the lack of clarity in the answer provided. The physical
       layout of the questionnaire itself or the coding documents can cause data capture
       errors. The method of data capture, manual or automated (for example, using an
       optical scanner), can also result in errors.
      Editing and imputation errors can be caused by the poor quality of the original
       data or by its complex structure. When the editing and imputation processes are
       automated, errors can also be the result of faulty programs that were insufficiently
       tested. The choice of an inappropriate imputation method can introduce bias. Errors
       can also result from incorrectly changing data that were found to be in error, or by
       erroneously changing correct data.
 - Is a method of sampling in which all elements in the population have a pre-assigned non
zero probability to be included in to the sample.
7|Page
   Examples:
- Is a method of selecting items from a population such that every possible sample of
specific size has an equal chance of being selected, In this case, sampling may be with or
without replacement. Or
- All elements in the population have the same pre-assigned non zero probability to be
included in to the sample.
- Simple random sampling can be done either using the lottery method or table of random
numbers.
Table of random numbers are tables of the digits 0, 1, 2,…,, 9, each digit having an equal
chance of selection at any draw. For convenience, the numbers are put in blocks of five. In
using these tables to select a simple random sample, the steps are:
    i. Number the units in the population from 1 to N (prepare frame of the population).
   ii. Then proceed in the following way If the first digit of N is a number between 5 and
   9 inclusively, the following method of selection is adequate. Suppose N=528 and we
   want n=10. Select three columns from the table of random numbers, say columns 25 to
   27. Go down the three columns selecting the first 10 distinct numbers between 001 &
   528. These are 36, 509, 364, 417, 348, 127, 149, 186, 439, and 329. Then the units with
   these roll numbers are our samples.
        Note: If sampling is without replacement, reject all the numbers that comes more
        than once.
8|Page
           - Some of the criteria for dividing a population into strata are: Sex (male, female);
           Age (under 18, 18 to 28, and 29 to 39); Occupation (blue-collar, professional, and
           other).
3. Cluster Sampling:
      - A simple random sample of groups or cluster of elements is chosen and all the
sampling units in the selected clusters will be surveyed.
      - Clusters are formed in a way that elements within a cluster are heterogeneous, i.e.
observations in each cluster should be more or less dissimilar.
4. Systematic Sampling:
- A complete list of all elements within the population (sampling frame) is required.
- The procedure starts in determining the first element to be included in the sample.
        - Then the technique is to take the kth item from the sampling frame.
                                                                    𝑁
   - Let 𝑁 = 𝑃𝑜𝑝𝑢𝑙𝑎𝑡𝑖𝑜𝑛 𝑠𝑖𝑧𝑒 , 𝑛 = 𝑠𝑎𝑚𝑝𝑙𝑒 𝑠𝑖𝑧𝑒,               𝑘=        = 𝑠𝑎𝑚𝑝𝑙𝑖𝑛𝑔 𝑖𝑛𝑡𝑒𝑟𝑣𝑎𝑙.
                                                                    𝑛
   - The unit is selected at first and then (𝑗 + 𝑘)𝑡ℎ , (𝑗 + 2𝑘)𝑡ℎ , . . . . . 𝑒𝑡𝑐 until the required
sample size is reached.
Examples:
 Judgment sampling.
9|Page
       Convenience sampling
       Quota Sampling.
1. Judgment Sampling
    - In this case, the person taking the sample has direct or indirect control over which
items are selected for the sample.
2. Convenience Sampling
    - In this method, the decision maker selects a sample from the population in a manner
that is relatively easy and convenient.
3. Quota Sampling
   - In this method, the decision maker requires the sample to contain a certain number of
items with a given characteristic. Many political polls are, in part, quota sampling.
1.2.1 Definition
 The normal probability distribution is used to determine probabilities for the normally
distributed individual measurements, given the mean and the standard deviation.
Symbolically, the variable is the measurement X, with the population mean µ and
population standard deviation δ. In contrast to such distributions of individual
measurements, a sampling distribution is a probability distribution for the possible values
of a sample statistic.
Population distribution: Is the distribution of measured values of its members and have
mean denoted by𝜇 and variance 𝛿 2 and standard deviation 𝜎. The population standard
deviation describes the variation among values of members of the population; where as the
standard deviation of sampling distribution measures the variability among values of the
statistics (sample) such as mean values, proportion values due to sampling errors.
10 | P a g e
distribution is a probability distribution for the possible values of a sample statistic, such
as a sample mean.
NB: The sampling distribution of the mean is not the sample distribution, which is the
    distribution of the measured values of X in one random sample. Rather, the sampling
     distribution of the mean is the probability distribution for X , the sample mean.
For any given sample size n taken from a population with mean µ and standard deviation
δ, the value of the sample mean would vary from sample to sample if several random
samples were obtained from the population. This variability serves as the basis for
sampling distribution.
The sampling distribution of the mean is described by two parameters: the expected value
( X ) = X , or mean of the sampling distribution of the mean, and the standard deviation of
the mean  x , the standard error of the mean.
    1. The arithmetic mean 𝜇𝑥̅ of the sampling distribution of mean values is equal to the
       population mean 𝜇 regardless of the form of population distribution .i.e. 𝜇𝑥̅ =𝜇
    2. The sampling distribution has a standard deviation (also called standard error) equal
       to the population standard deviation divided by the square root of the sample size i.e.,
             σ
       δx̅ = n . This holds true if and only of n<0.05N and N is very large. If N is finite and
               √
                   𝑆              𝑆     𝑁−𝑛
        𝛿𝑥̅ =          Or𝑆𝛿𝑥̅ =        √𝑁−1 , 𝑛 ≥ 0.05𝑁
                √𝑛                √𝑛
                              𝑁−𝑛
    3. The expression√ 𝑁−1 is called finite population correction factor/finite population
        multiplier. In the calculation of the standard error of the mean, if the population
11 | P a g e
                                                                              
        standard deviation δ is unknown, the standard error of the mean x , can be estimated
                                                        S
        by using the sample standard error of the mean X which is calculated as follows:
                𝑆               𝑆     𝑁−𝑛
        𝑆𝑥̅ =        or 𝑆𝑥̅ =        √𝑁−1
                √𝑛              √𝑛
    A population consists of the following ages: 10, 20, 30, 40, and 50. A random sample of
    three is to be selected from this population and mean computed. Develop the sampling
    distribution of the mean.
Solution: The number of simple random samples of size n that can be drawn without
                                                         𝑁!
replacement from a population of size is 𝑁𝐶𝑛 = 𝑛!(𝑁−𝑛)! With N= 5 and n = 3, 5C3 = 10
samples can be drawn from the population as:
12 | P a g e
                 23.33                1                      0.1
                 26.67                2                      0.2
                 30.00                2                      0.2
                 33.33                2                      0.2
                 36.67                1                      0.1
                 40.00                1                      0.1
                 TOTAL              10.00                   1.00
        Columns 1 and 2 show frequency distribution of sample means.
        Columns 1 and 3 show sampling distribution of the mean.
        
             X   x  30, Regardless of the sample size   X
             N      n
          𝑥 (Observation)          𝑥−𝜇                    (𝑥 − 𝜇)2
10 -20 400
20 -10 100
30 0 0
40 10 100
50 20 400
                  X              
                                   2
                              X           1000
                                              14.142
                          i
                          N                  5
                             N  n 14.142   53
        X           *                   *       5.774
                  n           N 1     3     5 1
                                       2
  The relationship between the shape of the population distribution and the shape of the
  sampling distribution of the mean is called the Central Limit Theorem.
The significance of the Central Limit Theorem is that it permits us to use sample statistics
to make inference about population parameters without knowing anything about the shape
of the frequency distribution of that population other than what we can get from the sample.
It also permits us to use the normal distribution curve for analyzing distributions whose
shape is unknown. It creates the potential for applying the normal distribution to many
problems when the sample is sufficiently large.
As mentioned earlier the above properties must exist, given this value of sample mean 𝑋̅
is first converted in to a value Z on the standard normal distribution to know how any single
value deviates from 𝑋̅ of sample mean values (𝜇𝑥̅ ), by using the formula;
                                                𝑋̅−𝜇𝑥̅         𝑋̅−𝜇
                                      𝑍=                 =           𝛿   because 𝜇𝑥̅ = 𝜇
                                                 𝛿𝑥̅
                                                                    √𝑛
If the population is finite and samples of fixed size n are drawn without replacement, then
the standard error of sampling distribution of mean can be modified to adjust the continued
change in the size of population 𝜇 due to the several draws of samples of size n is as follows:
     Example 1.         The mean length of a certain tool is 41.5 hours with a standard
               deviation of 2.5 hours. What is the probability that a simple random sample
               of size 50 drawn from this population will have a mean between 40.5 hours
               and 42 hours?
P (40.5≤ 𝑋̅ ≤42.0) =?
                       𝛿        2.5       2.5
𝜇𝑥̅ = 𝜇        𝛿𝑥̅ =        =         =7.0711 = 0.3536
                       √𝑛       √50
The population distribution is unknown, but sample size n=50 is large enough to apply the
central limit theorem. Hence the normal distribution can be used to find the required
probability.
                                 𝑋̅1 −𝜇                𝑋̅2 −𝜇
P (40.5≤ 𝑋̅ ≤420) = P (                   ≤𝑍≤                   )
                                  𝛿𝑥̅                    𝛿𝑥̅
                                        40.5−41.5                    42−41.5
                                =P(                    ≤𝑍≤                     )
                                          0.3536                     0.3536
                             = P (−2.8281 ≤ 𝑍 ≤ 1.4140)
                             =P (𝑍 ≥ −2.8281) + P (𝑍 ≤ 1.4140)
                            =0.4977+0.4207=0.9184
14 | P a g e
   Thus 0.9184 is the probability of the tool having mean life between the required hours.
                                                                            𝛿 = 2.5
                                     0.4977
                                                                                      0.4207
Solution:
A. P (𝑥̅ ≥ 900) =?
   𝜇𝑋̅ = 𝜇=800gms                                                        𝛿=300gms
   n=16
   P (𝑥̅ ≥ 900) =?
           𝛿        300   300
   𝛿𝑥̅ =        =         =       = 75
           √𝑛       √16       4
0.0918
                                   𝑋̅−𝜇𝑥̅       900−800
   P (𝑥̅ ≥ 900) =P (Z≥                      =             )
                                     𝛿𝑥̅          75
                           =P (Z≥ 1.33)
                          =0.5000-0.4082
                          =0.0918
   15 | P a g e
B. Since Z=1.96 for the middle 95% area under the normal curve, therefore using the formula
for z to solve for the values of x̅ in terms of the known values are as follows.
    𝑥̅1 =𝜇𝑋̅ -Z𝛿𝑥̅                                      𝑥̅ 2 =𝜇𝑋̅ +Z𝛿𝑥̅
         =800-1.96(75)                                        =800+1.96(75)
         =653gms                                                   0.95
                                                                  =947gms
                                       𝛿=300
   The sample proportion 𝑃̅ having the characteristic of interest (success or failure, accept or
   reject, head or tail) is the best use for statistical inferences about the population parameter P.
   the sample proportion can be defined as:
      𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑠𝑢𝑐𝑐𝑒𝑠𝑠,𝑋
   𝑃̅= 𝑠𝑎𝑚𝑝𝑙𝑒 𝑠𝑖𝑧𝑒,𝑛
   With same logic of sampling distribution of mean, the sampling distribution of sample
   proportions with mean 𝜇𝑃̅ and standard deviation also called standard error) 𝛿𝑃̅ is given by:
                     𝑝𝑞      𝑝(1−𝑃)
   𝜇𝑃̅ = P and 𝛿𝑃̅ =√ 𝑛 =√     𝑛
            A.    np≥5
            B.    nq≥5
   Then the sampling distribution of proportions is very closely normally distributed. It may be
   noted that the sampling distribution of the proportion would actually follow binomial
   distribution because population is binomially distributed.
   For finite population in which sampling is done without replacement we have;
                     𝑝𝑞     𝑁−𝑛
   𝜇𝑃̅ = P and 𝛿𝑃̅ =√ 𝑛 *√𝑁−1
   Under the same guidelines as mentioned in the previous sections, for a large sample size
   n≥30, the sampling distribution of proportion is closely approximated by a normal
   16 | P a g e
distribution with a mean and standard deviation as stated above. Hence, to standardize
sample proportion𝑃̅, the standard normal variable
     𝑃̅ −𝜇𝑃
          ̅    𝑃̅ −𝑃
Z=            =
      𝛿𝑃
       ̅              𝑝𝑞
                  √
                      𝑛
   Example 3.       Few years back, a policy was introduced to give loans to unemployed
            engineers to start their own business. Out of 1,000,000 engineers, 600,000
            accepted the policy and got the loan. A sample of 100 unemployed engineers
            is taken at the same time of allotment of loans. What is the probability that
            sample portion would have exceeded 50% acceptance?
Solution:
𝛿𝑃̅ =0.0489
                   𝑃̅ −𝜇        0.50−0.60
P (𝑃̅ ≥ 0.5) =P (Z≥ 𝛿 𝑃̅) =P (Z≥ 0.0489 ) =0.4793+0.5000=0.9793
                                     ̅
                                     𝑃
                                              0.4793
                                                                                0.5000
𝑃̅ = 0.5 P=0.60
         (0.4)(0.6)                                                                   𝑃̅ −𝑃
𝛿𝑃̅ =√                     =0.0346                       P (-0.03≤ 𝑃̅ ≤ 0.03) = 2P (Z≥ 𝛿 )
              200                                                                              ̅
                                                                                               𝑃
                                                                                              = 2P (Z ≤ 0.87)
                                                                                          =2x0.3078
                                                                                          =0.6156
17 | P a g e
                                         0.3078                                     0.3078
         (0.03)(0.97)
𝛿𝑃̅ =√                  =0.0098
             300
                                       𝑃̅ −𝑃             𝑃̅ −𝑃
P (-0.03≤ 𝑃̅ ≤ 0.03) = P ( 𝛿                   ≤𝑍≤               )
                                         ̅
                                         𝑃              𝛿𝑃
                                                         ̅
                                               0.02−0.03                 0.035−0.03
                                       =P(                 ≤𝑍≤                         )
                                                0.0098                        0.0098
                          = P (-1.02≤ 𝑍 ≤ 0.51)
                          =P (Z≥ −1.02) + P (Z≤ 0.51)
                        =0.3461+0.1950
                       = 0.5411
Hence the probability that the proportion of defective will lie between 0.02 and 0.035 is
0.5411
0.3461 0.1950
18 | P a g e
           1.2.3 Sampling Distribution of the difference between two means and two
                 proportions
                    -   Sampling Distribution Of The Difference Between Two Means
                The concept of sampling distribution of sample mean introduced earlier can also be used to
                compare a population of size 𝑁1 having mean 𝜇1 and standard deviation 𝛿1 with another
                similar type of population of size 𝑁2 having mean 𝜇2 and standard deviation𝛿2 .
                Let 𝑋̅1 𝑎𝑛𝑑 𝑋̅2 be the mean of sampling distribution of the mean of two populations,
                respectively. Then the difference between their mean values 𝜇1 and 𝜇2 can be estimated by
                generalizing the formula of standard normal variable as follows;
                               (𝑋̅1 −𝑋̅2 ) − (𝜇𝑋
                                               ̅ 1 − 𝜇𝑋
                                                      ̅2 )                     (𝑋̅1 −𝑋̅2 ) − (𝜇1− 𝜇2 )
                          Z=                                          =
                                        𝛿 (𝑋
                                           ̅ −𝑋̅ )                                    𝛿(𝑋
                                                                                        ̅ 1−𝑋
                                                                                            ̅2 )
                                                1   2
                                  𝛿 2       𝛿2 2
𝛿(𝑋̅1−𝑋̅2 ) = √𝛿𝑋̅1 2 + 𝛿𝑋̅2 2 = √ 𝑛1 +             (standard error of sampling distribution of difference of two means)
                                     1        𝑛2
                        𝑛1 And 𝑛2 are independent random samples drawn from first and second population
                , respectively.
                19 | P a g e
                                   =P (𝑍 ≥ −2)
                                   =0.5000+0.4772
                                   =0.9772 (area under normal curve)
0.9772
               Hence, the probability is very high that the life time of the stereos of A is 160
   hours more than that of b.
   Given:
   𝜇1 = 4,500                                                               𝜇2 = 4,000
   𝛿1 =200                                                                  𝛿2 =300
   𝑛1 =50                                                                   𝑛2 =100
                  𝛿 2      𝛿2 2        (200)2       (300)2
   𝛿(𝑋̅1 − 𝑋̅2 ) =√ 𝑛1 +          =√            +             = =41.23
                   1       𝑛2           50          100
                                                (𝑋̅1− 𝑋̅2 ) (𝜇1_ 𝜇2 )
   P (𝑋̅1 − 𝑋̅2 ≥600) =            P (𝑍 ≥                               )
                                                      𝛿(𝑋
                                                        ̅      ̅2 )
                                                            1− 𝑋
   20 | P a g e
                                            600−500
                             =P (𝑍 ≥                    )
                                                41.23
                             =P (𝑍 ≥ 2.43)
                             =0.4925
                             =0.5000 - 0.4925=0.0075 (area under normal curve)
Suppose two populations of size 𝑁1 and 𝑁2 are given. For each sample of size 𝑛1 from the first
population, compute sample proportion 𝑃̅1 and standard deviation𝛿𝑃̅1 . Similarly for each
sample size of 𝑛2 from the second population, compute sample proportion𝑃̅2 and standard
deviation𝛿𝑃̅2 .
For all combinations of these samples from these populations, we can obtain a sampling
distribution of the difference 𝑃̅1 − 𝑃̅2 of sample proportion. Such a distribution is called
sampling distribution of the difference of two proportions. The mean and standard deviation
of this distribution are given by;
𝜇𝑃̅1 − 𝜇𝑃̅2 = 𝑃1 − 𝑃2
                                     𝑃1𝑞1       𝑃2 𝑞2
𝛿(𝑃̅1−𝑃̅2 ) = √𝛿𝑃̅1 2 + 𝛿𝑃̅2 2 = √          +
                                      𝑛1         𝑛2
If sample size 𝑛1 𝑎𝑛𝑑 𝑛1 are large i.e. 𝑛1 ≥30, then the sampling distribution of difference
of proportions is closely approximated by a normal distribution.
                                     𝑃1𝑞1       𝑃2 𝑞2
𝛿(𝑃̅1−𝑃̅2 ) = √𝛿𝑃̅1 2 + 𝛿𝑃̅2 2 = √          +
                                     𝑛1          𝑛2
Exercise:
3. Suppose that 2% of all cell phone connections by a certain provider are dropped.
   Find the probability that in a random sample of 1,500 calls at most 40 will be
   dropped. First verify that the sample is sufficiently large to use the normal
   distribution
4. An airline claims that 72% of all its flights to a certain region arrive on time. In a
    random sample of 30 recent arrivals, 19 were on time. You may assume that
    the normal distribution applies.
   a. Compute the sample proportion.
   b. Assuming the airline’s claim is true, find the probability of a sample of
       size 30 producing a sample proportion so low as was observed in this sample
5. In one study it was found that 86% of all homes have a functional smoke detector.
    Suppose this proportion is valid for all homes. Find the probability that in a
    random sample of 600 homes, between 80% and 90% will have a functional smoke
    detector. You may assume that the normal distribution applies.
6. An outside financial auditor has observed that about 4% of all documents he
    examines contain an error of some sort. Assuming this proportion to be accurate, find
    the probability that a random sample of 700 documents will contain at least 30 with
    some sort of error. You may assume that the normal distribution applies.
22 | P a g e
                                       CHAPTER TWO
STATISTICAL ESTIMATIONS
Estimator is the rule or random variable that helps us to approximate a population parameter. But
estimate is the different possible values which an estimator can assume.
                                   ∑𝑋
Example: The sample mean ̅    X = 𝑛 𝑖 is an estimator for the population mean and X =10 is an
estimate, which is one of the possible values of ̅
                                                 X
     It should be unbiased.
     It should be consistent.
     It should be relatively efficient.
To explain these properties let 𝜃̂ be an estimator of θ
    1. Unbiased Estimator: An estimator whose expected value is the value of the parameter being
       estimated. i.e. 𝐸(𝜃̂) = 𝜃
    2. Consistent Estimator: An estimator which gets closer to the value of the parameter as the
       sample size increases. I.e. 𝜃̂ gets closer to θ as the sample size increases.
    3. Relatively Efficient Estimator: The estimator for a parameter with the smallest variance.
       This actually compares two or more estimators for one parameter.
23 | P a g e
2.4 Interval estimation of the difference between two independent means
Although X ̅ possesses nearly all the qualities of a good estimator, because of sampling error, we
know that it's not likely that our sample statistic will be equal to the population parameter, but
instead will fall into an interval of values. We will have to be satisfied knowing that the statistic is
"close to" the parameter. That leads to the obvious question, what is "close"?
We can phrase the latter question differently: How confident can we be that the value of the statistic
falls within a certain "distance" of the parameter? Or, what is the probability that the parameter's
value is within a certain range of the statistic's value? This range is the confidence interval.
The confidence level is the probability that the value of the parameter falls within the range
specified by the confidence interval surrounding the statistic. There are different cases to be
considered to construct confidence intervals.
Case 1:
Recall the Central Limit Theorem, which applies to the sampling distribution of the mean of a
sample. Consider a sample of size n drawn from a population, whose mean, is μ and standard
deviation is σ with replacement and order important. The population can have any frequency
                                           ̅ will have a mean
distribution. The sampling distribution of X
                                           𝜎
𝜇X̅ = 𝜇 and a standard deviation 𝜎x̅ =         , and approaches a normal distribution as n gets large.
                                          √𝑛
This allows us to use the normal distribution curve for computing confidence intervals.
          𝑋̅−𝜇
⟹ Z = 𝜎/           has a normal distribution with mean=0 and variance=1
              √𝑛
⟹μ=̅
   X ± 𝑍𝜎/√𝑛
       ̅ ± 𝜀 , 𝑤ℎ𝑒𝑟𝑒 𝜀 𝑖𝑠 𝑎 𝑚𝑒𝑎𝑠𝑢𝑟𝑒 𝑜𝑓 𝑒𝑟𝑟𝑜𝑟
      =X
⟹ ε = 𝑍𝜎/√𝑛
- For the interval estimator to be good the error should be small. How it is small?
                  By making n large
                  Small variability
                  Taking Z small
- To obtain the value of Z, we have to attach this to a theory of chance. That is, there is an area of
size 1−α such
Where 𝛼 = 𝑖𝑠 𝑡ℎ𝑒 𝑝𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑖𝑡𝑦 𝑡ℎ𝑎𝑡 𝑡ℎ𝑒 𝑝𝑎𝑟𝑎𝑚𝑒𝑡𝑒𝑟 𝑙𝑖𝑒𝑠 𝑜𝑢𝑡𝑠𝑖𝑑𝑒 𝑡ℎ𝑒 𝑖𝑛𝑡𝑒𝑟𝑣𝑎𝑙
24 | P a g e
         𝑍𝛼 = 𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑓𝑜𝑟 𝑡ℎ𝑒 𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑛𝑜𝑟𝑚𝑎𝑙 𝑣𝑎𝑟𝑖𝑎𝑏𝑙𝑒 𝑡𝑜 𝑡ℎ𝑒 𝑟𝑖𝑔ℎ𝑡 𝑜𝑓
              2
                           𝑋̅−𝜇
  ⟹ 𝑃(−𝑍𝛼/2 < 𝜎/                  < 𝑍𝛼/2 ) = 1 − 𝛼
                            √𝑛
  ⟹ (𝑋̅ − 𝑍𝛼/2 𝜎/√𝑛, 𝑋̅ + 𝑍𝛼/2 𝜎/√𝑛) Is a 100(1 − 𝛼 )% confidence interval for𝜇, But usually σ2
is not known, in that case we estimate by its point estimator S 2.
Here are the z values corresponding to the most commonly used confidence levels.
 ⟹ (𝑋̅ − 𝑡𝛼/2 𝑆/√𝑛, 𝑋̅ + 𝑡𝛼/2 𝑆/√𝑛) Is a 100(1 − 𝛼 )% confidence interval for𝜇. The unit of
measurement of the confidence interval is the standard error. This is just the standard deviation of
the sampling distribution of the statistic.
Example: From a normal sample of size 25 a mean of 32 was found .Given that the population
standard deviation is 4.2. Find
Solution:
a)
25 | P a g e
⟹ 𝑇ℎ𝑒 𝑟𝑒𝑞𝑢𝑖𝑟𝑒𝑑 𝑤𝑖𝑙𝑙 𝑏𝑒 𝑋̅ + 𝑡𝛼/2 𝜎/√𝑛
= 32 ± 1.96 × 4.2/√25
= 32 ± 1.65
= (30.35,33.65)
b)
= 32 ± 2.58 × 4.2/√25
= 32 ± 2.17
= (29.83,34.17)
Example2.A Drug Company is testing a new drug which is supposed to reduce blood pressure.
From the six people who are used as subjects, it is found that the average drop in blood pressure
is 2.28 points, with a standard deviation of 0.95 points. What is the 95% confidence interval for
the mean change in pressure?
= 28 ± 2.571 × 0.95/√6
= 2.28 ± 1.008
= (1.28,3.28)
That is, we can be 95% confident that the mean decrease in blood pressure is between 1.28 and
3.28 points.
               𝑋̅ − 𝜇
         t=                ~𝑡      𝑤𝑖𝑡ℎ 𝑛 − 1 𝑑𝑒𝑔𝑟𝑒𝑒 𝑜𝑓 𝑓𝑟𝑒𝑒𝑑𝑜𝑚
               𝑆/√𝑛
26 | P a g e
-      After specifying α we have the following regions on the student t-distribution corresponding
                 H0          Reject H0 if       Accept H0 if       Inconclusive if
                   𝑋̅ − 𝜇0   10.06 − 10
    ⟹ 𝑡𝑐𝑎𝑙 =               =            = 0.76
                       𝑆        0.25
                      √𝑛        √10
    Step 6: Decision
    Step 7: Conclusion: At 1% level of significance, we have no evidence to say that the average
    height content of containers of the given lubricant is different from 10 litters, based on the given
    sample data.
    27 | P a g e
2.6. Determining the sample size
Sample size is a research term used for defining the number of individuals included in a research
study to represent a population. The sample size references the total number of respondents
included in a study, and the number is often broken down into sub-groups by demographics such
as age, gender, and location so that the total sample achieves represents the entire population.
Determining the appropriate sample size is one of the most important factors in statistical analysis.
If the sample size is too small, it will not yield valid results or adequately represent the realities of
the population being studied. On the other hand, while larger sample sizes yield smaller margins
of error and are more representative, a sample size that is too large may significantly increase the
cost and time taken to conduct the research.
This article will discuss considerations to put in place when determining your sample size and how
to calculate the sample size.
Confidence Level
The confidence level refers to the percentage of probability, or certainty that the confidence
interval would contain the true population parameter when you draw a random sample many times.
It is expressed as a percentage and represents how often the percentage of the population who
would pick an answer lies within the confidence interval. For example, a 99% confidence level
means that should you repeat an experiment or survey over and over again, 99 percent of the time,
your results will match the results you get from a population.
The larger your sample size, the more confident you can be that their answers truly reflect the
population. In other words, the larger your sample for a given confidence level, the smaller your
confidence interval.
28 | P a g e
Standard Deviation
Another critical measure when determining the sample size is the standard deviation, which
measures a data set’s distribution from its mean. In calculating the sample size, the standard
deviation is useful in estimating how much the responses you receive will vary from each other
and from the mean number, and the standard deviation of a sample can be used to approximate the
standard deviation of a population.
The higher the distribution or variability is the greater the standard deviation and the greater the
magnitude of the deviation. For example, once you have already sent out your survey, how much
variance do you expect in your responses? That variation in responses is the standard deviation.
Population Size
The other important consideration to make when determining your sample size is the size of the
entire population you want to study. A population is the entire group that you want to draw
conclusions about. It is from the population that a sample is selected, using probability or non-
probability samples. The population size may be known (such as the total number of employees in
a company), or unknown (such as the number of pet keepers in a country), but there’s a need for a
close estimate, especially when dealing with a relatively small or easy to measure groups of people.
As demonstrated through the calculation below, a sample size of about 385 will give you a
sufficient sample size to draw assumptions of nearly any population size at the 95% confidence
level with a 5% margin of error, which is why samples of 400 and 500 are often used in research.
However, if you are looking to draw comparisons between different sub-groups, for example,
provinces within a country, a larger sample size is required. GeoPoll typically recommends a
sample size of 400 per country as the minimum viable sample for a research project, 800 per
country for conducting a study with analysis by a second-level breakdown such as females versus
males, and 1200+ per country for doing third-level breakdowns such as males aged 18-24 in
Nairobi.
29 | P a g e
                           80%           1.28
85% 1.44
90% 1.65
95% 1.96
99% 2.58
6. Put these figures into the sample size formula to get your sample size.
Example: Say you choose to work with a 95% confidence level, a standard deviation of 0.5, and
a confidence interval (margin of error) of ± 5%, you just need to substitute the values in the
formula:
= 0.9604 / .0025
= 384.16
Exercises:
    1. An electrical firm manufactures light bulbs that have a length of life that is approximately
       normally distributed with a standard deviation of 40 hours. If a random sample of 30 bulbs
       has an average life of 780 hours, find a 99% confidence interval for the population mean
       of all bulbs produced by this firm.
    2. A random sample of 400 households was drawn from a town and a survey generated data
       on weekly earning. The mean in the sample was Birr 250 with a standard deviation Birr
       80. Construct a 95% confidence interval for the population mean earning.
    3. A major truck has kept extensive records on various transactions with its customers. If a
       random sample of 16 of these records shows average sales of 290 liters of diesel fuel with
       a standard deviation of 12 liters, construct a 95% confidence interval for the mean of the
       population sampled.
30 | P a g e
                                      CHAPTER THREE
HYPOTHESIS TESTING
Hypothesis is a statement about the value of a population parameter developed for testing.
    The method of hypothesis testing can be summarized in four steps. We will describe each of
these four steps in greater detail in the next section.
    1. To begin, we identify a hypothesis or claim that we feel should be tested. For example, we
        might want to test the claim that the mean number of hours that children in Ethiopia watch
        TV is 3 hours.
    2. We select a criterion upon which we decide that the claim being tested is true or not. For
        example, the claim is that children watch 3 hours of TV per week. Most samples we select
        should have a mean close to or equal to 3 hours if the claim we are testing is true. So at
        what point do we decide that the discrepancy between the sample mean and 3 is so big that
        the claim we are testing is likely not true? We answer this question in this step of hypothesis
        testing.
    3. Select a random sample from the population and measure the sample mean. For example,
        we could select 20 children and measure the mean time (in hours) that they watch TV per
        week.
    4. Compare what we observe in the sample to what we expect to observe if the claim we are
        testing is true. We expect the sample mean to be around 3 hours. If the discrepancy between
        the sample mean and population mean is small, then we will likely decide that the claim
31 | P a g e
        we are testing is indeed true. If the discrepancy is too large, then we will likely decide to
        reject the claim as being not true.
    The null hypothesis (H0), stated as the null, is a statement about a population parameter,
    such as the population mean, that is assumed to be true.
    The null hypothesis is a starting point. We will test whether the value stated in the null
    hypothesis is likely to be true.
Keep in mind that the only reason we are testing the null hypothesis is because we think it is wrong.
We state what we think is wrong about the null hypothesis in an alternative hypothesis. For the
children watching TV example, we may have reason to believe that children watch more than (>)
or less than (<) 3 hours of TV per week. When we are uncertain of the direction, we can state that
the value in the null hypothesis is not equal to (≠) 3 hours.
32 | P a g e
In a courtroom, since the defendant is assumed to be innocent (this is the null hypothesis so to
speak), the burden is on a prosecutor to conduct a trial to show evidence that the defendant is not
innocent. In a similar way, we assume the null hypothesis is true, placing the burden on the
researcher to conduct a study to show evidence that the null hypothesis is unlikely to be true.
Regardless, we always make a decision about the null hypothesis (that it is likely or unlikely to be
true). The alternative hypothesis is needed for Step 2.
An alternative hypothesis (H1) is a statement that directly contradicts a null hypothesis by stating
that that the actual value of a population parameter is less than, greater than, or not equal to the
value stated in the null hypothesis.
The alternative hypothesis states what we think is wrong about the null hypothesis, which is needed
for Step 2.
Step 2: Set the criteria for a decision. To set the criteria for a decision, we state the level of
significance for a test. This is similar to the criterion that jurors use in a criminal trial. Jurors
decide whether the evidence presented shows guilt beyond a reasonable doubt (this is the
criterion). Likewise, in hypothesis testing, we collect data to show that the null hypothesis is not
true, based on the likelihood of selecting a sample mean from a population (the likelihood is the
criterion). The likelihood or level of significance is typically set at 5% in behavioral research
studies. When the probability of obtaining a sample mean is less than 5% if the null hypothesis
were true, then we conclude that the sample we selected is too unlikely and so we reject the null
hypothesis.
In behavioral science, the criterion or level of significance is typically set at 5%. When the
probability of obtaining a sample mean is less than 5% if the null hypothesis were true, then
we reject the value stated in the null hypothesis.
    The alternative hypothesis establishes where to place the level of significance. Remember that
we know that the sample mean will equal the population mean on average if the null hypothesis is
true. All other possible values of the sample mean are normally distributed (central limit theorem).
33 | P a g e
The empirical rule tells us that at least 95% of all sample means fall within about 2 standard
deviations (SD) of the population mean, meaning that there is less than a 5% probability of
obtaining a sample mean that is beyond 2 SD from the population mean. For the children watching
TV example, we can look for the probability of obtaining a sample mean beyond 2 SD in the upper
tail (greater than 3), the lower tail (less than 3), or both tails (not equal to 3). Figure 8.2 shows that
the alternative hypothesis is used to determine which tail or tails to place the level of significance
for a hypothesis test.
Step 3: Compute the test statistic. Suppose we measure samples mean equal to 4 hours per week
that children watch TV. To make a decision, we need to evaluate how likely this sample outcome
is, if the population mean stated by the null hypothesis (3 hours per week) is true. We use a test
statistic to determine this likelihood. Specifically, a test statistic tells us how far, or how many
standard deviations, a sample mean is from the population mean. The larger the value of the test
statistic, the further the distance, or number of standard deviations, a sample mean is from the
population mean stated in the null hypothesis. The value of the test statistic is used to make a
decision in Step 4.
The test statistic is a mathematical formula that allows researchers to determine the likelihood of
obtaining sample outcomes if the null hypothesis were true. The value of the test statistic is used
to make a decision regarding the null hypothesis.
Step 4: Make a decision. We use the value of the test statistic to make a decision about the null
hypothesis. The decision is based on the probability of obtaining a sample mean, given that the
value stated in the null hypothesis is true. If the probability of obtaining a sample mean is less than
5% when the null hypothesis is true, then the decision is to reject the null hypothesis. If the
probability of obtaining a sample mean is greater than 5% when the null hypothesis is true, then
the decision is to retain the null hypothesis. In sum, there are two decisions a researcher can make:
    1. Reject the null hypothesis. The sample mean is associated with a low probability of
        occurrence when the null hypothesis is true.
    2. Retain the null hypothesis. The sample mean is associated with a high probability of
        occurrence when the null hypothesis is true.
The probability of obtaining a sample mean, given that the value stated in the null hypothesis is
true, is stated by the p value. The p value is a probability: It varies between 0 and 1 and can never
34 | P a g e
be negative. In Step 2, we stated the criterion or probability of obtaining a sample mean at which
point we will decide to reject the value stated in the null hypothesis, which is typically set at 5%
in behavioral research. To make a decision, we compare the p value to the criterion we set in Step
2.
     A p value is the probability of obtaining a sample outcome, given that the value stated in
     the null hypothesis is true. The p value for obtaining a sample outcome is compared to the
     level of significance.
When the p value is less than 5% (p < .05), we reject the null hypothesis. We will refer to p < .05
as the criterion for deciding to reject the null hypothesis, although note that when p = .05, the
decision is also to reject the null hypothesis. When the p value is greater than 5% (p > .05), we
retain the null hypothesis. The decision to reject or retain the null hypothesis is called significance.
When the p value is less than .05, we reach significance; the decision is to reject the null
hypothesis. When the p value is greater than .05, we fail to reach significance; the decision is to
retain the null hypothesis. Figure 8.3 shows the four steps of hypothesis testing.
35 | P a g e
DECISION: RETAIN THE NULL HYPOTHESIS
When we decide to retain the null hypothesis, we can be correct or incorrect. The correct decision
is to retain a true null hypothesis. This decision is called a null result or null finding. This is usually
an uninteresting decision because the decision is to retain what we already assumed: that the value
stated in the null hypothesis is correct. For this reason, null results alone are rarely published in
behavioral research.
The incorrect decision is to retain a false null hypothesis. This decision is an example of a Type
II error, or b error. With each test we make, there is always some probability that the decision
could be a Type II error. In this decision, we decide to retain previous notions of truth that are in
fact false. While it’s an error, we still did nothing; we retained the null hypothesis. We can always
go back and conduct more studies.
Type II error, or beta (b) error, is the probability of retaining a null hypothesis that is actually
false.
When we decide to reject the null hypothesis, we can be correct or incorrect. The incorrect decision
is to reject a true null hypothesis. This decision is an example of a Type I error. With each test
we make, there is always some probability that our decision is a Type I error. A researcher who
makes this error decides to reject previous notions of truth that are in fact true. Making this type
of error is analogous to finding an innocent person guilty. To minimize this error, we assume a
36 | P a g e
defendant is innocent when beginning a trial. Similarly, to minimize making a Type I error, we
assume the null hypothesis is true when beginning a hypothesis test.
    Type I error is the probability of rejecting a null hypothesis that is actually true.
    Researchers directly control for the probability of committing this type of error.
    An alpha (𝛼) level is the level of significance or criterion for a hypothesis test. It is the
    largest probability of committing a Type I error that we will allow and still decide to
    reject the null hypothesis.
    Since we assume the null hypothesis is true, we control for Type I error by stating a level of
significance. The level we set, called the alpha level (symbolized as a), is the largest probability
of committing a Type I error that we will allow and still decide to reject the null hypothesis. This
criterion is usually set at .05 (𝛼 = 0.05), and we compare the alpha level to the p value. When the
probability of a Type I error is less than 5% (p < .05), we decide to reject the null hypothesis;
otherwise, we retain the null hypothesis.
    The correct decision is to reject a false null hypothesis. There is always some probability that
we decide that the null hypothesis is false when it is indeed false. This decision is called the power
of the decision-making process. It is called power because it is the decision we aim for. Remember
that we are only testing the null hypothesis because we think it is wrong. Deciding to reject a false
null hypothesis, then, is the power, inasmuch as we learn the most about populations when we
accurately reject false notions of truth. This decision is the most published result in behavioral
research.
The power in hypothesis testing is the probability of rejecting a false null hypothesis. Specifically,
it is the probability that a randomly selected sample will show that the null hypothesis is false
when the null hypothesis is indeed false.
Example 1.      Templer and Tomeo (2002) reported that the population mean score on the
                quantitative portion of the Graduate Record Examination (GRE) General Test for
                students taking the exam between 2018 and 2020 was 558 ± 139 (m ± s). Suppose
                we select a sample of 100 participants (n = 100). We record a sample mean equal
                to 585 (M = 585). Compute the one–independent sample z test for whether or not
                we will retain the null hypothesis (m = 558) at a .05 level of significance (a = .05).
37 | P a g e
Step 1: State the hypotheses. The population mean is 558, and we are testing whether the null
hypothesis is (=) or is not (≠) correct:
H0: m = 558 Mean test scores are equal to 558 in the population.
H1: m ≠ 558 Mean test scores are not equal to 558 in the population.
Step 2: Set the criteria for a decision. The level of significance is .05, which makes the alpha level
a = .05. To locate the probability of obtaining a sample mean from a given population, we use the
standard normal distribution. We will locate the z scores in a standard normal distribution that are
the cutoffs, or critical values, for sample mean values with less than a 5% probability of
occurrence if the value stated in the null (m = 558) is true.
    A critical value is a cutoff value that defines the boundaries beyond which less than 5%
    of sample means can be obtained if the null hypothesis is true. Sample means obtained
    beyond a critical value will result in a decision to reject the null hypothesis.
    In a non-directional two-tailed test, we divide the alpha value in half so that an equal proportion
    of area is placed in the upper and lower tail. Table 8.4 gives the critical values for one-and two-
    tailed tests at a .05, .01, and .001 level of significance. Figure 8.4 displays a graph with the
    critical values for Example 8.1 shown. In this example a = .05, so we split this probability in
    half:
                          𝛼       0.05
    Splitting 𝛼 in half       =          = 0.0250 in each tail
                          2        2
    TABLE Critical values for one- and two-tailed tests at three commonly used levels of
    significance
38 | P a g e
Example 2.        A soft drink bottling company’s advertisement states that a bottle of its product
contains 330 milliliters (ml). But customers are complaining that the company is
under filling its products. To check whether the complaint is true or not, an
HO: The average content of a bottle of this product equals 330 ml against
H1: The average content of a bottle of this product is less than 330 ml.
Or symbolically,
HO:  ≤ 330 ml
If the inspector takes a random sample of bottles of this product and finds that the mean content
per bottle is much less than 330 ml, then he may conclude that the complaint of the customers is
correct.
determine whether the hypothesis is reasonable statement and should not be rejected, or is
unreasonable and should be rejected. Or hypothesis testing is a procedure for checking the validity
of a statistical hypothesis. It is the process by which we decide whether the null hypothesis should
be rejected or not. The value, computed from sample information, used to determine whether or
not to reject the null hypothesis is called test statistic.
As mentioned earlier, in order to determine whether HO is accepted or not, one computes a test
statistic from the sample data. Our decision is then based on where this figure (the test statistic)
falls.
39 | P a g e
Depending on the sampling distribution of the statistic under consideration, one can identify the
acceptance region and the rejection region. If the test statistic falls in the acceptance region, then
HO is accepted and if the test statistic falls in the rejection region also called (the critical region),
then we reject HO and accept H1. The value that is a borderline between acceptance and rejection
is called the critical value. The critical value is obtained from appropriate statistical table such as
standard normal distribution table, the student t distribution table and others.
The following three charts indicate the acceptance and rejection regions for a test of significance
                                                           𝑧 = 1.64
Note in the chart that:
    1. The area where the null hypothesis is not rejected includes the area to the left of 1.645.
        Where the value 1.645 is Z.05 read from the table of standard normal distribution.
3. A one-tailed test (right) is being applied (this will be explained soon in the unit)
6. The value 1.645 separates the regions where the null hypothesis is rejected and where it is
not rejected.
Figure 2 Sampling Distribution for the statistic Z, one-tailed (Left) test, 0.05 level of
significance.
40 | P a g e
                   𝑧 = −1.64
 Figure 3 Regions of non-rejection and Rejection for a Two-Tailed Test, Z-statistics, 0.05 level of
significance.
𝑧 = −1.64 𝑧 = 1.64
We can construct a similar acceptance and rejection region while using the t-statistics.
Types of tests
Based on the form of the null and alternative hypothesis, there are two types of tests: a one-sided
41 | P a g e
               H1: The mean income of males is greater than the mean income of females.
        i) A left-tailed test: This is a type of test in which the less than sign is involved in the
               alternative hypothesis. It has one rejection region at the left tail of the appropriate
distribution.
HO:   O
H1:  < O
Figure 2 indicates the acceptance and rejection region of left-tailed test using the Z-distribution.
ii) A right-tailed test: This is a type of test in which the greater than sign is involved in the
alternative hypothesis. It has one rejection region at the right tail of the appropriate
distribution.
Right-tailed test a one-tailed test in which the sample outcome is hypothesized to be at the
right tail of the sampling distribution.
Example 4.         Suppose  O is the assumed mean. A right-tailed test for  O is
HO:   O
H1:  >  O
42 | P a g e
3.5.1 Hypothesis testing of the population mean
Mean of the population can be tested presuming different situations such as the population may be
normal or other than normal, it may be finite or infinite, sample size may be large or small, variance
of the population may be known or unknown and the alternative hypothesis many be two-sided or
one-sided. Our testing technique will differ in different situations. We may consider some of the
important situations.
Case I Population normal, large sample and population standard deviation is known. The
alternative hypothesis may be one-sided or two-sided.
In such a situation, Z-test (Z statistics) is used for testing hypothesis of the mean and the test
statistic Z is worked out as under
                                                 𝑥̅ − 𝜇0
                                            𝑍=
                                                 𝜎/√𝑛
Example 5. Selam Hotel has been having average sales of 500 teacups per day. Because of the
development of bus stand nearby, he expects to increase its sales. For the first 50
days after the start of the bus stand, he recorded an average daily sale of 550 teacups
per day. From the past records, it is known that the sales standard deviation is 50.
On the bases of this sample information, can one conclude that Selam’s Hotel sales have increased?
Use 5 percent level of significance
Solution:
43 | P a g e
Taking the null hypothesis that sales average 500 teacups per day and they have not increased
unless proved, we can write:
H1:  > 500 (as we want to conclude that sales have increased)
As the sample size is 50 > 30 (n > 30), and the population standard deviation  = 50 is known, we
shall use Z-test assuming normal population and shall work out the test statistic Z as:
      𝑥̅ − 𝜇0
𝑍=
      𝜎/√𝑛
      550−500         50           1
  =             =            =           = ඥ50 = 7.03
      50/ඥ50        50/ඥ50       1/ඥ50
As H1 is one-sided, we shall determine the rejection region applying one-tailed test (in the right
tail because H1 is of more than type) at 5 percent level of significance and it comes to as under,
using table of Z-distribution, Z0.05 = 1.96
Z0.05 = 1.96
The rejection region is Z > 1.96, and as the observed (calculated) test statistic is 7.03 which is
greater than 1.96, i.e. which is not in the rejection region, thus, there is an evidence to reject H O.
I.e. HO is rejected and we can conclude that the sample data indicate that Selam’s Hotel sales have
shown a considerable increase.
deviation of 3. The production manager may welcome any change is mean value
towards higher side but would like to safeguard against decreasing values of mean.
44 | P a g e
He takes a sample of 36 items that gives a mean value of 48.5. What inference should the manager
take for the production process on the basis of sample results? Use 1 percent level of significance
Solution: -
HO: O ≤50
mean)
n = 36, assuming the population to be normal, we can work out the test statistic Z as under:
      𝑥̅ − 𝜇0
𝑍=
      𝜎/√𝑛
      48.5−50       −1.5
  =             =          = −3.00
       3/ඥ36        3/6
As H1 is one-sided in the given question, we shall determine the rejection region applying one-
tailed test (in the left tail as H1 is of less than type) at 1 percent level of significance and it comes
 R: Z < -2.33
45 | P a g e
                    Z = -2.33
The observed value of Z which we call Z calculated is –3 which is < - 2.33 i.e. it is in the rejection
region, and thus, HO is rejected at 1 percent level of significance. We can conclude that the
production process is showing mean which is significantly less than the population mean and this
calls for some corrective action concerning the said process.
Example 7. A sample of 400 male students is found to have a mean height of 67.47 inches. Can
                  67.39 inches and standard deviation 1.30 inches? Test at 5% level of significance.
Solution: Taking the null hypothesis that the mean height of the population is equal to 67.39
HO: O = 67.39
H1:   67.39
And the given information as 𝑋̅ = 67.47” = 1.30” n = 400. Assuming the population to be normal,
      𝑥̅ − 𝜇0
𝑍=
      𝜎/√𝑛
      67.47−67.39      0.08
  =                 = 0.065 = 1.231
      1.30/ඥ400
As H1 is two-sided in the given question, we shall be applying a two-tailed test. In the two-tailed
As the observed value of Z is 1.231 which is less than 1.96, 1.231, is in the acceptance region for
the rejection region is R: Z > 1.96. Therefore, Ho is accepted. i.e. we may conclude that the given
sample (with mean height = 67.47”) can be regarded to have been taken from a population with
mean height 67.39” and standard deviation 1.30” at 5% level of significance.
* In case if the population is finite but population standard deviation is known, one can use a test-
statistic
            𝑥̅ − 𝜇0
𝑍=
      𝜎     𝑁−𝑛
         ቆ√     ቇ
     √ 𝑛    𝑁−1
               N n
   Where            is what we call the finite population correction
               N 1
And the procedure of testing is the same as in the above three examples.
Example 8.        A workers’ union is on strike for higher wages. The union claims that the mean
                  salary for workers is at most Birr 8,400 per year. The legislator does not want to
                  reject the union’s claim, however, unless the evidence is very strong against it.
                  Assume that salaries follow a normal distribution and the population standard
                  deviation is known to be Birr 3000. A random sample of 64 workers is obtained,
                  and the sample mean is Br, 9,400. Test if the state legislator accepts the unions’
                  claim or not at 1% significance level.
Solution:
HO :   8,400
𝑥̅ = 9,400
𝑛 = 64
𝛼 = 0.01
𝑧0.01 = 2.33
     𝑥̅ −𝜇
Z= 𝛿
       ⁄
        √𝑛
     9,400−8,400
Z=    3,000
           ⁄
            √64
Z= 2.67
𝑧0.01 = 2.33
Case II testing for the population mean; large sample, population standard deviation is
unknown
In such a case, the population standard deviation  will be approximated by the sample standard
deviation S provided the sample size is considerably greater than 30. And the test statistic will be:
And the testing procedure will be the same as in case I for different level of significance and
different types of tests (one-tailed or two-tailed) as mentioned in the three examples in case I.
Example 9.         The Thompson’s discount store chain issues its own credit card. The credit manager
                   wants to find out if the mean monthly-unpaid balance is more than $ 400. The level
of significance is set at .05. A random check of 172 unpaid balances revealed the
sample mean to be $407 and the standard deviation of the sample to be $ 38. Should
48 | P a g e
                        the credit manager conclude that the population mean is greater than $ 400, or is it
HO:   $ 400
Because the alternative hypothesis states a direction, a one-tailed test is applied. The critical value
           X              $ 407  $ 400
      Z=                                    2.42
           S      n           38 172
As the computed value of the test statistic (2.42) is larger than the critical value (1.645) or as 2.42
is in the rejection region R: Z > 1.645, the null hypothesis is rejected and the credit manager can
conclude that the mean unpaid balance is greater than $ 400.
Case III Population Normal, Small sample and Standard deviation of the population is
Unknown
               𝑥̅ −𝜇0
       𝑡=                   Which follows a student t-distribution with (n – 1) degree of freedom.
               𝑆/√𝑛
                             Xi  X 
                                        2
                S=
                               n 1
2) If the population is finite, using the finite population correction, the test statistic used is modified
as:
                          𝑥̅ −𝜇0
            𝑡 =        𝑆   𝑁−𝑛
                                      With (n – 1) df.
                         ቆ√     ቇ
                      √𝑛    𝑁−1
49 | P a g e
While testing, the value of the test statistic calculated from the sample result will be compared
with the tabulated value of the t-distribution at the given level of significance.
Example 10. The specimen of copper wires drawn from a large lot has the following breaking
strength (in kg. Weight): 578, 572, 570, 568, 572, 578, 570, 572, 596, 544.
Test whether the mean breaking strength of the lot may be taken to be 578 kg. Weight at 5% level
of significance
Solution: -
Taking the null hypothesis that the population mean is equal to hypothesized mean of 578 kg., we
can write:
As the sample size is small (n = 10) and the population standard deviation is not known, we shall
use t-test assuming normal population and shall work out the test statistic t as under:
          𝑥̅ − 𝜇0
     𝑡=
          𝑆/√𝑛
Calculating the sample mean, and sample standard deviation, one can obtain
                      572−578
Hence          𝑡=
                     12.72/ඥ10
As H1 is two-sided, we shall determine the rejection regions applying two-tailed test at 5% level
of significance, and it comes to as under, using table of t-distribution for 9.d.f.
                                         or  t  > t / 2
50 | P a g e
 Acceptance region – 2.262 < t < 2.262
As the observed value of t (i.e. –1.488) is in the acceptance region, we accept HO at 5% level and
conclude that the mean breaking strength of copper wires lot may be taken as 578 kg. Weight
* For two-tailed test in t-distribution, if the level is , the area to the right tail and the area to the
EXERCISE:
1. A sample of 36 observations is selected from a normal population. The sample mean is 21,
        and the sample standard deviation is 5. Conduct the following test of hypothesis using the
        .05 level of significance.
HO :   20
H1 :  > 20
2. A machine is set to fill a small bottle with 9.0 gms of medicine. It is claimed that the mean
weight is less than 9 grams. The hypothesis is to be tested at the .01 significance level.
A sample revealed these weights (in gms): 9.2, 8.7, 8.9, 8.6 8.8, 8.5, 8.7, and 9.0.
number of TV sets that are sold within six months of production. A random sample of 600
51 | P a g e
                                     CHAPTER FOUR
CHI-SQUARE DISTRIBUTIONS
   1. It is a continuous distribution.
   2. The 𝑥 2 distribution has a single parameter; the degree of freedom, 𝑣
   3. The mean of the Chi-square distribution is 𝑣
   4. The variance of the Chi-square distribution is2𝑣. Thus the mean and the variance depend
        on the degree of freedom.
52 | P a g e
    5. It is based on a comparison of the sample of observed data (results) with the expected
        results under the assumption that the null hypothesis is true.
    6. It is skewed distribution and only non-negative values of the variable 𝑥 2 are possible. The
        skewness decreases as 𝑣 increases; and when 𝑣 increases without limit it approaches a
        normal distribution. It extends indefinitely in the positive direction.
    7. The area under the curve is 1.0
        Having the above characteristics, 𝑥 2 distribution has the following areas of application:
               1. Test for independence between two variables
               2. Testing for equality of several proportions
               3. Goodness of fit tests (Binomial, Normal and Poisson)
Example:
   1. A company planning a TV advertising campaign wants to determine which TV shows its
        target audience watches and thereby to know whether the choice of TV program an
        individual watches is independent of the individuals income. The table supporting this is
        shown below. Use a 5% level of significance and the null hypothesis.
53 | P a g e
               Low                  143                  70                  37              250
Medium 90 67 43 200
High 17 13 20 50
Solution :
𝐞𝟏𝟏 = 𝟐𝟓𝟎 × 𝟐𝟓𝟎⁄𝟓𝟎𝟎 = 𝟏𝟐𝟓 𝒆𝟏𝟐 = 𝟐𝟓𝟎 × 𝟏𝟓𝟎⁄𝟓𝟎𝟎 = 𝟕𝟓 𝒆𝟏𝟑 = 𝟐𝟓𝟎 × 𝟏𝟎𝟎⁄𝟓𝟎𝟎 = 𝟓𝟎
e21 = 200 × 250⁄500 = 100 𝒆𝟐𝟐 = 𝟐𝟎𝟎 × 𝟏𝟓𝟎⁄𝟓𝟎𝟎 = 60 𝑒23 = 200 × 100⁄500 = 40
A test of the null hypothesis that variables are independent of one another is based on the
magnitude of the differences between the observed frequencies and the expected frequencies.
Large differences between 𝑂𝑖𝑗 𝑎𝑛𝑑 𝑒𝑖𝑗 provide evidence that the null hypothesis is false. The test is
based on the following chi-square test statistic.
1
  For the 𝑅 × 𝐶 contingency table, the degree of freedom are calculated as (𝑅 − 1)(𝐶 − 1). The degrees of freedom
refers to the number of expected frequencies that can be chosen freely provided the row and column totals of expected
frequencies are identical to the row and column totals of the observed frequency table.
54 | P a g e
                                         2
                                             (𝑂𝑖𝑗 − 𝑒𝑖𝑗 )2     2
                                                                    (𝑓𝑜 − 𝑓𝑒 )2
                                        𝑥 =∑               𝑜𝑟 𝑥 = ∑
                                                  𝑒𝑖𝑗                   𝑓𝑒
𝐸𝑖𝑗 (𝑓𝑒 ) = 𝑒𝑥𝑝𝑒𝑐𝑡𝑒𝑑 𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦 𝑓𝑜𝑟 𝑐𝑜𝑛𝑡𝑖𝑛𝑔𝑒𝑛𝑐𝑦 𝑡𝑎𝑏𝑙𝑒 𝑖𝑛 𝑟𝑜𝑤 𝑖 𝑎𝑛𝑑 𝑐𝑜𝑙𝑢𝑚𝑛 𝑗.
IV. Reject the null hypothesis that choice of TV program is independent from income level
    2. A human resource manager at EAGEL Inc. was interested in knowing whether the
         voluntary absence behavior of the firm’s employees was independent of the marital status.
         The employee files contained data on material status and on voluntary absenteeism
         behavior for a sample of 500 employees is shown below.
                                                                Marital status
Test the hypothesis that the absence behavior is independent of marital status at a significance
level of 1%.
Solution:
55 | P a g e
                                                𝛼 = 0.01
                                          𝑣 = (𝑅 − 1)(𝐶 − 1)
                                       𝑣 = (3 − 1)(4 − 1) = 6
                                       𝑥 2 𝛼,𝑣 = 𝑥 2 0.01,6 = 16.81
                                    Reject 𝐻𝑜 if sample 𝑥 2 > 16.81
  III. Compute the test statistic
36 30 36 1.200
64 60 16 0.267
50 60 100 1.667
16 20 16 0.800
34 40 36 0.900
50 40 100 2.500
14 10 16 1.600
20 20 0 0.000
16 20 16 0.800
34 40 36 0.900
82 80 4 0.500
84 80 16 0.200
                                                                    (𝑓𝑜 − 𝑓𝑒 )2 10.883
                                                                  ∑
                                                                        𝑓𝑒
    3. The personnel administrator of XYZ Company provided the following data as an example
        of selection among 40 male and 40 female applicants for 12 open positions.
                 Applicant                               Status
56 | P a g e
                     Male                             7            33           40
Female 5 35 40
Total 12 68 80
 A.        The 𝑥 2 test of independence was suggested as a way of determining if the decision to hire
           7 males and females should be interpreted as having a selection bias in favor of males.
           Conduct the test of independence using𝛼 = 0.10. What is your conclusion?
 B.        Using the same test, would the decision to hire 8 males and 4 females suggest concern for
           a selection bias?
 C.        How many males could be hired for the 12 open positions before the procedure would
           concern for a selection bias?
Solution:
      A.
     I.
Ho : There is no selection bias in favor of males. (Selection status and gender of the applicant are
        independent).
𝐻1 : There is selection bias in favor of males. (Selection status and gender of the applicant are not
     independent).
7 6 1 0.1667
33 34 1 0.0294
5 6 1 0.1667
35 34 1 0.0294
57 | P a g e
                                                                (𝑓𝑜 − 𝑓𝑒 )2 0.3922
                                                              ∑
                                                                    𝑓𝑒
8 6 4 0.6667
32 34 4 0.1176
4 6 4 0.6667
36 34 4 0.1176
                                                                  (𝑓𝑜 − 𝑓𝑒 )2 1.5686
                                                              ∑
                                                                      𝑓𝑒
IV.         Do not reject 𝐻𝑜 because 1.569< 2.71
             There is no selection bias in favor of males applicants
       C. There is no shortcut method to answer this question. Therefore, let’s try by increasing the
          number of male applicants who are accepted and decreasing the number of female
          applicants who are females.
 I.
There is no selection bias in favor of males. (Selection status and gender of the applicant are
     independent).
58 | P a g e
𝐻1 : There is selection bias in favor of males. (Selection status and gender of the applicant are not
     independent).
9 6 9 1.5000
31 34 9 0.2647
3 6 9 1.5000
37 34 9 0.2647
                                                           (𝑓𝑜 − 𝑓𝑒 )2 3.5294
                                                         ∑
                                                               𝑓𝑒
IV.    Reject 𝐻𝑜 because 3.5294 > 2.71
        There is no selection bias in favor of males applicants
        Therefore, 8 male and 4 female applicants must be hired for the 12 open positions so as to
        avoid selection bias in favor of males.
The chi-square test for independence is useful in helping to determine whether a relationship exists
between two variables, but it does not enable us to estimate or predict the values of one variable
based on the value of the other. If it is determine that dependence does exist between two
quantitative variables, then the techniques of regression analysis are useful in helping to find a
mathematical formula that expresses the nature of mathematical relationship.
Small expected frequencies can lead to inordinately large chi-square values with the chi-square
test of independence. Hence contingency tables should not be used with expected cell values of
less than 5 one way to avoid small expected values is to combine columns or rows whenever
possible and whenever doing so makes sense.
59 | P a g e
       4.1.2 Testing for the equality of several proportions
Testing for the equality of several proportions emphasizes on whether several proportions are equal
or not; and hence the null hypothesis takes the following form:
𝐻𝑜 : 𝑃1 = 𝑃2 = 𝑃3 = 𝑃20 ; 𝑃3 = 𝑃30 ; … 𝑃𝑘
              = 𝑃𝑘𝑜 ; 𝑎𝑛𝑑 𝑡ℎ𝑒 𝑎𝑙𝑡𝑒𝑟𝑛𝑎𝑡𝑖𝑣𝑒 ℎ𝑦𝑝𝑜𝑡ℎ𝑒𝑠𝑖𝑠 𝑡𝑎𝑘𝑒𝑠 𝑡ℎ𝑒 𝑓𝑜𝑙𝑙𝑜𝑤𝑖𝑛𝑔 𝑓𝑜𝑟𝑚
Example:
   1. In the business credit institution industry the accounts receivable for companies are
         classified as being “current”. “Moderately late”, “very late” and “uncollectible”. Industry
         figure shows that the ratio of these four classes is 9: 3: 3: 1
 I.      ENDURANCE firm has 800 accounts receivable, with 439, 168, 133, and 60 failing in each
         class. Are these proportions in agreement with the industry ratio? 𝐿𝑒𝑡 𝛼 = 0.05
Solution:
 II.      𝛼 = 0.05
                                          𝑣 = (𝐾 − 1) = (4 − 1) = 3
                                            𝑥 2 𝛼,𝑣 = 𝑥 2 0.05,3 = 7.81
                                         Reject 𝐻𝑜 if sample 𝑥 2 > 7.81
  III. Test statistic (sample 𝑥 2 )
                                                                             (𝑓𝑜 − 𝑓𝑒 )2 6.356
                                                                           ∑
                                                                                 𝑓𝑒
    II.     𝛼 = 0.05
                                               𝑣 = (𝐾 − 1) = (3 − 1) = 2
                                                 𝑥 2 𝛼,𝑣 = 𝑥 2 0.05,2 = 5.99
                                              Reject 𝐻𝑜 if sample 𝑥 2 > 5.99
     III. Test statistic (sample 𝑥 2 )
Blue 20 40 400 10
Yellow 40 40 0 0.00
                                                                               (𝑓𝑜 − 𝑓𝑒 )2
                                                                           ∑               = 20
                                                                                   𝑓𝑒
     IV. Reject 𝐻𝑜 ; because 20 > 5.99. This means that customers do have color preference. It
         appears that red is the most popular color and blue is the least popular.
     3. Rating sciences, Inc., a TV program-rating service, surveyed 600 families where the
           television was turned on during the prime time on week nights. They found the following
           numbers of people turned to the various networks.
2
 Since the null hypothesis states that there is no color preference, each of the three colors is preferred by one third
of the purchases.
61 | P a g e
                  Name of the network                Type               Number of viewers
Arts 170
Balageru 165
600
      A. Test the hypothesis that all four networks have equal proportions of viewers during this
            prime time period. using𝛼 = 0.05.
      B. Eliminate the results for EBC and repeat the test of hypothesis for the three commercial
            networks, using𝛼 = 0.05.
      C. Test the hypothesis that each of the three major networks has 30% of the weeknight prime
            time market and EBC has 10% using𝛼 = 0.005.
Solution:
       A.
     I. 𝐻𝑜 : : 𝐴𝑙𝑙 𝑜𝑓 𝑡ℎ𝑒 𝑓𝑜𝑢𝑟 𝑛𝑒𝑡𝑤𝑜𝑟𝑘𝑠 𝑑𝑜 ℎ𝑎𝑣𝑒 𝑒𝑞𝑢𝑎𝑙 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑣𝑖𝑒𝑤𝑒𝑟𝑠; 𝑃1 = 𝑃2 = 𝑃3 =
          𝑃4 = 1⁄4
          𝐻1 : 𝐴𝑙𝑙 𝑜𝑓 𝑡ℎ𝑒 𝑓𝑜𝑢𝑟 𝑛𝑒𝑡𝑤𝑜𝑟𝑘𝑠 𝑑𝑜 𝑛𝑜𝑡 ℎ𝑎𝑣𝑒 𝑒𝑞𝑢𝑎𝑙 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑣𝑖𝑒𝑤𝑒𝑟𝑠
    II.     𝛼 = 0.05
                                            𝑣 = (𝐾 − 1) = (4 − 1) = 3
                                              𝑥 2 𝛼,𝑣 = 𝑥 2 0.05,3 = 7.81
                                           Reject 𝐻𝑜 if sample 𝑥 2 > 7.81
3
  Since equal numbers of viewers are expected to watch each network, each of the four networks is watched by one
fourth of the viewers.
62 | P a g e
          Arts             170                     150                     400        2.6667
                                                                         (𝑓𝑜 − 𝑓𝑒 )2 88.3334
                                                                   ∑
                                                                             𝑓𝑒
 II.      𝛼 = 0.05
                                        𝑣 = (𝐾 − 1) = (3 − 1) = 2
                                          𝑥 2 𝛼,𝑣 = 𝑥 2 0.05,2 = 5.99
                                       Reject 𝐻𝑜 if sample 𝑥 2 > 5.99
  III. Test statistic (sample 𝑥 2 )
                                                                          (𝑓𝑜 − 𝑓𝑒 )2 6.6946
                                                                   ∑
                                                                              𝑓𝑒
   C.
Solution:
I. 𝐻𝑜 : : 𝑃1 = 𝑃2 = 𝑃3 = 0.30 ; 𝑃4 = 0.10
𝐻1 : 𝑂𝑛𝑒 𝑜𝑟 𝑚𝑜𝑟𝑒 𝑜𝑓 𝑡ℎ𝑒 𝑝𝑟𝑜𝑝𝑜𝑟𝑡𝑖𝑜𝑛𝑠 𝑎𝑟𝑒 𝑛𝑜𝑡 𝑒𝑞𝑢𝑎𝑙 𝑡𝑜 𝑡ℎ𝑒 𝑝𝑟𝑜𝑝𝑜𝑟𝑡𝑖𝑜𝑛𝑠 𝑔𝑖𝑣𝑒𝑛 𝑖𝑛 𝑡ℎ𝑒 𝑛𝑢𝑙𝑙 ℎ𝑦𝑝𝑜𝑡ℎ𝑒𝑠𝑖𝑠.
63 | P a g e
 II.     𝛼 = 0.005
                                        𝑣 = (𝐾 − 1) = (4 − 1) = 3
                                        𝑥 2 𝛼,𝑣 = 𝑥 2 0.005,3 = 12.838
                                      Reject 𝐻𝑜 if sample 𝑥 2 > 12.838
  III. Test statistic (sample 𝑥 2 )
II. 𝛼 = 0.05
𝑣 = (𝐾 − 1) = (3 − 1) = 2
64 | P a g e
                                         𝑥 2 𝛼,𝑣 = 𝑥 2 0.05,2 = 5.99
                                      Reject 𝐻𝑜 if sample 𝑥 2 > 5.99
          Because of the above change 𝐻𝑜 is translated as:
A 95 90 25 0.2778
B 85 80 25 0.1250
                                                                               (𝑓𝑜 − 𝑓𝑒 )2        3.9236
                                                                             ∑
                                                                                   𝑓𝑒
The chi-square test is widely used for a variety of analysis. One of the more important uses of chi-
square is the goodness-of-fit-test. That is, it can be used to decide whether a particular probability
distribution, such as the binomial, Poisson, or normal distribution. This is an important ability,
because as decision makers using statistics, we will need to choose a certain probability
distribution to represent the distribution of the data we happen to be considering.
In tests of hypothesis (Previous chapter), we assumed that the population was normal and tested
the hypothesis𝜇 = 𝜇𝑜 , 𝜌 = 𝜌𝑜 , , etc. but what if we want to check on the assumption of normality
itself? The multinomial 𝑥 2 goodness-of-fit-test can be applied.
4
  Expected frequency value for 𝑃4 is less than 5 (200*0.02=4). So, we have to combine 𝑃4 with one of other expected
frequencies, say 𝑃3 , to obtain a combined expected frequency of 30 (200*0.15). it can also be combined with other
expected frequency values.
65 | P a g e
The null hypothesis for a goodness-of-fit-test is test in that the distribution of the population from
which a sample it taken is the one specified. The alternative hypothesis is that the actual
distribution is not the specified distribution. Generally, a researcher specifies only the name of
distribution and uses the sample data to estimate the particular parameters of the distribution. In
this situation one degree of freedom is test for each parameter that has to be estimated. However,
if the research completely specifies the distribution including parameter values, then no additional
degrees of freedom is lost.
𝐻𝑜 : population is normal 𝜇, 𝜎 2
𝐻𝑜 : population is Poisson 𝜆 1
Example (Binomial)
    1. Miss Tsion, saleswoman for Moon Paper Company, has five accounts to visit per day. It is
        suggested that sales by Miss Tsion May be described by the binomial distribution, with the
        probability of selling each account being 0.40. Given the following frequency distribution
        of Miss Tsion’s number of sales per day, can we conclude that the data do in fact follow
        the binomial distribution? Uses 0.05 significance level
         No of sales per day        0        1         2           3        4           5
66 | P a g e
            Frequency               10          41         60             20            6        3
Solution:
 II.       𝛼 = 0.05
                                      𝑣 =𝐾−1−𝑚 =5−1−0 =4
  IV. Do not reject 𝐻𝑜 the data are well described by the binomial distribution with
      𝑛 = 5 , 𝑝 = 0.40
    2. A professional baseball player, Philippos, was at bat five times in each of 100 games.
           Philippos claims that he has a probability of 0.40 of getting a hit each time he goes to bat.
           Test his claim at the 0.05 level by seeing if the following data are distributed binomially.
            No of hits/game                               0       1        2        3       4    5
67 | P a g e
            No of games with that number of hits        12    38     27    17       5         1
Solution:
    II.    𝛼 = 0.05
                                     𝑣 =𝐾−1−𝑚 =5−1−0 =4
                                          𝑥 2 𝛼,𝑣 = 𝑥 2 0.05,4 = 9.49
                                       Reject 𝐻𝑜 if sample 𝑥 2 > 9.49
           Because of the above change 𝐻𝑜 is translated as:
                                                                                  (𝑓𝑜 − 𝑓𝑒 )2 11.9940
                                                                              ∑
                                                                                      𝑓𝑒
IV. Reject 𝐻𝑜 the # of hit over the same is not normally distributed
      3. The Ethiopian postal service is interested in modeling the “mangled letter” problem. It has
           been suggested that any letter sent to a certain area has a 0.15 chance of being mangled.
           Since the post office is so big, it can be assumed that two letters chances of being mangled
           are independent. A sample of 310 people was selected, and two test letters were mailed to
           each of them. The number of people receiving zero, one, or two mangled letters was 260,40
68 | P a g e
           and 10, respectively. At 0.10 level of significance, is it reasonable to conclude that the
           number of mangled letters received by people follows a binomial distribution with𝑃 =
           0.15?
           Solution:
           𝐻𝑜: The number of mangled letters received by people follows a binomial distribution with
           𝑛=2 𝑎𝑛𝑑 𝑝=0.15
           𝐻1: The number of mangled letters received by people doesn’t′ follow a binomial
           distribution with 𝑛=2 𝑎𝑛𝑑 𝑝=0.15
Solution:
     I. 𝐻𝑜 : The number of mangled letters received by people follows a binomial distribution with 𝑛 =
        2 𝑎𝑛𝑑 𝑝 = 0.15
𝐻1 : The number of mangled letters received by people doesn′ t follow a binomial distribution with 𝑛
= 2 𝑎𝑛𝑑 𝑝 = 0.15
    II.    𝛼 = 0.10
                                         𝑣 =𝐾−1−𝑚 =3−1−0 =2
                                             𝑥 2 𝛼,𝑣 = 𝑥 2 0.10,2 = 4.61
                                           Reject 𝐻𝑜 if sample 𝑥 2 > 4.61
     III. Test statistic (sample 𝑥 2 )
                                                                                (𝑓𝑜 − 𝑓𝑒 )2 26.3967
                                                                            ∑
                                                                                    𝑓𝑒
     IV.   Reject 𝐻𝑜 .
           The number of mangled letters received by people doesn′ t follow a binomial distribution with 𝑛 =
           2 𝑎𝑛𝑑 𝑝 = 0.15
Example (Poisson)
69 | P a g e
            observed number of breakdowns per month during a sample of 100 months. Use a 5% level
            of significance and test the null hypothesis.
             Breakdowns                                   0 1   2      3    4      5 and above
Observed frequency 14 20 34 22 5 3
Solution:
  II.       𝛼 = 0.05
                                        𝑣 =𝐾−1−𝑚 =6−1−0 =5
                                           𝑥 2 𝛼,𝑣 = 𝑥 2 0.05,5 = 11.07
                                         Reject 𝐻𝑜 if sample 𝑥 2 > 11.07
   III. Test statistic (sample 𝑥 2 )
                                                                                    (𝑓𝑜 − 𝑓𝑒 )2 6.3117
                                                                              ∑
                                                                                        𝑓𝑒
   IV.       Do not reject𝐻𝑜 . The number of breakdowns per month of a computer system at the
            university follows a Poisson distributionwith 𝜇 = 2. .
   2.       Suppose that a teller supervisor believes that the distribution of random arrivals at local
            bank is Poisson and sets out to test the hypothesis by gathering information. The following
            data represent a distribution of frequency of arrivals during one minute intervals at a bank.
            Use 𝛼 = 0.05 to test these data in an effort to determine whether they are Poisson
            Distributed.
 70 | P a g e
             No of arrivals                              0      1     2       3       4     5 and above
Observed frequency 7 18 25 17 12 5
          Solution:
          Before we solve the question, first we have to compute the arrival rate per minute, and
          hence one degree of freedom is lost.
                                 ∑(𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑎𝑟𝑟𝑖𝑣𝑎𝑙𝑠 ∗ 𝑜𝑏𝑠𝑒𝑟𝑣𝑒𝑑 𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦)
                            𝜆=
                                            ∑(𝑜𝑏𝑠𝑒𝑟𝑣𝑒𝑑 𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦)
               (0 ∗ 7) + (18 ∗ 1) + (25 ∗ 2) + (17 ∗ 3)(12 ∗ 4) + (5 ∗ 5) 192
           =                                                             =       = 2.3𝑐𝑢𝑠𝑡/𝑚𝑖𝑛
                                           84                               84
      I. 𝐻𝑜 : The arrival of customers at a bank is Poisson distributed with 𝜆 = 2.3.
         𝐻1 : The arrival of customers at a bank is not Poisson distributed with 𝜆 = 2.3.
     II.    𝛼 = 0.05
                                           𝑣 =𝐾−1−𝑚 =6−1−1 =4
                                              𝑥 2 𝛼,𝑣 = 𝑥 2 0.05,4 = 9.488
                                            Reject 𝐻𝑜 if sample 𝑥 2 > 9.488
    III.    Test statistic (sample 𝑥 2 )
                                                                                      (𝑓𝑜 − 𝑓𝑒 )2 1.795
                                                                                  ∑
                                                                                          𝑓𝑒
    IV.     Do not reject𝐻𝑜 . The arrival of customers at a bank follows a Poisson distribution with 𝜆 =
            2.3
    71 | P a g e
      3.    The number of automobile accidents occurring per day in a particular city is believed to
            have a Poisson distribution. A sample of 80 days during the past year gives the data shown
            below. Do the data support the belief that the number of accidents per day has a poison
            distribution? Use 𝛼 = 0.05
No of accidents 0 1 2 3 4
Observed frequency(days) 34 25 11 7 3
            Solution:
            Before we solve the question, first we have to compute the occurrence rate per day, and
            hence one degree of freedom is lost.
                                 ∑(𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑎𝑟𝑟𝑖𝑣𝑎𝑙𝑠 ∗ 𝑜𝑏𝑠𝑒𝑟𝑣𝑒𝑑 𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦)
                             𝜆=
                                              ∑(𝑜𝑏𝑠𝑒𝑟𝑣𝑒𝑑 𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦)
                    (0 ∗ 34) + (25 ∗ 1) + (11 ∗ 2) + (7 ∗ 3)(3 ∗ 4) 80
                                                                   =     = 1𝑎𝑐𝑐𝑖𝑑𝑒𝑛𝑡/𝑑𝑎𝑦
                                          80                          80
      I. 𝐻𝑜 : The occurence of acciddents per day follows a Poisson distribution with 𝜆 = 1.0
        𝐻1 : : The occurence of acciddents per day does not follow a Poisson distribution with 𝜆
                        = 1.0
     II.    𝛼 = 0.05
                                           𝑣 =𝐾−1−𝑚 =4−1−1 =2
                                               𝑥 2 𝛼,𝑣 = 𝑥 2 0.05,2 = 5.99
                                             Reject 𝐻𝑜 if sample 𝑥 2 > 5.99
    III.    Test statistic (sample 𝑥 2 )
                                                                                      (𝑓𝑜 − 𝑓𝑒 )2 4.3036
                                                                                  ∑
                                                                                          𝑓𝑒
    72 | P a g e
       IV.      Do not reject𝐻𝑜 .
                The occurence of acciddents follows a Poisson distribution with 𝜆 = 1.0
             Example (Normal)
        1.       Suppose that Ato Paulos developed an overall attitude scale to determine how his
                 company’s employees feel toward their company. In theory the scores can vary from 0 to
                 50. Ato Paulos retests his measurement instrument on a randomly selected group of 100
                 employees. He tallies the scores and summarizes them in to six categories as shown below.
                 Are these retest scores approximately normally distributed with 𝜇 = 24.9 𝑎𝑛𝑑 𝜎 = 7.194?
                 Use 𝛼 = 0.05
Frequency 11 14 24 28 13 10
  I.          𝐻𝑜 : The attitude scoress are normally distributed with 𝜇 = 24.9 𝑎𝑛𝑑 𝜎 = 7.194
              𝐻1 : The attitude scoress are not normally distributed with 𝜇 = 24.9 𝑎𝑛𝑑 𝜎 = 7.194
 II.         𝛼 = 0.05
                                                  𝑣 =𝐾−1−𝑚 =6−1−0 =5
                                                     𝑥 2 𝛼,𝑣 = 𝑥 2 0.05,5 = 11.07
                                                   Reject 𝐻𝑜 if sample 𝑥 2 > 11.07
III.         Test statistic (sample 𝑥 2 )
                            𝑋−𝜇
                 With𝑧 =          , the expected probability of each category can be obtained as follows:
                              𝜎
   73 | P a g e
         25−24.9                                                      +0.00399
 𝑧25 =             = +0.01
          7.194
The six probabilities do not sum to 1.00. even though observed frequencies were obtained only for
these six categories, getting a score less than 10 or greater than 40 was also possible. Because 0.50
of the probabilities liee in each half of a normal distribution utilizing the sum of expected
probabilities on each side of the mean, 24.9, we can obtain a probability of the <
10 𝑐𝑎𝑡𝑒𝑔𝑜𝑟𝑦: 0.5 − (0.06456 + 0.16446 + 0.25175) =
0.01923. 𝑆𝑖𝑚𝑖𝑙𝑎𝑟𝑙𝑦, 𝑤𝑤𝑒 𝑐𝑎𝑛 𝑜𝑏𝑡𝑎𝑖𝑛 𝑡ℎ𝑒 𝑝𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑖𝑡𝑦 𝑜𝑓 > 40 𝑐𝑎𝑡𝑒𝑒𝑔𝑜𝑟𝑦: 0.5 −
(0.00399 + 0.25716 + 0.15809 + 0.06290) = 0.01786. expected frequencies can then be
obtained by multiplying each expected probability by thee total frequency (100), as shown below.
74 | P a g e
                20 − 25                 0.16845                 25.574               25.574
        As the < 10 𝑎𝑛𝑑 > 40 categories have values of less than 5, each must be combined
        with the adjacent category. As a result, the < 10 category becomes part of the 10-15
        categories and the > 40 category becomes part of the 35-40 categories.
10 − 15 0.08379 8.379
15 − 20 0.16446 16.446
20 − 25 0.25574 25.574
25 − 30 0.25716 25.716
30 − 35 0.15809 15.809
35 − 40 0.08076 8.076
75 | P a g e
                 35 − 40            10           0.08076        8.076            3.7018         0.4584
                                                                                 𝑓𝑜 − 𝑓𝑒 )2     2.4409
                                                                            ∑
                                                                                     𝑓𝑒
IV.       Do not reject 𝐻𝑜 . The attitude score are normally distributed with mean 24.9 and standard
          deviation 7.194.
        2. The director of a major soccer team believes that the ages f purchasers of game tickets are
             normally distributed. If the following data represent the distribution of ages for a sample
             of observed purchasers of major soccer game tickets, use the chi-square goodness-of-fit-
             test to determine whether this distribution is significantly different from the normal
             distribution. Assume that 𝛼 = 0.05.
Frequency 16 44 61 56 35 19
       II.    𝛼 = 0.05
                                              𝑣 =𝐾−1−𝑚 =6−1−2 =3
                                                  𝑥 2 𝛼,𝑣 = 𝑥 2 0.05,3 = 7.81
                                                Reject 𝐻𝑜 if sample 𝑥 2 > 7.81
      76 | P a g e
                                                   ∑ 𝑓𝑚 9155
                                            𝑥̅ =       =     = 39.63
                                                     𝑛   231
                                                     2
                                        (∑ 𝑓𝑚)
                                        2                  (9155)2
                               √ ∑ 𝑓𝑚 −          √405375 −
                            𝑠=             𝑛   =             231 = 13.60
                                      𝑛−1             231 − 1
                     𝑋−𝜇
         With 𝑧 =          , the expected probability of each category can be obtained as follows:
                      𝜎
77 | P a g e
               Expected probability                                     0.15682
The six probabilities do not sum to 1.00. Even though the observed frequencies were obtained only
for these six categories, getting a score less than 10 or greater than 70 was also possible.
For < 10
For > 70
Then, the expected frequencies can be obtained by multiplying each expected probability by thee
total frequency (231), as shown below.
78 | P a g e
               50-60    0.15682                 36.225        36.225
        As the < 10 𝑎𝑛𝑑 > 70 categories have values of less than 5, each must be combined
        with the adjacent category. As a result, the < 10 category becomes part of the 10-20
        categories and the > 70 category becomes part of the 60-70 categories.
                                                                           𝑓𝑜 − 𝑓𝑒 )2    2.4497
                                                                       ∑
                                                                               𝑓𝑒
79 | P a g e
IV.       Do not reject 𝐻𝑜 . The age of purchasers of soccer game tickets are normally distributed.
        3.    The instructor for introductory statistics course attempts to construct the final examination
              so that the grades are normally distributed with a mean of 65. From the sample of grades
              appearing in the accompanying frequency distribution table, can you conclude that they
              have achieved his objective? Use 𝛼 = 0.05.
Frequency 4 17 29 49 33 18
       II.    α = 0.05
                                                𝑣 =𝐾−1−𝑚 =5−1−1 =3
                                                    𝑥 2 𝛼,𝑣 = 𝑥 2 0.05,3 = 7.81
                                                  Reject 𝐻𝑜 if sample 𝑥 2 > 7.81
      III.    Test statistic (sample 𝑥 2 )
                                                       ∑ 𝑓𝑚 9690
                                                𝑥̅ =       =     = 64.60~65
                                                         𝑛   150
                                                                 2
                                             2    (∑ 𝑓𝑚)              (9690)2
                                        √∑ 𝑓𝑚 −      𝑛     √649,750 −   150 = 12.63
                                   𝑠=                    =
                                                𝑛−1              150 − 1
                         𝑋−𝜇
              With𝑧 =            , the expected probability of each category can be obtained as follows:
                             𝜎
      80 | P a g e
                  For category 30-40    Probability
         30−65                           0.49720
 𝑧30 =           = −2.77
         12.63
         40−65                           0.47615
 𝑧40 =           = −1.98
         12.63
81 | P a g e
The six probabilities do not sum to 1.00. Even though the observed frequencies were obtained only
for these six categories, getting a score less than 30 or greater than 90 was also possible.
For < 30
For > 90
Then, the expected frequencies can be obtained by multiplying each expected probability by thee
total frequency (150), as shown below.
Since the < 30, 30 − 40 𝑎𝑛𝑑 > 90 categories have values of less than 5, each must be combined
with the adjacent category. As a result, the < 30 𝑎𝑛𝑑 30 − 40 category becomes part of the 40-
50 categories and the > 90 category becomes part of the 80-90 category.
82 | P a g e
               50-60    0.22756             52.5664
                                                                       𝑓𝑜 − 𝑓𝑒 )2    1.6190
                                                                   ∑
                                                                           𝑓𝑒
IV. Do not reject𝐻𝑜 . The grades of students are normally distributed with a mean 65.
Exercise
1. The theory predicts the proportion of beans in the 4 groups A, B, C, D should be 9:3:3:1. In
an experiment among 1600 beans, the numbers in the four groups were 882, 313, 287 and 118.
2. A study was conducted, among 100 professors from 3 different divisions for the preference
on beverages of 3 categories test if there is any relationship between the field of teaching and
preference of beverage.
83 | P a g e
3. A random sample of 600 students from Delhi University are selected and asked their opinion
ab out autonomous Status of Colleges. The results were given below. Test the hypothesis at 5%
level that opinions are independent of class groupings.
CHAPTER FIVE
ANALYSIS OF VARIANCE
An F test is used to test a hypothesis concerning the means of three or more populations, the
technique is called analysis of variance (commonly abbreviated as ANOVA). At first glance, you
might think that to compare the means of three or more samples, you can use the t test, comparing
two means at a time. But there are several reasons why the t test should not be done.
First, when you are comparing two means at a time, the rest of the means under study are ignored.
With the F test, all the means are compared simultaneously. Second, when you are comparing two
means at a time and making all pairwise comparisons, the probability of rejecting the null
hypothesis when it is true is increased, since the more t tests that are conducted, the greater is the
likelihood of getting significant differences by chance alone. Third, the more means there are to
compare, the more t tests are needed. For example, for the comparison of 3 means two at a time, 3
t tests are required. For the comparison of 5 means two at a time, 10 tests are required. And for the
comparison of 10 means two at a time, 45 tests are required.
84 | P a g e
With the F test, two different estimates of the population variance are made. The first estimate is
called the between-group variance, and it involves finding the variance of the means. The second
estimate, the within-group variance, is made by computing the variance using all the data and is
not affected by differences in the means. If there is no difference in the means, the between-group
variance estimate will be approximately equal to the within-group variance estimate, and the F test
value will be approximately equal to 1. The null hypothesis will not be rejected. However, when
the means differ significantly, the between-group variance will be much larger than the within-
group variance; the F test value will be significantly greater than 1; and the null hypothesis will be
rejected. Since variances are compared, this procedure is called analysis of variance (ANOVA).
Let 𝜇1 , 𝜇2 , 𝜇3 ,…..,= 𝜇𝑘 be the mean value for population 1,2,3,….K respectively. Then from
sample data we intend to test the following hypothesis.
𝐻0 =𝜇1 = 𝜇2 = 𝜇3 =…..,= 𝜇𝑘
𝐻1 = Not all 𝜇𝑗 are equal j=1, 2, 3….K
I.e. the null hypothesis should be rejected if any of the n sample means is different from others. A
significant test value means that there is a high probability that this difference in means is not due
to chance, but it does not indicate where the difference lies.
The following are few examples involving more than two populations where it is necessary to
conduct a comparison to arrive at a statistical inference:
85 | P a g e
The first step in the analysis of variance is to partition the total variation in the sample data in to
the following two component variations in such a way that it is possible to estimate the contribution
of factors that may cause variation.
    1. The amount of variation among the sample mean or the variation attributed to the
          difference among sample means. This variation is due to assignable causes.
    2. The amount of variation within the sample observations. This difference is considered due
          to chance causes or experimental [random] errors.
Suppose our main aim is to make inferences about 𝑘 population means based on sample data,
where, 𝜇𝑗 is the mean of the population of measurements associated with the treatment [𝑗 =
1,2,3, … 𝑘]. The null and alternative hypothesis to be tested is stated as:
𝐻0 =𝜇1 = 𝜇2 = 𝜇3 =…..,= 𝜇𝑘
𝐻1 = Not all 𝜇𝑗 are equal j=1,2,3….K
Let:
𝑛𝑗 = size of the 𝑗𝑡ℎ sample [𝑗 = 1,2,3, … 𝑘]
𝑛𝑗 = total number of observations in all samples combined i.e. [𝑛 = 𝑛1 + 𝑛2 + 𝑛3 … 𝑛𝑘 ]or [𝑛 =
𝑟𝑘] if 𝑖 = 𝑗
𝑥𝑖𝑗 = 𝑡ℎ𝑒 𝑖 𝑡ℎ Observation in 𝑗𝑡ℎ sample
 Observation                                   Populations [number of samples]
[measurements] 1 2 3
. . . .
86 | P a g e
                  .                     .                       .                        .
. . . .
Sum 𝑇1 𝑇2 𝑇𝑘 = 𝑇
Where:
        𝑟                                    𝑘                                   𝑟
                                                                              1
𝑇𝑖 = ∑ 𝑥𝑖𝑗                             𝑇 = ∑ 𝑇𝑖                          x̅i = ∑ 𝑥𝑖𝑗
                                                                              𝑟
       𝑖=1                                   𝑗=1                                𝑖=1
             𝑘           𝑟   𝑘
        1        1
𝑥̿ =      ∑ 𝑥̅𝑗 = ∑ ∑ 𝑥𝑖𝑗
       𝑟𝑘        𝑛
            𝑗=1         𝑖=1 𝑗=1
The values of x̅ are called sample means and 𝑥̿ is the grand mean of all observations (or
measurements) in all the samples. Since there are r rows and k columns in the table above, then
total number of observations is = 𝑛 , provided each row has equal number of observations. But if
the number of observations in each row varies, then the total number of observations is 𝑛 = 𝑛1 +
𝑛2 + 𝑛3 … 𝑛𝑘 = 𝑛
  Example 1. Three brands of tires, A, B, C were tested for durability. A sample of four types of
  each brand is subjected to the same test and the number of kilometers until wear out was noted
  from each brand of tires. The data in thousand kilometers is given below
       Observations                                Samples [number of brands]
A B C
1 26 18 23
2 25 16 19
3 28 17 26
4 12 18 30
Since the same number of observations is obtained from each brand of tires [population], therefore
the number of observations in the table is 𝑛 = 𝑟𝑘
                                             𝑛 = 4 × 3 = 12
87 | P a g e
        The sample [population] mean of the three samples are given by
        𝑟
     1
x̅j = ∑ 𝑥𝑖𝑗
     𝑟
       𝑖=1
         4                                    4                                    4
     1       1                            1       1                         1       1
x̅A = ∑ 𝑥𝑖1 = (91)                   x̅B = ∑ 𝑥𝑖2 = (69)                x̅C = ∑ 𝑥𝑖3 = (98)
     4       4                            4       4                         4       4
        𝑖=1                                  𝑖=1                                  𝑖=1
         4     3
     1            1
𝑥̿ =    ∑ ∑ 𝑥𝑖𝑗 =    (26 + 25 + 28 + ⋯ 19 + 30) = 𝟐𝟏. 𝟓𝟎
     12           12
        𝑖=1 𝑗=1
. The degrees of freedom for this application are the denominators in the mean squares, that is,
𝑉1 = 𝑘 − 1 and 𝑉2 = 𝑛 − 𝑘
 𝑉1 𝑑𝑓 𝑁𝑢𝑚𝑒𝑟𝑎𝑡𝑜𝑟 = 𝑘 − 1 ,𝑉1 = 3 − 1 = 𝟐
 𝑉2 𝑑𝑓 𝑑𝑒𝑛𝑜𝑚𝑖𝑛𝑎𝑡𝑜𝑟 = 𝑛 − 𝑘, 𝑉2 = 12 − 3 = 𝟗
      𝐹𝛼,(𝑘−1,𝑛−𝑘) = 𝐹0.05,(3−1,12−3) = 𝟒. 𝟐𝟔
𝑟𝑒𝑗𝑒𝑐𝑡𝑖𝑜𝑛 𝑟𝑒𝑔𝑖𝑜𝑛
4.26
STEP 3. Compute the test statistics, using the procedure outlined here
88 | P a g e
    a) Find the mean and variance of each sample
    b) Find the grand mean, denoted by 𝑥̿ , it is the mean of values in the samples
                                ∑ 𝑥 26 + 25 + 28 + ⋯ … … … … + 30
                         𝑥̿ =      =                              = 21.5
                                 𝑛               12
                                                          1                               22.75+17.25+24.50
        If the samples are equal in size then, 𝑥̿ = 𝑘 (𝑥̅1 , 𝑥̅ 2 , 𝑥̅ 3 … … . 𝑥̅ 𝑘 ) =                       = 21.5
                                                                                                 3
    c) Find the between group variance, denoted by MSB
    If the null hypothesis is true, the population means would all be equal. We would then expect
    that the sample means would be close to one another. If the alternative hypothesis is true,
    however, there would be large differences between some of the sample means.
Sum of squares between samples [SSB] measures the proximity of the sample means to each other.
Calculate the difference between the mean of each sample and the grand mean as x̅1 − 𝑥̿ , x̅2 −
𝑥̿ , x̅3 − 𝑥̿ … x̅k − 𝑥̿ . Multiply each of these by the number of observations in the corresponding
sample and sum. The total gives the sum of the squared differences between the sample means in
each group and is denoted by𝑆𝑆𝐵.
 𝑆𝑆𝐵 = ∑ 𝑛𝑗 (𝑥̅ − 𝑥̿ )2
         𝑗=1
= 𝑛𝐴 (𝑥̅𝐴 − 𝑥̿ )2 + 𝑛𝐵 (𝑥̅ 𝐵 − 𝑥̿ )2 + 𝑛𝐶 (𝑥̅𝐶 − 𝑥̿ )2
= 4(22.75 − 21.50)2 + 4(17.25 − 21.50)2 + 4(24.50 − 21.50)2
= 6.25 + 72.25 + 36
=𝟏𝟏𝟒. 𝟓𝟎
                         𝑆𝑆𝐵         ∑𝑘𝑗=1 𝑛𝑗 (𝑥̅ − 𝑥̿ )2 114.50
                   𝑀𝑆𝐵 =     = 𝑀𝑆𝐵 =                     =       = 𝟓𝟕. 𝟐𝟓
                         𝐾−1               𝐾−1             3−1
          4(22.75 − 21.5)2 + 4(17.25 − 21.5)2 + 4(24.5 − 21.5)2 114.5
        =                                                      =      = 57.25
                                    3−1                           2
    d) Find within group variance -within sample variability
If large differences exist between sample means, at least some sample means differ considerably
from the grand mean, producing a large value of SSB. It is then rationale to reject the null
hypothesis in favor of the alternative hypothesis. Sum of square within sample [SSW] provides a
measure of the amount of variation in the response variable that is not caused by the samples.
89 | P a g e
          𝑟    𝑘
Where:
∑𝑘𝑗=1(𝑥1𝑗 − 𝑥̅𝐴 )2 + ∑𝑘𝑗=1(𝑥2𝑗 − 𝑥̅ 𝐵 )2 + ∑𝑘𝑗=1(𝑥3𝑗 − 𝑥̅ 𝐶 )2 +… … … … . ∑𝑘𝑗=1(𝑥𝑖𝑗 − 𝑥̅𝑗 )2
= [(26 − 22.75)2 +(25 − 22.75)2 (28 − 22.75)2 (12 − 22.75)2 ]+[(18 − 17.25)2 + (16 −
17.25)2 + (17 − 17.25)2 + (18 − 17.25)2 ] + [(23 − 24.50)2 + (19 − 24.50)2 + (26 −
24.50)2 + (30 − 24.50)2 ]
=𝟐𝟐𝟔. 𝟓𝟎
                       𝑆𝑆𝑊    226.50
 𝑀𝑆𝑊 = 𝑀𝑆𝑆𝐸 = 𝑛−𝑘 =                    =𝟐𝟓. 𝟏𝟔𝟕
                              12−3
𝑀𝑆𝑊 = 𝑀𝑆𝑆𝐸
90 | P a g e
                                    ∑(𝑛𝑖 − 1)𝑠𝑖 2             ∑(𝑛𝑖 − 1)𝑠𝑖 2
                            𝑀𝑆𝑊 =                 = 𝑀𝑆𝑆𝐸 =
                                     ∑(𝑛𝑖 − 1)                 ∑(𝑛𝑖 − 1)
                                                 ∑(𝑛𝑖 − 1)𝑠𝑖 2
                                       𝑀𝑆𝑊 =
                                                  ∑(𝑛𝑖 − 1)
                     (4 − 1)(52.917) + (4 − 1)(0.9167) + (4 − 1)(21.67) 226.50
               𝑀𝑆𝑊 =                                                        =
                                 (4 − 1) + (4 − 1) + (4 − 1)                  9
                          = 𝟐𝟓. 𝟏𝟔𝟕
Note:
This formula finds an overall variance by calculating a weighted average of the variances. It does
not involve using differences of the means.
10 6 5
12 8 9
9 3 12
15 0 8
13 2 4
                   𝒙
                   ̅ 𝟏 = 𝟏𝟏. 𝟖              𝒙
                                            ̅ 𝟐 = 𝟑. 𝟖              𝒙
                                                                    ̅ 𝟑 = 𝟕. 𝟔
91 | P a g e
                   𝒔𝟏 𝟐 = 𝟓. 𝟕               𝑠2 2 = 10.2            𝑠3 2 = 10.3
        𝑉1 𝑑𝑓 𝑁𝑢𝑚𝑒𝑟𝑎𝑡𝑜𝑟 = 𝑘 − 1 ,𝑉1 = 3 − 1 = 𝟐
        𝑉2 𝑑𝑓 𝑑𝑒𝑛𝑜𝑚𝑖𝑛𝑎𝑡𝑜𝑟 = 𝑛 − 𝑘, 𝑉2 = 15 − 3 = 𝟏𝟐
      𝐹𝛼,(𝑘−1,𝑛−𝑘) = 𝐹0.05,(3−1,15−3) = 𝟑. 𝟖𝟗
92 | P a g e
                                                                    𝑟𝑒𝑗𝑒𝑐𝑡𝑖𝑜𝑛 𝑟𝑒𝑔𝑖𝑜𝑛
3.89
STEP 3. Compute the test statistics, using the procedure outlined here
                          ∑ 𝑥 10 + 12 + 9 + ⋯ … … … … + 4 116
                   𝑥̿ =      =                           =    = 𝟕. 𝟕𝟑
                           𝑛              12               15
    c) Find the between group variance, denoted by MSB
                                             ∑𝑘𝑗=1 𝑛𝑗 (𝑥̅ − 𝑥̿ )2
                                    𝑀𝑆𝐵 =
                                              𝐾−1
            5(11.8 − 7.73)2 + 5(3.8 − 7.73)2 + 5(7.6 − 7.73)2 160.13
          =                                                  =       = 𝟖𝟎. 𝟎𝟕
                                  3−1                           2
    d) Find within group variance -within sample variability, denoted by 𝑀𝑆𝑊
                                                      ∑(𝑛𝑖 − 1)𝑠𝑖 2
                                             𝑀𝑆𝑊 =
                                                        ∑(𝑛𝑖 − 1)
                         (5 − 1)(5.7) + (5 − 1)(10.2) + (5 − 1)(10.3) 104.80
               𝑀𝑆𝑊 =                                                    =           = 𝟖. 𝟕𝟑
                                  (5 − 1) + (5 − 1) + (5 − 1)                 12
Note:
This formula finds an overall variance by calculating a weighted average of the variances. It does
not involve using differences of the means.
93 | P a g e
Exercise
2. If there is a very strong correlation between two variables then the correlation coefficient
must be
3. In regression, the equation that describes how the response variable (y) is related to the
explanatory variable (x) is:
4. The relationship between number of beers consumed (x) and blood alcohol content (y) was
studied in 16 male college students by using least squares regression. The following regression
equation was obtained from this study:
94 | P a g e
       b. smaller than SST
c. equal to 1
d. equal to zero
b. independent variable
c. intervening variable
d. is usually x
8. Regression analysis was applied to return rates of sparrow hawk colonies. Regression analysis
was used to study the relationship between return rate (x: % of birds that return to the colony in a
given year) and immigration rate (y: % of new adults that join the colony per year). The
following regression equation was obtained.
Y’ = 31.9 – 0.34x
 Based on the above estimated regression equation, if the return rate were to decrease by 10% the
rate of immigration to the colony would:
a. increase by 34%
b. increase by 3.4%
c. decrease by 0.34%
d. decrease by 3.4%
95 | P a g e
                                         CHAPTER SIX
 Such perfect correlation is seldom encountered. We still need to measure correlational strength, –
defined as the degree to which data point adhere to an imaginary trend line passing through the
96 | P a g e
“scatter cloud.” Strong correlations are associated with scatter clouds that adhere closely to the
imaginary trend line. Weak correlations are associated with scatter clouds that adhere marginally
to the trend line.
 Examples of strong and weak correlations are shown below. Note: Correlational strength can not
be quantified visually. It is too subjective and is easily influenced by axis-scaling. The eye is not
a good judge of correlational strength.
To calculate a correlation coefficient, you normally need three different sums of squares (SS). The
sum of squares for variable X, the sum of square for variable Y, and the sum of the cross-product
of XY. The sum of squares for variable X is:
𝑆𝑆𝑋𝑋 = ∑(𝑥𝑖 − 𝑋̅ )2
This statistic keeps track of the spread of variable X. For the illustrative data, 𝑋̅ = 30.83 and SSXX
= (50-30.83)2 + (11-30.83)2 + . . . + (25-30.83)2 = 7855.67. Since this statistic is the numerator of
the variance of X (s ), it can also be calculated as SSXX = (𝑆𝑥2 )(𝑛 − 1). Thus, SSXX = (714.152)(12-
1) = 7855.67. 2
97 | P a g e
The sum of squares for variable Y is:
This statistic keeps track of the spread of variable Y and is the numerator of the variance of 𝑌(𝑆𝑥2 ) .
For the 2 illustrative data 𝑦̅ = 30.883 and SSYY = (22.1-30.883)2 + (35.9-30.883)2 + . . . + (38.4-
30.883)2 = 3159.68. An alterative way to calculate the sum of squares for variable Y is SS YY =
(𝑆𝑥2 )(𝑛 − 1). Thus, SSYY = (287.243)(12-1) = 2 3159.68.
                                                   𝑆𝑆𝑋𝑌
                                         𝑟=
                                              ඥ(𝑆𝑆𝑋𝑋 )(𝑆𝑆𝑌𝑌 )
                                         −4231.1333
                               𝑟=                            = −0.849
                                     ඥ(7855.67)(3159.68)
Coefficient of Determination
The coefficient of determination is the square of the correlation coefficient (r2). For illustrative
data, r2 = -0.8492 = 0.72. This statistic quantifies the proportion of the variance of one variable
“explained” (in a 2 2 statistical sense, not a causal sense) by the other. The illustrative coefficient
98 | P a g e
of determination of 0.72 suggests 72% of the variability in helmet use is explained by
socioeconomic status.
▪ Formula
                       6 ∑ 𝐷2
            𝑅𝑘 = 1 − 𝑁(𝑁2 −1)
100 | P a g e
                                                 y = α + βx
However, this equation (y = α + βx) is an exact one. Assuming that this equation is appropriate, if
the values of α and β had been calculated, then given a value of x, it would be possible to determine
with certainty what the value of y would be. Imagine -- a model which says with complete certainty
what the value of one variable will be given any value of the other!
Charles Spearman’s coefficient of correlation (or rank correlation) is the technique of determining
the degree of correlation between two variables in case of ordinal data where ranks are given to
the different values of the variables. The main objective of this coefficient is to determine the
extent to which the two sets of ranking are similar or dissimilar. This coefficient is determined as
under:
                                                         6∑𝑑 2
                                                         𝑖
Spearman's coefficient of correlation (or rs) = 1 − 𝑛(𝑛2−1)
101 | P a g e
                𝑋̅ = mean of X
                𝑌𝑖 = ith value of Y variable
                𝑌̅ = Mean of Y
                𝑛 = number of pairs of observations of X and Y
                𝜎𝑋 = Standard deviation of X
                𝜎𝑌 = Standard deviation of Y
In case we use assumed means (Ax and Ay for variables X and Y respectively) in place of true
means, then Karl Person’s formula is reduced to:
   Example 1.          Two points on the straight line that represents Mr. Jones’s salary are(x1 ;
       y1) = (5 ; 140 000) and (x2 ; y2) = (9 ; 216 000).
       The slope of the line is given by
                                                          𝑦2 − 𝑦1
                                                 𝑏=
                                                           𝑥2− 𝑥1
                                                          216,000−140,000
                                                      =
                                                                   9−5
                                                          76,000
                                                      =     4
= 𝟏𝟗, 𝟎𝟎𝟎
Two points on the straight line that represents Mr. Brown’s salary are (x1 ; y1) = (6 ; 160 000) and
(x2 ; y2) = (9 ; 244 000). The slope of the line is
                                                     244,000−160,000
                                               =            9−6
= 𝟐𝟖, 𝟎𝟎𝟎
Two points on the straight line that represents Mr. Smith’s salary are (x1 ; y1) = (7 ; 150 000) and
(x2 ; y2) = (9 ; 210 000).The slope of the line is
                       b = 30 000.
Mr. Smith’s salary increased at the highest rate.
   A. The equation is y = a + bx with b = 28000, thus y = a + 28000x.
                Substitute point (6 ; 160 000) into the equation:
102 | P a g e
                160 000 = a + 28 000 × 6
                a + 168 000 = 160 000
                a = −8 000.
                Thus y = −8000 + 28000x.
   B.
        The year 2008 corresponds to x = 10.
Substitute x = 10 into y = −8000 + 28000x which gives
y = −8 000 + 28 000 × 10
= 272 000.
In 2008 he will make R272 000.
        A.      You must do these calculations directly on your calculator. We are only showing
                mathematical how your calculator came to the answer.
103 | P a g e
 Year (𝑖)         Average               Yearly                       𝑥𝑖 2       𝑥𝑖 𝑦𝑖      𝑦𝑖 2
                  interest (𝑥𝑖 )        investment
                                        (𝑦𝑖 )
 1                13.8                  1,060                    190.44     14,628      1,123,600
                                 𝑛 ∑𝑛           𝑛       𝑛
                                    𝑖=1 𝑥𝑖 𝑦𝑖 −∑𝑖=1 𝑥𝑖 ∑𝑖=1 𝑦𝑖
                𝑟=
                                            2                        2
                     √𝑛 ∑𝑛     2   𝑛
                         𝑖=1 𝑥𝑖 −(∑𝑖=1 𝑥𝑖 )
                                              √𝑛 ∑𝑛     2   𝑛
                                                  𝑖=1 𝑦𝑖 −(∑𝑖=1 𝑦𝑖 )
                                      10(22,569)−(149.1)(14,730)
                     =
                         ඥ10(2,229.03)−(149.1)2ඥ10(23,501,300)(147,730)2
                  24,447
            = 32,759.8161
= 𝟎. 𝟖𝟗𝟖𝟗
104 | P a g e
           C. The equation of the straight line is 𝑦 = 𝑎 + 𝑏𝑥 where
                          ∑10          10      10
                           𝑖=1 𝑥𝑖 𝑦𝑖 −∑𝑖=1 𝑥𝑖 ∑𝑖=1 𝑦𝑖
                 𝑏=
                           √𝑛 ∑10 𝑥𝑖 2 −(∑10 𝑥𝑖 )2
                               𝑖=1        𝑖=1
                                  10(22,569)−(149.1)(14,730)
                              =      10(2,229.03)−(149.1)2
                                  24,447
                              =   59.49
                    = 𝟒𝟗𝟒. 𝟗𝟗
          ∑10
           𝑖=1 𝑦𝑖        𝑏 ∑10
                            𝑖=1 𝑥𝑖
And 𝑎 =             −
            𝑛               𝑛
                14,730        (494.99)(149.1)
           =              −
                    10               10
= −𝟓𝟗𝟎𝟕. 𝟑𝟎
        Example 3. The following table shows the number of loans approved for different
                              amounts during the second half of 2008.
                           Amount of loan in Birr Number of loans (y)
                           100,000 (x)
                                             2                          45
3 250
4 250
5 175
6 125
   Example 4.                   A study was undertaken at eight garages to determine how the resale value
       of a car is affected by its age. The following data was obtained:
 Garage                                     Age of car (in years)         Resale value (in Birr)
105 | P a g e
                1                                1                             41,250
2 6 10,250
3 4 24,310
4 2 38,720
5 5 8,740
6 4 26,110
7 1 38,650
8 2 36,200
The garage manager suspects a linear relationship between the two variables. Fit a curve of the
form y = a + bx to the data.
The equation for the regression line is
   1. The annual car sales of a small car manufacturer, c, and the annual advertising
      expenditure, £ a, has product moment correlation coefficient r ac. The data is coded as
                                  𝑎
          x =c − 7000 and 𝑦 = 1000 and the summary is shown in the table below.
         Year       2010       2011       2012   2013       2014      2015      2016       2017
       a) Find, by a statistical calculator, the value of the product moment correlation coefficient
       between x and y , denoted by rxy
       b) State with full justification the value of r ac
       c) Interpret the value of rac
   2. The table below shows the number of Math’s teachers x, working in 8 different towns and
the number of burglaries y, committed in a given month in the same 8 towns.
106 | P a g e
                     town         A           B            C           D             E          F            G              H
X 35 42 21 55 33 29 39 40
Y 30 28 21 38 35 27 30 K
0.0934
0.0062̅0.0075
      𝑃 1−                     a) Use a statistical calculator to find the product moment correlation coefficient between
 ̅(2𝑋̅ =
𝜇𝑃       ̅ 2)   =        𝜇(𝑋1 − 𝑋̅2 ) the
                           ̅          = number of math’s teachers and the number of burglaries, for the towns A to G.
      1− 𝑋
500         ̅
200 𝜇𝑋(1𝑋
0.02          ̅𝑃̅ −
                  1 ̅ 2 ) = b) Interpret the value of the product moment correlation coefficient in the context of this
                  1−−
                    𝑃
 ̅2 = 600
 𝑋
 ̅
 𝑋2 = 250
0.05                                  question.
               3. The table below shows the maximum daytime temperature, in °C, at a certain city Centre, and
               the amount of a certain pollutant in mg per liter.
Maximum Temperature 10 12 14 16 18 20 22 24
Amount of Pollutant 513 475 525 530 516 520 507 521
                   a) Find, using a statistical calculator, the value of the product moment correlation coefficient
               for the above data.
                   b) Test, at the 10% level of significance, whether there is evidence of positive correlation in
               these bivariate data.
               107 | P a g e
Answer key
Chapter one
   1. a) 𝜇𝑋̅ = 75 , 𝜎𝑋̅ = 1.09
                𝑝𝑞
   2.   p ±3√        = 0.08±0.05 and [0.03,0.13]⊂[0,1],0.1210
                 𝑛
                𝑝𝑞
   3. P ±3√          = 0.02±0.05 and [0.01, 0.03] ⊂ [0, 1], 0.9671
                𝑛
   4. A) 0.63         b) 0.1446
   5. 0.9977
108 | P a g e
   6. 0.3483
Chapter Two
   1. (761.19, 798.81)
   2. (242.16, 257.84)
   3. (283.61, 296.39)
Chapter Three
   1. a) ‘One-tailed’’ the alternate hypothesis is greater than direction
       c) This is evidence to conclude that the population mean is greater than 20.
   2. A machine is set to fill a small bottle with 9.0 grams of medicine. A sample of eight
   bottles revealed the following amounts (grams) in each bottle.
   9.2 8.7 8.9 8.6 8.8 8.5 8.7 9.0 At the .01 significance level, can we conclude that the mean
   weight is less than 9.0 grams?
      a) State the null hypothesis and the alternate hypothesis.
   Ho: mu=9 grams
   Ha: mu<9
   b. How many degrees of freedom are there? n-1 = 7
   c. Compute the value of t. What is your decision regarding the null
   hypothesis? X-bar = 8.8 grams; df = 7; Alpha = 1%;
   Test Statistic = (8.8-9)/[0.2268/sqrt(8))] = -.2/0.0802 = -2.4942
   d. Estimate the p-value. P-value = 0.0207
   Conclusion: Since p-value is greater than 1% there is insufficient evidence to reject Ho.
Chapter Four
   1. 0.98743
   2. 0.8634
   3. 0.6543
Chapter Five
Chapter Six
   1. a) rxy=0.969
      b) rac=0.969
109 | P a g e
       c) Strong positive correlation, i.e. The More spend on advertising the more the car sales
2. a) r= 0.792
       c) The critical view for h=7, at 5% significance is 0.6694 as 0.792>0.6694, i.e sufficient to
reject H0
d) Y=0.408x+15.1
3. a) r=0.320
110 | P a g e