0% found this document useful (0 votes)
75 views15 pages

Testing of Hypothesis: Structure

This document discusses hypothesis testing, which involves testing claims or assertions about population parameters using sample data. It covers: 1) The key components of hypothesis testing - the null hypothesis, alternative hypothesis, test statistic, and interpreting results. The null hypothesis is the initial claim being tested. 2) Formulating the null and alternative hypotheses - the null is the initial claim and the alternative is what would be supported if the null is rejected. They must be mutually exclusive and exhaustive. 3) The concept of a rejection region and critical value - this is used to determine whether to reject the null hypothesis based on how extreme the test statistic is. If the test statistic is in the rejection region, the null is rejected

Uploaded by

Ankush Singh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
75 views15 pages

Testing of Hypothesis: Structure

This document discusses hypothesis testing, which involves testing claims or assertions about population parameters using sample data. It covers: 1) The key components of hypothesis testing - the null hypothesis, alternative hypothesis, test statistic, and interpreting results. The null hypothesis is the initial claim being tested. 2) Formulating the null and alternative hypotheses - the null is the initial claim and the alternative is what would be supported if the null is rejected. They must be mutually exclusive and exhaustive. 3) The concept of a rejection region and critical value - this is used to determine whether to reject the null hypothesis based on how extreme the test statistic is. If the test statistic is in the rejection region, the null is rejected

Uploaded by

Ankush Singh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 15

TESTING OF HYPOTHESIS

Structure
19.0 Objectives
19.1 Introduction
19.2 Formulation of a Hypothesis
19.3 Rejection Region and Type of Errors
19.3.1 Rejection Region for Large Samples
19.3.2 One-tail and Two-tail Tests
19.3.3 Type I and Type I1 Errors
19.3.4 Rejection Region for Small Samples
19.4 Testing of Hypothesis for a Single Sample
19.4.1 Population Variance is Known
19.4.2 Population Variance not Known
19.5 Tests for Difference between two Samples
19.5.1 Population Variance is Known
19.5.2 Population Variance is not Known
19.6 Let Us Sum Up
19.7 Key Words
19.8 Some Useful Books
19.9 AnswersfHints to Check Your Progress Exercises

19.0 OBJECTIVES -

After going through this Unit you will be in a position to:


explain the concepts of null hypothesis and alternativehypothesis;
identify critical region based on the level of significance;
distinguish between Type I and Type I1 errors;
test for hypothesis concerning population mean on the basis of one sample; and
test for the difference in sample means obtained from two samples.

19.1 INTRODUCTION
In the previous Unit we learnt about the estimation of confidence interval for population
mean on the basis of sample data. In the present Unit we will look into another aspect
of statistical inference, that is, hypothesis testing. Hypothesis is a statement or assertion
or claim about the population parameter. For example, suppose we have a hitch that
the per capita income of Chhattisgarh state is Rs.20000 per annum. We can be sure
about the truth in the above statement if we urldertake a complete census of households
in the state. This implies we collect data on the income of all the households in
Chhattisgarh and calculate the per capita income of the state. However, constraints
such as time, money and manpower may restrict us to go for a s a p l e survey and draw '
a conclusion about the statement on the basis of sample information. The procedure
followed in the above is the subject matter of hypothesis testing.
Hypothesis testing is applied widely in various fields and to various situations. For
example, suppose the effectiveness of a new drug in curing tuberculosis needs to be
Statistical Inference tested. Obviously all the patients suffering from tuberculosis need not be,administered
with the new drug to see its effectiveness. What we need is a representative sample and
test whether the new drug is more effective than existing drugs. As another example let
us take the case of a planner who asserts that the crude birth rate is the same in the
states Bihar and Rajasthan. In this case it may not be possible on our part to go for a
census survey of all the births that have taken place in Bihar and Rajasthan during the
last year andcalculate the crude birth rate. Instead a sample survey is undertaken and
the assertion made by the planner is put to test.
In hypothesistesting we try to answer questions of the following types: Is the sample
under consideration drawn from a particular pbpulation? Is the difference between two
samples significant enough that they cannot belong to the same population?

19.2 FORMULATION OF A HYPOTHESIS


A hypothesis is a tentative statement about a characteristicof a population. It could be
an assertion or a claim also. For example, official records for recent years claim that
female literacy in Orissa is 5 1 per cent. Here a statement or a claim about the rate of
female literacy is being made. Thus it could be considered as a hypothesis.
In hypothesis testing there are four important components: i) null hypothesis, ii)
alternative hypothesis, iii) test statistic, and iv) interpretation of results. We discuss
each of these below.
Usually statistical hypotheses are denoted by the alphabet H. There are two types of
hypothesis: null hypothesis and alternative hypothesis. A null hypothesis is a statement
that we consider to be true about the population and put to test by a test statistic.
Usually we denote null hypothesis by H , .In the example on female literacy in Orissa
our null hypothesis is

where p is the parameter, in this case female literacy in Orissa.


There is a possibility that the null hypothesis that we intend to test is not true and
female literacy is not equal to 5 1 per cent. Thus there is a need for an alternative
hypothesis which holds true in case the null hypothesis is not true. We denote alternative
hypothesis by the symbol H A and formulate it as follows:

We have to keep in mind that null hypothesis and alternative hypothesis are mutually
exclusive, that is, both cannot be true simultaneously. Secondly, both H , and H A
exhaust all possible options regarding the parameter, that is, there cannot be a third
possibility. For example, in the case of female literacy in Orissa, there are two
possibilities -literacy rate is 5 1 per cent or it is not 5 1 per cent; a third possibility is
not there.
It is alrare coincidence that sample mean ( i ) is equal to population mean ( P ). In most
cases we find a difference between 5 and ,D . Is the difference because of sampling
fluctxtation or is there a genuine difference between the sample and the population? In
orderltoanswer this question we need a test statistic to test the difference between the
two. The result that we obtain by using the test statistic needs to be interpreted and a
decisiion needs to be taken regarding the acceptance or rejection of the null hypothesis.
The development of test statistic for hypothesis testing and interpretation of results
requires elaboration. Before discussing further on these two steps we present another
concept -critical or rejection region.
Testing of Hypothesis
19.3 REJECTION REGION AND TYPE OF ERRORS
The underlying idea behind hypothesis testing and interval estimation (discussed in the
previous Unit) is the same. Recall from Unit 18 that a confidence interval is built
around sample mean with certain confidence level. A confidence level of 95 per cent
implies that in 95 per cent cases the population mean would remain in the confidence
interval estimated from the sample mean. It is implicit that in 5 per cent cases the
population mean will not remain within the confidence interval. Note that when the
population mean does not remain within the confidence interval we should reject the
null hypothesis.

19.3.1 Rejection Region for Large Samples


Let us explain the concept of critical region for large samples. Later on we will extend
the concept to small samples.
As you already know from previous Units, sampling distribution of sample mean ( ) x
0
follows normal distribution with mean p and standard deviation -. Thus
J;; x can be
transformed into a standard normal variate, z, so that it follows normal distribution
0
with mean 0 and standard deviation 1. Recall that - is the standard error of 2.
J;;
i-p
In notations = -and
'
I - w-41)
Recall from Unit 15, Block 5 that area under the standard normal curve gives the
probability for different range of values assumed by z. These probabilities can also be
presented in a table (see Table 15.1 in Unit 15, Block 5).
Let us look into the standard normal curve preserited in Fig. 19.1, where the x-axis
represents the variable z and the y-axis represents the probability of z, that is p(z) .
We should note the following points.
When sample mean is equal to population mean (that is, x = p ) we fihd that z =
0. When 2 > p we find that z is positive. On the other hand, when x < p we find
that z is negative.
Note that we are concerned with the difference between 2 and p . Therefore,
negative or positive sign o f z does not matter much.
Higher the difference between x and P ,higher is the absolute value ofz. Thus
z-value measures the discrepancy between 2 and p ,and therefore can be used
as a test statistic.
We should find out a critical value of z beyond which the difference between 2
and p is significant.
If the absolute value of z is less than the critical value we should not reject the
null hypothesis.
If the absolute value of z exceeds the critical value we should reject the null
hypothesis and accept the alternative hypothesis.
Thus in the case of large sampleq the absolute value of z can be considered as test
statistic for hypothesis testing such that
Statistical Inference Let us explain the concept of critical region through the standard normal curve given
in Fig. 19.1 below. When we have'a confidence coefficient of 95 per cent, the area
covered under the standard normal curve is 95 per cent. Thus 95 per cent area under
the curve is bounded by - 1.96 5 5 1.96 . The remaining 5 per cent area is covered by
z I -1.96 and z 2 1.96 .Thus 2.5 per cent of area on both sides ofthe standard normal
curve constitute the rejection region. This area is shown in Fig. 19.1. If the sample
mean falls in the rejection region wr ;eject the null hypothesis.

Fig. 19.1: Critical Regions

19.3.2 One-tail and Two-tail Tests


In Fig. 19.1 we have shown the rejection region on both sides of the standard normal
curve. However, in many cases we may place the rejection region on one side (either
left or right) of the standard normal curve.
a
Remember that if a is the level of significance, then for a two-tail test - area is
2
placed on both sides of the standard normal curve. But if it is a one-tail test then a
area is placed on one-side of the standard normal curve. Thus the critical value for
one-tail and two- tail tests differ.
The selection of one-tail or two-tail test depends upon the formulation of the alternative
hypothesis. When the alternative hypothesis is of the type H A :Z # p we have a two-
tail test, because x' could be either greater than or less than p . On the other hand, if
alternative hypothesis is of the type H A : x' < p , then entire rejection is on the left
hand side ofthe standard normal curve. Similarly, ifthe alteinative hypothesis is of the
type H A : x' > p , then the entire rejection is on the right hand side of the standard
normal curve.
The critical values for z depend upon the level of significance. In Table 19.1 these
critical values for certain specified levels of significance (a ) are given for the tests to
be conducted under the assumption of normal distribution. The values are given for
both two-tail and one-tail tests.

Table 19.1: Critical Values for z-statistic

Significance Level (a) 0.1'0 0.05 0.01 0.005


two-tail test 1.65 1.96 2.58 2.81
one-tail test 1.28 1.65 2.33 2.58
Note: The table is derived from Table 15.1.
Testing of Hypothesis
19.3.3 Type I and Type I1 Errors
In hypothesis testing we reject or do not reject a hypothesis with certain degree of
confidence. As you know, a confidence coefficient of 0.95 implies that in 95 out of 100
samples the parameter remains within the acceptance region and in 5 per cent cases the
parameter remains in the rejection region. Thus in 5 per cent cases the szmple is drawn
from the population but sample mean is too far away from the population mean. In
such cases the sample belongs to the population but our test procedure rejects it.
Obviously we commit an error such that H, is true but gets rejected. This is called
'Type I error'. Similarly there could be situations when the H,is not true, but on the
basis of sample information we do not reject it. Such an error in decision making is
termed 'Type I1 error' (see Table 19.2).
Note that Type I error specifies how much error we are in a position to tolerate. Type
I error is equal to the level of significance, and is denoted by a . Remember that
confidence coeff~cientis equal to 1-a .

Table 19.2: Type of Errors


r

H, true H o not true

Reject Ho Type I Error Correct decision

Do not reject Ho Correct decision Type I1 Error

19.3.4 Rejection Region for Small Samples


Let us go back to Fig. 18.2 in Unit 18 where we have given certain criteria for use of
proper test statistic in interval estimation. We see that in the case of small samples
( n 30 ), if population standard deviation is known we apply z-statistic for hypothesis
testing. On the other hand, if population standard deviation is not known we apply t-
statistic. The same criteria apply to hypothesis testing also.
In the case of small samples if population standard deviation is known the test
statistic is

On the other hand, if population standard deviation is not known the test statistic is

In the case of t-statistic, however, the area under the curve (which implies probability)
changes according to degrees of freedom. p u s while finding the critical value oft we
should take into account the degrees of freedom. When sample size is n, degrees of
freedom is n - 1 . Thus we should remember two things while finding critical value of
t. These are: i) significance level, and ii) degtees of freedom.

Check Your Progress 1


1) Distinguish between the following:
a) Null hypothesis and Alternative hypothesis
b) One-tail and Two-tail tests
Statistical Inference Confidence level and Level of significance
c)
d) Type I and Type I1 errors

~u$posea sample of 100 students has mean age of 12.5 years. Show through
diagram the rejection region at 5 per cent level of significance to test the hypoth-
esis that the sample has a mean age greater than the population mean. Assume
thatpopulation mean and standard deviation are 10 years and 2 years respec-
tively.

19.4 TESTING OF HYPOTHESIS FOR


A SINGLE SAMPLE
In many situations we are asked to judge whether a sample is dgnificantly different
from a givkn population. For example, let us assume that we syrveyed a sample of 400

x"
households of Raigarh district of Chhattisgarh state and calcul ted the per capita income
of these hbuseholds. Subsequently, our task is to test the ypothesis that per capita
income calculated from the sample is not different from h e per capita income of the
district.
In the above example we can have two different sitwions: i) population (in this case
all the households of the district) variance is k n w n , ii) population variance is not
known to us. We explain the steps to be followyl in each case below.

19.4.1 Population Variance is K n w n


Let us consider the case that we know h e per capita income of Raigarh district of
Chhattisgarh as well as its variance./Suppose the data available in official records
show that per capita income of Raigarh district is Rs. 10000 and standard deviation of
per capita income is Rs. 1500. However, we did a sample survey of 400 households
and found that their per capita income is Rs. 10500. Do we accept the data provided in
official recgrds?
In this case' p = Rs. 10000
o = Rs. 1500

From the central limit theorem we know that when sample; size is large, sample is
approximatelynormally distributed. This is true even in cases where the parent population
is normally distributed. Thus the example is appropriate for application of normal
distribution.
Our null hypothesis in this case is Testing ofHypothesis

The null hypothesis suggests that sample mean is equal to population mean. In other
words, per capita income obtained from the sample is the same as the data provided in
official records.
Our alternative hypothesis is

Suppose we do not have any reason to say that per capita income obtained from the
sample ( 2 ) is greater than or smaller than the per capita income available in official
recdrds. Thus our alternative hypothesis is that 2 could be on either side of p .
Therefore, we should go for two-tail test so that rejection region is on both sides ofthe
i standard normal curve and the test statistic is -

By substituting values in the above we obtain

Recall the standard normal curve and the area for different values ofz (See Table 15.1
in Unit 15). We notice that when z = 1.96 the area covered under standard normal
curve is 0.4750. Therefore, the level of significance is 5 per cent. Similarly, when
z = 2.58 the area covered under standard normal curve is 0.4950. Therefore, the level
of significance is 1 per cent.
In the above case since z = 6.67, the sample lies in the critical region and we reject the
null hypothesis. Thus the per capita income obtained from the sample is significantly
different from the per capita income provided in official records.
The steps you should follow are:
I
1) Specify the null hypothesis.
2) Find out whether it requires one-tail or two-tail test. Accordingly identify your
critical region. This will help in specification of alternative hypothesis.
3) Apply sample values to z-statistic given at (19.6).
4) Find out from z-table the critical value according to level of significance.
5) If you obtain a value lower than the critical value do not reject the null hypoth-
esis.
6) If you obtain a value greater than the critical value reject the null hypothesis and
accept the alternative hypothesis.

Example 19.1
Suppose the voltage generated by certain brand of battery is normally distributed. A
random sample of 100 such batteries was tested and found to have a mean voltage of
1.4 volts. At 0.01 level of significance, does this indicate that these batteries have a
general average voltage, that is different from 1.5volts? Assume that population standard
deviation is 0.2 1 volts.
Statistical Inference Since average voltage of the sample can be different frem average voltage of
the population if it is either less than or more than 1.5 volts, our rejection region
is on both sides of the normal curve. Thus it is a case of two-tail test and
alternative hypothesis is

Since the population standard deviation CJ is known, the test statistic is

From the table for the area under the standard normal curve, we find that the
critical value at the 1 per cent level of significance is 2.58. Since the observed
value of z is greater than 2.58 we reject the null hypothesis at 1% level and
accept the alternative hypothesis that the average life of batteries is different
from 1.5 volts.

19.4.2 Population Variance not Known


The assumption that population standard deviation ( a )is known to us is unrealistic,
as we do not.know population mean itself. When a is unknown we have to estimate it
by sample stmdard deviation (s). In such situations there are two possibilities depending
upon the sample size. If the sample size is large ( n > 30) we apply z-statistic, that is,

In case the sample size is small ( n 5 30) we apply t-statistic with n - 1 degrees of
freedom. The test statistic is

The steps you should follow are:


1) Specify the null hypothesis. ,

2) Find out whether it requires one-tail or two-tail test. Accordingly identify the
rejection region in the standard normal curve. This will help in specification of
the alternative hypothesis.
3 Check whether sample size is large ( n > 30) or small ( n s 30 ).
4) In case n > 30,apply z-statistic given at (19.7).
5) Find out from z-table the critical value according to level of significance ( a ).
6) In case n s 3 0 ,apply t-statistic given at (19.8).
7) Find out from t-table (given in Table 15.3 in Block 5) the critical value for n - 1
degrees of freedom and level of significance (a).
8) If you obtain a value lower than q e critical value do not reject the null.hypoth-
esis.
9) If you obtain greater than the critical value reject the null hypothesis and
Example 19.2 Testing ofHypothesis*

A tablet is supposed to contain on an average 10 mg. of aspirin. A random sample of


100 tablets show a mean aspirin content of 10.2 mg. with a standaid deviation of 1.4
mg. Can you conclude at the 0.05 level of significance that the mean aspirin content is
indeed 10 mg.?

Here, the null hypothesis is Ho: CI = 10


The rejection region is on both sides of 10 mg. Thus it requires a two-tail test
and H A :p + l o .
Also, the sample mean is ,- = 10.2 and the sample size n = 100. Since population
standard deviation is not known we estimate it by sample standard deviations.
I? PI
-
Since sample size is large, bur test statistic is = -.By applying relevant
s/&
values from the sample we obtain

At 5 per cent level of significance the critical value of z is 1.96. Since the z
value that we have obtained is less than 1.96, we do not reject the null hypothesis.
Therefore the mean level of aspirin is 10 mg.

r Example 19.3
The population of Haripura district has a mean life expectancy of 60 years. Certain
health care measures are undertaken in the district. Subsequently,a random sample of
25 persons shows an average life expectancy of 60.5 years with a standard deviation
of 2 years. Can we conclude at the 0.05 level of significance that the average life
expectancy in the district has indeed gone up?

Here, Ho : p = 60

We have to test for an increase in life expectancy. Thus it is a case of one-tail


test and the rejection region will be on the right-hand tail ofthe curve.

I Hence our alternative hypothesis is

Here population standard deviation o is not known and we estimate it by the


sample standard deviation s. Here the sample size is small hence we have to
apply t-statistic given at (19.8).

Since sample size is 25, degrees of freedom is 25 - 1 = 24. From the t-table we
find that for 24 degrees of freedom, 5 per cent level of significance, and one-
tail test the t-value is 1.71.
Since t-value obtained above is less than the critical value we donot reject the
hypothesis. Therefore, life expectancy has not changed Therefore, we accept
the alternative hypothesis that life expectancy for the district has not changed
after the health care measures.
Statistical Inference Check Your Progress 2

1) A report claimed that in the 'School Leaving Examination', the average marks
scored in Mathematics was 78 with a standard deviation of 16. However, a ran-
dom sample of 37 students showed an average of 84 marks in Mathematics. In
the light of this evidence, can we conclude that actually the average was more
than 78? Use 0.05 level of significance.

2). A passenger car company claims that average fuel efficiency of cars is 35 km.
per litre of petrol. A random sample of 50 cars shows an average of 32 km. per
litre with a standard deviation of 1.2 km. Does this evidence falsify the claim of
the passenger car company at 0.01 level of significance?

3) A random sample of 200 tins of coconut oil gave an average weight of 4.95 kg
per tin with a standard deviation of 0.2 1 kg. Do we accept the hypothesis of net
weight of 5 kg per tin at 0.01 level of significance?

4) According to a report, the national average annual income of the government


employees during a recent year was Rs. 24,632 with a standard deviation of Rs.
1827.A random sample of 49 government employees during the same year shows
an average annual income of Rs. 25,4 15. On the evidence of this sample, at 0.05
level of significance, can we conclude that the national average annual income of
government employees is indeed Rs. 24,632?
Testing ofHypothesis
19.5 TESTS FOR DIFFERENCE BETWEEN TWO
SAMPLES
Many times we need to test for the difference between two samples. The objective may
be to ascertain whether both samples are drawn from the same population or to check
whether a particular characteristic is the same in twd populations. For example, we
formulate a hypothesis that the production per worker in plant A is the same as the
production per worker in plant B. We discuss below the procedure for testing of such
a hypothesis. i

Here again we deal with two different situations: whether variance of both the
populations are known. Another consideration is sample size: large or small.
The null hypothesis is the statement that population means of both the populgtions are
the same. In notations

I Ho : PI = P2 ...(19.9)
The alternativehypothesis is the statementthat both the population means are different.
In notations

19.5.1 Population Variance is Known


I

When standard deviations (positive square root of variance) of both the populations
are known we apply z statistic specified as follows:

In (19.1 1) above, subscript 1 refers to the first sample and subscript 2 refers tb the
second sample. By applying relevant data in (19.1 1) we obtain-theobserved value ofz
and compare it with the critical value for specified level of significance.

Example 19.4
A bank wants to find out the average savings of its customers in Delhi and Kolkata. A
sample of 250 accounts in Delhi shows an average savings of Rs. 22500 while a
C
sample of 200 accounts in Kolkata shows an average savings of Rs. 21500. Tt is
known that standard deviation of savings in Delhi is Rs. 150 and that in Kolkata is Rs.
200. Can we conclude at 1 per cent level of significancethat banking pattern of customers
in Delhi and Kolkata is the same?
In this case the null kypothesis is Ho : p, = p,

and the alternative hypothesis is H A : pI # p2


We are provided with the information that

2,= Rs. 22500 o, =Rs. 150

Z2= Rs. 22400 0, = ~ s . f k

n, = 250 n, = 200

, ,
Since o and o are known we apply z-test.
Statistical Inference
- x 2 ) - (PI - ~2 1
The test statistic is z = I -

By applying the information provided above we obtain

We find that at 1 per cent level of significance the critical value obtained from Table
19.2 is 2.58.
Since the observed value oft is greater than the critical value oft the null hypothesis is
rejected and the alternativehypothesis is accepted. Thusthe banking pattern of customers
in Delhi and Kolkata are different.

19.5.2 Population Variance is not Known


When population variance (,2) is not known we estimate it by sample variance (s2).
If both samples are large in size ( ~ 3 0then
) we apply z statistic as follows:

On the other hand, if samples are small in size (n5 30 ) then we apply t-statistic as
follows:
-
t = I(%
i--
- x2 1 -(PI - ~2 1
"2

Degrees freedom for t-test = (n,- 1) + (n,- 1) = n, + n2 2

Example 19.5
A mathematics teacher wants to compare the performance of Class X students in two
sections. She administersthe same set of questions to 25 students in Section A and 20
students in Section B. She finds that Section A students have a mean score of 78 marks
with standard deviation of 4 marks while Section B students have a mean score of 75
marks with standard deviation of 5 marks. Is the performance of students in both
Sectionsdifferent at 1 per cent level of significance?
In this case the null hypothesis is H , :p, = p2

and the alternative hypothesis is H A :p, # p2


We are provided with the information that
%=78 s, = 4
Since o , and 0 , are not known arid sample sizes are small we apply t-test. Testing of Hypothesis

t=
1 -2 1- 1 -2 1 178 - 751
3

The degrees of freedom in this case is 25+20-2 = 43.


\
We find out from Table 15.3 that at 1 per cent level of significance the t-value for 43
degrees of fyedom is 2.69.
Since the critical value o f t is less than the observed value o f t we reject the null
hypothesis and accept the alternative hypothesis. Therefore, students in Section A and
Section B are different with respecf to their performance in mathematics.

Check Your Progress 3

1) You are given the following information.

0 , =6.1 0, =6.1 .
Test the following hypothesis.

2, From two normal populations two samples are drawn. The following informa-
tion is obtained.
nl=l 5 n2=10

Test the hypothesis at 1 % level of significancethat there is no difference between


both the populations.

3) Suppose that samples of size n1=20and n2=15 are drawn from two normal popu-
lations. The sample statistics are as follows:
Statistical Inference
Can we conclude at the 5% level of significance that p, < p, ?

LET US SUM UP
In the present Unit we discussed about the methods of testing a hypothesis and drawing
conclusions about the population. Hypothesis is a statement about a parameter. In
order to test a hypothesis we formulate test statistic on the basis of the information
available to us. In this Unit we considered two situations: i) description of a single
sample, and ii) comparison between two samples.
Construction ofthe test statistic depends on the knowledge about the population variance
and sample size. When population variance is known to us or the sample size is
large we apply normal distribution and use z statistic to test the hypothesis. On the
other hand, when we do not know the population variance and sample size is small
we construct the test statistic on the basis o f t distribution. Remember that for large
samples t distribution approximates normal distribution and therefore we can use z
statistic.

19.7 KEY WORDS


Estimator : It is another name given to statistic in the theory of
estidation.
Parameter : It is a measure of some characteristic ofthe population.
Problem of : Some times we may have some information about
Hypothesis Testing certain features of the population and we want to
examine whether the information is tenable in the light
of a random sample drawn from the population. This
problem of statistical inference is called the problem
of hypothesis testing.
SamplingDistribution : It is the relative frequency or probability distribution
of a statisatic.
Sampling Error : In the sampling method, we try to approximate some
feature of a given population from a sample drawn
from it. Since in the sample all the members of the
population are not included, howsoever close the
approximation is, it is not identical to the required
population feature and some error is committed. This
error is called the sampling error.
Sampljng Fluctuation : It is the variation in the values of a statistic computed.
Statistic ' : It is a function of the values of the units that are
included in the sample. The basic purpose of a statistic
is to estimate some population parameter.
Standard Error It is the standard deviation ofthe sampling distribution
of a statistic.
Testing of Hypothesis
19.8 SOME USEFUL BOOKS
Nagar, A. L. and Das, R. K., 1989, Basic Statistics: Oxford University Press, Delhi,
Chapter 9.
., 1991. Statistics for Business and Economics (Third Edition): Prentice
ersey, Chapters 6, 7, 8 and 9.
Keller, G, and B. Warrack, 1991, Essentials of Business Statistics, Wordsworth
Publishing Co., California, Chapters 7 and 8.

19.9 ANSWERSIHINTS TO CHECK YOUR


PROGRESS EXERCISES

Check Your Progress 1

1) Go through Sections 18.2 and 18.3, and answer.


2) It is large sample and a is unknown. It requires one-tail test. Thus rejection
region is to the right hand tail of standard normal curve. Accordingly draw the
diagram.

Check Your Progress 2

1) Since it is large sample with known variance, we apply z-statistic. Since alterna-
tive hypothesis is p > 78 ,we apply one-tail test. The observed value ofz is 2.28
and critical value of z at 5% level of significance is 1.65. Since the observed
value is greater than the critical value we reject the null hypothesis. Therefore,
we conclude that the average marks was more than 78.
2) It is a large sample with unknown variance. It requires two-tail test. The observed
value ofz is 17.68 and critical value ofzat 1% level of significance is 2.58. Since
the observed value is greater than the critical value, the null hypothesis is re-
jected.
3) It is a large sample with unknown variance. Requires two-tail test with z-statistic.
Observed value of z is 3.37. Null hypothesis is rejected.
4) Since it is large sample with known standard deviation, we applyz-statistic. Re-
quires two-tail test. Observed value ofz is 3.00. critical value ofz at 5% level of
significance is 2.58. null hypothesis is rejected. Therefore, the national average
of annual income of government employees is different from Rs.24532.

Check Your Progress 3

1) The samples sizes are large and population standard deviations are known. Hence
we apply z-statistic and observed value ofz is 2.58. Since the alternative hypoth-
esis is p, z p , ,we have a two tail test at a = 0.05 the critical value ofz is 1.96.
Null hypothesis is rejected.
2) The sample sizes are small and a is not known. Hence we apply t-statistic and
observed value oft is 0.61. The hypothesis are Ho: p, = p, against HA:p, z p , .
Thus, it requires a two-tail test. For 23 d.f., at 1% level of significance, the
critical value o f t is 2.50. Ho is not'rejected.
3) The sample sizes are small and a is not known, t-statistic is applied. Observed
value of t is 0.72. Ho is p, = p, and HAis p, < p, . Hence one-tail test is
required. Thus critical value o f t at 33 d.f. for 5% level of-significanceis 2.00.
Ho is not rejected.

You might also like