0% found this document useful (0 votes)

153 views9 pages

Hypothesis Testing Basics Guide

The document defines key terminology used in hypothesis testing: - The null and alternative hypotheses represent the two opposing possibilities being tested. The null hypothesis is assumed true by default. - The test statistic summarizes sample data in a way that allows determining if the value is plausible or implausible under the null hypothesis. - The rejection region defines the range of test statistic values considered too extreme to support the null hypothesis. The critical value separates this region. - The significance level sets the probability of rejecting the null hypothesis when it is actually true, with smaller levels making rejection less likely.

Uploaded by

Nur Alia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

153 views9 pages

Hypothesis Testing Basics Guide

Uploaded by

Nur Alia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

DASHBOARD LEARN MENU

Learn VEE Mathematical Stats 4 4.1 4.1.1 Terminology

Terminology

A hypothesis test takes two opposing possibilities and checks which one is better supported by the available data.
Specifically, the data is summarized by a single value which is judged to be either a plausible or implausible outcome
using probability. A plausible value supports one possibility, while an implausible value supports the other. This logic is
consistent with the intuition expressed in the introductory coin example.

To define the boundary that separates "plausible" from "implausible", we need to be familiar with the terminology
associated with hypothesis testing.

Null and Alternative Hypotheses

The two "opposing possibilities" mentioned are called the null hypothesis and the alternative hypothesis. They are often
denoted as H 0 and H 1 , respectively. These hypotheses are usually mathematical statements about parameters of
interest. For example, "a coin is fair" can be expressed as the hypothesis: a Bernoulli parameter p = 0.5 .

The null hypothesis often takes a "status quo" position, meaning it is the statement assumed to be true by default. In
turn, the alternative hypothesis is typically the statement that a researcher has interest in proving.

In conducting a hypothesis test, the calculations are performed assuming the null hypothesis is true. After weighing the
evidence, the researcher decides to either:

• Fail to reject the null hypothesis, or

• Reject the null hypothesis in favor of the alternative hypothesis.

In other words, without sufficient evidence supporting H 1 , we keep assuming the default of H 0 . Otherwise, sufficient
evidence favoring H 1 would suggest that H 0 ought to be rejected.

MORE INFORMATION

Note that the first decision does not say "accept the null hypothesis". Strongly affirming that something is true is
arguably beyond the scope and ability of a hypothesis test. Thus, the phrase "fail to reject" is more accurate and
preferred.

Processing math: 26%

Test Statistic
The test statistic is a statistic as defined in Section 1.2, which is used to reject or not reject the null hypothesis. This is
achieved by summarizing the sample observations while assuming the null hypothesis is true. Using its sampling
distribution, we can determine whether the calculated test statistic from the data is considered "plausible" or
"implausible".

COACH'S REMARKS

The literature on statistics tends to use certain terms rather loosely. For example, both Xˉ and xˉ are often simply referred
to as "sample mean", in spite of the inherent difference between the two, as previously discussed.

The term "test statistic" is no different. However, there is a lesser emphasis on a test statistic being a random variable
in hypothesis testing. Therefore, we hereby use "test statistic" to only refer to the value calculated from the data.

If a test statistic is near either tail of the sampling distribution, then the data appears to be a rare occurrence (i.e.
implausible). Conversely, a test statistic closer to the center of the sampling distribution suggests that the data appears
to be a typical occurrence (i.e. plausible). Keep in mind that the sampling distribution is based on the null hypothesis
being true.

When a test statistic is in either tail of the sampling distribution, perhaps it is not true that the data was a rare
occurrence. Instead, the data may have come from a different distribution altogether. This implies that the null
hypothesis is actually incorrect. In other words, an extreme test statistic would support the alternative hypothesis more
than the null hypothesis.

To assist in learning the rest of the jargon, we will assume the following trivial setup throughout this subsection:

• There is one sample observation, X , with mean μ and variance σ 2 .

• The hypothesis test investigates the value of μ .

• The test statistic is x , the observed value of X .

• The sampling distribution is a normal distribution.

Rejection Region and Critical Value

The rejection region is the range of test statistic values that we consider "too extreme" and thus decide to reject H 0 in
favor of H 1 . A critical value is a value that separates the rejection region from the rest of the possible test statistic
values.

Processing math: 26%

COACH'S REMARKS

Before continuing, it is important to distinguish between two-tailed tests and one-tailed tests.

• Two-tailed: both tails of the sampling distribution are included in the rejection region.

• One-tailed: only one tail of the sampling distribution is included in the rejection region.

We limit our discussion to two-tailed tests for now. The rejection region can be written as

[x ≤ a] ∪ [x ≥ b]

meaning a test statistic x is "too extreme" if it is smaller than a or greater than b , in which case H 0 would be rejected.
The critical values a and b are chosen such that both tails are symmetrical. Since X is normally distributed, the rejection
region can also be written in terms of the standard normal distribution, such as

[z ≤ − c] ∪ [z ≥ c] = | z | ≥ c

x−μ
where the test statistic is now z = . In this case, notice that we may avoid keeping track of two critical values, − c
σ
and c , by taking the absolute value of z .

Significance Level
The critical value sets the boundary for how extreme the test statistic must be in order to reject the null hypothesis. A
critical value is determined by setting a significance level denoted by α , where

The significance level is the probability of rejecting H_0 , assuming it is true. Clearly, we would prefer not to reject H_0
if it is true, hence \alpha is typically a small percent. The closer \alpha is to 0, the less likely that H_0 will be rejected; a
test statistic would need to be more extreme to provide evidence against H_0 .

The following graph illustrates the concepts. It shows a standard normal distribution assuming H_0 is true.

Processing math: 26%

From start to finish, a hypothesis test looks like this:

• Determine an appropriate significance level for the test. A common value is \alpha = 0.05 .

• From the sampling distribution which assumes H_0 is true, determine the critical value that corresponds to the
chosen \alpha .

• Collect data, and use it to calculate the test statistic.

• Compare the test statistic to the critical value based on the rejection region. For a two-tailed test under the setup
we have been assuming:

◦ If \left\vert\, z \,\right\vert \ge c , then it is in the rejection region; reject H_0 .

◦ If \left\vert\, z \,\right\vert < c , then it is not in the rejection region; do not reject H_0 .

Let's try performing a complete hypothesis test.

EXAMPLE 4.1.1

It is believed that the mean age of licensed drivers in 2017 is 43.7. To test whether this is the case, one licensed
driver's age is observed to be 60 in 2017. These ages are normally distributed with variance 80.

Test whether the mean age of licensed drivers in 2017 differs from 43.7 at the 5% significance level.

Processing math: 26%

SOLUTION

Start by formally stating the null and alternative hypotheses. Letting \mu represent the mean age of licensed drivers in
2017,

H_0: \mu = 43.7

H_1: \mu \ne 43.7

Next, determine the critical value. Since \alpha = 0.05 ,

\Pr(\left\vert\, Z \,\right\vert \ge c) = 0.05

The graph indicates that c is the 2.5 + 95 = 97.5 th percentile of Z . From the Z -table, obtain c = 1.96 .

Next, calculate the test statistic.

z = \dfrac{x - \mu}{\sigma} = \dfrac{60 - 43.7}{\sqrt{80}} = 1.82

Since \left\vert\, 1.82 \,\right\vert < 1.96 , the test statistic does not fall in the rejection region, and thus we do not reject
the null hypothesis of \pmb{\mu} \mathbf{\, = 43.7} at the 5% significance level. This means the observation of a 60-
year-old licensed driver is plausible if the mean age is in fact 43.7.
\tag*{$\blacksquare$}

While no additional terminology is required to perform a hypothesis test, there are a few others worth mentioning.

Processing math: 26%

p-Value
A p -value is the probability of observing the test statistic or a value more extreme, assuming H_0 is true. Thus, for a
test statistic z , the p -value is

\Pr(\left\vert\, Z \,\right\vert \ge \left\vert\, z \,\right\vert \mid H_0 \text{ is true})

This is similar to the definition of a significance level, with \left\vert\, z \,\right\vert replacing the critical value c .
Therefore, instead of comparing \left\vert\, z \,\right\vert with c to make a decision, we may compare the p -value with
the significance level \alpha .

• If p -value \le \alpha , then reject H_0 .

• If p -value > \alpha , then do not reject H_0 .

It should be evident that the two comparisons are equivalent. A larger \left\vert\, z \,\right\vert results in a smaller p -
value, and vice versa.

What is the p -value for Example 4.1.1?

\begin{align} \Pr(\left\vert\, Z \,\right\vert \ge \left\vert\, z \,\right\vert) & = 2 \cdot \Pr(Z \ge \left\vert\, z \,\right\vert) \\ & = 2
\cdot \Pr(Z \ge 1.82) \\ & = 2(1 - \Pr(Z < 1.82)) \\ & = 2(1 - 0.9656) \\ & = \mathbf{0.0688} \end{align}

As expected, we arrive at the same conclusion to not reject H_0 , since the p -value of 6.88% is greater than the 5%
significance level.

Processing math: 26%

Type I and Type II Errors
While we are unable to know for certain if a hypothesis is true, we try to make an informed decision with a hypothesis
test. This does not mean hypothesis test decisions are always right, even when all the necessary test assumptions are
met. However, we can assume H_0 to be either true or false, and then consider the impact of making a wrong decision.

A type I error occurs when H_0 is rejected while it is true. The probability of making this error is the significance level
\alpha . In other words, we are willing to make a wrong decision 100\alpha \% of the time when H_0 is true, balanced
by the possibility that H_0 is actually false.

A type II error occurs when H_0 fails to be rejected while it is false. The probability of making this error is denoted as
\beta .

H_{0} is True H_{0} is False

Reject H_{0} Type I Error Correct Decision

Fail to Reject H_{0} Correct Decision Type II Error

In layman's terms, a type I error is a false positive, while a type II error is a false negative. Hypothesis test decisions are
not error-free; they merely provide sensible judgment given the data.

Power of a Test
We would like to make the right decision of rejecting H_0 when it is false. The probability of making this right decision is
called the power of a test, which is denoted by 1 - \beta . This is because rejecting H_0 when it is false is the
complement of a type II error.

Since the condition is that H_0 is false, a power can only be calculated when an explicit alternative to H_0 is provided.

What is the power of the test in Example 4.1.1 if in reality \mu = 48 ?

To do this, we need to find the critical values in the original unit of age.

\dfrac{a - 43.7}{\sqrt{80}}= -1.96 \quad \Rightarrow \quad a = 26.169

\dfrac{b - 43.7}{\sqrt{80}}= 1.96 \quad \Rightarrow \quad b = 61.231

In summary, the rejection region for the test is

[x \le 26.169] \cup [x \ge 61.231]

Therefore, if \mu = 48 , the power of the test is

Processing math: 26%

\begin{align} \Pr([X \le 26.169] \cup [X \ge 61.231]) & = \Pr\left(\left[Z \le \dfrac{26.169 - 48}{\sqrt{80}}\right] \cup \left[Z
\ge \dfrac{61.231 - 48}{\sqrt{80}}\right]\right) \\ & = \Pr([Z \le -2.44] \cup [Z \ge 1.48]) \\ & = \Pr(Z \le -2.44) + \Pr(Z \ge
1.48) \\ & = [1 - \Pr(Z < 2.44)] + 1 - \Pr(Z < 1.48) \\ & = 1 - 0.9927 + 1 - 0.9306 \\ & = \mathbf{0.0767}\end{align}

Intuitively, this low power makes sense. If \mu = 48 , then H_0 should be rejected, since \mu \ne 43.7 . However, there
is not a major difference between \mu = 43.7 and \mu = 48 . As a result, the test is unlikely to detect the distinction, and
thus unlikely to correctly reject H_0 .

OTHER INFORMATION

You may be curious what the rejection region is for the coin toss scenario. The null hypothesis is that p = 0.5 for a
binomial sampling distribution with 10 "trials". Note that

\Pr(\le 1 \text{ head}) + \Pr(\ge 9 \text{ heads}) = 0.0215

while \Pr(\le 2 \text{ heads}) + \Pr(\ge 8 \text{ heads}) would exceed 0.05. Therefore, at the 5% significance level, we
reject the null hypothesis if 0, 1, 9, or 10 heads are observed from 10 coin tosses.

Discussions

Processing math: 26%

Previous Lesson Next Lesson
Watch 4.1.0 Overview Watch 4.1.1 Hypotheses and Test Statistic

Processing math: 26%

C 5 A
No ratings yet
C 5 A
127 pages
Hypothesis Testing in Statistics
No ratings yet
Hypothesis Testing in Statistics
42 pages
Hypothesis Testing Fundamentals
No ratings yet
Hypothesis Testing Fundamentals
59 pages
Hypothesis
100% (1)
Hypothesis
61 pages
Hypothesis Testing Guide
No ratings yet
Hypothesis Testing Guide
30 pages
Hypothesis Testing 1
No ratings yet
Hypothesis Testing 1
47 pages
Engineering Statistics: Introduction To Hypothesis Testing
No ratings yet
Engineering Statistics: Introduction To Hypothesis Testing
55 pages
5 Hypothesis Testing - MPH (Compatibility Mode)
No ratings yet
5 Hypothesis Testing - MPH (Compatibility Mode)
47 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
80 pages
RM&IPR Mod 4
No ratings yet
RM&IPR Mod 4
97 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
51 pages
Httpsmoodle - Smu.tnpluginfile - Php304222mod foldercontent0Hypothesis20Testing - Pdfforcedownload 1
No ratings yet
Httpsmoodle - Smu.tnpluginfile - Php304222mod foldercontent0Hypothesis20Testing - Pdfforcedownload 1
79 pages
Bab 5 Fundamentals of Hypothesis
No ratings yet
Bab 5 Fundamentals of Hypothesis
55 pages
Lecture III
No ratings yet
Lecture III
52 pages
webMATH236 Lecture6
No ratings yet
webMATH236 Lecture6
60 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
51 pages
Lecture 7 With Solutions1
No ratings yet
Lecture 7 With Solutions1
42 pages
Hypothesis - Testing-1
No ratings yet
Hypothesis - Testing-1
51 pages
Chapter 9 (Compatibility Mode)
No ratings yet
Chapter 9 (Compatibility Mode)
26 pages
Lecture 8
No ratings yet
Lecture 8
50 pages
Hypothesis Testing Basics
No ratings yet
Hypothesis Testing Basics
49 pages
Engineers' Guide to Hypothesis Testing
No ratings yet
Engineers' Guide to Hypothesis Testing
10 pages
4.2 Hypothesis Testing
No ratings yet
4.2 Hypothesis Testing
49 pages
Chapter 9 - Hypothesis Testing
No ratings yet
Chapter 9 - Hypothesis Testing
100 pages
Chapter 5-BUSINESS STATISTICS
No ratings yet
Chapter 5-BUSINESS STATISTICS
15 pages
Chapt10 Hypothesis Testing One-Sample Tests BBA
No ratings yet
Chapt10 Hypothesis Testing One-Sample Tests BBA
50 pages
Testing of Hypothesis Stastistics
No ratings yet
Testing of Hypothesis Stastistics
93 pages
One Tailed Test
No ratings yet
One Tailed Test
10 pages
An Introduction To Statistical Inference
No ratings yet
An Introduction To Statistical Inference
33 pages
Hypothesis Testing 112
No ratings yet
Hypothesis Testing 112
46 pages
Statistical Hypothesis Testing Yp G: Null Hypothesis Null Hypothesis
No ratings yet
Statistical Hypothesis Testing Yp G: Null Hypothesis Null Hypothesis
34 pages
Hypothesis Testing Guide
No ratings yet
Hypothesis Testing Guide
47 pages
CVE 303 - 6. Hypothesis Test
No ratings yet
CVE 303 - 6. Hypothesis Test
44 pages
Lecture 09
No ratings yet
Lecture 09
48 pages
9 - Introduction To Hypothesis Testing
No ratings yet
9 - Introduction To Hypothesis Testing
40 pages
Stat II - Chap 3
No ratings yet
Stat II - Chap 3
10 pages
QEM 2004 - Module 1 (Fundamental of Hypothesis)
No ratings yet
QEM 2004 - Module 1 (Fundamental of Hypothesis)
49 pages
Hypothesis Testing in Engineering
No ratings yet
Hypothesis Testing in Engineering
81 pages
A Study Lecture of A Research Methods
No ratings yet
A Study Lecture of A Research Methods
40 pages
Fundamentals of Hypothesis Testing: One-Sample Tests
100% (1)
Fundamentals of Hypothesis Testing: One-Sample Tests
105 pages
7 - Hypothesis Testing (Compatibility Mode) PDF
No ratings yet
7 - Hypothesis Testing (Compatibility Mode) PDF
9 pages
Hypothesis Testing Guide
No ratings yet
Hypothesis Testing Guide
89 pages
AK - STATISTIKA - 03 - One Sample Tests of Hypothesis
No ratings yet
AK - STATISTIKA - 03 - One Sample Tests of Hypothesis
54 pages
Hypotheses Testing - OKITE UMI New DUGM AND DLTM
No ratings yet
Hypotheses Testing - OKITE UMI New DUGM AND DLTM
46 pages
20220117162233-Session 10-1
No ratings yet
20220117162233-Session 10-1
46 pages
MATH 264 Statistics For Social Sciences: Hypothesis Testing
No ratings yet
MATH 264 Statistics For Social Sciences: Hypothesis Testing
62 pages
Lecture 04
No ratings yet
Lecture 04
104 pages
Hypothesis Tests
No ratings yet
Hypothesis Tests
3 pages
Lesson9 HypTests - Lesson9 Hyptests
No ratings yet
Lesson9 HypTests - Lesson9 Hyptests
68 pages
Fundamentals of Hypothesis Testing
No ratings yet
Fundamentals of Hypothesis Testing
20 pages
Handout Lec 7: Statistical Inference by Dr. Javed Iqbal: Test of Hypothesis On Population Mean When Is Known
No ratings yet
Handout Lec 7: Statistical Inference by Dr. Javed Iqbal: Test of Hypothesis On Population Mean When Is Known
2 pages
Materi 3B - Hypothesis Testing - Intro Test Concerning Means
No ratings yet
Materi 3B - Hypothesis Testing - Intro Test Concerning Means
43 pages
Test of Hypothesis Part 1
No ratings yet
Test of Hypothesis Part 1
21 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
41 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
49 pages
Hypothesis Testing
100% (2)
Hypothesis Testing
58 pages
Chapter 6
No ratings yet
Chapter 6
18 pages
Chapter 9 Fundamental of Hypothesis Testing
No ratings yet
Chapter 9 Fundamental of Hypothesis Testing
27 pages
Student's T Distribution
No ratings yet
Student's T Distribution
6 pages
Pearson Chi Square Test
No ratings yet
Pearson Chi Square Test
6 pages
Stats Hypothesis Testing Guide
No ratings yet
Stats Hypothesis Testing Guide
3 pages
Hypothesis Test For Means
No ratings yet
Hypothesis Test For Means
6 pages
StatProb Q3 Module 16
No ratings yet
StatProb Q3 Module 16
19 pages
Project Report Adv Stat V1.0
No ratings yet
Project Report Adv Stat V1.0
5 pages
Untitled
No ratings yet
Untitled
25 pages
Development and Evaluation of The Revised Academic Hardiness Scale Benishek Lopez
No ratings yet
Development and Evaluation of The Revised Academic Hardiness Scale Benishek Lopez
18 pages
Bayesian Structural Time Series Models
100% (1)
Bayesian Structural Time Series Models
100 pages
Project
No ratings yet
Project
11 pages
Econometrics: Omitted Variable Bias
No ratings yet
Econometrics: Omitted Variable Bias
34 pages
Math11 Statistics Q4 - BANGA WEEK 1-2-Key-Answer
100% (2)
Math11 Statistics Q4 - BANGA WEEK 1-2-Key-Answer
3 pages
Yulsardi & Ratmanida, 2021
No ratings yet
Yulsardi & Ratmanida, 2021
10 pages
Sobo 1040-Business Mathematics and Statistics - August 2022 Final Exams
No ratings yet
Sobo 1040-Business Mathematics and Statistics - August 2022 Final Exams
2 pages
Confusion Matrix
No ratings yet
Confusion Matrix
13 pages
Chapter 2 Normal Distribution
No ratings yet
Chapter 2 Normal Distribution
31 pages
Lesson Review - Sampling and Sampling Distributions
No ratings yet
Lesson Review - Sampling and Sampling Distributions
1 page
Quality Control with X-bar & R Charts
No ratings yet
Quality Control with X-bar & R Charts
20 pages
10.1 Homework
No ratings yet
10.1 Homework
3 pages
Math Exam: Probability & Statistics
0% (1)
Math Exam: Probability & Statistics
4 pages
Biostatistics Study Notes
No ratings yet
Biostatistics Study Notes
13 pages
Statistics Assignment Solutions
100% (1)
Statistics Assignment Solutions
13 pages
Dependent T-Test Using SPSS
No ratings yet
Dependent T-Test Using SPSS
5 pages
OBJECTIVES
No ratings yet
OBJECTIVES
4 pages
Experimental Design Syllabus
No ratings yet
Experimental Design Syllabus
53 pages
12 Bias-Variance - Underfit - Overfit
No ratings yet
12 Bias-Variance - Underfit - Overfit
4 pages
02 Simple-Regression-An-Overview Simple Regression
No ratings yet
02 Simple-Regression-An-Overview Simple Regression
130 pages
Nonlinearity Detection Tests
No ratings yet
Nonlinearity Detection Tests
4 pages
Extending Linear Model R PDF
0% (4)
Extending Linear Model R PDF
2 pages
Gee
No ratings yet
Gee
40 pages
Heteroskedasticity Analysis Guide
No ratings yet
Heteroskedasticity Analysis Guide
26 pages
Frequency Distributions & Graphs
No ratings yet
Frequency Distributions & Graphs
11 pages
DV-Viva-Voice-Data Visualization
No ratings yet
DV-Viva-Voice-Data Visualization
12 pages
Research Methods For Public Administrators, 7th Edition Complete Ebook Edition
100% (13)
Research Methods For Public Administrators, 7th Edition Complete Ebook Edition
15 pages

Hypothesis Testing Basics Guide

Uploaded by

Hypothesis Testing Basics Guide

Uploaded by

DASHBOARD LEARN MENU

Learn VEE Mathematical Stats 4 4.1 4.1.1 Terminology

Null and Alternative Hypotheses

• Fail to reject the null hypothesis, or

• Reject the null hypothesis in favor of the alternative hypothesis.

Processing math: 26%

• There is one sample observation, X , with mean μ and variance σ 2 .

• The hypothesis test investigates the value of μ .

• The test statistic is x , the observed value of X .

• The sampling distribution is a normal distribution.

Rejection Region and Critical Value

Processing math: 26%

Processing math: 26%

• Collect data, and use it to calculate the test statistic.

◦ If \left\vert\, z \,\right\vert \ge c , then it is in the rejection region; reject H_0 .

Let's try performing a complete hypothesis test.

Processing math: 26%

H_0: \mu = 43.7

Next, determine the critical value. Since \alpha = 0.05 ,

\Pr(\left\vert\, Z \,\right\vert \ge c) = 0.05

Next, calculate the test statistic.

z = \dfrac{x - \mu}{\sigma} = \dfrac{60 - 43.7}{\sqrt{80}} = 1.82

Processing math: 26%

\Pr(\left\vert\, Z \,\right\vert \ge \left\vert\, z \,\right\vert \mid H_0 \text{ is true})

• If p -value \le \alpha , then reject H_0 .

• If p -value > \alpha , then do not reject H_0 .

What is the p -value for Example 4.1.1?

Processing math: 26%

H_{0} is True H_{0} is False

Reject H_{0} Type I Error Correct Decision

Fail to Reject H_{0} Correct Decision Type II Error

What is the power of the test in Example 4.1.1 if in reality \mu = 48 ?

\dfrac{a - 43.7}{\sqrt{80}}= -1.96 \quad \Rightarrow \quad a = 26.169

In summary, the rejection region for the test is

[x \le 26.169] \cup [x \ge 61.231]

Therefore, if \mu = 48 , the power of the test is

Processing math: 26%

\Pr(\le 1 \text{ head}) + \Pr(\ge 9 \text{ heads}) = 0.0215

Processing math: 26%

Processing math: 26%

You might also like