SLR Prediction

Uploaded by

hanandeh0791

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views21 pages

SLR Prediction

Uploaded by

hanandeh0791

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Simple Linear

Regression - Prediction
Research Objective
Research Question: What is the average student height for students whose mother is 64
inches tall?

How would you figure this out?

Prediction in Regression
Research Question: What is the average student height for students whose mother is 64
inches tall?
Answer: Use the best fit regression line to tell you the answer.

y^ = β^0 + β^1 x = 35.653 + 0.503 × 64 = 67.845

Confidence Intervals for Averages
Using similar principles as we have used in the past to build confidence intervals:

⋆ 1 (x − x̄)
^√ +
y^ ± t σ
n ∑ ni=1 (x i − x̄) 2

Is a confidence interval for the average value of y given an x (the population average
student height for 64 inch tall mothers) where the value of t ⋆ is determined by the
confidence level.
For our analysis, this comes out to be (67.662, 68.054) for a 95% interval.
Notes:

1. Don’t worry about the formula (computer will calculate this for you).
2. Interpetation: We are 95% confident that the average height of all students whose
mothers are 64 inches tall is between 67.662 and 68.054.
Prediction in Regression
Research Question: Shaylee’s mom is 64 inches tall, what will her height be?
Thought Questions:

1. Is this the same question as above? If not, what is the diﬀerence?

It’s not the same. One is asking about an average while one is asking about a
specific person.
The “average” is the line while specific people are the “dots”.
Prediction in Regression
Research Question: Shaylee’s mom is 64 inches tall, what will her height be?
Thought Questions:

2. Should our point prediction (1 number prediction) be the same or diﬀerent?

The point prediction should be the same because “dots” could either fall above or
below the line. In this case, we still think Shaylee’s height will be 67.845.
Prediction in Regression
Research Question: Shaylee’s mom is 64 inches tall, what will her height be?

3. Should our interval for the prediction be the same or diﬀerent? Why or why not?
It should be wider because heights vary a lot from person to person
Prediction Intervals for Individuals
Using similar principles as we have used in the past to build confidence intervals:

⋆ 1 (x − x̄)
^ √1 + +
y^ ± t σ
n ∑ ni=1 (x i − x̄) 2

is a prediction interval for the value of y given an x (for example, Shaylee’s height if her
mom is 64 inches tall) where the value of t ⋆ is determined by the confidence level.
For our analysis, this comes out to be (60.449, 75.268) for a 95% interval.
Notes:

1. Don’t worry about the formula (computer will calculate this for you).
2. Interpetation is similar: We are 95% confident that Shaylee’s height, given her mom is
64 inches tall, should be between 60.449 and 75.268.
Prediction vs Confidence Intervals
Confidence interval for prediction: An interval estimate for the average of y given an x.
Prediction interval for prediction: An interval estimate for the value of a single y given an
x.

Prediction intervals are ALWAYS wider than confidence intervals. Why?

There is more variability from student to student than with the average heights for
students.
Using the Analysis Tool
All previous steps in the tool are the same as covered in previous lecture notes:
Nuances of Predictions
Research Question: Lucy’s mom is 82 inches tall, what will her height be?

Answer:

Don’t do the prediction because its outside of the data range! This is referred to as
extrapolation.
Nuances of Predictions
1. Extrapolation - trying to predict outside of the range of the data.
Nuances of Predictions
2. How do we know if our predictions are any good? For example, how do we know if our
prediction for Shaylee’s height was good or bad?
Issue: To evaluate how well we do at predicting, we essentially need to know the
true answer of the thing we are predicting for.
Solution: Cross-validation
Principles of K-Fold CV
Purpose: Assess how well your model does at predicting
General Idea: Fit your model to part of your data then see how well your model predicts
the remainder of your data
Using the Analysis Tool
Nuances of Cross Validation
1. Randomly split the data into folds → every run of cross-validation will give slightly
diﬀerent results
2. Lots of performance metrics but most common is root mean square error

 n validation
1
RMSE = ∑ (y i − y^i ) 2
⎷ n validation i=1

where y i is an observation in the validation set and y^i is the corresponding prediction.
3. The intuitive interpretation of RMSE is the average error across our predictions.
4. What constitutes a “small” RMSE is relative to the problem.
Additional Prediction Practice
Measuring possum head size can be diﬀicult. However, measuring total possum length is
easier. What is the relationship between possum length and head size? Use a simple linear
regression model (and the course app) to answer the following questions:

1. Sydney found a huge 96 cm possum. What is your predicted head length for this
possum?
95% prediction interval is (92.431, 102.986).
2. Sydney found a huge 96 cm possum. What is the average head length for possums of
this size?
95% confidence interval is (96.545, 98.872).
3. Sydney found a baby 70 cm possum. What is your predicted head length for this
possum?
EXTRAPOLATION!
4. Is your model good or bad at predicting possum head sizes?
The RMSE of a 104 fold CV is 2.0132492.
Key Terminology
Confidence Intervals for Averages Prediction Intervals for Individuals
Extrapoloation Cross validation
Root mean square error (RMSE)

MLR Prediction
No ratings yet
MLR Prediction
16 pages
Math Modeling for Upper Sec Students
50% (2)
Math Modeling for Upper Sec Students
21 pages
Report
No ratings yet
Report
9 pages
EP102 7 InferenceFromAsampleMean
No ratings yet
EP102 7 InferenceFromAsampleMean
88 pages
sssCHAPTER 5. Introduction To Estimation 23
No ratings yet
sssCHAPTER 5. Introduction To Estimation 23
5 pages
Estimation
No ratings yet
Estimation
14 pages
Central Limit Theorem Sample Size Determination Confidence Interval
No ratings yet
Central Limit Theorem Sample Size Determination Confidence Interval
19 pages
Lesson: Estimation of Parameters
No ratings yet
Lesson: Estimation of Parameters
3 pages
Inferences Based On A Single Sample: Confidence Intervals and Tests of Hypothesis (9 Hours)
No ratings yet
Inferences Based On A Single Sample: Confidence Intervals and Tests of Hypothesis (9 Hours)
71 pages
Stat4 Confidence Intervals
No ratings yet
Stat4 Confidence Intervals
34 pages
CH 8
No ratings yet
CH 8
20 pages
Unit-4 - Confidence Interval and CLT
No ratings yet
Unit-4 - Confidence Interval and CLT
29 pages
Unit II
No ratings yet
Unit II
3 pages
Multiple Regression Analysis Guide
No ratings yet
Multiple Regression Analysis Guide
7 pages
Week 11 Lecture
No ratings yet
Week 11 Lecture
17 pages
Stat 3 RD
No ratings yet
Stat 3 RD
91 pages
1 Estimation of Parameters - Part 1
No ratings yet
1 Estimation of Parameters - Part 1
49 pages
Confidence Interval
No ratings yet
Confidence Interval
4 pages
Answers
No ratings yet
Answers
7 pages
10 Estimation and Confidence Intervals
No ratings yet
10 Estimation and Confidence Intervals
33 pages
Chapter 8
No ratings yet
Chapter 8
40 pages
SPSS Hypothesis Testing
No ratings yet
SPSS Hypothesis Testing
8 pages
Estimation
No ratings yet
Estimation
35 pages
SLR Inference
No ratings yet
SLR Inference
33 pages
Answer 4 MGT 2 3rd Year
No ratings yet
Answer 4 MGT 2 3rd Year
8 pages
Inferential Statistics Part 1
No ratings yet
Inferential Statistics Part 1
10 pages
Lecture Notes Confidence Intervals
No ratings yet
Lecture Notes Confidence Intervals
7 pages
Confidence Interval Estimation
No ratings yet
Confidence Interval Estimation
13 pages
3 Confidence Intervals
No ratings yet
3 Confidence Intervals
16 pages
Term Project Stats
No ratings yet
Term Project Stats
5 pages
Lecture 6
No ratings yet
Lecture 6
28 pages
Statistics for Analysts
100% (3)
Statistics for Analysts
27 pages
SMA 4.2 Hypothesis Testing
No ratings yet
SMA 4.2 Hypothesis Testing
21 pages
CH 4 - Estimation & Hypothesis One Sample
No ratings yet
CH 4 - Estimation & Hypothesis One Sample
139 pages
Stat Chapter 4
No ratings yet
Stat Chapter 4
19 pages
SEE5211 Chapter5 P2017
No ratings yet
SEE5211 Chapter5 P2017
48 pages
Point Estimation of Process Parameters
No ratings yet
Point Estimation of Process Parameters
64 pages
Step-by-Step PR-WPS Office
No ratings yet
Step-by-Step PR-WPS Office
6 pages
Statistics Full Notes
No ratings yet
Statistics Full Notes
14 pages
2nd Demo
No ratings yet
2nd Demo
4 pages
Statistics For Management-Single Mean Method
No ratings yet
Statistics For Management-Single Mean Method
10 pages
Stats Reviewer
No ratings yet
Stats Reviewer
8 pages
Sta301 Lec40
No ratings yet
Sta301 Lec40
59 pages
Probability and Statistics 3 - INFERENCE STATISTICS
No ratings yet
Probability and Statistics 3 - INFERENCE STATISTICS
15 pages
Inbound 6396611661788218607
No ratings yet
Inbound 6396611661788218607
123 pages
Module - 6 PROB
No ratings yet
Module - 6 PROB
145 pages
Chapter 7 8
No ratings yet
Chapter 7 8
32 pages
Engineering Data Analysis Guide
No ratings yet
Engineering Data Analysis Guide
36 pages
Confidence Interval
No ratings yet
Confidence Interval
22 pages
Distribution of Data
No ratings yet
Distribution of Data
32 pages
MLR Inference
No ratings yet
MLR Inference
39 pages
BIOE Transes
No ratings yet
BIOE Transes
8 pages
Chapter 6
No ratings yet
Chapter 6
43 pages
BRMS13 - 14 - AC - Jan-Mar 2024
No ratings yet
BRMS13 - 14 - AC - Jan-Mar 2024
116 pages
Sample Size Calculations
No ratings yet
Sample Size Calculations
3 pages
Analysis of Sample Mean
No ratings yet
Analysis of Sample Mean
43 pages
Chapter 8
No ratings yet
Chapter 8
45 pages
Problems 1
No ratings yet
Problems 1
3 pages
Question 1 (10 PTS) : Write The Final Answer ONLY For Each of The Following 5 Questions: 1) Not O 2)
No ratings yet
Question 1 (10 PTS) : Write The Final Answer ONLY For Each of The Following 5 Questions: 1) Not O 2)
1 page
MLR Eda Model
No ratings yet
MLR Eda Model
32 pages
MLR Ethics
No ratings yet
MLR Ethics
19 pages
Ica 6
No ratings yet
Ica 6
4 pages
CTSDG06516XDM PDF
No ratings yet
CTSDG06516XDM PDF
2 pages
Safety Feature of RNPP
No ratings yet
Safety Feature of RNPP
29 pages
Medicine Box
100% (2)
Medicine Box
19 pages
Purifier Disc Cleaner
No ratings yet
Purifier Disc Cleaner
2 pages
Human Resource Management: Stephen P. Robbins Mary Coulter
No ratings yet
Human Resource Management: Stephen P. Robbins Mary Coulter
45 pages
Annual Report 2017 en
No ratings yet
Annual Report 2017 en
202 pages
Literature Review On Indian FMCG Industry
No ratings yet
Literature Review On Indian FMCG Industry
23 pages
Coaching and Mentoring Form
100% (4)
Coaching and Mentoring Form
8 pages
Automatic Controls, Electronic Controls, Compressors, Condensing Units and Packages For All Refrigerants
100% (2)
Automatic Controls, Electronic Controls, Compressors, Condensing Units and Packages For All Refrigerants
0 pages
Tutorial 6 - Domain Modelling
No ratings yet
Tutorial 6 - Domain Modelling
2 pages
1.3 Energy Management & Audit
No ratings yet
1.3 Energy Management & Audit
25 pages
Dbms Query Evaluation
No ratings yet
Dbms Query Evaluation
28 pages
Aspiring Journalists' Mastery Guide
No ratings yet
Aspiring Journalists' Mastery Guide
13 pages
HUB-Cloud Network Architect
No ratings yet
HUB-Cloud Network Architect
4 pages
Unit Test 2
No ratings yet
Unit Test 2
2 pages
116 SA1130 Hi Fi Choice English Nov 1998
No ratings yet
116 SA1130 Hi Fi Choice English Nov 1998
1 page
Complaint Offence Under Section 493-A PPC
No ratings yet
Complaint Offence Under Section 493-A PPC
9 pages
2022 ALS Geochemistry Fee Schedule USD 2022
No ratings yet
2022 ALS Geochemistry Fee Schedule USD 2022
52 pages
Displacement Velocity Acceleration
No ratings yet
Displacement Velocity Acceleration
6 pages
SEO Complete Guide by Surojit
No ratings yet
SEO Complete Guide by Surojit
55 pages
Return-Oriented Programming Attacks
No ratings yet
Return-Oriented Programming Attacks
2 pages
Introduction To The Python Programming Language
No ratings yet
Introduction To The Python Programming Language
41 pages
Advanced Binary Trading Guide - Quotex Edition: 1. Understand The Platform (Quotex)
No ratings yet
Advanced Binary Trading Guide - Quotex Edition: 1. Understand The Platform (Quotex)
4 pages
Ngspice 38 Manual
No ratings yet
Ngspice 38 Manual
715 pages
Broker Setup Packet
No ratings yet
Broker Setup Packet
6 pages
Q2 Project Instructions
No ratings yet
Q2 Project Instructions
12 pages
MID1 Materials
No ratings yet
MID1 Materials
4 pages
Public Liability Insurance Guide
No ratings yet
Public Liability Insurance Guide
8 pages
Fitness Studio Business Plan
No ratings yet
Fitness Studio Business Plan
9 pages
50-SDMS-01 Page 9
No ratings yet
50-SDMS-01 Page 9
1 page

SLR Prediction

Uploaded by

SLR Prediction

Uploaded by

Simple Linear

How would you figure this out?

y^ = β^0 + β^1 x = 35.653 + 0.503 × 64 = 67.845

1. Is this the same question as above? If not, what is the diﬀerence?

2. Should our point prediction (1 number prediction) be the same or diﬀerent?

Prediction intervals are ALWAYS wider than confidence intervals. Why?

You might also like