0% found this document useful (0 votes)

60 views12 pages

36-401 Modern Regression HW #7 Solutions: Problem 1 (40 Points)

This document contains solutions to homework problems from a modern regression course. Problem 1 asks students to analyze residual plots from a multiple linear regression model fitting various athlete measurements. Part (a) describes the data on 202 athletes. Part (b) notes that the instructor has already discussed residual diagnostic summaries in previous homework, so no discussion is provided here.

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

60 views12 pages

36-401 Modern Regression HW #7 Solutions: Problem 1 (40 Points)

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

36-401 Modern Regression HW #7 Solutions

DUE: 11/3/2017 at 3PM

Problem 1 [40 points]

(a) (5 pts.)

150 190 40 70 4 8 12 12 16 20 30

0.8
Sex

0.0
200

Ht
150

100
Wt

40
40 80

LBM

6.0
RCC

4.0
10

WCC
4

50
Hc

35
18

Hg
12

150
Ferr

0
20 30

BMI

Bfat
5 20

0.0 0.6 40 80 4.0 5.5 35 45 55 0 100 5 20 35

Figure 1: Data on 102 male and 100 female athletes collected at the Australian Institute of Sport

(b) (5 pts.)

I have provided quite a few sample (pre-outlier) residual diagnostic summaries to this point, so I am omitting
a discussion here.

1
2

2
1

1
0

0
Residuals

Residuals
−1

−1
−2

−2
−3

−3
0 1 155 165 175 185 195 205

Sex Ht
2

2
1

1
0

0
Residuals

Residuals
−1

−1
−2

−2
−3

−3

35 45 55 65 75 85 95 105 115 125 4 5 6

Wt RCC
2

2
1

1
0

0
Residuals

Residuals
−1

−1
−2

−2
−3

−3

5 7.5 10 12.5 35 37.5 40 42.5 45 47.5 50 52.5 55 57.5 60

WCC Hc
2

2
1

1
0

0
Residuals

Residuals
−1

−1
−2

−2
−3

−3

12 13 14 15 16 17 18 19 5 25 45 65 85 105 145 185 225

Hg Ferr

2
2

2
1

1
0

0
Residuals

Residuals
−1

−1
−2

−2
−3

−3
17.5 20 22.5 25 27.5 30 32.5 35 5 10 15 20 25 30 35

BMI Bfat
2

160
37
1
Residuals

0
−1
−2
−3

11
−4

35 45 55 65 75 85 95 105

Fitted values

Figure 2: Linear Regression Residual Plots

Again, omitting the discussion. See past HW solutions.

Table 1: Summary of LBM Regression on Australian Institute of

Sport Data

Estimate Std. Error t value Pr(>|t|)

(Intercept) 2.9980681 5.8990540 0.5082286 0.6118795
Sex 0.2974007 0.2264383 1.3133848 0.1906289
Ht 0.0424954 0.0329911 1.2880873 0.1992739
Wt 0.8456297 0.0407385 20.7575246 0.0000000
RCC 0.0351007 0.2690925 0.1304411 0.8963547
WCC -0.0158286 0.0269263 -0.5878501 0.5573273
Hc 0.0138507 0.0505976 0.2737415 0.7845791
Hg -0.0788514 0.1206357 -0.6536325 0.5141347
Ferr 0.0003470 0.0011303 0.3070358 0.7591506
BMI 0.0700461 0.1341848 0.5220119 0.6022669
Bfat -0.7766341 0.0147278 -52.7325075 0.0000000

3
(d) (5 pts.)

Eigenvalues
9568797.81
382253.71
20508.53
8523.50
2367.41
585.92
319.61
27.73
8.83
5.77

10
9
Eigenvalue (times 10e6)

8
7
6
5
4
3
2
1
0
1 2 3 4 5 6 7 8 9 10
Sorted Eigenvalue Indices

4
(e)

We construct a 90% confidence rectangle for the regression parameters by using a Bonferroni correction.
Thus, the endpoints for each parameter correspond to a 99% marginal confidence interval. The vertices of
the hyper-rectangle are shown in Table 3.

Table 3: 90% Confidence Rectangle for Regression Coefficients

0.5 % 99.5 %
Sex -0.29 0.89
Ht -0.04 0.13
Wt 0.74 0.95
RCC -0.67 0.74
WCC -0.09 0.05
Hc -0.12 0.15
Hg -0.39 0.24
Ferr 0.00 0.00
BMI -0.28 0.42
Bfat -0.81 -0.74

(f) (5 pts.)

Table 4: Summary of LBM Regression on Australian Institute of

Sport Data

Estimate Std. Error t value Pr(>|t|)

(Intercept) -1.7432696 5.9836490 -0.2913389 0.7710987
Sex -8.3863142 0.5930703 -14.1405054 0.0000000
Ht 0.1048551 0.0328314 3.1937406 0.0016353
Wt 0.6408123 0.0226820 28.2520776 0.0000000
RCC 0.8090598 0.5756953 1.4053612 0.1614890

Again, omitting a discussion here.

5
(g) (5 pts.)

0.7
0.68
0.66
Weight

0.64
0.62
0.6

0.05 0.1 0.15

Height

Figure 3: 95% Confidence Ellipsoid for Height and Weight

(h) (5 pts.)

Table 5: Analysis of Variance Table

Res.Df RSS Df Sum of Sq F Pr(>F)

197 1457.42797 NA NA NA NA
191 82.25216 6 1375.176 532.2222 0

The F -test yields a p-value of 2.492207×10−116 , signifying that the larger model very likely includes additional
valuable information for predicting lean body mass.

6
Problem 2 [30 points]
(a) (10 pts.)

kv1 k2 v1T v2 v1T vq

 
···
 v2T v1 kv2 k2 ··· v2T vq 
T
X X= .
 
.. .. .. 
 .. . . . 
vqT v1 vqT v2 ··· kvq k2
kv1 k2
 
0 ··· 0
 0 kv2 k2 ··· 0 
= . .
 
.. .. ..
 .. . . . 
0 0 ··· kvq k2

If kvj k > 0 for all j, then det(X T X) > 0. Therefore, X T X is non-singular.

(b) (10 pts.)

 1 
0 ··· 0
 kv1 k2 
 1 
 0 ··· 0 
(X T X)−1 kv2 k2
 
= .
 . .. .. .. 
 .. . . . 
 
 1 
0 0 ···
kvq k2

(c) (10 pts.)

There are a lot of ways to do this.

Let  
βb1
βb =  ... 
 

βbq
be some parameter vector estimator, yielding predictions

Yb = X T β.
b

Now define  
βb1 + t
βe =  ... 
 

βbq
for t 6= 0.
Since v1 = (0, 0, . . . , 0),
Yb = X T β,
e

so the estimators yield equal residuals and thus equal squared-errors.

We have shown that, given any estimator, there are an infinite number of distinct estimators yielding the
same MSE. Therefore, there cannot be a unique minimizer of squared error.

7
Problem 3 [30 points]
(a) (15 pts.)

βbλ = (X T X + λI)−1 X T Y
−1
= λ(λ−1 X T X + I) X T Y

= λ−1 (λ−1 X T X + I)−1 X T Y

 T 
v1 Y
 λ 
 . 
= (λ−1 X T X + I)−1  .. 
| {z } vT Y 

→I q

| {z λ }
→0
 
0
0
→ . , as λ → ∞
 
 .. 
0

Here we used the continuity of the matrix inverse operator.

(b) (15 pts.)

βbλ = λ(X T X + λI)−1 X T Y

−1
= λ λ(λ−1 X T X + I) X T Y

= (λ−1 X T X + I)−1 X T Y
= (λ−1 X T X + I)−1 X T Y
| {z }
→I
T
→ X Y, as λ → ∞

8
Appendix

addTrans <- function(color,trans)

{
# This function adds transparancy to a color.
# Define transparancy with an integer between 0 and 255
# 0 being fully transparant and 255 being fully visable
# Works with either color and trans a vector of equal length,
# or one of the two of length 1.

if (length(color)!=length(trans)&!any(c(length(color),length(trans))==1)){
stop("Vector lengths not correct")
}
if (length(color)==1 & length(trans)>1) color <- rep(color,length(trans))
if (length(trans)==1 & length(color)>1) trans <- rep(trans,length(color))

num2hex <- function(x)

{
hex <- unlist(strsplit("0123456789ABCDEF",split=""))
return(paste(hex[(x-x%%16)/16+1],hex[x%%16+1],sep=""))
}
rgb <- rbind(col2rgb(color),trans)
res <- paste("#",apply(apply(rgb,2,num2hex),2,paste,collapse=""),sep="")
return(res)
}

Problem 1 [40 points]

sports <- read.table("http://stat.cmu.edu/~larry/=stat401/sports.txt", header = TRUE)

sports$Sport <- sports$Label <- sports$SSF <- NULL

(a) (5 pts.)

pairs(sports, pch = 19, cex = 0.4, cex.axis = 1.4)

(b) (5 pts.)

model1 <- lm(LBM ~ ., data = sports)

nearest5 <- function(x, floor = TRUE){

if ( x%%5 == 0 ){
return(x)
} else {
if ( floor ){
tmp <- x - x%%5
} else {
tmp <- x - x%%5 + 5
}

9
return(tmp)
}
}

resid_plot <- function(model, index){

plot(sports[[index]], residuals(model), col = NA, axes = FALSE,
xlab= names(sports)[index], ylab = "Residuals", font.lab = 3)
xax <- seq(nearest5(min(sports[[index]])), nearest5(max(sports[[index]]),
FALSE), by = 5)
cand_increm <- c(0.5,1,2.5,5,10,15,20)
lens <- rep(NA,length(cand_increm))
for (itr in 1:length(cand_increm)){
lens[itr] <- length(seq(min(xax),max(xax), by = cand_increm[itr]))
}

xax <- seq(min(xax),max(xax), by = cand_increm[which.min(abs(lens - 10))])

yax <- seq(-4,2,1)
axis(side = 1, at = xax, as.character(xax), font = 5)
axis(side = 2, at = yax, labels = as.character(yax), font = 5)
abline(h = yax, v = xax, col = "gray70", lty = 2)
abline(0,0, lty = 2, col = "gray45")
points(sports[[index]], residuals(model1), col = addTrans("orange",120),
pch = 19, cex = 1.25)
points(sports[[index]], residuals(model1), col = "orange", cex = 1.25)
panel.smooth(sports[[index]], residuals(model1), col = NA,cex = 0.5,
col.smooth = "seagreen", span = 0.5, iter = 3)
}

par(mfrow=c(4,2))
par(oma=c(0,0,0,0))
par(mar = c(4,4,2,1)+0.1)
boxplot(residuals(model1) ~ sports[[1]],
col = addTrans(c("seagreen","orange"),120),
border = c("seagreen","orange"), xlab = "Sex", font.lab = 5,
ylab = "Residuals", pch = 19, boxwex = 0.5)
abline(0,0, lty = 2, col = "gray45")
for (itr in c(2:3,5:9)){
resid_plot(model1, itr)
}
par(mfrow=c(4,2))
par(oma=c(0,0,0,0))
par(mar = c(4,4,2,1)+0.1)

for (itr in 10:11){

resid_plot(model1, itr)
}

plot(model1, which = 1, col = NA, pch = 19, axes = FALSE,

add.smooth = FALSE, caption = "", sub.caption = "",
font.lab = 3)
xax <- seq(nearest5(min(fitted(model1))),
nearest5(max(fitted(model1)),FALSE), by = 10)
yax <- seq(-4,2,1)

10
abline(h = yax, col = "gray70", lty = 2)
abline(v = xax, col = "gray70", lty = 2)
abline(0,0, lty = 2, col = "gray45")
axis(side = 1, at = xax, as.character(xax), font = 5)
axis(side = 2, at = yax, labels = as.character(yax), font = 5)
points(fitted(model1), residuals(model1), col = addTrans("orange",120), pch = 19)
points(fitted(model1), residuals(model1), col = "orange")
panel.smooth(fitted(model1), residuals(model1),
col = "orange",cex = 1, col.smooth = "seagreen", span = 0.5, iter = 3)

library(knitr)
kable(summary(model1)$coefficients,
caption = "Summary of LBM Regression on Australian Institute of Sport Data")

(d) (5 pts.)

X <- as.matrix(sports[,c(1:3,5:11)])
G <- t(X) %*% X
eig <- eigen(G)
tmp <- data.frame(Eigenvalues = eig$values)
kable(tmp, digits = 2,
caption = "Eigenvalues of Gram Matrix")

barplot(eig$values, col = NA, xlab = "", ylab = "", ylim = c(0,10000000),

xaxt = "n", yaxt = "n")
abline(h = seq(0,10000000,1000000),col = "gray70", lty = 2)
barplot(eig$values, col = "orange", xlab = "", ylab = "", add = TRUE,
ylim = c(0,10000000), xaxt = "n", yaxt = "n")
mids <- barplot(eig$values, col = "orange", xlab = "", ylab = "",
add = TRUE, ylim = c(0,10000000), xaxt = "n", yaxt = "n", plot = FALSE)
axis(side = 1, at = mids, labels = 1:10, font = 5, tick = FALSE,
line = -0.75)
axis(side = 2, at = seq(0,10000000,1000000), labels = FALSE, font = 5)
text(par("usr")[1] - 0.65, seq(0,10000000,1000000) + 500000,
labels = as.character(seq(0,10,1)), srt = 0, pos = 1, xpd = TRUE)
mtext(side = 1, text = "Sorted Eigenvalue Indices", font = 3, line = 1.5)
mtext(side = 2, text = "Eigenvalue (times 10e6)", font = 3, line = 3)

(e)

kable(confint(model1, level = 0.99, parm = 2:11), digits = 2,

caption = "90% Confidence Rectangle for Regression Coefficients")

11
(f) (5 pts.)

model2 <- lm(LBM ~ Sex + Ht + Wt + RCC, data = sports)

kable(summary(model2)$coefficients,
caption = "Summary of LBM Regression on Australian Institute of Sport Data")

(g) (5 pts.)

library(ellipse)
plot(ellipse(model2,which=c(3,4),level=0.95), type = "l", axes = FALSE,
xlab = "Height", ylab = "Weight",
font.lab = 3)
yax <- seq(0.56,0.7,0.02)
xax <- seq(0,0.2,0.05)
abline(h = yax, col = "gray70", lty = 2)
abline(v = xax, col = "gray70", lty = 2)
abline(0,0, lty = 2, col = "gray45")
axis(side = 1, at = xax, as.character(xax), font = 5)
axis(side = 2, at = yax, labels = as.character(yax), font = 5)
lines(ellipse(model2,which=c(3,4),level=0.95), lwd = 3.5, col = "orange")

(h) (5 pts.)

kable(anova(model2,model1), caption = "Analysis of Variance Table")

HW7
No ratings yet
HW7
1 page
The University of Auckland: Second Semester, 2004 Campus: City
No ratings yet
The University of Auckland: Second Semester, 2004 Campus: City
23 pages
332 3 Muscle - Mass
No ratings yet
332 3 Muscle - Mass
9 pages
36-401 Modern Regression HW #2 Solutions: Problem 1 (36 Points Total)
No ratings yet
36-401 Modern Regression HW #2 Solutions: Problem 1 (36 Points Total)
15 pages
Yaikob Second Assesiment Final
No ratings yet
Yaikob Second Assesiment Final
33 pages
A028 GLM-SC3
No ratings yet
A028 GLM-SC3
137 pages
Sheet5 Sol
No ratings yet
Sheet5 Sol
13 pages
Linear Regression Models Analysis
No ratings yet
Linear Regression Models Analysis
6 pages
Homework 1
No ratings yet
Homework 1
8 pages
Stat 362 UNIT 1
No ratings yet
Stat 362 UNIT 1
53 pages
soruma-SECOND-ASSEsiment L Reg
No ratings yet
soruma-SECOND-ASSEsiment L Reg
33 pages
Soruma SECOND ASSEsiment Final L Reg
No ratings yet
Soruma SECOND ASSEsiment Final L Reg
34 pages
Problem 7.5 A)
No ratings yet
Problem 7.5 A)
11 pages
4 - Multiple Linear Regressions
No ratings yet
4 - Multiple Linear Regressions
61 pages
Problem 4.7:: Nhi Ly 2025-02-27
No ratings yet
Problem 4.7:: Nhi Ly 2025-02-27
9 pages
Linear Models Assignment: VI
No ratings yet
Linear Models Assignment: VI
7 pages
Matlab Prject
No ratings yet
Matlab Prject
10 pages
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
100% (1)
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
15 pages
Linear Regression Analysis - 6
No ratings yet
Linear Regression Analysis - 6
29 pages
Regression Analysis Script
No ratings yet
Regression Analysis Script
24 pages
Data Science Using R
No ratings yet
Data Science Using R
11 pages
Cappstone
No ratings yet
Cappstone
2 pages
MATH3714 Jan 2023
No ratings yet
MATH3714 Jan 2023
9 pages
MATH3714 Jan 2024
No ratings yet
MATH3714 Jan 2024
9 pages
Regression P 5
No ratings yet
Regression P 5
15 pages
Lab-5-1-Regression and Multiple Regression
100% (2)
Lab-5-1-Regression and Multiple Regression
8 pages
Machine Learning-Lecture 1 (Student)
No ratings yet
Machine Learning-Lecture 1 (Student)
14 pages
WINSEM2024-25 CSE3506 ELA CH2024250502181 Reference Material III 21-12-2024 21NEW3
No ratings yet
WINSEM2024-25 CSE3506 ELA CH2024250502181 Reference Material III 21-12-2024 21NEW3
7 pages
H-311 Linear Regression Analysis With R
100% (1)
H-311 Linear Regression Analysis With R
71 pages
Shivam Batra (19BPS1131) 21/01/2022: List
No ratings yet
Shivam Batra (19BPS1131) 21/01/2022: List
5 pages
Calculus for Commerce Students
No ratings yet
Calculus for Commerce Students
4 pages
R Computer Lab4 Instructions
No ratings yet
R Computer Lab4 Instructions
10 pages
TP MSDC 3
No ratings yet
TP MSDC 3
6 pages
Lab4 Ch22b023 CTC
No ratings yet
Lab4 Ch22b023 CTC
15 pages
20mia1006 FDA LAB REGRESSION TYPES
No ratings yet
20mia1006 FDA LAB REGRESSION TYPES
11 pages
Statistic For Agriculture Studies: The Assumptions of Regression
No ratings yet
Statistic For Agriculture Studies: The Assumptions of Regression
6 pages
Final Exam 2018 R Output
No ratings yet
Final Exam 2018 R Output
20 pages
Multiple Linear Reegression
No ratings yet
Multiple Linear Reegression
21 pages
Graded Homework 1 Solutions
No ratings yet
Graded Homework 1 Solutions
19 pages
418 Material
No ratings yet
418 Material
16 pages
Exam 1 Notes
No ratings yet
Exam 1 Notes
4 pages
Arpan Final Assignment EcII
No ratings yet
Arpan Final Assignment EcII
39 pages
Chapter 14 - Linear Regression - White Background
No ratings yet
Chapter 14 - Linear Regression - White Background
27 pages
Curve Fitting
No ratings yet
Curve Fitting
17 pages
Bi Pract 9
No ratings yet
Bi Pract 9
8 pages
Isye4031 Regression and Forecasting Practice Problems 2 Fall 2014
No ratings yet
Isye4031 Regression and Forecasting Practice Problems 2 Fall 2014
5 pages
试卷1
No ratings yet
试卷1
17 pages
Multicollinearity Analysis in R
No ratings yet
Multicollinearity Analysis in R
10 pages
University of Michigan STATS 500 hw4 F2020
No ratings yet
University of Michigan STATS 500 hw4 F2020
2 pages
Exp - 5 - Zarin
No ratings yet
Exp - 5 - Zarin
19 pages
STA302H1 20199 621576366167mt A19 302 l02 Soln
No ratings yet
STA302H1 20199 621576366167mt A19 302 l02 Soln
11 pages
Lab4 23bce1964
No ratings yet
Lab4 23bce1964
2 pages
23BCE
No ratings yet
23BCE
2 pages
R
No ratings yet
R
4 pages
HW 03 Sol
No ratings yet
HW 03 Sol
9 pages
Wic 5 MLR & Anova
No ratings yet
Wic 5 MLR & Anova
10 pages
Advanced Density Estimation Guide
No ratings yet
Advanced Density Estimation Guide
32 pages
Nonparametric Classification 10/36-702: 1 1 N N N I I
No ratings yet
Nonparametric Classification 10/36-702: 1 1 N N N I I
20 pages
Linear Regression: 1 1 N N I I I D I I
No ratings yet
Linear Regression: 1 1 N N I I I D I I
20 pages
Linear Classification: 1 1 N N I D I
No ratings yet
Linear Classification: 1 1 N N I D I
33 pages
Sparse Additive Models: University of California, Berkeley, USA
No ratings yet
Sparse Additive Models: University of California, Berkeley, USA
22 pages
Differential Privacy: 1 N I 1 N N
No ratings yet
Differential Privacy: 1 N I 1 N N
7 pages
Statistical Machine Learning Solutions
No ratings yet
Statistical Machine Learning Solutions
11 pages
Homework 4 Due Friday April 19 3:00 PM Submit A PDF File On Canvas
No ratings yet
Homework 4 Due Friday April 19 3:00 PM Submit A PDF File On Canvas
2 pages
Boosting: I I I I
No ratings yet
Boosting: I I I I
5 pages
Online Prediction Algorithms
No ratings yet
Online Prediction Algorithms
8 pages
Support Vector Machines
No ratings yet
Support Vector Machines
5 pages
Advanced Dimension Reduction Techniques
No ratings yet
Advanced Dimension Reduction Techniques
40 pages
High-Dimensional, Two-Sample Testing
No ratings yet
High-Dimensional, Two-Sample Testing
9 pages
High-Dimensional, Two-Sample Testing
No ratings yet
High-Dimensional, Two-Sample Testing
9 pages
36-708 Statistical Machine Learning Homework #3 Solutions: DUE: March 29, 2019
No ratings yet
36-708 Statistical Machine Learning Homework #3 Solutions: DUE: March 29, 2019
22 pages
Lecture 4: Simple Linear Regression Models, With Hints at Their Estimation
No ratings yet
Lecture 4: Simple Linear Regression Models, With Hints at Their Estimation
12 pages
A Closer Look at Sparse Regression Ryan Tibshirani: 2.1 Three Norms: ', ', '
No ratings yet
A Closer Look at Sparse Regression Ryan Tibshirani: 2.1 Three Norms: ', ', '
25 pages
CMU Statistical ML Homework Solutions
No ratings yet
CMU Statistical ML Homework Solutions
16 pages
36-708 Statistical Methods For Machine Learning Homework #1 Solutions
No ratings yet
36-708 Statistical Methods For Machine Learning Homework #1 Solutions
12 pages
Functionspaces PDF
No ratings yet
Functionspaces PDF
15 pages
Causal Inference: 1.1 Two Types of Causal Questions
No ratings yet
Causal Inference: 1.1 Two Types of Causal Questions
19 pages
Manifold Estimation, Hidden Structure and Dimension Reduction
No ratings yet
Manifold Estimation, Hidden Structure and Dimension Reduction
39 pages
Lecture 8: Inference 36-401, Fall 2015, Section B
No ratings yet
Lecture 8: Inference 36-401, Fall 2015, Section B
16 pages
Data Analysis Exam 1 36-401, Section B
No ratings yet
Data Analysis Exam 1 36-401, Section B
3 pages
Lecture 7: Diagnostics: 36-401, Fall 2017, Section B
No ratings yet
Lecture 7: Diagnostics: 36-401, Fall 2017, Section B
35 pages
Lecture 9: Predictive Inference
No ratings yet
Lecture 9: Predictive Inference
10 pages
Data Analysis Project 2 Due 5:00 PM Nov 21 1 Instructions
No ratings yet
Data Analysis Project 2 Due 5:00 PM Nov 21 1 Instructions
3 pages
1 Review
No ratings yet
1 Review
7 pages
Tugas Simulasi Sistem Industri MUHAMMAD ARIO - D071171516
0% (1)
Tugas Simulasi Sistem Industri MUHAMMAD ARIO - D071171516
3 pages
03 9709 62 2023 2RP Afp M23 13022023030714
No ratings yet
03 9709 62 2023 2RP Afp M23 13022023030714
16 pages
T-Tests Type I Errors: Developed by Ronald Fisher, ANOVA Stands For Analysis of Variance
No ratings yet
T-Tests Type I Errors: Developed by Ronald Fisher, ANOVA Stands For Analysis of Variance
5 pages
Bpsy55 Reviewer
No ratings yet
Bpsy55 Reviewer
27 pages
MID Exam Probability
No ratings yet
MID Exam Probability
2 pages
WORKSHEET 8 Random Variables PDF
No ratings yet
WORKSHEET 8 Random Variables PDF
2 pages
17a P-Value
No ratings yet
17a P-Value
7 pages
Statistics and Probability WEEK 1 DLL
No ratings yet
Statistics and Probability WEEK 1 DLL
14 pages
Factor Analysis Notes
No ratings yet
Factor Analysis Notes
11 pages
Cornwall & Rupert (1997)
No ratings yet
Cornwall & Rupert (1997)
10 pages
Eberhardt 2012 Estimating Panel Time Series Models With Heterogeneous Slopes
No ratings yet
Eberhardt 2012 Estimating Panel Time Series Models With Heterogeneous Slopes
11 pages
AML-2203 Advanced Python AI and ML Tools Assignment
No ratings yet
AML-2203 Advanced Python AI and ML Tools Assignment
19 pages
FE - Engineering Probability and Statistics
No ratings yet
FE - Engineering Probability and Statistics
22 pages
Heteroscadasticity
No ratings yet
Heteroscadasticity
11 pages
Econometrics Jimma 1
No ratings yet
Econometrics Jimma 1
216 pages
S.4 Chi-Square Tests - STAT ONLINE
No ratings yet
S.4 Chi-Square Tests - STAT ONLINE
1 page
Time Table Biostatistics and Research Methodology M. Phil. Basic Sciences Session: 2019 - 2021 Timing: 9:00 - 10:30 Am S. No. Date Day Topic
No ratings yet
Time Table Biostatistics and Research Methodology M. Phil. Basic Sciences Session: 2019 - 2021 Timing: 9:00 - 10:30 Am S. No. Date Day Topic
2 pages
Time Series Patterns
No ratings yet
Time Series Patterns
18 pages
Course Outline BS102
No ratings yet
Course Outline BS102
4 pages
Logistic Regression For Spam Filtering: Niclas Englesson
No ratings yet
Logistic Regression For Spam Filtering: Niclas Englesson
37 pages
January 2017 (IAL) QP - S2 Edexcel
No ratings yet
January 2017 (IAL) QP - S2 Edexcel
12 pages
Sign Test
No ratings yet
Sign Test
10 pages
HS 512 Advanced Econometrics
No ratings yet
HS 512 Advanced Econometrics
2 pages
Engineering Stats Problem Set
No ratings yet
Engineering Stats Problem Set
3 pages
Mathematical Statistics - Wiki
No ratings yet
Mathematical Statistics - Wiki
5 pages
PANAS
No ratings yet
PANAS
21 pages
SSC CGL 2024 Tier-II (Statistics) Official Paper-II (Held On - 19 Jan, 2025)
No ratings yet
SSC CGL 2024 Tier-II (Statistics) Official Paper-II (Held On - 19 Jan, 2025)
34 pages
R Package
0% (1)
R Package
123 pages
Rank Biserial & Mann-Whitney U Test
100% (1)
Rank Biserial & Mann-Whitney U Test
3 pages
Lecture5 PDF
No ratings yet
Lecture5 PDF
7 pages

36-401 Modern Regression HW #7 Solutions: Problem 1 (40 Points)

Uploaded by

36-401 Modern Regression HW #7 Solutions: Problem 1 (40 Points)

Uploaded by

36-401 Modern Regression HW #7 Solutions

DUE: 11/3/2017 at 3PM

Problem 1 [40 points]

0.0 0.6 40 80 4.0 5.5 35 45 55 0 100 5 20 35

35 45 55 65 75 85 95 105 115 125 4 5 6

5 7.5 10 12.5 35 37.5 40 42.5 45 47.5 50 52.5 55 57.5 60

12 13 14 15 16 17 18 19 5 25 45 65 85 105 145 185 225

Figure 2: Linear Regression Residual Plots

Again, omitting the discussion. See past HW solutions.

Table 1: Summary of LBM Regression on Australian Institute of

Estimate Std. Error t value Pr(>|t|)

Table 3: 90% Confidence Rectangle for Regression Coefficients

Table 4: Summary of LBM Regression on Australian Institute of

Estimate Std. Error t value Pr(>|t|)

Again, omitting a discussion here.

0.05 0.1 0.15

Figure 3: 95% Confidence Ellipsoid for Height and Weight

Table 5: Analysis of Variance Table

Res.Df RSS Df Sum of Sq F Pr(>F)

kv1 k2 v1T v2 v1T vq

If kvj k > 0 for all j, then det(X T X) > 0. Therefore, X T X is non-singular.

(b) (10 pts.)

(c) (10 pts.)

There are a lot of ways to do this.

so the estimators yield equal residuals and thus equal squared-errors.

= λ−1 (λ−1 X T X + I)−1 X T Y

Here we used the continuity of the matrix inverse operator.

(b) (15 pts.)

βbλ = λ(X T X + λI)−1 X T Y

addTrans <- function(color,trans)

num2hex <- function(x)

Problem 1 [40 points]

sports <- read.table("http://stat.cmu.edu/~larry/=stat401/sports.txt", header = TRUE)

pairs(sports, pch = 19, cex = 0.4, cex.axis = 1.4)

model1 <- lm(LBM ~ ., data = sports)

nearest5 <- function(x, floor = TRUE){

resid_plot <- function(model, index){

xax <- seq(min(xax),max(xax), by = cand_increm[which.min(abs(lens - 10))])

for (itr in 10:11){

plot(model1, which = 1, col = NA, pch = 19, axes = FALSE,

barplot(eig$values, col = NA, xlab = "", ylab = "", ylim = c(0,10000000),

kable(confint(model1, level = 0.99, parm = 2:11), digits = 2,

model2 <- lm(LBM ~ Sex + Ht + Wt + RCC, data = sports)

kable(anova(model2,model1), caption = "Analysis of Variance Table")

You might also like