0% found this document useful (0 votes)
30 views6 pages

Solution

The document provides solutions to three questions involving regression analysis. It interprets regression coefficients, tests hypotheses, and discusses descriptive statistics and model significance. Additionally, it examines the goodness of fit and normality of residuals in the context of multiple regression models.

Uploaded by

Tazeen Zehra
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
30 views6 pages

Solution

The document provides solutions to three questions involving regression analysis. It interprets regression coefficients, tests hypotheses, and discusses descriptive statistics and model significance. Additionally, it examines the goodness of fit and normality of residuals in the context of multiple regression models.

Uploaded by

Tazeen Zehra
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 6

Question 01: solution

a) Interpret the regression coefficients:

Interpretation of constant/intercept: The estimate of the intercept is 0.2033. This


value implies that when labour force participation rate of women in 1968 is zero,
labour force participation rate of women in 1972 will be positive 0.2033

Interpretation of regression coefficient (B2): The estimated value of the regression


coefficient is 0.6560. The estimated value implies that when labour force participation
rate of women in 1968 increases by 1, labour force participation rate of women in
1972 would increase by approximately 0.6560 labour force.

b) Are the regression coefficients statistically significant at the 5% level.


P value: missing

c) Explain the value of r-squared.


R2= 0.397

d) Test the null hypothesis H0: B2=1 using the t-test at the 1% level of significance

T-test approach: H0: B2=1


1% level of significance
Criticat value: 2.567
T value: constant = 2.0829
Coefficient = 3.3452

Question 02: solution (Table 5.5)

a) Prepare a scatter plot with salary on the Y axis and spending on the X axis. Also
fit a regression line through the scatter plot.

b) Generate and discuss the


descriptive statistics of the
variables
c) Generate and discuss the correlations between the variables.

d) Estimate the regression of salary on spending and interpret the results.

Interpretation of intercept: The estimated value of the intercept term is12129.37. The
estimated value implies that when both the spending is equal to zero, the salary would be
12129.37 million.
Interpretation of regression coefficient: The estimated value of the regression coefficient is
3.307585. The estimated value implies that when spending increases by 1 dollor, salary
would increase by 3.307585 dollars.
Generate the ANOVA table and explain how it can be used.
e) Use the 95% confidence interval to test the null hypothesis that the slope
coefficient is equal to zero.
The lower limit of confidence interval is 2.681192 and the upper limit is 3.933978. As the
null hypothesis doest not lies within the confidence interval, we can reject the null hypothesis
at 95% confidence interval.
Moreover, the T statistic and the P value also indicate the null hypothesis H0: B2=0 can be
rejected at 5% level of significance.

f) Use the 99% confidence interval to test the null hypothesis that the slope
coefficient is equal to one.

Question 03: solution (Table 7.7)


Y = number of oil wells drilled
X2= price of the wellhead in the previous period
X3= domestic output
X4= GNP in dollars
X5= trend variable

a) Generate descriptive statistics and the pairwise correlations.


b) Estimate a multiple regression of Y on the X variables. Also interpret the
regression results in detail.

Interpretation of intercept (B1) The estimated value of the intercept term is -9.854598.
The estimated value implies that when the number of oil wells drilled X2= price of
the wellhead in the previous period X3= domestic output X4= GNP in dollars X5=
trend variable are equal to zero, number of oil wells drilled would be -9.854598.

Interpretation of regression coefficient (X2) The estimated value of the regression


coefficient is 2.701012. The estimated value implies that when price of the wellhead
in the previous period increases by 1 million, number of oil wells drilled would
increase by 2.701012.

Interpretation of regression coefficient (X3) The estimated value of the regression


coefficient is 3.059606. The estimated value implies that when domestic output
increases by 1, number of oil wells drilled would increase by 3.059606.

Interpretation of regression coefficient (X4) The estimated value of the regression


coefficient is -0.0160601. The estimated value implies that when GNP increases by 1,
number of oil wells drilled would increase by approximately -0.0160601 million
dollars

Interpretation of regression coefficient (X5) the estimated value of the regression


coefficient is -0.0227016. The estimated value implies that when trend variable input
increases by 1 million days, number of oil wells drilled would increase by
-0.0227016.

Test the overall significance of the regression model using the F-test.
The regression results indicate that the F-statistic is equal to 8.99 and its p-value is
0.001. numerater is 4 and denominator is 26.

5% level of significance: Therefore, we can conclude that the overall model is


statistically significant at the 5% level of significance as the F statistic 8.99 is greater
As the overall model is significant, all the independent variables collectively explain
the variation in the dependent variable.

c) Comment on the goodness of fit of the regression model. How would you explain
Adjusted r-squared.

The regression results indicate that the R2 value is 0.5804. The R2 value implies that
59.04% of the variation.. The adjusted R2 value iss lightly lower than R2 as expected

d) Use the residuals of the regression to examine the hypothesis that the error term
is normally distributed. Explain your answer.

The following results were generated form Shapiro wilk normality test.
The P value of the test is 0.10081which is greater than 0.05. Hence, we concluded that the
residual (error term) is normally distributed.

You might also like