STAT 516 hw 2

Author

Karl Gregory

Twenty-four students recorded their heights and the lengths of their index fingers. The measurements, in centimeters and millimeters, respectively, are read into R using the code below:

# height (cm)
Y <- c(162.56, 162.56, 172.72, 165.10, 167.64, 
       193.04, 172.72, 177.80, 185.42, 165.10, 
       175.26, 170.18, 172.72, 203.20, 167.64, 
       168.91, 157.48, 180.34, 160.02, 187.96, 
       187.96, 180.34, 160.02, 193.04)

# length of index finger (mm)
x <- c(75, 70, 70, 68, 71, 78, 80, 73, 68, 67, 
       75, 75, 74, 88, 80, 70, 70, 74, 70, 81, 
       77, 78, 60, 80)

# sample size
n <- length(Y)

It is of interest to use the simple linear regression model to predict the height of a person based on the length of his or her index finger.

1)

Make a scatterplot of the heights versus the index finger lengths with the least-squares line overlaid. Report the intercept and slope of the least-squares line as well as the value of Pearson’s correlation coefficient.

2)

Based on the fitted simple linear regression model, to what difference in height does an additional millimeter of index finger length correspond?

3)

Give an estimate of the error term variance.

4)

Make a normal quantile-quantile plot of the residuals as well as a residuals versus fitted values plot. Then carefully explain whether you believe the assumptions of the simple linear regression model are satisfied for these data.

5)

Give a 99% confidence interval for the slope parameter β1.

6)

State whether you would reject H0: β1=0 versus H1: β10 at the α=0.05 significance level.

7)

Give an estimate of the mean height of persons with index finger length equal to 72 mm. In addition, give a 95% confidence interval for this mean height.

8)

A hand print is found made by a hand with an index finger length of 72 mm. Give an interval which will contain with 95% probability the height of the person who made the print.

9)

Do you think the above interval would be useful in identifying the person who made the print?

10)

Give the value of the coefficient of determination for these data. Interpret the value.

11)

Give the value of the test statistic Ttest=β^1σ^/Sxx as well as the value of Ftest=MSRegMSError.

12)

Give the p-value for testing H0: β0 versus H1: β1>0 based on the value of the test statistic Ttest. Interpret your answer.

13)

Comment on whether there are any outliers in the data set. Show a plot to support your answer.