STAT 704 Fall 2008 -------------------- Homework 2 ------------ Please write your answers neatly and clearly! KNNL = Kutner, Nachtsheim, Neter, Li 1. A random sample of 893 teenagers revealed that in this sample, the mean number of hours per week of TV watching was 12.6, with a standard deviation of 1.8. Find (AND INTERPRET) a 90% confidence interval for the true mean weekly TV-watching time for teenagers. Why can we use a t CI procedure in this problem? 2. An engineer wants to calibrate a pH meter. She uses the meter to measure the pH in 12 neutral substances (pH = 7.0), obtaining the following data: 7.02, 7.04, 7.03, 6.96, 7.01, 6.99, 6.98, 7.06, 7.03, 7.04, 7.01, 6.99. (a) Use a graph to determine whether the assumption of normality for these data is reasonable. (b) Test (at a 0.05 significance level) whether the true mean pH reading for neutral substances differs from 7.0. Use R or SAS and report the P-value of your test. 3. Suppose a sample of 10 types of compact cars reveals the following one-day rental prices (in dollars) for Hertz and Thrifty, respectively: Hertz: 30.61, 18.09, 15.95, 17.64, 28.85, 23.92, 27.95, 24.80, 27.02, 40.05 Thrifty: 29.49, 12.19, 15.07, 15.17, 24.52, 22.32, 25.30, 22.74, 19.35, 34.44 (a) Explain why this is a paired-sample problem. (b) Use a graph to determine whether the assumption of normality is reasonable. (c) Test (at a 0.05 significance level) with a t-test whether Thrifty has a lower true mean rental rate than Hertz. Use R or SAS and report the P-value of your test. 4. Examine the data in Problem 16.7 on page 723 of the KNNL textbook. We will only deal with the data on the first two lines ("Low" and "Moderate"). (a) Use R or SAS to prepare side-by-side box plots for the two samples. Do the spreads seems to differ across samples? (b) Test (at a 0.05 significance level) with a t-test whether the firms rated "Moderate" have a significantly higher mean productivity improvement than those rated "Low". Use R or SAS and report the P-value of your test. (c) Using R or SAS, find (and interpret) a 90% CI for the difference in mean productivity improvement between firms rated "Moderate" and those rated "Low". 5. A cereal company claims its boxes contain 445 grams of cereal. A random sample of 14 boxes produces the following measurements: 441.82 437.38 445.92 444.17 444.89 445.93 443.97 445.40 445.95 443.35 441.95 444.86 438.96 439.38 (a) Use a graph to determine whether the assumption of normality is reasonable. (b) Using an appropriate test (at a 0.05 significance level), determine whether the center of the distribution of cereal weights is 445 grams. Use R or SAS and report the P-value of your test. 6. For the special case of n1 = n2 = n, show that the test statistic for the two-independent samples t-test (assuming equal population variances) has a t-distribution under the null hypothesis. What are the degrees of freedom for this t-distribution? (You can use the fact that the sum of two independent chi-square r.v.'s is a chi-square with degrees of freedom equaling the sum of the d.f. for the individual r.v.'s.)