Cash-back offer from May 7th to 12th, 2024: Get a flat 10% cash-back credited to your account for a minimum transaction of $50.Post Your Questions Today!

Question DetailsNormal
$ 25.00

STAT 501 – Homework 10 (covers Lesson 11) | Complete Solution

Question posted by
Online Tutor Profile
request

STAT 501 – Homework 10 (covers Lesson 11) – Spring 2015 – due 5 April


Instructions: Use Word to type your answers within this document. Then, submit your answers in the appropriate dropbox in ANGEL by the due date. The point distribution is located next to each question. If there are multiple parts, then the points are divided equally over the subparts.

________________________________________________________________


1.      (45 points) Use the “Female Bears Data.” Data from n = 19 female bears of varying ages are used to develop an equation for estimating Y = female bear's weight from X = female bear's neck circumference.


a.       Fit a simple linear regression model with Y = female bear's weight and X = female bear's neck circumference. Click the “Storage” button in the Minitab Regression Dialog and select each of the items in the left-hand list (i.e., Fits, Residuals, Standardized residuals, Deleted residuals, Leverages, Cook’s distance, DFITS). Write down the estimated regression equation and the MSE for this model.

b.      Which bear number has the highest leverage and what is that leverage? [Leverages are in the column labeled “HI1”]

c.       Is the leverage in the previous part higher than the threshold 3(p/n)?

d.      Use the estimated regression equation from part (a) to calculate the fitted value for bear #6. [You can check your answer with the one Minitab provides in the column labeled “FITS1”.]

e.       Use your answer from the previous part together with the actual weight of bear #6 to calculate the residual for this bear. [You can check your answer with the one Minitab provides in the column labeled “RESI1”.]

f.       What is the leverage for bear #6?

g.      Use the residual from part (e), the MSE from part (a), and the leverage from part (f) to calculate the internally studentized residual for bear #6. [You can check your answer with the one Minitab provides in the column labeled “SRES1” – remember Minitab calls these “Standardized residuals.”]

h.      Delete bear #6 from the dataset as follows: select Data > Subset Worksheet, click “Specify which rows to exclude,” click “Row numbers,” and type “6” into the adjoining box. Then refit the simple linear regression model with Y = female bear's weight and X = female bear's neck circumference. Write down the estimated regression equation and the MSE for this model.

i.        Use the residual from part (e), the MSE from part (h), and the leverage from part (f) to calculate the externally studentized residual for bear #6. [You can check your answer with the one Minitab provides in the column labeled “TRES1” in the original worksheet – remember Minitab calls these simply “Deleted residuals.”]

j.        Use the estimated regression equation from part (h) to calculate the predicted value for bear #6 (i.e., based on the model fit to the subset worksheet excluding bear #6). [Note: the answer won’t make a whole lot of sense, but don’t worry about this since we’re simply going to use this predicted value for part (k).]

k.      Use the fitted value from part (d), the predicted value from part (j), the MSE from part (h), and the leverage from part (f) to calculate the DFFITS for bear #6. [You can check your answer with the one Minitab provides in the column labeled “DFIT1” in the original worksheet.]

l.        Is the absolute value of DFFITS in the previous part higher than the threshold given in the online notes, ?

m.    Use the residual from part (e), the MSE from part (a), and the leverage from part (f) to calculate the Cook’s distance for bear #6. [You can check your answer with the one Minitab provides in the column labeled “COOK1” in the original worksheet.]

n.      Is the Cook’s distance from the previous part higher than the upper threshold given in the notes, 1?

o.      Briefly summarize your findings with respect to bear #6. You might want to consider graphical evidence too!


2.      (27 points) Use the “College GPA Data.” Data from n = 40 college students are used to develop an equation for estimating Y = grade point average (GPA) from X1 = verbal score on a college entrance exam (percentile) and X2 = math score on a college entrance exam (percentile).


a.       Fit a “full quadratic” multiple linear regression model with Y, X1, X2, X12, X22, and X1 X2. [In Minitab: Select Y as the Response, X1 and X2 as the Continuous predictors, click “Model,” select both X1 and X2 together in the Predictors box and click the Add buttons next to “Interactions through order 2” and “Terms through order 2.”] Also click the “Storage” button in the Minitab Regression Dialog and select Deleted residuals, Leverages, and Cook’s distance. Write down the estimated regression equation.

b.      Which student has the largest absolute externally studentized residual and what is that externally studentized residual?

c.       Is the externally studentized residual from the previous part greater in absolute value than 3? What do we call such points?

d.      Which student has the highest leverage and what is that leverage?

e.       Is the leverage from the previous part higher than the threshold 3(p/n)?

f.       What is it about the student identified in part (d) that gives him/her such a high leverage? (Hint: compare this student’s exam scores with other students’ scores.)

g.      Which student has the highest Cook’s distance and what is that Cook’s distance?

h.      Is the Cook’s distance from the previous part higher than the upper threshold given in the notes, 1?

i.        Investigate whether removing any of the observations identified in the previous parts dramatically alters the model results.


3.      (4+4+8+6+6=28 points) Use the “Brand Preference Data.” Here, n = 16 observations are used to develop an equation for estimating Y = Degree of brand liking from X1 = Moisture content of the product and X2 = Sweetness of the product. The results were obtained from an experiment based on a completely randomized design (the data is coded).


a.       Obtain the studentized deleted residuals and identify any outlying Y observations using the Bonferroni outlier test procedure with α = 0.10. State the decision rule and your conclusion. (In Minitab: Use “Storage” and check “Deleted residuals” under “Stat > Regression > Regression > Fit Regression Model …” to get studentized deleted residuals).

b.      Use the leverage values to explain if any of the observations outlying with regard to their X-values according to the rule of thumb 3(p/n)?

(In Minitab, use “Storage” and check “Leverages” under “Stat > Regression > Regression > Fit Regression Model …” to get leverage values).

c.       The Management wishes to estimate the mean degree of brand liking for moisture content X1 = 10 and sweetness X2 = 3. Construct a scatter plot of X2 against X1 and determine visually whether this prediction involves an extrapolation beyond the range of the data. Also, use equation (10.29) of the textbook to determine whether an extrapolation is involved. Do your conclusions from the two methods agree?

d.      The largest absolute studentized deleted residual is for case 14 (see part (a)). Obtain the DFFlTS, and Cook's distance values for this case to assess the influence of this case. What do you conclude from each of the above values?

e.       Calculate the average absolute percent difference in the fitted values with and without the case 14. What does this measure indicate about the influence of case 14?

 

Available Answer
$ 25.00

[Solved] STAT 501 – Homework 10 (covers Lesson 11) | Complete Solution

  • This Solution has been Purchased 1 time
  • Submitted On 27 May, 2015 01:21:41
Answer posted by
Online Tutor Profile
solution
From the above scatter pot we can see that the range of values lie...
Buy now to view the complete solution
Other Similar Questions
User Profile
Exper...

STAT 501 – Homework 10 (covers Lesson 11) | Complete Solution

From the above scatter pot we can see that the range of values lies in the same interval thus visually there is no sign of an extrapolation beyond the range of the data. From the given data we can see that, 1 4 2 1 4 4 1...
User Profile
Homew...

STAT 501 Final Exam | 3 Questions solved

The intercept test β_0 may or may not have any practical interpretation depending on the range of the predictors, it has the usual interpretation that if all the predictors are 0 then the value of the dependent variable. Th...
User Profile
smart...

STAT 501 Mid-Term Exam 2 | Solution

Analysis of Variance Source DF Adj SS Adj MS F-Value P-Value Regression 4 112.612 28.1529 33.54 0.000 Stay 1 15.703 15.7032 18.71 0.000 Cultures 1 19.536 19.5358 23.27 0.00...

The benefits of buying study notes from CourseMerits

homeworkhelptime
Assurance Of Timely Delivery
We value your patience, and to ensure you always receive your homework help within the promised time, our dedicated team of tutors begins their work as soon as the request arrives.
tutoring
Best Price In The Market
All the services that are available on our page cost only a nominal amount of money. In fact, the prices are lower than the industry standards. You can always expect value for money from us.
tutorsupport
Uninterrupted 24/7 Support
Our customer support wing remains online 24x7 to provide you seamless assistance. Also, when you post a query or a request here, you can expect an immediate response from our side.
closebutton

$ 629.35