You are on page 1of 6

Week

 7  Handout    -­‐  Fall  2015  -­‐  Linear  Regression  Analysis    


Professor  Silvestrini    
 
 
Question  1:  Using  the  plots  on  slide  13,  do  any  of  the  residual  assumptions  appear  to  
be  violated?  If  yes,  which?    
 
 
   
   
   
 
 
Question  2:  Using  the  Residual  v.  Leverage  plot  from  the  cigarette  data  answer  each:    
 

Residuals vs Leverage
2

WinstonLights 1

0.5
1
Standardized residuals

0
-1

0.5

Now 1
-2

BullDurham
-3

Cook's distance

0.0 0.1 0.2 0.3 0.4 0.5

Leverage
lm(CarbonMonoxide ~ .)  
 
a. Are  there  any  high  leverage  points?    
 
 
b. Are  there  any  influential  observations?      
 
 
 
 
 

  1  
Question  3:  Comment  on  the  differences  in  the  outputs  with  and  without  the  Bull  
Durham  observation.    
 

 
 

 
 
 
 

  2  
Question  4:  Are  there  any  terms  that  shouldn’t  be  included  in  this  model?  Base  your  
answer  on  the  Cig.lmNBD  output  and  explain.    
 
 
 
 
 
 
Question  5:  Is  there  any  indication  of  non-­‐linearity  in  the  cigarette  model  plots  
shown  on  slide  30?  Explain.      
 
 
 
 
 
 
 
 
 
 
Question  6:  Do  the  partial  residual  plots  on  slide  32  indicate  an  incorrect  model  
specification?  Explain.      
 
 
 
   

  3  
Question  7:  Does  the  pairs  plots  on  slide  41  indicate  multicollinearity  in  the  
Cigarette  data?  Explain.  
 
 
 
5 10 15 20 0 5 10 15 20 25 30 0.5 1.0 1.5 2.0 0.8 0.9 1.0 1.1

20

20
15

15
CarbonMonoxide

10

10
5

5
10 15 20 25 30
Tar

5
2.0 0
1.5
Nicotine

1.0
0.5
1.1
1.0
Weight

0.9
0.8
0.8 0.9 1.0 1.1

 
 
 
 
 
Question  8:  Does  the  summary  of  the  linear  model  in  conjunction  with  the  pairs  plot  
indicate  mulitcollinearity?  Explain.    
 
 
 
 
Question  9:  Does  the  correlation  of  variables  in  the  data  set  indicate  a  problem  with  
multicollinearity?    See  slide  44.  Explain.    
 
 
 
 
Question  10:  Do  the  VIFs  on  slide  47  indicate  a  problem  with  multicollinearity?    

  4  
Participation  Class  Portion  (10/9/15)  
 
Question  7:  Watch/following  along  with  Lab  3  Segments  1,  2,  and  3.    
 
 
Lab  Segment   Important  R  Functions  
colors(),  levels(),    
1.  Plots  and  Legends    
as.numeric(),  legend(),    
2.  Regression  Models  with   predict(),  lines()  
Categorical  Variables    
3.  Diagnostic  Plots     plot(),  termplot()  
 
Downloads  Associated  with  Lab  3:  Lab3.R,  BodyFat  Data,  and  Soft  Drink  Data    
 
 
For  each  lab  segment  write  a  few  bullet  points  or  paragraph  describing  what  you  
learned  how  to  do  during  the  lab  segment.    
 
 
Lab  Segment  1:  Plots  and  Legends    
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

  5  
Lab  Segment  2:  Regression  Models  with  Categorical  Variables    
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
Lab  Segment  3:  Diagnostic  plots    
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
Signed  by  Prof  S,  Paulina  or  Quincey:  ____________________________________________________  
 

  6  

You might also like