尊敬的 微信汇率:1円 ≈ 0.046166 元 支付宝汇率:1円 ≈ 0.046257元 [退出登录]
SlideShare a Scribd company logo
Statistics for
Business and Economics
7th Edition
Chapter 12
Multiple Regression
Ch. 12-1Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Chapter Goals
After completing this chapter, you should be able to:
 Apply multiple regression analysis to business decision-
making situations
 Analyze and interpret the computer output for a multiple
regression model
 Perform a hypothesis test for all regression coefficients
or for a subset of coefficients
 Fit and interpret nonlinear regression models
 Incorporate qualitative variables into the regression
model by using dummy variables
 Discuss model specification and analyze residuals
Ch. 12-2Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
The Multiple Regression Model
Idea: Examine the linear relationship between
1 dependent (Y) & 2 or more independent variables (Xi)
εXβXβXββY kk22110  
Multiple Regression Model with k Independent Variables:
Y-intercept Population slopes Random Error
Ch. 12-3
12.1
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Multiple Regression Equation
The coefficients of the multiple regression model are
estimated using sample data
kik2i21i10i xbxbxbby  ˆ
Estimated
(or predicted)
value of y
Estimated slope coefficients
Multiple regression equation with k independent variables:
Estimated
intercept
In this chapter we will always use a computer to obtain the
regression slope coefficients and other regression
summary measures.
Ch. 12-4Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Multiple Regression Equation
Two variable model
y
x1
x2
22110 xbxbby ˆ
(continued)
Ch. 12-5Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Multiple Regression Model
Two variable model
y
x1
x2
yi
yi
<
ei = (yi – yi)
<
x2i
x1i The best fit equation, y ,
is found by minimizing the
sum of squared errors, e2
<
Sample
observation
22110 xbxbby ˆ
Ch. 12-6Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Standard Multiple Regression
Assumptions
 The values xi and the error terms εi are
independent
 The error terms are random variables with
mean 0 and a constant variance, 2.
(The constant variance property is called
homoscedasticity)
n),1,(iforσ]E[εand0]E[ε 22
ii 
Ch. 12-7Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
 The random error terms, εi , are not correlated
with one another, so that
 It is not possible to find a set of numbers, c0,
c1, . . . , ck, such that
(This is the property of no linear relation for
the Xj’s)
(continued)
jiallfor0]εE[ε ji 
0xcxcxcc KiK2i21i10  
Ch. 12-8
Standard Multiple Regression
Assumptions
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Example:
2 Independent Variables
 A distributor of frozen desert pies wants to
evaluate factors thought to influence demand
 Dependent variable: Pie sales (units per week)
 Independent variables: Price (in $)
Advertising ($100’s)
 Data are collected for 15 weeks
Ch. 12-9Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Pie Sales Example
Sales = b0 + b1 (Price)
+ b2 (Advertising)
Week
Pie
Sales
Price
($)
Advertising
($100s)
1 350 5.50 3.3
2 460 7.50 3.3
3 350 8.00 3.0
4 430 8.00 4.5
5 350 6.80 3.0
6 380 7.50 4.0
7 430 4.50 3.0
8 470 6.40 3.7
9 450 7.00 3.5
10 490 5.00 4.0
11 340 7.20 3.5
12 300 7.90 3.2
13 440 5.90 4.0
14 450 5.00 3.5
15 300 7.00 2.7
Multiple regression equation:
Ch. 12-10Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Estimating a Multiple Linear
Regression Equation
 Excel will be used to generate the coefficients and
measures of goodness of fit for multiple regression
 Data / Data Analysis / Regression
Ch. 12-11
12.2
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Multiple Regression Output
Regression Statistics
Multiple R 0.72213
R Square 0.52148
Adjusted R Square 0.44172
Standard Error 47.46341
Observations 15
ANOVA df SS MS F Significance F
Regression 2 29460.027 14730.013 6.53861 0.01201
Residual 12 27033.306 2252.776
Total 14 56493.333
Coefficients Standard Error t Stat P-value Lower 95% Upper 95%
Intercept 306.52619 114.25389 2.68285 0.01993 57.58835 555.46404
Price -24.97509 10.83213 -2.30565 0.03979 -48.57626 -1.37392
Advertising 74.13096 25.96732 2.85478 0.01449 17.55303 130.70888
ertising)74.131(Advce)24.975(Pri-306.526Sales 
Ch. 12-12Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
The Multiple Regression Equation
ertising)74.131(Advce)24.975(Pri-306.526Sales 
b1 = -24.975: sales
will decrease, on
average, by 24.975
pies per week for
each $1 increase in
selling price, net of
the effects of changes
due to advertising
b2 = 74.131: sales will
increase, on average,
by 74.131 pies per
week for each $100
increase in
advertising, net of the
effects of changes
due to price
where
Sales is in number of pies per week
Price is in $
Advertising is in $100’s.
Ch. 12-13Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Coefficient of Determination, R2
 Reports the proportion of total variation in y
explained by all x variables taken together
 This is the ratio of the explained variability to
total sample variability
squaresofsumtotal
squaresofsumregression
SST
SSR
R2

Ch. 12-14
12.3
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Coefficient of Determination, R2
Regression Statistics
Multiple R 0.72213
R Square 0.52148
Adjusted R Square 0.44172
Standard Error 47.46341
Observations 15
ANOVA df SS MS F Significance F
Regression 2 29460.027 14730.013 6.53861 0.01201
Residual 12 27033.306 2252.776
Total 14 56493.333
Coefficients Standard Error t Stat P-value Lower 95% Upper 95%
Intercept 306.52619 114.25389 2.68285 0.01993 57.58835 555.46404
Price -24.97509 10.83213 -2.30565 0.03979 -48.57626 -1.37392
Advertising 74.13096 25.96732 2.85478 0.01449 17.55303 130.70888
.52148
56493.3
29460.0
SST
SSR
R2

52.1% of the variation in pie sales
is explained by the variation in
price and advertising
(continued)
Ch. 12-15Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Estimation of Error Variance
 Consider the population regression model
 The unbiased estimate of the variance of the errors is
where
 The square root of the variance, se , is called the
standard error of the estimate
1Kn
SSE
1Kn
e
s
n
1i
2
i
2
e





iKiK2i21i10i εxβxβxββY  
iii yye ˆ
Ch. 12-16Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Standard Error, se
Regression Statistics
Multiple R 0.72213
R Square 0.52148
Adjusted R Square 0.44172
Standard Error 47.46341
Observations 15
ANOVA df SS MS F Significance F
Regression 2 29460.027 14730.013 6.53861 0.01201
Residual 12 27033.306 2252.776
Total 14 56493.333
Coefficients Standard Error t Stat P-value Lower 95% Upper 95%
Intercept 306.52619 114.25389 2.68285 0.01993 57.58835 555.46404
Price -24.97509 10.83213 -2.30565 0.03979 -48.57626 -1.37392
Advertising 74.13096 25.96732 2.85478 0.01449 17.55303 130.70888
47.463se 
The magnitude of this
value can be compared to
the average y value
Ch. 12-17Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Adjusted Coefficient of
Determination,
 R2 never decreases when a new X variable is
added to the model, even if the new variable is
not an important predictor variable
 This can be a disadvantage when comparing
models
 What is the net effect of adding a new variable?
 We lose a degree of freedom when a new X
variable is added
 Did the new X variable add enough
explanatory power to offset the loss of one
degree of freedom?
2
R
Ch. 12-18Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Adjusted Coefficient of
Determination,
 Used to correct for the fact that adding non-relevant
independent variables will still reduce the error sum of
squares
(where n = sample size, K = number of independent variables)
 Adjusted R2 provides a better comparison between
multiple regression models with different numbers of
independent variables
 Penalize excessive use of unimportant independent
variables
 Smaller than R2
(continued)
2
R
1)(n/SST
1)K(n/SSE
1R2



Ch. 12-19Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Regression Statistics
Multiple R 0.72213
R Square 0.52148
Adjusted R Square 0.44172
Standard Error 47.46341
Observations 15
ANOVA df SS MS F Significance F
Regression 2 29460.027 14730.013 6.53861 0.01201
Residual 12 27033.306 2252.776
Total 14 56493.333
Coefficients Standard Error t Stat P-value Lower 95% Upper 95%
Intercept 306.52619 114.25389 2.68285 0.01993 57.58835 555.46404
Price -24.97509 10.83213 -2.30565 0.03979 -48.57626 -1.37392
Advertising 74.13096 25.96732 2.85478 0.01449 17.55303 130.70888
.44172R2

44.2% of the variation in pie sales is
explained by the variation in price and
advertising, taking into account the sample
size and number of independent variables
2
R
Ch. 12-20Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Coefficient of Multiple
Correlation
 The coefficient of multiple correlation is the correlation
between the predicted value and the observed value of
the dependent variable
 Is the square root of the multiple coefficient of
determination
 Used as another measure of the strength of the linear
relationship between the dependent variable and the
independent variables
 Comparable to the correlation between Y and X in
simple regression
2
Ry),yr(R  ˆ
Ch. 12-21Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Evaluating Individual
Regression Coefficients
 Use t-tests for individual coefficients
 Shows if a specific independent variable is
conditionally important
 Hypotheses:
 H0: βj = 0 (no linear relationship)
 H1: βj ≠ 0 (linear relationship does exist
between xj and y)
Ch. 12-22
12.4
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Evaluating Individual
Regression Coefficients
H0: βj = 0 (no linear relationship)
H1: βj ≠ 0 (linear relationship does exist
between xi and y)
Test Statistic:
(df = n – k – 1)
jb
j
S
0b
t


(continued)
Ch. 12-23Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Evaluating Individual
Regression Coefficients
Regression Statistics
Multiple R 0.72213
R Square 0.52148
Adjusted R Square 0.44172
Standard Error 47.46341
Observations 15
ANOVA df SS MS F Significance F
Regression 2 29460.027 14730.013 6.53861 0.01201
Residual 12 27033.306 2252.776
Total 14 56493.333
Coefficients Standard Error t Stat P-value Lower 95% Upper 95%
Intercept 306.52619 114.25389 2.68285 0.01993 57.58835 555.46404
Price -24.97509 10.83213 -2.30565 0.03979 -48.57626 -1.37392
Advertising 74.13096 25.96732 2.85478 0.01449 17.55303 130.70888
t-value for Price is t = -2.306, with
p-value .0398
t-value for Advertising is t = 2.855,
with p-value .0145
(continued)
Ch. 12-24Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
H0: βj = 0
H1: βj  0
d.f. = 15-2-1 = 12
 = .05
t12, .025 = 2.1788
The test statistic for each variable falls
in the rejection region (p-values < .05)
There is evidence that both
Price and Advertising affect
pie sales at  = .05
From Excel output:
Reject H0 for each variable
Coefficients Standard Error t Stat P-value
Price -24.97509 10.83213 -2.30565 0.03979
Advertising 74.13096 25.96732 2.85478 0.01449
Decision:
Conclusion:
Reject H0Reject H0
/2=.025
-tα/2
Do not reject H0
0
tα/2
/2=.025
-2.1788 2.1788
Example: Evaluating Individual
Regression Coefficients
Ch. 12-25Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Confidence Interval Estimate
for the Slope
Confidence interval limits for the population slope βj
Example: Form a 95% confidence interval for the effect of
changes in price (x1) on pie sales:
-24.975 ± (2.1788)(10.832)
So the interval is -48.576 < β1 < -1.374
jbα/21,Knj Stb 
Coefficients Standard Error
Intercept 306.52619 114.25389
Price -24.97509 10.83213
Advertising 74.13096 25.96732
where t has
(n – K – 1) d.f.
Here, t has
(15 – 2 – 1) = 12 d.f.
Ch. 12-26Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Confidence Interval Estimate
for the Slope
Confidence interval for the population slope βi
Example: Excel output also reports these interval endpoints:
Weekly sales are estimated to be reduced by between 1.37 to
48.58 pies for each increase of $1 in the selling price
Coefficients Standard Error … Lower 95% Upper 95%
Intercept 306.52619 114.25389 … 57.58835 555.46404
Price -24.97509 10.83213 … -48.57626 -1.37392
Advertising 74.13096 25.96732 … 17.55303 130.70888
(continued)
Ch. 12-27Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Test on All Coefficients
 F-Test for Overall Significance of the Model
 Shows if there is a linear relationship between all
of the X variables considered together and Y
 Use F test statistic
 Hypotheses:
H0: β1 = β2 = … = βk = 0 (no linear relationship)
H1: at least one βi ≠ 0 (at least one independent
variable affects Y)
Ch. 12-28
12.5
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
F-Test for Overall Significance
 Test statistic:
where F has k (numerator) and
(n – K – 1) (denominator)
degrees of freedom
 The decision rule is
1)KSSE/(n
SSR/K
s
MSR
F 2
e 

α1,Knk,0 FFifHReject 
Ch. 12-29Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
F-Test for Overall Significance
6.5386
2252.8
14730.0
MSE
MSR
F 
Regression Statistics
Multiple R 0.72213
R Square 0.52148
Adjusted R Square 0.44172
Standard Error 47.46341
Observations 15
ANOVA df SS MS F Significance F
Regression 2 29460.027 14730.013 6.53861 0.01201
Residual 12 27033.306 2252.776
Total 14 56493.333
Coefficients Standard Error t Stat P-value Lower 95% Upper 95%
Intercept 306.52619 114.25389 2.68285 0.01993 57.58835 555.46404
Price -24.97509 10.83213 -2.30565 0.03979 -48.57626 -1.37392
Advertising 74.13096 25.96732 2.85478 0.01449 17.55303 130.70888
(continued)
With 2 and 12 degrees
of freedom
P-value for
the F-Test
Ch. 12-30Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
F-Test for Overall Significance
H0: β1 = β2 = 0
H1: β1 and β2 not both zero
 = .05
df1= 2 df2 = 12
Test Statistic:
Decision:
Conclusion:
Since F test statistic is in
the rejection region (p-
value < .05), reject H0
There is evidence that at least one
independent variable affects Y
0
 = .05
F.05 = 3.885
Reject H0Do not
reject H0
6.5386
MSE
MSR
F 
Critical
Value:
F = 3.885
(continued)
F
Ch. 12-31Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Tests on a Subset of
Regression Coefficients
 Consider a multiple regression model involving
variables xj and zj , and the null hypothesis that the z
variable coefficients are all zero:
r)1,...,(j0αofoneleastat:H
0ααα:H
j1
r210

 
irir1i1KiK1i10i εzαzαxβxββy  
Ch. 12-32Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Tests on a Subset of
Regression Coefficients
 Goal: compare the error sum of squares for the
complete model with the error sum of squares for the
restricted model
 First run a regression for the complete model and obtain SSE
 Next run a restricted regression that excludes the z variables
(the number of variables excluded is r) and obtain the
restricted error sum of squares SSE(r)
 Compute the F statistic and apply the decision rule for a
significance level 
(continued)
α1,rKnr,2
e
0 F
s
r/)SSESSE(r)(
FifHReject 


Ch. 12-33Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Prediction
 Given a population regression model
 then given a new observation of a data point
(x1,n+1, x 2,n+1, . . . , x K,n+1)
the best linear unbiased forecast of yn+1 is
 It is risky to forecast for new X values outside the range of the data used
to estimate the model coefficients, because we do not have data to
support that the linear model extends beyond the observed range.
n),1,2,(iεxβxβxββy iKiK2i21i10i  
1nK,K1n2,21n1,101n xbxbxbby   ˆ
^
Ch. 12-34
12.6
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Using The Equation to Make
Predictions
Predict sales for a week in which the selling
price is $5.50 and advertising is $350:
Predicted sales
is 428.62 pies
428.62
(3.5)74.131(5.50)24.975-306.526
ertising)74.131(Advce)24.975(Pri-306.526Sales



Note that Advertising is
in $100’s, so $350
means that X2 = 3.5
Ch. 12-35Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Nonlinear Regression Models
 The relationship between the dependent
variable and an independent variable may
not be linear
 Can review the scatter diagram to check for
non-linear relationships
 Example: Quadratic model
 The second independent variable is the square
of the first variable
εXβXββY 2
12110 
Ch. 12-36
12.7
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Quadratic Regression Model
 where:
β0 = Y intercept
β1 = regression coefficient for linear effect of X on Y
β2 = regression coefficient for quadratic effect on Y
εi = random error in Y for observation i
i
2
1i21i10i εxβxββy 
Model form:
Ch. 12-37Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Linear vs. Nonlinear Fit
Linear fit does not give
random residuals
Nonlinear fit gives
random residuals

X
residuals
X
Y
X
residuals
Y
X
Ch. 12-38Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Quadratic Regression Model
Quadratic models may be considered when the scatter
diagram takes on one of the following shapes:
X1
Y
X1X1
YYY
β1 < 0 β1 > 0 β1 < 0 β1 > 0
β1 = the coefficient of the linear term
β2 = the coefficient of the squared term
X1
β2 > 0 β2 > 0 β2 < 0 β2 < 0
i
2
1i21i10i εXβXββY 
Ch. 12-39Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Testing for Significance:
Quadratic Effect
 Testing the Quadratic Effect
 Compare the linear regression estimate
 with quadratic regression estimate
 Hypotheses
 (The quadratic term does not improve the model)
 (The quadratic term improves the model)
2
12110 xbxbby ˆ
110 xbby ˆ
H0: β2 = 0
H1: β2  0
Ch. 12-40Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Testing for Significance:
Quadratic Effect
 Testing the Quadratic Effect
Hypotheses
 (The quadratic term does not improve the model)
 (The quadratic term improves the model)
 The test statistic is
H0: β2 = 0
H1: β2  0
(continued)
2b
22
s
βb
t


3nd.f. 
where:
b2 = squared term slope
coefficient
β2 = hypothesized slope (zero)
Sb = standard error of the slope
2
Ch. 12-41Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Testing for Significance:
Quadratic Effect
 Testing the Quadratic Effect
Compare R2 from simple regression to
R2 from the quadratic model
 If R2 from the quadratic model is larger than
R2 from the simple model, then the
quadratic model is a better model
(continued)
Ch. 12-42Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Example: Quadratic Model
 Purity increases as filter time
increases:
Purity
Filter
Time
3 1
7 2
8 3
15 5
22 7
33 8
40 10
54 12
67 13
70 14
78 15
85 15
87 16
99 17
Purity vs. Time
0
20
40
60
80
100
0 5 10 15 20
Time
Purity
Ch. 12-43Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Example: Quadratic Model
 Simple regression results:
y = -11.283 + 5.985 Time
(continued)
Regression Statistics
R Square 0.96888
Adjusted R Square 0.96628
Standard Error 6.15997
Coefficients
Standard
Error t Stat P-value
Intercept -11.28267 3.46805 -3.25332 0.00691
Time 5.98520 0.30966 19.32819 2.078E-10
F Significance F
373.57904 2.0778E-10
^
Time Residual Plot
-10
-5
0
5
10
0 5 10 15 20
Time
Residuals
t statistic, F statistic, and
R2 are all high, but the
residuals are not random:
Ch. 12-44Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Example: Quadratic Model
Coefficients
Standard
Error t Stat P-value
Intercept 1.53870 2.24465 0.68550 0.50722
Time 1.56496 0.60179 2.60052 0.02467
Time-squared 0.24516 0.03258 7.52406 1.165E-05
Regression Statistics
R Square 0.99494
Adjusted R Square 0.99402
Standard Error 2.59513
F Significance F
1080.7330 2.368E-13
 Quadratic regression results:
y = 1.539 + 1.565 Time + 0.245 (Time)2^
(continued)
Time Residual Plot
-5
0
5
10
0 5 10 15 20
Time
Residuals
Time-squared Residual Plot
-5
0
5
10
0 100 200 300 400
Time-squared
Residuals
The quadratic term is significant and
improves the model: R2 is higher and se is
lower, residuals are now random
Ch. 12-45Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
The Log Transformation
 Original multiplicative model
 Transformed multiplicative model
The Multiplicative Model:
εXXβY 21 β
2
β
10
)log(ε)log(Xβ)log(Xβ)log(βlog(Y) 22110 
Ch. 12-46Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Interpretation of coefficients
For the multiplicative model:
 When both dependent and independent
variables are logged:
 The coefficient of the independent variable Xk can
be interpreted as
a 1 percent change in Xk leads to an estimated bk
percentage change in the average value of Y
 bk is the elasticity of Y with respect to a change in Xk
i1i10i εlogXlogββlogYlog 
Ch. 12-47Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Dummy Variables
 A dummy variable is a categorical independent
variable with two levels:
 yes or no, on or off, male or female
 recorded as 0 or 1
 Regression intercepts are different if the
variable is significant
 Assumes equal slopes for other variables
 If more than two levels, the number of dummy
variables needed is (number of levels - 1)
Ch. 12-48
12.8
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Dummy Variable Example
Let:
y = Pie Sales
x1 = Price
x2 = Holiday (X2 = 1 if a holiday occurred during the week)
(X2 = 0 if there was no holiday that week)
210 xbxbby 21
ˆ
Ch. 12-49Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Dummy Variable Example
Same
slope
(continued)
x1 (Price)
y (sales)
b0 + b2
b0
1010
12010
xbb(0)bxbby
xb)b(b(1)bxbby
121
121


ˆ
ˆ Holiday
No Holiday
Different
intercept
If H0: β2 = 0 is
rejected, then
“Holiday” has a
significant effect
on pie sales
Ch. 12-50Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Interpreting the
Dummy Variable Coefficient
Sales: number of pies sold per week
Price: pie price in $
Holiday:
Example:
1 If a holiday occurred during the week
0 If no holiday occurred
b2 = 15: on average, sales were 15 pies greater in
weeks with a holiday than in weeks without a
holiday, given the same price
)15(Holiday30(Price)-300Sales 
Ch. 12-51Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Interaction Between
Explanatory Variables
 Hypothesizes interaction between pairs of x
variables
 Response to one x variable may vary at different
levels of another x variable
 Contains two-way cross product terms

)x(xbxbxbb
xbxbxbby
21322110
3322110

ˆ
Ch. 12-52Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Effect of Interaction
 Given:
 Without interaction term, effect of X1 on Y is
measured by β1
 With interaction term, effect of X1 on Y is
measured by β1 + β3 X2
 Effect changes as X2 changes
21322110
1231220
XXβXβXββ
)XXβ(βXββY


Ch. 12-53Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Interaction Example
x2 = 1:
y = 1 + 2x1 + 3(1) + 4x1(1) = 4 + 6x1
x2 = 0:
y = 1 + 2x1 + 3(0) + 4x1(0) = 1 + 2x1
Slopes are different if the effect of x1 on y depends on x2 value
x1
4
8
12
0
0 10.5 1.5
y
Suppose x2 is a dummy variable and the estimated
regression equation is 2121 x4x3x2x1y ˆ
^
^
Ch. 12-54Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Significance of Interaction Term
 The coefficient b3 is an estimate of the difference
in the coefficient of x1 when x2 = 1 compared to
when x2 = 0
 The t statistic for b3 can be used to test the
hypothesis
 If we reject the null hypothesis we conclude that there is
a difference in the slope coefficient for the two
subgroups
0β0,β|0β:H
0β0,β|0β:H
2131
2130


Ch. 12-55Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Multiple Regression Assumptions
Assumptions:
 The errors are normally distributed
 Errors have a constant variance
 The model errors are independent
ei = (yi – yi)
<
Errors (residuals) from the regression model:
Ch. 12-56
12.9
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Analysis of Residuals
in Multiple Regression
 These residual plots are used in multiple
regression:
 Residuals vs. yi
 Residuals vs. x1i
 Residuals vs. x2i
 Residuals vs. time (if time series data)
<
Use the residual plots to check for
violations of regression assumptions
Ch. 12-57Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Chapter Summary
 Developed the multiple regression model
 Tested the significance of the multiple regression model
 Discussed adjusted R2 ( R2 )
 Tested individual regression coefficients
 Tested portions of the regression model
 Used quadratic terms and log transformations in
regression models
 Explained dummy variables
 Evaluated interaction effects
 Discussed using residual plots to check model
assumptions
Ch. 12-58Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall

More Related Content

What's hot

Regression analysis
Regression analysisRegression analysis
Regression analysis
Shameer P Hamsa
 
Multiple regression presentation
Multiple regression presentationMultiple regression presentation
Multiple regression presentation
Carlo Magno
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
Ravi shankar
 
Correlation
CorrelationCorrelation
Correlation
James Neill
 
Linear regression
Linear regressionLinear regression
Linear regression
Karishma Chaudhary
 
Chap04 discrete random variables and probability distribution
Chap04 discrete random variables and probability distributionChap04 discrete random variables and probability distribution
Chap04 discrete random variables and probability distribution
Judianto Nugroho
 
Multiple Regression Analysis (MRA)
Multiple Regression Analysis (MRA)Multiple Regression Analysis (MRA)
Multiple Regression Analysis (MRA)
Naveen Kumar Medapalli
 
Correlation and regression analysis
Correlation and regression analysisCorrelation and regression analysis
Correlation and regression analysis
_pem
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
cbt1213
 
Applications of regression analysis - Measurement of validity of relationship
Applications of regression analysis - Measurement of validity of relationshipApplications of regression analysis - Measurement of validity of relationship
Applications of regression analysis - Measurement of validity of relationship
Rithish Kumar
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
Birinder Singh Gulati
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
Salim Azad
 
multiple regression
multiple regressionmultiple regression
multiple regression
Priya Sharma
 
Basics of Regression analysis
 Basics of Regression analysis Basics of Regression analysis
Basics of Regression analysis
Mahak Vijayvargiya
 
Probit and logit model
Probit and logit modelProbit and logit model
Probit and logit model
Jithmi Roddrigo
 
Regression Analysis presentation by Al Arizmendez and Cathryn Lottier
Regression Analysis presentation by Al Arizmendez and Cathryn LottierRegression Analysis presentation by Al Arizmendez and Cathryn Lottier
Regression Analysis presentation by Al Arizmendez and Cathryn Lottier
Al Arizmendez
 
Correlation and Regression ppt
Correlation and Regression pptCorrelation and Regression ppt
Correlation and Regression ppt
Santosh Bhaskar
 
Statistics-Regression analysis
Statistics-Regression analysisStatistics-Regression analysis
Statistics-Regression analysis
Rabin BK
 
Binary Logistic Regression
Binary Logistic RegressionBinary Logistic Regression
Binary Logistic Regression
Seth Anandaram Jaipuria College
 
Presentation On Regression
Presentation On RegressionPresentation On Regression
Presentation On Regression
alok tiwari
 

What's hot (20)

Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Multiple regression presentation
Multiple regression presentationMultiple regression presentation
Multiple regression presentation
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Correlation
CorrelationCorrelation
Correlation
 
Linear regression
Linear regressionLinear regression
Linear regression
 
Chap04 discrete random variables and probability distribution
Chap04 discrete random variables and probability distributionChap04 discrete random variables and probability distribution
Chap04 discrete random variables and probability distribution
 
Multiple Regression Analysis (MRA)
Multiple Regression Analysis (MRA)Multiple Regression Analysis (MRA)
Multiple Regression Analysis (MRA)
 
Correlation and regression analysis
Correlation and regression analysisCorrelation and regression analysis
Correlation and regression analysis
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
Applications of regression analysis - Measurement of validity of relationship
Applications of regression analysis - Measurement of validity of relationshipApplications of regression analysis - Measurement of validity of relationship
Applications of regression analysis - Measurement of validity of relationship
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
 
multiple regression
multiple regressionmultiple regression
multiple regression
 
Basics of Regression analysis
 Basics of Regression analysis Basics of Regression analysis
Basics of Regression analysis
 
Probit and logit model
Probit and logit modelProbit and logit model
Probit and logit model
 
Regression Analysis presentation by Al Arizmendez and Cathryn Lottier
Regression Analysis presentation by Al Arizmendez and Cathryn LottierRegression Analysis presentation by Al Arizmendez and Cathryn Lottier
Regression Analysis presentation by Al Arizmendez and Cathryn Lottier
 
Correlation and Regression ppt
Correlation and Regression pptCorrelation and Regression ppt
Correlation and Regression ppt
 
Statistics-Regression analysis
Statistics-Regression analysisStatistics-Regression analysis
Statistics-Regression analysis
 
Binary Logistic Regression
Binary Logistic RegressionBinary Logistic Regression
Binary Logistic Regression
 
Presentation On Regression
Presentation On RegressionPresentation On Regression
Presentation On Regression
 

Similar to Chap12 multiple regression

Newbold_chap13.ppt
Newbold_chap13.pptNewbold_chap13.ppt
Newbold_chap13.ppt
cfisicaster
 
Bbs11 ppt ch14
Bbs11 ppt ch14Bbs11 ppt ch14
Bbs11 ppt ch14
Tuul Tuul
 
Multiple Regression
Multiple RegressionMultiple Regression
Multiple Regression
Khawaja Naveed
 
InstructionsView CAAE Stormwater video Too Big for Our Ditches.docx
InstructionsView CAAE Stormwater video Too Big for Our Ditches.docxInstructionsView CAAE Stormwater video Too Big for Our Ditches.docx
InstructionsView CAAE Stormwater video Too Big for Our Ditches.docx
dirkrplav
 
Applied Business Statistics ,ken black , ch 3 part 2
Applied Business Statistics ,ken black , ch 3 part 2Applied Business Statistics ,ken black , ch 3 part 2
Applied Business Statistics ,ken black , ch 3 part 2
AbdelmonsifFadl
 
Multiple Regression.ppt
Multiple Regression.pptMultiple Regression.ppt
Multiple Regression.ppt
TanyaWadhwani4
 
Some study materials
Some study materialsSome study materials
Some study materials
SatishH5
 
QuantitativeDecisionMaking
QuantitativeDecisionMakingQuantitativeDecisionMaking
QuantitativeDecisionMaking
Eik Den Yeoh
 
Ch15
Ch15Ch15
Chap13 intro to multiple regression
Chap13 intro to multiple regressionChap13 intro to multiple regression
Chap13 intro to multiple regression
Uni Azza Aunillah
 
Statistics homework help
Statistics homework helpStatistics homework help
Statistics homework help
Expertsmind IT Education Pvt Ltd.
 
Lecture - 8 MLR.pptx
Lecture - 8 MLR.pptxLecture - 8 MLR.pptx
Lecture - 8 MLR.pptx
iris765749
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
Avijit Famous
 
Lesson07_new
Lesson07_newLesson07_new
Lesson07_new
shengvn
 
Multiple Linear Regression.pptx
Multiple Linear Regression.pptxMultiple Linear Regression.pptx
Multiple Linear Regression.pptx
BHUSHANKPATEL
 
Multinomial Logistic Regression Analysis
Multinomial Logistic Regression AnalysisMultinomial Logistic Regression Analysis
Multinomial Logistic Regression Analysis
HARISH Kumar H R
 
Chap02 describing data; numerical
Chap02 describing data; numericalChap02 describing data; numerical
Chap02 describing data; numerical
Judianto Nugroho
 
Trend analysis
Trend analysisTrend analysis
Trend analysis
Zahedul Islam
 
Newbold_chap14.ppt
Newbold_chap14.pptNewbold_chap14.ppt
Newbold_chap14.ppt
cfisicaster
 
simple regression-1.pdf
simple regression-1.pdfsimple regression-1.pdf
simple regression-1.pdf
BabulShaikh1
 

Similar to Chap12 multiple regression (20)

Newbold_chap13.ppt
Newbold_chap13.pptNewbold_chap13.ppt
Newbold_chap13.ppt
 
Bbs11 ppt ch14
Bbs11 ppt ch14Bbs11 ppt ch14
Bbs11 ppt ch14
 
Multiple Regression
Multiple RegressionMultiple Regression
Multiple Regression
 
InstructionsView CAAE Stormwater video Too Big for Our Ditches.docx
InstructionsView CAAE Stormwater video Too Big for Our Ditches.docxInstructionsView CAAE Stormwater video Too Big for Our Ditches.docx
InstructionsView CAAE Stormwater video Too Big for Our Ditches.docx
 
Applied Business Statistics ,ken black , ch 3 part 2
Applied Business Statistics ,ken black , ch 3 part 2Applied Business Statistics ,ken black , ch 3 part 2
Applied Business Statistics ,ken black , ch 3 part 2
 
Multiple Regression.ppt
Multiple Regression.pptMultiple Regression.ppt
Multiple Regression.ppt
 
Some study materials
Some study materialsSome study materials
Some study materials
 
QuantitativeDecisionMaking
QuantitativeDecisionMakingQuantitativeDecisionMaking
QuantitativeDecisionMaking
 
Ch15
Ch15Ch15
Ch15
 
Chap13 intro to multiple regression
Chap13 intro to multiple regressionChap13 intro to multiple regression
Chap13 intro to multiple regression
 
Statistics homework help
Statistics homework helpStatistics homework help
Statistics homework help
 
Lecture - 8 MLR.pptx
Lecture - 8 MLR.pptxLecture - 8 MLR.pptx
Lecture - 8 MLR.pptx
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Lesson07_new
Lesson07_newLesson07_new
Lesson07_new
 
Multiple Linear Regression.pptx
Multiple Linear Regression.pptxMultiple Linear Regression.pptx
Multiple Linear Regression.pptx
 
Multinomial Logistic Regression Analysis
Multinomial Logistic Regression AnalysisMultinomial Logistic Regression Analysis
Multinomial Logistic Regression Analysis
 
Chap02 describing data; numerical
Chap02 describing data; numericalChap02 describing data; numerical
Chap02 describing data; numerical
 
Trend analysis
Trend analysisTrend analysis
Trend analysis
 
Newbold_chap14.ppt
Newbold_chap14.pptNewbold_chap14.ppt
Newbold_chap14.ppt
 
simple regression-1.pdf
simple regression-1.pdfsimple regression-1.pdf
simple regression-1.pdf
 

More from Judianto Nugroho

Chap14 en-id
Chap14 en-idChap14 en-id
Chap14 en-id
Judianto Nugroho
 
Chap19 en-id
Chap19 en-idChap19 en-id
Chap19 en-id
Judianto Nugroho
 
Chap18 en-id
Chap18 en-idChap18 en-id
Chap18 en-id
Judianto Nugroho
 
Chap16 en-id
Chap16 en-idChap16 en-id
Chap16 en-id
Judianto Nugroho
 
Chap15 en-id
Chap15 en-idChap15 en-id
Chap15 en-id
Judianto Nugroho
 
Chap17 en-id
Chap17 en-idChap17 en-id
Chap17 en-id
Judianto Nugroho
 
Chap13 en-id
Chap13 en-idChap13 en-id
Chap13 en-id
Judianto Nugroho
 
Chap12 en-id
Chap12 en-idChap12 en-id
Chap12 en-id
Judianto Nugroho
 
Chap11 en-id
Chap11 en-idChap11 en-id
Chap11 en-id
Judianto Nugroho
 
Chap10 en-id
Chap10 en-idChap10 en-id
Chap10 en-id
Judianto Nugroho
 
Chap09 en-id
Chap09 en-idChap09 en-id
Chap09 en-id
Judianto Nugroho
 
Chap08 en-id
Chap08 en-idChap08 en-id
Chap08 en-id
Judianto Nugroho
 
Chap05 en-id
Chap05 en-idChap05 en-id
Chap05 en-id
Judianto Nugroho
 
Chap07 en-id
Chap07 en-idChap07 en-id
Chap07 en-id
Judianto Nugroho
 
Chap06 en-id
Chap06 en-idChap06 en-id
Chap06 en-id
Judianto Nugroho
 
Chap04 en-id
Chap04 en-idChap04 en-id
Chap04 en-id
Judianto Nugroho
 
Chap03 en-id
Chap03 en-idChap03 en-id
Chap03 en-id
Judianto Nugroho
 
Chap02 en-id
Chap02 en-idChap02 en-id
Chap02 en-id
Judianto Nugroho
 
Chap01 en-id
Chap01 en-idChap01 en-id
Chap01 en-id
Judianto Nugroho
 
Spss session 1 and 2
Spss session 1 and 2Spss session 1 and 2
Spss session 1 and 2
Judianto Nugroho
 

More from Judianto Nugroho (20)

Chap14 en-id
Chap14 en-idChap14 en-id
Chap14 en-id
 
Chap19 en-id
Chap19 en-idChap19 en-id
Chap19 en-id
 
Chap18 en-id
Chap18 en-idChap18 en-id
Chap18 en-id
 
Chap16 en-id
Chap16 en-idChap16 en-id
Chap16 en-id
 
Chap15 en-id
Chap15 en-idChap15 en-id
Chap15 en-id
 
Chap17 en-id
Chap17 en-idChap17 en-id
Chap17 en-id
 
Chap13 en-id
Chap13 en-idChap13 en-id
Chap13 en-id
 
Chap12 en-id
Chap12 en-idChap12 en-id
Chap12 en-id
 
Chap11 en-id
Chap11 en-idChap11 en-id
Chap11 en-id
 
Chap10 en-id
Chap10 en-idChap10 en-id
Chap10 en-id
 
Chap09 en-id
Chap09 en-idChap09 en-id
Chap09 en-id
 
Chap08 en-id
Chap08 en-idChap08 en-id
Chap08 en-id
 
Chap05 en-id
Chap05 en-idChap05 en-id
Chap05 en-id
 
Chap07 en-id
Chap07 en-idChap07 en-id
Chap07 en-id
 
Chap06 en-id
Chap06 en-idChap06 en-id
Chap06 en-id
 
Chap04 en-id
Chap04 en-idChap04 en-id
Chap04 en-id
 
Chap03 en-id
Chap03 en-idChap03 en-id
Chap03 en-id
 
Chap02 en-id
Chap02 en-idChap02 en-id
Chap02 en-id
 
Chap01 en-id
Chap01 en-idChap01 en-id
Chap01 en-id
 
Spss session 1 and 2
Spss session 1 and 2Spss session 1 and 2
Spss session 1 and 2
 

Recently uploaded

pol sci Election and Representation Class 11 Notes.pdf
pol sci Election and Representation Class 11 Notes.pdfpol sci Election and Representation Class 11 Notes.pdf
pol sci Election and Representation Class 11 Notes.pdf
BiplabHalder13
 
Observational Learning
Observational Learning Observational Learning
Observational Learning
sanamushtaq922
 
Opportunity scholarships and the schools that receive them
Opportunity scholarships and the schools that receive themOpportunity scholarships and the schools that receive them
Opportunity scholarships and the schools that receive them
EducationNC
 
Talking Tech through Compelling Visual Aids
Talking Tech through Compelling Visual AidsTalking Tech through Compelling Visual Aids
Talking Tech through Compelling Visual Aids
MattVassar1
 
BỘ BÀI TẬP TEST THEO UNIT - FORM 2025 - TIẾNG ANH 12 GLOBAL SUCCESS - KÌ 1 (B...
BỘ BÀI TẬP TEST THEO UNIT - FORM 2025 - TIẾNG ANH 12 GLOBAL SUCCESS - KÌ 1 (B...BỘ BÀI TẬP TEST THEO UNIT - FORM 2025 - TIẾNG ANH 12 GLOBAL SUCCESS - KÌ 1 (B...
BỘ BÀI TẬP TEST THEO UNIT - FORM 2025 - TIẾNG ANH 12 GLOBAL SUCCESS - KÌ 1 (B...
Nguyen Thanh Tu Collection
 
nutrition in plants chapter 1 class 7...
nutrition in plants chapter 1 class 7...nutrition in plants chapter 1 class 7...
nutrition in plants chapter 1 class 7...
chaudharyreet2244
 
Creating Images and Videos through AI.pptx
Creating Images and Videos through AI.pptxCreating Images and Videos through AI.pptx
Creating Images and Videos through AI.pptx
Forum of Blended Learning
 
bryophytes.pptx bsc botany honours second semester
bryophytes.pptx bsc botany honours  second semesterbryophytes.pptx bsc botany honours  second semester
bryophytes.pptx bsc botany honours second semester
Sarojini38
 
Ethiopia and Eritrea Eritrea's journey has been marked by resilience and dete...
Ethiopia and Eritrea Eritrea's journey has been marked by resilience and dete...Ethiopia and Eritrea Eritrea's journey has been marked by resilience and dete...
Ethiopia and Eritrea Eritrea's journey has been marked by resilience and dete...
biruktesfaye27
 
Cross-Cultural Leadership and Communication
Cross-Cultural Leadership and CommunicationCross-Cultural Leadership and Communication
Cross-Cultural Leadership and Communication
MattVassar1
 
Keynote given on June 24 for MASSP at Grand Traverse City
Keynote given on June 24 for MASSP at Grand Traverse CityKeynote given on June 24 for MASSP at Grand Traverse City
Keynote given on June 24 for MASSP at Grand Traverse City
PJ Caposey
 
The Rise of the Digital Telecommunication Marketplace.pptx
The Rise of the Digital Telecommunication Marketplace.pptxThe Rise of the Digital Telecommunication Marketplace.pptx
The Rise of the Digital Telecommunication Marketplace.pptx
PriyaKumari928991
 
CapTechTalks Webinar Slides June 2024 Donovan Wright.pptx
CapTechTalks Webinar Slides June 2024 Donovan Wright.pptxCapTechTalks Webinar Slides June 2024 Donovan Wright.pptx
CapTechTalks Webinar Slides June 2024 Donovan Wright.pptx
CapitolTechU
 
Information and Communication Technology in Education
Information and Communication Technology in EducationInformation and Communication Technology in Education
Information and Communication Technology in Education
MJDuyan
 
Creativity for Innovation and Speechmaking
Creativity for Innovation and SpeechmakingCreativity for Innovation and Speechmaking
Creativity for Innovation and Speechmaking
MattVassar1
 
Creation or Update of a Mandatory Field is Not Set in Odoo 17
Creation or Update of a Mandatory Field is Not Set in Odoo 17Creation or Update of a Mandatory Field is Not Set in Odoo 17
Creation or Update of a Mandatory Field is Not Set in Odoo 17
Celine George
 
Diversity Quiz Prelims by Quiz Club, IIT Kanpur
Diversity Quiz Prelims by Quiz Club, IIT KanpurDiversity Quiz Prelims by Quiz Club, IIT Kanpur
Diversity Quiz Prelims by Quiz Club, IIT Kanpur
Quiz Club IIT Kanpur
 
78 Microsoft-Publisher - Sirin Sultana Bora.pptx
78 Microsoft-Publisher - Sirin Sultana Bora.pptx78 Microsoft-Publisher - Sirin Sultana Bora.pptx
78 Microsoft-Publisher - Sirin Sultana Bora.pptx
Kalna College
 
How to Download & Install Module From the Odoo App Store in Odoo 17
How to Download & Install Module From the Odoo App Store in Odoo 17How to Download & Install Module From the Odoo App Store in Odoo 17
How to Download & Install Module From the Odoo App Store in Odoo 17
Celine George
 
Decolonizing Universal Design for Learning
Decolonizing Universal Design for LearningDecolonizing Universal Design for Learning
Decolonizing Universal Design for Learning
Frederic Fovet
 

Recently uploaded (20)

pol sci Election and Representation Class 11 Notes.pdf
pol sci Election and Representation Class 11 Notes.pdfpol sci Election and Representation Class 11 Notes.pdf
pol sci Election and Representation Class 11 Notes.pdf
 
Observational Learning
Observational Learning Observational Learning
Observational Learning
 
Opportunity scholarships and the schools that receive them
Opportunity scholarships and the schools that receive themOpportunity scholarships and the schools that receive them
Opportunity scholarships and the schools that receive them
 
Talking Tech through Compelling Visual Aids
Talking Tech through Compelling Visual AidsTalking Tech through Compelling Visual Aids
Talking Tech through Compelling Visual Aids
 
BỘ BÀI TẬP TEST THEO UNIT - FORM 2025 - TIẾNG ANH 12 GLOBAL SUCCESS - KÌ 1 (B...
BỘ BÀI TẬP TEST THEO UNIT - FORM 2025 - TIẾNG ANH 12 GLOBAL SUCCESS - KÌ 1 (B...BỘ BÀI TẬP TEST THEO UNIT - FORM 2025 - TIẾNG ANH 12 GLOBAL SUCCESS - KÌ 1 (B...
BỘ BÀI TẬP TEST THEO UNIT - FORM 2025 - TIẾNG ANH 12 GLOBAL SUCCESS - KÌ 1 (B...
 
nutrition in plants chapter 1 class 7...
nutrition in plants chapter 1 class 7...nutrition in plants chapter 1 class 7...
nutrition in plants chapter 1 class 7...
 
Creating Images and Videos through AI.pptx
Creating Images and Videos through AI.pptxCreating Images and Videos through AI.pptx
Creating Images and Videos through AI.pptx
 
bryophytes.pptx bsc botany honours second semester
bryophytes.pptx bsc botany honours  second semesterbryophytes.pptx bsc botany honours  second semester
bryophytes.pptx bsc botany honours second semester
 
Ethiopia and Eritrea Eritrea's journey has been marked by resilience and dete...
Ethiopia and Eritrea Eritrea's journey has been marked by resilience and dete...Ethiopia and Eritrea Eritrea's journey has been marked by resilience and dete...
Ethiopia and Eritrea Eritrea's journey has been marked by resilience and dete...
 
Cross-Cultural Leadership and Communication
Cross-Cultural Leadership and CommunicationCross-Cultural Leadership and Communication
Cross-Cultural Leadership and Communication
 
Keynote given on June 24 for MASSP at Grand Traverse City
Keynote given on June 24 for MASSP at Grand Traverse CityKeynote given on June 24 for MASSP at Grand Traverse City
Keynote given on June 24 for MASSP at Grand Traverse City
 
The Rise of the Digital Telecommunication Marketplace.pptx
The Rise of the Digital Telecommunication Marketplace.pptxThe Rise of the Digital Telecommunication Marketplace.pptx
The Rise of the Digital Telecommunication Marketplace.pptx
 
CapTechTalks Webinar Slides June 2024 Donovan Wright.pptx
CapTechTalks Webinar Slides June 2024 Donovan Wright.pptxCapTechTalks Webinar Slides June 2024 Donovan Wright.pptx
CapTechTalks Webinar Slides June 2024 Donovan Wright.pptx
 
Information and Communication Technology in Education
Information and Communication Technology in EducationInformation and Communication Technology in Education
Information and Communication Technology in Education
 
Creativity for Innovation and Speechmaking
Creativity for Innovation and SpeechmakingCreativity for Innovation and Speechmaking
Creativity for Innovation and Speechmaking
 
Creation or Update of a Mandatory Field is Not Set in Odoo 17
Creation or Update of a Mandatory Field is Not Set in Odoo 17Creation or Update of a Mandatory Field is Not Set in Odoo 17
Creation or Update of a Mandatory Field is Not Set in Odoo 17
 
Diversity Quiz Prelims by Quiz Club, IIT Kanpur
Diversity Quiz Prelims by Quiz Club, IIT KanpurDiversity Quiz Prelims by Quiz Club, IIT Kanpur
Diversity Quiz Prelims by Quiz Club, IIT Kanpur
 
78 Microsoft-Publisher - Sirin Sultana Bora.pptx
78 Microsoft-Publisher - Sirin Sultana Bora.pptx78 Microsoft-Publisher - Sirin Sultana Bora.pptx
78 Microsoft-Publisher - Sirin Sultana Bora.pptx
 
How to Download & Install Module From the Odoo App Store in Odoo 17
How to Download & Install Module From the Odoo App Store in Odoo 17How to Download & Install Module From the Odoo App Store in Odoo 17
How to Download & Install Module From the Odoo App Store in Odoo 17
 
Decolonizing Universal Design for Learning
Decolonizing Universal Design for LearningDecolonizing Universal Design for Learning
Decolonizing Universal Design for Learning
 

Chap12 multiple regression

  • 1. Statistics for Business and Economics 7th Edition Chapter 12 Multiple Regression Ch. 12-1Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 2. Chapter Goals After completing this chapter, you should be able to:  Apply multiple regression analysis to business decision- making situations  Analyze and interpret the computer output for a multiple regression model  Perform a hypothesis test for all regression coefficients or for a subset of coefficients  Fit and interpret nonlinear regression models  Incorporate qualitative variables into the regression model by using dummy variables  Discuss model specification and analyze residuals Ch. 12-2Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 3. The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & 2 or more independent variables (Xi) εXβXβXββY kk22110   Multiple Regression Model with k Independent Variables: Y-intercept Population slopes Random Error Ch. 12-3 12.1 Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 4. Multiple Regression Equation The coefficients of the multiple regression model are estimated using sample data kik2i21i10i xbxbxbby  ˆ Estimated (or predicted) value of y Estimated slope coefficients Multiple regression equation with k independent variables: Estimated intercept In this chapter we will always use a computer to obtain the regression slope coefficients and other regression summary measures. Ch. 12-4Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 5. Multiple Regression Equation Two variable model y x1 x2 22110 xbxbby ˆ (continued) Ch. 12-5Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 6. Multiple Regression Model Two variable model y x1 x2 yi yi < ei = (yi – yi) < x2i x1i The best fit equation, y , is found by minimizing the sum of squared errors, e2 < Sample observation 22110 xbxbby ˆ Ch. 12-6Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 7. Standard Multiple Regression Assumptions  The values xi and the error terms εi are independent  The error terms are random variables with mean 0 and a constant variance, 2. (The constant variance property is called homoscedasticity) n),1,(iforσ]E[εand0]E[ε 22 ii  Ch. 12-7Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 8.  The random error terms, εi , are not correlated with one another, so that  It is not possible to find a set of numbers, c0, c1, . . . , ck, such that (This is the property of no linear relation for the Xj’s) (continued) jiallfor0]εE[ε ji  0xcxcxcc KiK2i21i10   Ch. 12-8 Standard Multiple Regression Assumptions Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 9. Example: 2 Independent Variables  A distributor of frozen desert pies wants to evaluate factors thought to influence demand  Dependent variable: Pie sales (units per week)  Independent variables: Price (in $) Advertising ($100’s)  Data are collected for 15 weeks Ch. 12-9Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 10. Pie Sales Example Sales = b0 + b1 (Price) + b2 (Advertising) Week Pie Sales Price ($) Advertising ($100s) 1 350 5.50 3.3 2 460 7.50 3.3 3 350 8.00 3.0 4 430 8.00 4.5 5 350 6.80 3.0 6 380 7.50 4.0 7 430 4.50 3.0 8 470 6.40 3.7 9 450 7.00 3.5 10 490 5.00 4.0 11 340 7.20 3.5 12 300 7.90 3.2 13 440 5.90 4.0 14 450 5.00 3.5 15 300 7.00 2.7 Multiple regression equation: Ch. 12-10Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 11. Estimating a Multiple Linear Regression Equation  Excel will be used to generate the coefficients and measures of goodness of fit for multiple regression  Data / Data Analysis / Regression Ch. 12-11 12.2 Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 12. Multiple Regression Output Regression Statistics Multiple R 0.72213 R Square 0.52148 Adjusted R Square 0.44172 Standard Error 47.46341 Observations 15 ANOVA df SS MS F Significance F Regression 2 29460.027 14730.013 6.53861 0.01201 Residual 12 27033.306 2252.776 Total 14 56493.333 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 306.52619 114.25389 2.68285 0.01993 57.58835 555.46404 Price -24.97509 10.83213 -2.30565 0.03979 -48.57626 -1.37392 Advertising 74.13096 25.96732 2.85478 0.01449 17.55303 130.70888 ertising)74.131(Advce)24.975(Pri-306.526Sales  Ch. 12-12Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 13. The Multiple Regression Equation ertising)74.131(Advce)24.975(Pri-306.526Sales  b1 = -24.975: sales will decrease, on average, by 24.975 pies per week for each $1 increase in selling price, net of the effects of changes due to advertising b2 = 74.131: sales will increase, on average, by 74.131 pies per week for each $100 increase in advertising, net of the effects of changes due to price where Sales is in number of pies per week Price is in $ Advertising is in $100’s. Ch. 12-13Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 14. Coefficient of Determination, R2  Reports the proportion of total variation in y explained by all x variables taken together  This is the ratio of the explained variability to total sample variability squaresofsumtotal squaresofsumregression SST SSR R2  Ch. 12-14 12.3 Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 15. Coefficient of Determination, R2 Regression Statistics Multiple R 0.72213 R Square 0.52148 Adjusted R Square 0.44172 Standard Error 47.46341 Observations 15 ANOVA df SS MS F Significance F Regression 2 29460.027 14730.013 6.53861 0.01201 Residual 12 27033.306 2252.776 Total 14 56493.333 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 306.52619 114.25389 2.68285 0.01993 57.58835 555.46404 Price -24.97509 10.83213 -2.30565 0.03979 -48.57626 -1.37392 Advertising 74.13096 25.96732 2.85478 0.01449 17.55303 130.70888 .52148 56493.3 29460.0 SST SSR R2  52.1% of the variation in pie sales is explained by the variation in price and advertising (continued) Ch. 12-15Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 16. Estimation of Error Variance  Consider the population regression model  The unbiased estimate of the variance of the errors is where  The square root of the variance, se , is called the standard error of the estimate 1Kn SSE 1Kn e s n 1i 2 i 2 e      iKiK2i21i10i εxβxβxββY   iii yye ˆ Ch. 12-16Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 17. Standard Error, se Regression Statistics Multiple R 0.72213 R Square 0.52148 Adjusted R Square 0.44172 Standard Error 47.46341 Observations 15 ANOVA df SS MS F Significance F Regression 2 29460.027 14730.013 6.53861 0.01201 Residual 12 27033.306 2252.776 Total 14 56493.333 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 306.52619 114.25389 2.68285 0.01993 57.58835 555.46404 Price -24.97509 10.83213 -2.30565 0.03979 -48.57626 -1.37392 Advertising 74.13096 25.96732 2.85478 0.01449 17.55303 130.70888 47.463se  The magnitude of this value can be compared to the average y value Ch. 12-17Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 18. Adjusted Coefficient of Determination,  R2 never decreases when a new X variable is added to the model, even if the new variable is not an important predictor variable  This can be a disadvantage when comparing models  What is the net effect of adding a new variable?  We lose a degree of freedom when a new X variable is added  Did the new X variable add enough explanatory power to offset the loss of one degree of freedom? 2 R Ch. 12-18Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 19. Adjusted Coefficient of Determination,  Used to correct for the fact that adding non-relevant independent variables will still reduce the error sum of squares (where n = sample size, K = number of independent variables)  Adjusted R2 provides a better comparison between multiple regression models with different numbers of independent variables  Penalize excessive use of unimportant independent variables  Smaller than R2 (continued) 2 R 1)(n/SST 1)K(n/SSE 1R2    Ch. 12-19Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 20. Regression Statistics Multiple R 0.72213 R Square 0.52148 Adjusted R Square 0.44172 Standard Error 47.46341 Observations 15 ANOVA df SS MS F Significance F Regression 2 29460.027 14730.013 6.53861 0.01201 Residual 12 27033.306 2252.776 Total 14 56493.333 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 306.52619 114.25389 2.68285 0.01993 57.58835 555.46404 Price -24.97509 10.83213 -2.30565 0.03979 -48.57626 -1.37392 Advertising 74.13096 25.96732 2.85478 0.01449 17.55303 130.70888 .44172R2  44.2% of the variation in pie sales is explained by the variation in price and advertising, taking into account the sample size and number of independent variables 2 R Ch. 12-20Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 21. Coefficient of Multiple Correlation  The coefficient of multiple correlation is the correlation between the predicted value and the observed value of the dependent variable  Is the square root of the multiple coefficient of determination  Used as another measure of the strength of the linear relationship between the dependent variable and the independent variables  Comparable to the correlation between Y and X in simple regression 2 Ry),yr(R  ˆ Ch. 12-21Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 22. Evaluating Individual Regression Coefficients  Use t-tests for individual coefficients  Shows if a specific independent variable is conditionally important  Hypotheses:  H0: βj = 0 (no linear relationship)  H1: βj ≠ 0 (linear relationship does exist between xj and y) Ch. 12-22 12.4 Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 23. Evaluating Individual Regression Coefficients H0: βj = 0 (no linear relationship) H1: βj ≠ 0 (linear relationship does exist between xi and y) Test Statistic: (df = n – k – 1) jb j S 0b t   (continued) Ch. 12-23Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 24. Evaluating Individual Regression Coefficients Regression Statistics Multiple R 0.72213 R Square 0.52148 Adjusted R Square 0.44172 Standard Error 47.46341 Observations 15 ANOVA df SS MS F Significance F Regression 2 29460.027 14730.013 6.53861 0.01201 Residual 12 27033.306 2252.776 Total 14 56493.333 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 306.52619 114.25389 2.68285 0.01993 57.58835 555.46404 Price -24.97509 10.83213 -2.30565 0.03979 -48.57626 -1.37392 Advertising 74.13096 25.96732 2.85478 0.01449 17.55303 130.70888 t-value for Price is t = -2.306, with p-value .0398 t-value for Advertising is t = 2.855, with p-value .0145 (continued) Ch. 12-24Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 25. H0: βj = 0 H1: βj  0 d.f. = 15-2-1 = 12  = .05 t12, .025 = 2.1788 The test statistic for each variable falls in the rejection region (p-values < .05) There is evidence that both Price and Advertising affect pie sales at  = .05 From Excel output: Reject H0 for each variable Coefficients Standard Error t Stat P-value Price -24.97509 10.83213 -2.30565 0.03979 Advertising 74.13096 25.96732 2.85478 0.01449 Decision: Conclusion: Reject H0Reject H0 /2=.025 -tα/2 Do not reject H0 0 tα/2 /2=.025 -2.1788 2.1788 Example: Evaluating Individual Regression Coefficients Ch. 12-25Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 26. Confidence Interval Estimate for the Slope Confidence interval limits for the population slope βj Example: Form a 95% confidence interval for the effect of changes in price (x1) on pie sales: -24.975 ± (2.1788)(10.832) So the interval is -48.576 < β1 < -1.374 jbα/21,Knj Stb  Coefficients Standard Error Intercept 306.52619 114.25389 Price -24.97509 10.83213 Advertising 74.13096 25.96732 where t has (n – K – 1) d.f. Here, t has (15 – 2 – 1) = 12 d.f. Ch. 12-26Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 27. Confidence Interval Estimate for the Slope Confidence interval for the population slope βi Example: Excel output also reports these interval endpoints: Weekly sales are estimated to be reduced by between 1.37 to 48.58 pies for each increase of $1 in the selling price Coefficients Standard Error … Lower 95% Upper 95% Intercept 306.52619 114.25389 … 57.58835 555.46404 Price -24.97509 10.83213 … -48.57626 -1.37392 Advertising 74.13096 25.96732 … 17.55303 130.70888 (continued) Ch. 12-27Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 28. Test on All Coefficients  F-Test for Overall Significance of the Model  Shows if there is a linear relationship between all of the X variables considered together and Y  Use F test statistic  Hypotheses: H0: β1 = β2 = … = βk = 0 (no linear relationship) H1: at least one βi ≠ 0 (at least one independent variable affects Y) Ch. 12-28 12.5 Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 29. F-Test for Overall Significance  Test statistic: where F has k (numerator) and (n – K – 1) (denominator) degrees of freedom  The decision rule is 1)KSSE/(n SSR/K s MSR F 2 e   α1,Knk,0 FFifHReject  Ch. 12-29Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 30. F-Test for Overall Significance 6.5386 2252.8 14730.0 MSE MSR F  Regression Statistics Multiple R 0.72213 R Square 0.52148 Adjusted R Square 0.44172 Standard Error 47.46341 Observations 15 ANOVA df SS MS F Significance F Regression 2 29460.027 14730.013 6.53861 0.01201 Residual 12 27033.306 2252.776 Total 14 56493.333 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 306.52619 114.25389 2.68285 0.01993 57.58835 555.46404 Price -24.97509 10.83213 -2.30565 0.03979 -48.57626 -1.37392 Advertising 74.13096 25.96732 2.85478 0.01449 17.55303 130.70888 (continued) With 2 and 12 degrees of freedom P-value for the F-Test Ch. 12-30Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 31. F-Test for Overall Significance H0: β1 = β2 = 0 H1: β1 and β2 not both zero  = .05 df1= 2 df2 = 12 Test Statistic: Decision: Conclusion: Since F test statistic is in the rejection region (p- value < .05), reject H0 There is evidence that at least one independent variable affects Y 0  = .05 F.05 = 3.885 Reject H0Do not reject H0 6.5386 MSE MSR F  Critical Value: F = 3.885 (continued) F Ch. 12-31Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 32. Tests on a Subset of Regression Coefficients  Consider a multiple regression model involving variables xj and zj , and the null hypothesis that the z variable coefficients are all zero: r)1,...,(j0αofoneleastat:H 0ααα:H j1 r210    irir1i1KiK1i10i εzαzαxβxββy   Ch. 12-32Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 33. Tests on a Subset of Regression Coefficients  Goal: compare the error sum of squares for the complete model with the error sum of squares for the restricted model  First run a regression for the complete model and obtain SSE  Next run a restricted regression that excludes the z variables (the number of variables excluded is r) and obtain the restricted error sum of squares SSE(r)  Compute the F statistic and apply the decision rule for a significance level  (continued) α1,rKnr,2 e 0 F s r/)SSESSE(r)( FifHReject    Ch. 12-33Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 34. Prediction  Given a population regression model  then given a new observation of a data point (x1,n+1, x 2,n+1, . . . , x K,n+1) the best linear unbiased forecast of yn+1 is  It is risky to forecast for new X values outside the range of the data used to estimate the model coefficients, because we do not have data to support that the linear model extends beyond the observed range. n),1,2,(iεxβxβxββy iKiK2i21i10i   1nK,K1n2,21n1,101n xbxbxbby   ˆ ^ Ch. 12-34 12.6 Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 35. Using The Equation to Make Predictions Predict sales for a week in which the selling price is $5.50 and advertising is $350: Predicted sales is 428.62 pies 428.62 (3.5)74.131(5.50)24.975-306.526 ertising)74.131(Advce)24.975(Pri-306.526Sales    Note that Advertising is in $100’s, so $350 means that X2 = 3.5 Ch. 12-35Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 36. Nonlinear Regression Models  The relationship between the dependent variable and an independent variable may not be linear  Can review the scatter diagram to check for non-linear relationships  Example: Quadratic model  The second independent variable is the square of the first variable εXβXββY 2 12110  Ch. 12-36 12.7 Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 37. Quadratic Regression Model  where: β0 = Y intercept β1 = regression coefficient for linear effect of X on Y β2 = regression coefficient for quadratic effect on Y εi = random error in Y for observation i i 2 1i21i10i εxβxββy  Model form: Ch. 12-37Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 38. Linear vs. Nonlinear Fit Linear fit does not give random residuals Nonlinear fit gives random residuals  X residuals X Y X residuals Y X Ch. 12-38Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 39. Quadratic Regression Model Quadratic models may be considered when the scatter diagram takes on one of the following shapes: X1 Y X1X1 YYY β1 < 0 β1 > 0 β1 < 0 β1 > 0 β1 = the coefficient of the linear term β2 = the coefficient of the squared term X1 β2 > 0 β2 > 0 β2 < 0 β2 < 0 i 2 1i21i10i εXβXββY  Ch. 12-39Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 40. Testing for Significance: Quadratic Effect  Testing the Quadratic Effect  Compare the linear regression estimate  with quadratic regression estimate  Hypotheses  (The quadratic term does not improve the model)  (The quadratic term improves the model) 2 12110 xbxbby ˆ 110 xbby ˆ H0: β2 = 0 H1: β2  0 Ch. 12-40Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 41. Testing for Significance: Quadratic Effect  Testing the Quadratic Effect Hypotheses  (The quadratic term does not improve the model)  (The quadratic term improves the model)  The test statistic is H0: β2 = 0 H1: β2  0 (continued) 2b 22 s βb t   3nd.f.  where: b2 = squared term slope coefficient β2 = hypothesized slope (zero) Sb = standard error of the slope 2 Ch. 12-41Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 42. Testing for Significance: Quadratic Effect  Testing the Quadratic Effect Compare R2 from simple regression to R2 from the quadratic model  If R2 from the quadratic model is larger than R2 from the simple model, then the quadratic model is a better model (continued) Ch. 12-42Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 43. Example: Quadratic Model  Purity increases as filter time increases: Purity Filter Time 3 1 7 2 8 3 15 5 22 7 33 8 40 10 54 12 67 13 70 14 78 15 85 15 87 16 99 17 Purity vs. Time 0 20 40 60 80 100 0 5 10 15 20 Time Purity Ch. 12-43Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 44. Example: Quadratic Model  Simple regression results: y = -11.283 + 5.985 Time (continued) Regression Statistics R Square 0.96888 Adjusted R Square 0.96628 Standard Error 6.15997 Coefficients Standard Error t Stat P-value Intercept -11.28267 3.46805 -3.25332 0.00691 Time 5.98520 0.30966 19.32819 2.078E-10 F Significance F 373.57904 2.0778E-10 ^ Time Residual Plot -10 -5 0 5 10 0 5 10 15 20 Time Residuals t statistic, F statistic, and R2 are all high, but the residuals are not random: Ch. 12-44Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 45. Example: Quadratic Model Coefficients Standard Error t Stat P-value Intercept 1.53870 2.24465 0.68550 0.50722 Time 1.56496 0.60179 2.60052 0.02467 Time-squared 0.24516 0.03258 7.52406 1.165E-05 Regression Statistics R Square 0.99494 Adjusted R Square 0.99402 Standard Error 2.59513 F Significance F 1080.7330 2.368E-13  Quadratic regression results: y = 1.539 + 1.565 Time + 0.245 (Time)2^ (continued) Time Residual Plot -5 0 5 10 0 5 10 15 20 Time Residuals Time-squared Residual Plot -5 0 5 10 0 100 200 300 400 Time-squared Residuals The quadratic term is significant and improves the model: R2 is higher and se is lower, residuals are now random Ch. 12-45Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 46. The Log Transformation  Original multiplicative model  Transformed multiplicative model The Multiplicative Model: εXXβY 21 β 2 β 10 )log(ε)log(Xβ)log(Xβ)log(βlog(Y) 22110  Ch. 12-46Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 47. Interpretation of coefficients For the multiplicative model:  When both dependent and independent variables are logged:  The coefficient of the independent variable Xk can be interpreted as a 1 percent change in Xk leads to an estimated bk percentage change in the average value of Y  bk is the elasticity of Y with respect to a change in Xk i1i10i εlogXlogββlogYlog  Ch. 12-47Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 48. Dummy Variables  A dummy variable is a categorical independent variable with two levels:  yes or no, on or off, male or female  recorded as 0 or 1  Regression intercepts are different if the variable is significant  Assumes equal slopes for other variables  If more than two levels, the number of dummy variables needed is (number of levels - 1) Ch. 12-48 12.8 Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 49. Dummy Variable Example Let: y = Pie Sales x1 = Price x2 = Holiday (X2 = 1 if a holiday occurred during the week) (X2 = 0 if there was no holiday that week) 210 xbxbby 21 ˆ Ch. 12-49Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 50. Dummy Variable Example Same slope (continued) x1 (Price) y (sales) b0 + b2 b0 1010 12010 xbb(0)bxbby xb)b(b(1)bxbby 121 121   ˆ ˆ Holiday No Holiday Different intercept If H0: β2 = 0 is rejected, then “Holiday” has a significant effect on pie sales Ch. 12-50Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 51. Interpreting the Dummy Variable Coefficient Sales: number of pies sold per week Price: pie price in $ Holiday: Example: 1 If a holiday occurred during the week 0 If no holiday occurred b2 = 15: on average, sales were 15 pies greater in weeks with a holiday than in weeks without a holiday, given the same price )15(Holiday30(Price)-300Sales  Ch. 12-51Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 52. Interaction Between Explanatory Variables  Hypothesizes interaction between pairs of x variables  Response to one x variable may vary at different levels of another x variable  Contains two-way cross product terms  )x(xbxbxbb xbxbxbby 21322110 3322110  ˆ Ch. 12-52Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 53. Effect of Interaction  Given:  Without interaction term, effect of X1 on Y is measured by β1  With interaction term, effect of X1 on Y is measured by β1 + β3 X2  Effect changes as X2 changes 21322110 1231220 XXβXβXββ )XXβ(βXββY   Ch. 12-53Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 54. Interaction Example x2 = 1: y = 1 + 2x1 + 3(1) + 4x1(1) = 4 + 6x1 x2 = 0: y = 1 + 2x1 + 3(0) + 4x1(0) = 1 + 2x1 Slopes are different if the effect of x1 on y depends on x2 value x1 4 8 12 0 0 10.5 1.5 y Suppose x2 is a dummy variable and the estimated regression equation is 2121 x4x3x2x1y ˆ ^ ^ Ch. 12-54Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 55. Significance of Interaction Term  The coefficient b3 is an estimate of the difference in the coefficient of x1 when x2 = 1 compared to when x2 = 0  The t statistic for b3 can be used to test the hypothesis  If we reject the null hypothesis we conclude that there is a difference in the slope coefficient for the two subgroups 0β0,β|0β:H 0β0,β|0β:H 2131 2130   Ch. 12-55Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 56. Multiple Regression Assumptions Assumptions:  The errors are normally distributed  Errors have a constant variance  The model errors are independent ei = (yi – yi) < Errors (residuals) from the regression model: Ch. 12-56 12.9 Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 57. Analysis of Residuals in Multiple Regression  These residual plots are used in multiple regression:  Residuals vs. yi  Residuals vs. x1i  Residuals vs. x2i  Residuals vs. time (if time series data) < Use the residual plots to check for violations of regression assumptions Ch. 12-57Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  • 58. Chapter Summary  Developed the multiple regression model  Tested the significance of the multiple regression model  Discussed adjusted R2 ( R2 )  Tested individual regression coefficients  Tested portions of the regression model  Used quadratic terms and log transformations in regression models  Explained dummy variables  Evaluated interaction effects  Discussed using residual plots to check model assumptions Ch. 12-58Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
  翻译: