Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Analysis U4320
Lecture 13: Explaining
Variation
Prof. Sharyn OHalloran
I. Explaining Variation: R2
Money Spent
on Health Care
x
x
6
5
4
20
30
x
Y=a+bx
x
Y
10
40
50 60 70
Income
I. Explaining Variation: R2
Total Variation
$ Spent on
Health Care
Y
unexplained by regression
x Y-Y=deviation
x
8
(x,y)
7
6
5
4
5.9
x
Y=a+bx
10
20
30
40
50 60 70
Income
I. Explaining Variation: R2
Total Deviation
Y Y
(Y$ Y )
Total =
Explained +
Deviation Deviation
(Y Y$) .
Unexplained
Deviation
Y to is the
The total distance from any point
sum of the distance from Y to the regression
Y
line plus the distance from the regression
line
to .
I. Explaining Variation: R2
B. Sums of Squares
(Y Y )
2
$
$
(Y Y ) (Y Y )
I. Explaining Variation: R2
I. Explaining Variation: R2
I. Explaining Variation: R2
C. Definition of R2
I. Explaining Variation: R2
Y
Money Spent
on Health Care
x
x
6
5
4
x
Y
10
20
30
40
50
60
70
Income
I. Explaining Variation: R2
Money Spent
on Health Care
6
5
4
10
20
30
40
50
60
70
Income
I. Explaining Variation: R2
3. Formula
R2
SSR
SST
I. Explaining Variation: R2
I. Explaining Variation: R2
D. Example
1. Dependent variable
TRUSTTV = 1 if has a lot of confidence
= 2 if somewhat confidence
= 3 if the individual has no confidence.
2. Independent variables
I. Explaining Variation: R2
3. Calculating R2
a) Correlation matrix
TRUSTTV TUBETIME
TRUSTTV
1.000
TUBETIME -.177
SKOOL
.112
LIKEJPAN
.043
YELOWSTN .003
MYSIGN
-.038
-.177
1.000
-.272
.080
-.137
.053
.112
-.272
1.000
-.072
-.016
.012
.043
.080
-.072
1.000
.040
-.001
.003
-.137
-.016
.040
1.000
-.020
-.038
.053
.012
-.001
-.020
1.000
I. Explaining Variation: R2
Analysis of Variance
DF Sum of Squares
Regression
1
5.90547
Residual
468
183.61793
Mean Square
5.90547
0.39235
I. Explaining Variation: R2
SSR
SST
5.90
189.54
.031
R2
I. Explaining Variation: R2
I. Explaining Variation: R2
F. Using R2 in Practice
1. Useful Tool
2. Measure of Unexplained Variance
3. Not a Statistical Test
4. Don't Obsess about R2
5. You can always improve R2 by adding
variables
I. Explaining Variation: R2
G. Example
II. Adjusted R2
II. Adjusted R2
A. Definition of Adjusted R2
n 1
(1 R2 );
nk
n number of observations,
k number of independent variables.
R2 1
II. Adjusted R2
1. Adjusted R2
II. Adjusted R2
2. Calculating Adjusted R2
Example: Equation 2
Multiple R
.18856
R Square
.03555
Adjusted R Square .03142
Standard Error
.62562
Analysis of Variance
DF
Sum of Squares
Regression
2
6.73848
Residual
F=
467
8.60813
182.78492
Mean Square
3.36924
.39140
Signif F = .0002
II. Adjusted R2
Variable
SKOOL
B
.015524
SE B
.010641
Beta
.068897
T
1.459
Sig T
.1453
TUBETIME
-.048228
.014436 -.157772
-3.341
.0009
(Constant)
2.127991
.158260
13.446
.0000
We calculate:
n 1
R2 1
(1 R2 )
nk
470 1
1
(1.03555)
470 3
.0314
II. Adjusted R2
C. Stepwise Regression
III. F Tests
III. F Tests
III. F Tests
B. Equations
III. F Tests
III. F Tests
D. Calculating an F-Test
If the adjusted R2 goes up, then you need to do
a more complicated test, F-Test.
1. Ratio
SSE1 SSE2
SSE2
III. F Tests
2. Correction
III. F Tests
2. Correction (cont.)
Fnm k
III. F Tests
E. Example
III. F Tests
H0: 4 = 5 = 0.
III. F Tests
N = 470
182.07 18182
.
2
F
18182
.
470 6
= 0.319
k=6
III. F Tests
B=0 0.319
3.0
III. F Tests
Equation 2
Multiple R
.18856
R Square
.03555
Adjusted R Square .03142
Standard Error
.62562
Analysis of Variance
DF
Sum of Squares
Regression
2
6.73848
Residual
F=
467
8.60813
182.78492
Signif F = .0002
Mean Square
3.36924
.39140
III. F Tests
Variable
SE B
Beta
Sig T
SKOOL
.015524
.010641
.068897
1.459
.1453
TUBETIME
-.048228
.014436
-.157772
-3.341
.0009
(Constant)
2.127991
.158260
13.446
.0000
a) Hypothesis:
H0: 1 = 2 = 0.
SSE1 SSE2
m
F
,
SSE2
nk
III. F Tests
b) Calculate F-statistic
SSE = 182.78
2
189.54 182.78
2
F
182.78
470 3
= 8.61.
III. F Tests
F470
The critical value at the 5% level,
3
from your table, is 3.00.
So this time we can reject the null
hypothesis that 1 = 2 = 0.