Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
William G. Zikmund
All rights reserved. Requests for permission to make copies of any part of the work should be mailed to the following address: Permissions Department, Harcourt, Inc., 6277 Sea Harbor Drive, Orlando, Florida 32887-6777.
Measures of association
A general term that refers to a number of bivariate statistical techniques used to measure the strength of a relationship between two variables.
Type of Measurement
Measure of Association
Type of Measurement
Measure of Association
Ordinal Scales
Type of Measurement
Measure of Association
Nominal
Correlation coefficient
A statistical measure of the covariation or association between two variables. Are dollar sales associated with advertising dollar expenditures?
rxy
Correlation coefficient
r r ranges from +1 to -1 r = +1 a perfect positive linear relationship r = -1 a perfect negative linear relationship r = 0 indicates no correlation
rxy ryx
X X Y Y Xi X Yi Y
i i 2
Copyright 2000 by Harcourt, Inc. All rights reserved.
rxy ryx
xy
2 x 2 y
= Variance of Y
xy = Covariance of X and Y
CORRELATION PATTERNS
NO CORRELATION
X
Copyright 2000 by Harcourt, Inc. All rights reserved.
CORRELATION PATTERNS
CORRELATION PATTERNS
X
Copyright 2000 by Harcourt, Inc. All rights reserved.
Calculation of r
6.3389 99.712
Coefficient of Determination
Correlation matrix
The standard form for reporting correlation results.
CORRELATION MATRIX
Var1 Var1 Var2 Var3 1.0 0.45 0.31 Var2 0.45 1.0 0.10 Var3 0.31 0.10 1.0
Law No. 3 Unless you can think of a logical reason why two variables should be connected as cause and effect, it doesnt help much to find a correlation between them. In Columbus, Ohio, the mean monthly rainfall correlates very nicely with the number of letters in the names of the months!
Copyright 2000 by Harcourt, Inc. All rights reserved.
REGRESSION
DICTIONARY DEFINITION GOING OR MOVING BACKWARD
BIVARIATE REGRESSION
A MEASURE OF LINEAR ASSOCIATION THAT INVESTIGATES A STRAIGHT LINE RELATIONSHIP USEFUL IN FORECASTING
Y intercept
a An intercepted segment of a line The point at which a regression line intercepts the Y-axis
Slope
B The inclination of a regression line as compared to a base line Rise over run D - notation for a change in
160
150 140 130 120 110 100 90
80
70
80
90
100
110
120
130
140
150
160
170
180
190
X
X a Y b
80
DX
80 90 100 110
DY
120
130
140
150
160
170
180
190
X
160
150 140 130 120 110 100 90
Y hat for Dealer 3
80
70
80
90
100
110
120
130
140
150
160
170
180
190
X
130
Deviation not explained
{}
140 150
80
80
90
100
110
120
130
160
170
180
190
X
e is minimum
i 1 2 i
ei
= Yi - Y i
(The residual)
Yi = actual value of the dependent variable = estimated value of the dependent variable (Y hat) Y i
n = number of observations
Bivariate Regression
a Y bX
Bivariate Regression
n XY X Y n X
2
Y = dependent variable
Y = mean of the dependent variable
= number of observations
Copyright 2000 by Harcourt, Inc. All rights reserved.
99 .8 .54638 125 a
99 .8 68 .3 31 .5
99 .8 .54638 125 a
99 .8 68 .3 31 .5
31 .5 .546 X Y
31 .5 .546 89
31 .5 48 .6 80 .1
Copyright 2000 by Harcourt, Inc. All rights reserved.
31 .5 .546 X Y
31 .5 .546 89
31 .5 48 .6 80 .1
Copyright 2000 by Harcourt, Inc. All rights reserved.
121 .6
83 .4
Copyright 2000 by Harcourt, Inc. All rights reserved.
ei Y9 Y9 97 96 .5 0 .5
Copyright 2000 by Harcourt, Inc. All rights reserved.
121 .6
83 .4
Copyright 2000 by Harcourt, Inc. All rights reserved.
ei Y9 Y9 97 96 .5 0 .5
Copyright 2000 by Harcourt, Inc. All rights reserved.
Y9 31 .5 .546 119
F-test (regression)
A procedure to determine whether there is more variability explained by the regression or unexplained by the regression. Analysis of variance summary table
We are always acting on what has just finished happening. It happened at least 1/30th of a second ago.We think were in the present, but we arent. The present . we know is only a movie of the past. Tom Wolfe in The Electric Kool-Aid Acid Test
Y Y Y i
=
Yi Y i
Total deviation
Y = Mean of the total group = Value predicted with regression equation Y Yi = Actual value
Y Y
i
Y i
Explained = variation
SUM OF SQUARES
Coefficient of Determination - r2
the proportion of variance in Y that is explained by X (or vice versa) A measure obtained by squaring the correlation coefficient; that proportion of the total variance of a variable that is accounted for by knowing the value of another variable.
Coefficient of Determination - r2
Source of variation
EXPLAINED BY REGRESSION DEGREES OF FREEDOM
k-1 where k= number of estimated constants (variables)
SUM OF SQUARES
SSr
MEAN SQUARED
SSr/k-1
Copyright 2000 by Harcourt, Inc. All rights reserved.
Source of variation
UNEXPLAINED BY REGRESSION DEGREES OF FREEDOM
n-k where n=number of observations
SUM OF SQUARES
SSe
MEAN SQUARED
SSe/n-k
Copyright 2000 by Harcourt, Inc. All rights reserved.
r2 in the example
MULTIPLE REGRESSION
EXTENSION OF BIVARIATE REGRESSION MULTIDIMENSIONAL WHEN THREE OR MORE VARIABLES ARE INVOLVED SIMULTANEOUSLY INVESTIGATES THE EFFECT OF TWO OR MORE VARIABLES ON A SINGLE DEPENDENT VARIABLE DISCUSSED IN CHAPTER 24