Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Questions
Give a concrete example (names of vbls, context) where it makes sense to compute a partial correlation. Why a partial rather than semipartial? Why is the squared semipartial always less than or equal to the squared partial? Give a concrete example where it makes sense to compute a semipartial correlation. Why semi rather than partial? Why is regression more closely related to semipartials than partials? How could you use ordinary regression to compute 3rd order partials?
Partial Correlation
People differ in many ways. When one difference is correlated with an outcome, cannot be sure the correlation is not spurious. Would like to hold third variables constant, but cannot manipulate. Can use statistical control. Statistical control is based on residuals. If we regress X2 on X1 and take residuals of X2, this part of X2 will be uncorrelated with X1, so anything X2 resids correlate with will not be explained by X1.
Example of Partials
Use SAT to predict grades (HS & College Fresh) HS=.8557+.0043*SAT; F=.9563+.0038*SAT.
Person
(HS)
E1 -0.01 -0.02 0.01 -0.08 -0.24 0.15 0.03 0.58 -0.15 -0.12
(F)
E2 -0.06 -0.05 0.13 -0.28 0.06 -0.13 -0.12 0.65 -0.03 -0.15
1 2 3 4 5 6 7 8 9 10
SAT-V 500 550 450 400 600 650 700 550 650 550
HSGPA 3.0 3.2 2.8 2.5 3.2 3.8 3.9 3.8 3.5 3.1
FGPA 2.8 3.0 2.8 2.2 3.3 3.3 3.5 3.7 3.4 2.9
PFGPA 2.86 3.05 2.67 2.48 3.24 3.43 3.61 3.05 3.43 3.05
Note that P and SAT are perfectly correlated. P & SAT do not correlate with E1 or E2 (residuals).
1 .81 .45 .58 1 .00 .00 1 .74 1
A partial correlation; the correlation between the residuals of the two GPAs. The correlation between HS GPA and FGPA holding SAT constant.
r12.3 !
The partial correlation can be a little or a lot bigger or smaller than the original.
r12.3 !
! .74
rY21.2
RY2.12 RY2.2 ! 2 1 RY .2
2 Y .2
3.
2X
4.
1X
2X
1.
1X:YU 1X Y derahS
21.y R 2
2 Y .2
1X
2.y R - 21.y R
2X
2X
1X
2.y R - 1
What is the correlation between trait anxiety and the number of doctor visits controlling for family medical history?
Find a partial
1 1 ANX 2 Fam History 3 DOC Visit 1 .20 .35 1 .15 1 2 3
r13.2 !
1 r
2 32
! .33
Semipartial Correlation
With partial correlation, we find the correlation between X and Y holding Z constant for both X and Y. Sometimes, we want to hold Z constant for just X or just Y. Instead of holding constant for both, hold for only one, therefore its a semipartial correlation instead of a partial. With a semipartial, we find the residuals of X on Z or Y on Z but the other is the original, raw variable. Correlate one raw with one residual. In our example, we found the correlation between E1 (HSGPA) and FGPA to be .45. This is the semipartial correlation between HSGPA and FGPA holding SAT constant for HSGPA only.
r12 r13r23 1 r
2 23
and r2 (1.3) !
r12 r13r23
2 1 r13
Note that r1(2.3) means the semipartial correlation between variables 1 and 2 where 3 is partialled only from 2. In our example: r1( 2.3) ! .92 (.87)(.81) 1.81
2
! .37 r2 (1.3) !
! .44
rY21.2
RY2.12 RY2.2 ! 2 1 RY .2
2 Y (1.2 )
!R
2 Y .12
R
2 Y .2
UY : X 1 ! ! UY : X 1 1
2X
1X
2X 1X
X derahS
2X:YU
1X:YU Y derahS
Semipartial
2 2
Why is the squared partial larger than the squared semipartial? Look at the respective areas for Y.
2X
1X
2.y R - 1
rY (1.2 ) !
rY 1 rY 2 r12
2 1 r12
Semipartial correlation
The difference is the square root in the denominator. The regression coefficient can exceed 1.0 in absolute value; the correlation cannot.
The semipartial is most often used when we want to show that some variable adds incremental variance in Y above and beyond other X variable
Pilot performance and Cog ability, motor skills Patient well being and surgery, social support
Review
Give a concrete example (names of vbls, context) where it makes sense to compute a partial correlation. Why a partial rather than semipartial? Give a concrete example where it makes sense to compute a semipartial correlation. Why semi rather than partial?
Suppressor Effects
Hard to understand, but
Inspection of r not enough to tell value Need to know to avoid looking dumb Show problems with Venn diagrams
Think of observed variable as composite of different stuff, e.g., satisfaction with car (price, prestige, etc.)
Note that X2 is correlated with X1 but NOT with Y. Will X2 be useful in a regression equation?
If we solve for beta weights, we find, beta1=.667 and beta2 = -.333. Notice that the beta weight for the first is actually larger than r (.50), and the second has become negative. Can also happen that r is (usually slightly) positive and beta is negative. This is a suppressor effect. Always inspect your correlations along with your regression weights to see if this is happening.
What does it mean that beta2 is negative? Sometimes people forget that there are other X variables in the equation. The results mean that we should feed people more to get them to lose weight.
Review
Why is the squared semipartial always less than or equal to the squared partial? Why is regression more closely related to semipartials than partials? How could you use ordinary regression to compute 3rd order partials?
What is the correlation between Y and X1 holding X2 constant only for X1?
ry (1.2 ) ! ?
Find a Semipartial
Y X1 X2 Y 1 .20 .30 X1 1 .40 X2
The correlation of X1 with Y after controlling for X2 (from X1 only) is rather small.
ry (1.2) ! ry (1.2) !
ry1 ry 2 r12 1 r
2 12
! .087
Computer Exercise
Go to labs and download 2IV Example. Find the partial correlation between hassles and well being holding gender and anger constant (2nd order partial). Find the squared semipartial for anger when well being is the DV and gender and hassles are the other IVs, that is, find the increment in R-square when anger is added to the equation after gender and hassles.