Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
• Research Question
• A series of 11 questions were answered at the end of
the experiment after the completion of the text editing
task.
• Could they be grouped into a smaller set of
meaningful constructs?
• Here we see all the 11 items. They were all asked on a 1 to 7 scale.
• For items 1 to 8, 7 represented strongly agree; 1 strongly disagree.
• For items 9 to 11, the high score is shown
1. Looking at items 1 to 8, which two variables have the lowest means and
what does this suggests?
2. Looking at items 1 to 8, which two variables have the highest means and
what does this suggest?
4.1 Communalities
• I Extracted 3 components Communalities
Initial Extraction
Q1) During the tas k I often s et a s pecific goal for mys elf 1.000 .730
Q2) I always focus ed my attention on the text-editing tas k 1.000 .594
Q3) I often worried about not being able to reach my best poss ible
1.000 .634
s core
Q4) I often forgot about trying to reach my bes t poss ible s core 1.000 .548
Q5) I wondered about how my performance compared with others 1.000 .453
Q6) I cons tantly focus ed on trying to do my bes t 1.000 .732
Q7) I often daydreamed while doing the tas k 1.000 .834
Q8) I often let my mind wander while doing the task 1.000 .852
Q9) How difficult was this tas k? (High=Extremely Difficult) 1.000 .649
Q10) How much of your attention was required to perform this task?
1.000 .620
(High = A great deal of attention)
Q11) How press ured did you feel while performing this task?
1.000 .571
(High=Extremely Press ured)
Extraction Method: Principal Component Analys is .
1. What is the interpretation of the numbers in the extracted column?
2. Which variable/s have less than half their variance explained by the extracted
components?
3. How much variance in Q1 was explained by the three components?
4. Looking at the content of the items, which items might you expect to group
together?
How many factors to extract?
Total Variance Explained
Component
1 2 3
Q1) During the task I often s et a specific goal for myself .637 -.182 .540
Q2) I always focused my attention on the text-editing task .735 -.153 .173
Q3) I often worried about not being able to reach my bes t pos s ible s core .625 .288 .400
Q4) I often forgot about trying to reach my bes t pos s ible s core -.728 -.125 -.060
Q5) I wondered about how my performance compared with others .312 .340 .490
Q6) I cons tantly focus ed on trying to do my best .826 -.223 -.009
Q7) I often daydreamed while doing the tas k -.710 .253 .515
Q8) I often let my mind wander while doing the tas k -.620 .193 .656
Q9) How difficult was this task? (High=Extremely Difficult) -.118 .769 -.208
Q10) How much of your attention was required to perform this tas k? (High =
.369 .573 -.395
A great deal of attention)
Q11) How pres s ured did you feel while performing this tas k?
.313 .686 -.054
(High=Extremely Pres s ured)
Extraction Method: Principal Component Analys is.
1. Which
a. variables load highly on component
3 components extracted.
1? Which of them are positive and which are
negative?
2. How would you describe someone who was high on component 1?
3. Which variables load highly on component 2? Which of them are positive and which are
negative?
4. How would you describe someone who was high on component 2?
5. Which variables load highly on component 3? Which of them are positive and which are
negative?
6. How would you describe someone who was high on component 3?
Total Variance Explained
& Variance 3
4
5
1.625
.798
.618
14.768
7.253
5.621
65.630
72.883
78.504
1.625 14.768 65.630 1.834
Explained 6
7
8
.593
.537
.451
5.395
4.882
4.099
83.898
88.780
92.879
9 .336 3.052 95.931
10 .312 2.835 98.766
11 .136 1.234 100.000
Extraction Method: Principal Component Analys is .
a. When components are correlated, s ums of s quared loadings cannot be added to obtain a total variance.
Total Variance Explained
Initial Eigenvalues Extraction Sums of Squared Loadings Rotation Sums of Squared Loadings
Component Total % of Variance Cumulative % Total % of Variance Cumulative % Total % of Variance Cumulative %
1 3.783 34.387 34.387 3.783 34.387 34.387 2.733 24.847 24.847
2 1.812 16.474 50.862 1.812 16.474 50.862 2.652 24.110 48.956
3 1.625 14.768 65.630 1.625 14.768 65.630 1.834 16.674 65.630
4 .798 7.253 72.883
5 .618 5.621 78.504
6 .593 5.395 83.898
7 .537 4.882 88.780
8 .451 4.099 92.879
9 .336 3.052 95.931
10 .312 2.835 98.766
11 .136 1.234 100.000
Extraction Method: Principal Component Analys is .
1. Which table shows the oblique rotation and which shows the orthogonal rotation?
2. What is the sum of eigenvalues for three components prior to rotation?
3. What is the sum of eigenvalues after rotation for the top table?
4. What is the sum of eigenvalues after rotation for the bottom table?
5. Why are the two sums different?
4.1 Orthogonal (Varimax) Rotation
Rotated Component Matrixa
Component
1 2 3
Q1) During the task I often s et a specific goal for myself .807 -.137 -.245
Q2) I always focused my attention on the text-editing task .628 -.436 -.093
Q3) I often worried about not being able to reach my bes t pos s ible s core .756 -.073 .238
Q4) I often forgot about trying to reach my bes t pos s ible score -.577 .419 -.202
Q5) I wondered about how my performance compared with others .600 .217 .216
Q6) I cons tantly focus ed on trying to do my best .560 -.641 -.091
Q7) I often daydreamed while doing the tas k -.125 .905 -.018
Q8) I often let my mind wander while doing the tas k .030 .917 -.104
Q9) How difficult was this task? (High=Extremely Difficult) -.140 .185 .772
Q10) How much of your attention was required to perform this tas k? (High = A
.057 -.333 .712
great deal of attention)
Q11) How pres s ured did you feel while performing this tas k? (High=Extremely
.265 -.034 .707
Press ured)
Extraction Method: Principal Component Analys is.
Rotation Method: Varimax with Kaiser Normalization.
a. Rotation converged in 6 iterations.
Component
1 2 3
Q1) During the task I often s et a specific goal for myself .839 .048 -.271
Q2) I always focused my attention on the text-editing tas k .587 -.314 -.116
Q3) I often worried about not being able to reach my bes t poss ible score .789 .093 .214
Q4) I often forgot about trying to reach my bes t poss ible score -.531 .314 -.181
Q5) I wondered about how my performance compared with others .678 .364 .199
Q6) I cons tantly focus ed on trying to do my best .475 -.546 -.113
Q7) I often daydreamed while doing the tas k .042 .929 -.007
Q8) I often let my mind wander while doing the tas k .212 .980 -.099
Q9) How difficult was this task? (High=Extremely Difficult) -.126 .146 .778
Q10) How much of your attention was required to perform this task? (High =
-.014 -.355 .708
A great deal of attention)
Q11) How pres s ured did you feel while performing this tas k?
.265 .009 .699
(High=Extremely Pres s ured)
1. What is
Extraction 2
.839Principal
Method: 2
+ .048 + .271 ?
Component 2Analys is .
Rotation Method: Promax with Kaiser Normalization. Component Correlation Matrix
2. Why is it not
a. Rotation the same as the communality?
converged in 6 iterations .
3. For each of the three components say which variables Component 1 2 3
1 1.000 -.385 .039
load highly and what a person who was high on that 2 -.385 1.000 .004
component would look like? 3 .039 .004 1.000
4. Give a clever one or two word name to each component Extraction Method: Principal Component Analysis .
Rotation Method: Promax with Kaiser Normalization.
that reflects what high scores on the variable represent
5. Using the names you created state which two
components correlated?
6. Does this correlation make sense based on your
understanding of the components?
Example 4.2
• Research Question
• Now that we have a measure of self-reported Task
Focus, we can use it as a dependent variable to think
about what might predict it. The variable was created
by getting the average score on the items that load on
it.
• Is there an effect of gender and/or age group as
defined by being either over or under 25 years of age
on Task Focus? Do these two variables interact?
1. Describe the three variables in terms of being binary, nominal, ordinal, interval
or ratio?
2. Which statistical procedure could we use?
3. What would be the null hypotheses?
4. What is your expectation?
UNIANOVA
qt_fac1 BY agegroup gender
/METHOD = SSTYPE(3)
/INTERCEPT = INCLUDE
/PLOT = PROFILE( agegroup*gender )
/PRINT = DESCRIPTIVE HOMOGENEITY
4.2 Descriptive Statistics
/CRITERIA = ALPHA(.05)
/DESIGN = agegroup gender agegroup*gender .
Descriptive Statistics
1. Looking at the graph, which gender was more task focused? Which age group
was more task focused? And does there appear to be an interaction?
2. What is the marginal mean for males? What is the marginal mean for females?
What is the difference between the two?
3. What is the approximate difference between the marginal means for males
and females in terms of one of the standard deviations?
4. Does the difference between the groups seem large?
5. What might be deceptive about the default graph from SPSS?
• Note that I filtered the cases to give equal group sizes. This would not typically be done
in real research but is purely done to make it clearer as a learning tool for you.
4.2 Homogeneity of Variance
Descriptive Statistics
1. Write out the F statistics, degrees of freedom and p value for the three effects.
2. What’s the answer to our research questions?
3. Using your formula sheet to assist, what three values make up SS between?
4. What is SS between?
5. What is SS within?
6. What is SS between + SS within (i.e., SS Total)?
7. Which source does SS Total correspond to in the table?
8. Without looking back at earlier slides, what was the total sample size?
9. What two values were used to form the F ratio for Gender?
10.What is the critical F value from Howell’s Table for the hypotheses tested (α=.05)
11.If all you could see was the F values, and not the significance values, how would you know that all the
effects were not statistically significant?
12.What is the value for MS error?
13.What is the square root of MS error? How does this value compare to the standard deviations shown
on the previous slide?
Example 4.3
• Research Question
– Do the three abilities (z-scores) and prior learning as
defined by typing speed (words per minutes) and
knowledge of text editing short cut keys (% correct)
predict task performance (seconds) at the end of
practice over and above initial performance
(seconds)?
1. Describe the three variables in terms of being binary, nominal, ordinal, interval
or ratio?
2. Which statistical procedure could we use?
3. What would be the null hypotheses?
4. What is your expectation?
4.3 Descriptive Statistics
Descriptive Statistics
• Note that the 54 trials were grouped into blocks made up of six trials each. Thus, initial
performance is defined by block 1 (i.e., trials 1 to 6) and block 9 is made up of trials 49 to 54)
• Also remember that faster times imply better performance
4.3 Correlations & Multicollinearity
Correlations
Pears on Correlation
TEP: Text TEP: Text
Editing Editing Zscore: Typing
Performance Performance Zscore: Zscore: PMA: QBK: Total Tes t:
(Block 9) (Block 1) GA: PSA: Psycho Knowledge Speed
(Average (Average General Perceptual motor of Text (Words
Seconds to Seconds to Ability Speed Ability Editing Keys Per
Complete) Complete) Total Ability Total Total (% correct) Minute)
TEP: Text Editing Performance
(Block 9) (Average Seconds to 1.000 .725 -.312 -.403 -.571 -.389 -.492
Complete)
TEP: Text Editing Performance
(Block 1) (Average Seconds to .725 1.000 -.318 -.464 -.558 -.473 -.509
Complete)
Zscore: GA: General Ability Total -.312 -.318 1.000 .347 .216 .077 .207
Zscore: PSA: Perceptual Speed
-.403 -.464 .347 1.000 .425 .054 .358
Ability Total
Zscore: PMA: Ps ychomotor Ability
-.571 -.558 .216 .425 1.000 .138 .297
Total
QBK: Total Knowledge of Text
-.389 -.473 .077 .054 .138 1.000 .394
Editing Keys (% correct)
Typing Test: Speed (Words Per
-.492 -.509 .207 .358 .297 .394 1.000
Minute)
1. Which variable has the largest correlation with block 9 text editing performance?
2. Do all the variables have moderate or large correlations with the dependent variable?
3. Give an interpretation to the correlation between typing speed and block 9 text editing
performance
4. Are there any very large correlations between predictors (e.g., above .7)?
4.3 Model Assessment
Model Summaryc
Model
Equation B
Coefficients
Std. Error
Coefficients
Beta t Sig.
95% Confidence Interval for B
Lower Bound Upper Bound Zero-
1 (Cons tant) -.360 3.440 -.105 .917 -7.188 6.468
TEP: Text Editing Performance (Block 1)
.651 .063 .725 10.360 .000 .527 .776
(Average Seconds to Complete)
2 (Cons tant) 19.819 6.895 2.875 .005 6.126 33.512
TEP: Text Editing Performance (Block 1)
.403 .091 .449 4.421 .000 .222 .584
(Average Seconds to Complete)
Zscore: GA: General Ability Total -1.040 .962 -.078 -1.082 .282 -2.950 .870
Zscore: PSA: Perceptual Speed Ability
-.114 1.076 -.009 -.106 .916 -2.251 2.024
Total
Zscore: PMA: Ps ychomotor Ability Total -3.474 1.162 -.248 -2.990 .004 -5.782 -1.167
QBK: Total Knowledge of Text Editing Keys
-4.086 4.057 -.081 -1.007 .316 -12.143 3.970
(% correct)
Typing Tes t: Speed (Words Per Minute) -.158 .092 -.139 -1.720 .089 -.340 .024
a. Dependent Variable: TEP: Text Editing Performance (Block 9) (Average Seconds to Complete)
1. Write out the unstandardised regression equation for the final model
2. Which regression coefficients have lower and upper bound 95% confidence intervals that are
either both positive or both negative?
3. Which regression coefficients are statistically significant?
4. What is the degrees of freedom associated with the Block 1 text editing performance regression
coefficient in model 1? What about model 2?
5. What is the unstandardised regression coefficient for General Ability? What is the standard
error? What is the regression coefficient divided by the standard error?
6. Look up Howell’s table and see what critical t is associated with 92 degrees of freedom alpha =
.05 (two tailed)?
7. What is 1 - .05 in percentage terms?
8. What is the critical t (.05 two-tailed) multiplied by the standard error for General ability?
9. What is -1.04 plus the value you obtained in Q8? What is -1.04 minus the value in Q8?
4.3 Correlations & Multicollinearity
Coefficientsa
1.158
2.784
3.974
.000
2.210
1.000
.584
99
99
Predicted Value
Adjus ted Predicted Value 8.2281 64.8080 33.9752 10.18449 99
Res idual -22.08496 30.63221 .00000 8.32525 99
Std. Res idual -2.570 3.565 .000 .969 99
Stud. Res idual -2.661 3.630 .002 1.005 99
Deleted Res idual -23.66952 31.75273 .03602 8.96232 99
Stud. Deleted Residual -2.755 3.900 .005 1.026 99
Mahal. Dis tance .791 19.968 5.939 3.713 99
Cook's Dis tance .000 .145 .011 .023 99
Centered Leverage Value .008 .204 .061 .038 99
a. Dependent Variable: TEP: Text Editing Performance (Block 9) (Average Seconds to
Complete)
1. What is the rule of thumb suggested by Paul for studentised residuals in large samples? Are any cases
outside that rule of thumb?
2. What is the rule of thumb suggested by Paul for leverage in large samples? Are any cases outside this
rule of thumb?
3. What is the rule of thumb suggested by Paul for identifying influential cases? Are any cases outside the
rule of thumb?
4. If cases are identified as falling outside the rule of thumb, what is the next step?
5. What is the mean of the predicted value? Going back to descriptive statistics, what was the mean of
the DV?
6. If leverage is defined by extreme scores on predicted Y, based on the table above, what are the two
possible predicted values that correspond to the case with the largest leverage value?
Casewise Diagnosticsa
TEP: Text
1. Going back to the casewise diagnostics what is the mean and standard deviation of the
predicted value?
2. What would be the unstandardised predicted time at minus 1 standardised predicted
values? What about plus 1 standardised predicted values
3. Is there a difference in the spread of residuals at one versus minus one standardised
predicted values? What might this mean?
4. Is there any evidence of non-linearity?
4.3 Linearity
1 2 3 4 5
Uns tandardized Predicted Value 50.81 39.20 19.81 32.49 39.00
Uns tandardized Res idual 8.21 .22 -4.34 -7.17 -.50
Studentized Res idual .99 .03 -.53 -.88 -.06
Cook's Distance .011 .000 .004 .012 .000
Centered Leverage Value .06 .08 .09 .09 .02
TEP: Text Editing Performance (Block 9) (Average Seconds to Complete) 59.03 39.42 15.47 25.32 38.50
TEP: Text Editing Performance (Block 1) (Average Seconds to Complete) 76.91 57.25 30.00 46.69 60.54
Typing Test: Speed (Words Per Minute) 19.3 24.7 51.0 47.6 21.0
QBK: Prior Text Editing Knowledge Overall (% correct) .57 .57 1.00 .29 .43
Zscore: GA: General Ability Total -1.47 1.52 -.13 1.05 .72
Zscore: PSA: Perceptual Speed Ability Total -.66 1.14 -.76 .58 -.20
Zscore: PMA: Ps ychomotor Ability Total -1.08 -1.22 .06 -1.06 -.16
a. Limited to first 5 cases .
• Remember that the model regression equation looked like this:
• [Predicted Block 9] = 19.8 + .40Block1 – 1.0GA - .1PSA – 3.5PMA – 4.1QBK -.16WPM
1. Confirm for a couple of cases that when you apply the model regression equation to
the case’s value that it corresponds to the unstandardised predicted value?
2. Describe case 1 in words, you may wish to make reference to the means and standard
deviations in the descriptive statistics
3. Case 1 and 4 have very different profiles of scores. Compare case 4 to 1? Why do they
have similar leverage values, which are clearly larger than cases 2, 3 and 5?
Example 4.4
• Initial Note
• Using a method that you wont learn about until 4th year
psychology called cluster analysis, 3 groups of participants
were created based on their ability scores.
• 3 Ability Groups
– Lower Ability Group
– Higher Ability Group
– Lower General Ability / Higher Psychomotor Ability Group
• Research Question
• Do the three groups differ in terms of their overall performance on
the text editing task?
• Overall did performance improve from block 1 (i.e., start of the task)
to block 9 (i.e., end of the task) on the text editing task?
• Did any of the three groups improve more or less between block 1
and block 9?
1. Describe the three variables in terms of being binary, nominal, ordinal, interval
or ratio?
2. Which statistical procedure could we use?
3. What would be the null hypotheses?
4. What is your expectation?
• Note that to simplify the example, I filtered
some cases to give equal group sizes.
4.4 Descriptives
Within-Subjects Factors