Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
The t statistic
Pooling Variances
The previous equation is appropriate when sample size are equal, it can be improved for unequal sample sizes. This equation will provide a better estimate of the population variances. One of the assumption for the t test is that the variances are equal (homogeneity of variance)
5
Pooling Variances
If we want a better estimate of , namely and , it seems appropriate to attain an average of these two values. But a simple average is not suitable because it gives equal weight to both values. (not suitable because sample size not the same)
4/30/2013
Example 1
Kalau kita bahagikan dua kumpulan kepada 2 jenis diet yang berbeza:
diet nasi lemak diet teh tarik
Example 1 (cont.)
Pada akhir minggu, kita mengukur perubahan berat badan. Diet yang mana menyebabkan peningkatan berat badan yang lebih? Maka, hipotesis nol ialah: Ho: wt. gain diet nasi lemak =wt. gain diet teh tarik
Subjek dimasukkan secara rawak dalam kump diet nasi lemak dan kump teh tarik untuk satu minggu. Ini mungkin tidak beretika kerana nasi lemak mestilah makan bersama teh tarik! Tetapi ini hanyalah contoh.
Example 1 (cont.)
Why? The null hypothesis is the opposite of what we hope to find. In this case, our research hypothesis is that there ARE differences between the 2 diets. Therefore, our null hypothesis is that there are NO differences between these 2 diets.
11
4/30/2013
Formula
The formula for the independent samples t-test is:
Example 1 (cont.)
The first step in calculating the independent samples t-test is to calculate the variance and mean in each condition. In the previous example, there are a total of 10 people, with 5 in each condition. Since there are different people in each condition, these samples are independent of one another; giving rise to the name of the test.
14
, df = (n1-1) + (n2-1)
13
Example 1 (cont.)
The variances and means are calculated separately for each condition (nasi lemak and teh tarik). In short, we take each observed weight gain for the nasi lemak condition, subtract it from the mean gain of the nasi lemak dieters and square the result.
15
X1 : nasi lemak 1 2 2 2 3
X2 : teh tarik 3 4 4 4 5
( 1 1 ) 2 ( 2 2 ) 2
1 0 0 0 1
2
1 0 0 0 1
1 =
2 =
2 sx =
( )
n 1
0.5
0.5
16
Formula
The formula for the independent samples t-test is:
, df = (n1-1) + (n2-1)
After calculating the t value, we need to know if it is large enough to reject the null hypothesis.
17 18
4/30/2013
Some theory
The t is calculated under the assumption, called the null hypothesis, that there are no differences between the nasi lemak and teh tarik diet. If this were true, when we repeatedly sample 10 people from the population and put them in our 2 diets, most often we would calculate a t of 0.
19
4/30/2013
Example 1 (cont.)
The calculated t-value of 4.47 is larger in magnitude than the C.V. of 2.31, therefore we can reject the null hypothesis. Even for a results section of journal article, this language is a bit too formal and general. It is more important to state the research result, namely:
Participants on the teh tarik diet (M=4.000.5) gained significantly more weight than those on the nasi lemak diet (M=2.000.5), t(8) = -4.47, p < 0.05, 95%CI= -0.968, -3.031.
25 26
Example 2
IQ score after training is given to a special class (smart students) and normal class students.
Special Class 24.0 148.87 35 Normal Class 16.5 139.16 29
Participants on the teh tarik diet (M = 4.00) gained significantly more weight than those on the nasi lemak diet (M = 2.00), t(8) = 4.47, p < 0.05. Making this conclusion requires inspection of the t tables.
mean Var n
27
28
4/30/2013
Empathy Scores
Person
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
Psychology
10 12 13 10 8 15 13 14 10 12 10 12 13 10 8
Physics
8 14 12 8 12 9 10 11 12 13 8 14 12 8 12
Output SPSS
34
Check your answers Now Lets use SPSS to run our Analysis
35
36
4/30/2013
Participants on the the tarik diet (M = 4.00) gained significantly more weight than those on the nasi lemak diet (M = 2.00), t(8) = 4.47, p < 0.01 (two-tailed).
In APA style we normally only display significance to 2 significant digits. Therefore, the probability is displayed as p<0.01, which is the smallest probability within this range of accuracy.
38
END.
41
42
4/30/2013
Introduction
So what if we have two related data set? Pre and post test data? Level of love felt among husband and wife? Repeated measures Matched/related samples
Twins, husband-wife, father-son, motherdaughter, mother-son Two scores for one case.
43
44
45
46
Contoh 1
Suatu kajian terapi untuk masalah anorexia telah dijalankan. Sampel kajian adalah 17 budak perempuan. Berat badan telah dicatatkan sebelum dan selepas menjalani terapi tersebut. Data adalah seperti berikut:
Before Mean S 83.23 5.02 After 90.49 8.48 Diff Score 7.26 7.16
47
4/30/2013
Hipotesis
= 0.05
Tetapkan alpha
49
50
Buat Pengiraan
51
52
Latihan 1
Subject A B C D Mean S
53 54
Before 10 15 12 11
After 14 13 15 12
Diff score