Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
3
T-Test of two independent samples
Situation
The Dean of the College of Arts and Sciences of the university was amazed to
find out that there were more female students majoring in mathematics. Their
choice of majors was based on an aptitude test. Because of this observation, he
wanted to find out if gender has an influence on the achievement of the
mathematics among the math major students. He got hold of the test results in basic
math and compared the mean scores between male and female students. Test the
hypothesis of no significant difference between the male and female score
(achievement) in mathematics at =0.05 using the data below.
Male Female
n 45 75
Mean 67.4 71.5
SD 12.3 13.8
1. Problem statement
Ha: The female students have significantly higher achievement in basic math
than male students. (Ha: μ2 > μ1, directional or one tailed hypothesis)
X1 – X2
T-Value =
(SD1)² + (SD2)²
n1 n2
Where:
X1 = mean of first group X2 = mean of the second group
D1 = SD of first group SD2 = SD of second group
n1 = number of cases of first group n2 = number of cases of second group
T-value = -1.69
P-value = 0.095
There is no significant difference in the mean scores between male and female
students in mathematics. The performance of both gender groups in the
mathematics test is not comparable (meaning the mean scores of the two
groups are statistically equal).
8. Implications
9. Conclusion
Classroom Exercise
Two photocopying machines were observed in a span of five years to find out
which brand was better. An investigation on the number of repair times was
recorded and the data are shown below.
A 60 84.2 19.4
B 60 91.6 18.8
Test at the 0.05 level of significance whether the difference between these
two sample means repair time is significant.
Learning Module No. 4
T–test of two correlated samples or
T-test for paired/matched samples
Situation
A certain Music Teacher of City Central School wanted to find out if her new method
of teaching music was effective. She administered a test in Music in June and administered
the same test at the end of the first grading period. The new method called Kudali
approach is teaching music using indigenous musical instruments made from bamboos or
wood. Music concepts were introduced using indigenous concepts of music from our
ancestors or from several ethnic groups. Fifteen pupils were selected at random. Test the
hypothesis of no significant improvement in the pupils' scores from the pre-test to the
post-test using the following results:
1. Problem statement
Is there a significant improvement of the students’ music scores from the pre-test to
the post-test?
2. Hypotheses
Ho: There is no significant improvement in the students’ scores from the pre-test to
the post-test.
Ha: There is a significant improvement in the students’ scores from the pre-test to
the post-test.
Xd where:
T-Value = Xd = mean of the difference
SDd SDd = SD of the difference
N N = number of cases
T-value = 5.70
P-value = 0.00
There was a significant improvement of the students’ scores in music from the pre-
test to the posttest. Generally, the students’ scores increased in the test administered
in August.
8. Implications
Students enjoyed the use of indigenous musical instruments. The concepts were new
so they were excited to be exposed to the music of our ancestors and the different
ethnic groups.
9. Conclusion
The Kudali approach was an effective approach in the teaching of music among the
Grade Six pupils.
Classroom Exercise
Find out the effect of daily exercise in the reducing program of 20 ladies using their
weights before and after the program at = 0.05.
The chi-square goodness-of-fit test is always a right-tailed test. Whether or not the
Ho is rejected depends on how much the observed (Fo) and expected (Fe) frequencies
significantly differ from each other. It should be noted that to make a goodness-of-fit test,
the sample size should be large enough so that the expected frequency for each category
is at least five. When there is a category with Fe less than five, either increase the sample
size or combine two or more categories to make each Fe at least five. For more than the
2x2 contingency table, the chi-square test should not be used if more than 20 percent of
the expected frequencies are less than five. The chi-square test statistic can only be
applied to either nominal or ordinal (categorical) data.
There are two situations of expected frequencies (Fe) where you are going to use
and apply the chi-square test statistic, the first is with equal expected frequency and the
second one is with unequal expected frequency. For the latter, you compare the observed
and the predicted frequencies where equal frequency cannot be expected. Here, you test
the null hypothesis that the observed frequencies for an experiment/research follow a
certain pattern or theoretical distribution.
A random sample of 120 high school teachers was drawn from a Cebuano
population. The sample teachers were asked to rate the job satisfaction items using the
Likert scale. The self-perceptions made by the subjects were categorized into three
satisfaction levels. The resulting data of teachers’ degree of job satisfaction is presented
below.
Very Satisfied 42 40 2
Satisfied 75 40 35
Less Satisfied 3 40 37
Total 120
Test the null hypothesis that the relative frequency of teachers who are very
satisfied is the same as those who are satisfied or less satisfied. What do the results
indicate?
1. Problem statement
Does the sampling distribution differ significantly from the hypothesized population
distribution (equal expected frequency)?
Ho: There is no significant difference between the observed and expected frequency
of the teachers’ job satisfaction groups?
(Ho: fo = fe or Ho: fo – fe = 0)
Ho: The sampling distribution did not differ significantly from the hypothesized
population distribution. (Ho: fo = fe or Ho: fo – fe = 0)
Ha: There is a significant difference between the observed frequency and expected
frequency of the teachers’ job satisfaction groups. (Ha: fo ≠ fe)
Ha: The sampling distribution significantly differs from the hypothesized population
distribution. (Ho: fo ≠ fe or Ho: fo – fe ≠ 0)
3. Choice of Test Statistic:
Chi-square test of equal expected frequency, = 0.05.
X2-value = 64.95
P-value = 0.00
There is a significant difference between the observed and the expected frequency.
The sampling distribution differs significantly from the hypothesized equal
distribution. There were more satisfied teachers and very satisfied teachers than
expected.
8. Implications Ranges for the Weighted Mean
For three levels
2.34 – 3.00 – Very Satisfied (High)
Teachers being satisfied on their job could be 1.67 – 2.33 – Satisfied (Average)
attributed to the school environment and 1.00 – 1.66 – Less Satisfied (Low)
the collegial peer relationship. The
administration may be supportive of their For five levels
professional needs and provide assistance in 4.21 – 5.00 – Very High
3.41 – 4.20 – High
solving their problems.
2.61 – 3.40 – Average
1.81 – 2.60 – Low
9. Conclusion
On the average, the teachers’ ratings were on the satisfied level. In other words, based
on the findings considering all of teachers’ rating, they were satisfied with their job.
∑fw
WM =
n
WM = 2.33
Classroom Exercise
Eighty-two High School Principals in Cebu Province were asked about their opinions
on the desirability of integrating Sex Education in the HS curriculum. Thirty-one of the
sample School Administrators agree, 22 tend to agree, 18 tend to disagree while 11
disagree. Is there a significant difference in the opinion of HS Principals regarding the
integration of Sex Education in the HS curriculum? Test at 0.05 level of significance.
5.1 Situation 2: Unequal Expected Frequency
On divorce issue in the Philippines, it can never be expected that there will be equal
number of those who agree and disagree. Since the Philippines is predominantly Christian,
it is expected that there will be more women who will disagree than those who will agree.
Thus an expected 30–70 ratio is hypothesized in favor for the no response.
Opinion n
Yes 42
No 108
1. Problem
Is there a significant difference between the observed and the expected frequency
(30–70 distribution) among the mothers’ opinions on legalizing divorce in the
Philippines?
Ho: There is no significant difference between the observed and the expected
frequency (30–70 distribution) among the mothers’ opinions legalizing divorce in
the Philippines.
Ho: There is no significant difference between women who disagree and agree.
Ha: There is a significant difference between the observed and the expected
frequency (30–70 distribution) among the mothers’ opinions legalizing divorce in
the Philippines.
OR (Take note! Only one Ha is required.)
Ha: There are significantly more women who disagree than those who agree.
X2-value = 0.286
P-value = 0.593
There was no significant difference between the mothers’ opinion on legalizing divorce
in the Philippines and the theorized 30 – 70 distribution. This means that the sample
population come up with the expected 30 – 70 distribution in favor of the “no” group.
As expected, there were few mothers who agreed to legalize divorce in the country.
8. Implications
Most of the mothers’ negative opinions may be attributed to religious beliefs that only
God can separate what He has joined in marriage.
9. Conclusion
Classroom Exercise
A mixing nut machine was designed to mix peanuts, hazelnuts, cashews and pecans
following a 50:20:20:10 grams ratio. Twenty-five randomly selected 500 grams mixed-nut
cans revealed, on the average, 269 grams of peanuts, 112 grams of hazelnuts, 74 grams of
cashews, and 45 grams of pecans. Test at 5 percent level of significance whether the
machine is fitted to produce the designed ratio based on the given sample data.
Learning Module No. 6
Chi-square test of independence
Relationship between two variables with nominal data is determined using the chi-
square test of independence. Significant chi-square value means a presence of
relationship. The degree of relationship could be computed using the coefficient of
contingency. Here one tests the Ho that the two attributes (characteristics) of the elements
of a given population are not related (independent) against the Ha that two characteristics
are related. (Mann, 2004; Doane and Seward, 2009)
Forty-four high school students were categorized as drinkers and non-drinkers (See
data below). It was hypothesized that the proportions of drinkers among college bound
and non-college bound high school students are the same as the proportions of non-
drinkers who are college bound and not college bound. Using the data, find out if status of
drinking and college disposition are related at 0.01 level of significance.
1. Problem statement
2. Hypotheses
N(AD – BC) ²
2
X -value =
(A+B) (C+D) (A+C) (B+D)
X2-value = 4.227
P-value = 0.040
There is no significant relationship between the students’ status of drinking and their
college disposition. College disposition does not relate on their status of drinking. The
results indicate that the proportion of high school alcohol drinkers who will study in
college is not significantly different from those who will not study in college. This also
indicates that the proportions in the present samples are not significantly different
from the proportions in the population from where these subjects drawn from.
8. Implications
9. Conclusion
Classroom Exercise
Two drugs were administered to two groups of randomly assigned patients to cure
the same disease. One hundred twenty patients were involved in the research with the
data given below revealing the number of patients who were cured and not cured by each
of the two drugs. Test at 0.01 level of significance whether the two drugs are similar (or
otherwise) in curing and not curing the patients.
1. Problem statement
2. Hypotheses
8. Implications
Classroom exercises
Mr. Juan Dela Cruz is a statistics Professor who teaches graduate statistics courses
to English and Social Sciences masters students. He has observed that many of his students
do not like the rigors of mathematical computing during class sessions especially when
they have to use calculators instead of computers. He fielded a questionnaire to examine
if students’ attitude has bearing on their midterm and final examination results in his
statistics subject. With the data he gathered as shown in the table below, test whether
attitude relates to statistics examination performance at 0.05 level of significance.
Test Performance
Attitude AA A BA
Positive 27 32 8
Negative 18 25 36
Legend: AA – above average
A – average
BA – below average
Situation
It should be noted that the situation is given only for the sake of illustrating how to
compute and analyze a statistical problem which calls for Pearson r statistic. The 20-sample
observations are small that in actual research, this should be increased to at least 30
observations.
1. Problem statement
Does relationship exist between the work environment and efficiency ratings of the
clerks in CNU?
2. Hypotheses
∑ xy
r-value =
(SDx )(SDy)(N)
r-value = -0.007
P-value = 0.976
There is no significant relationship between the work environment and the efficiency
ratings of the clerks in CNU. Clerks with better work environment may or may not be
efficient workers.
8. Implications
Efficiency of work is a result of one’s capability and commitment to job, so even with
a marginalized work environment still a person can work efficiently if he/she has the
capabilities and the desire to work better.
9. Conclusion
Classroom Exercise
Shown on the table below are the average daily consumption of saturated fat (in
grams) and the cholesterol level (in milligrams per hundred millimeters) of 10 randomly
chosen sample of patients at the Cebu City Medical Center. Is there a significant correlation
between fat consumption and cholesterol level? Test at 0.05 level of significance.
Fat consumption 65 78 60 44 53 68 87 46 55 73
Cholesterol level 190 225 205 175 180 214 245 160 210 215
You should take note that even if these variables (fat consumption and cholesterol
level) may be highly correlated, it is not a sufficient proof of causation. To be able to
establish causation, all predicting variables for cholesterol level must be considered like
health, age, nutritional status and stress level, among others. To conclude causation, the
causal variables must precede the variable it causes, and several conditions must be met
(e.g., reversibility, strength, exposure response, etc.).
The F-Distribution
(Doane and Seward, 2009; Lind, Marchal and Wathen, 2006)
The F-Distribution is named in honor of Sir Ronald A. Fisher (1890-1962), one of the
founders of modern day statistics. This probability distribution is used to test whether two
(or more) samples are from the same populations having equal variances. It is also applied
when you want to compare several population means simultaneously. This simultaneous
comparison of several population means is called analysis of variance (ANOVA).
When it is determined that differences exist among treatment means, post hoc
range tests and pairwise multiple comparisons are further employed to uncover which
among the treatment means differ from each other. Range tests ascertain homogeneous
subsets of means that are not different from each other. Using the Minitab to run the
ANOVA and opting to use the Tukey’s Honestly Significant Difference (HSD), the pairwise
multiple comparisons test the difference between each pair of means and yield a grouping
where letters denote significantly different group means at a given alpha level (i.e. α =
0.05).
Situation
Four groups of Cebu City Medical Center (CCMC) patients (3 person in each group)
were treated with different medications for the same ailments. The data below show the
number of average days it took the patients to recover from the start of ailment. Is there
enough evidence to conclude that there was significant difference (α = 0.01) in the number
of recovery days (hence significant difference in the effect of medications)?
1. Problem Statement
Is there a significant difference in the mean number of recovery days among patient
groups?
OR (Take note! You are only one required to state one problem.)
2. Hypotheses
Ho: There is no significant difference in the mean number of recovery days among
patient groups.
Ha: There is significant difference in the mean number of recovery days among patient
groups.
F-value = 11.46
P-value = 0.003
The resulting ANOVA table with computed values is shown below.
There is significant difference in the mean number of days of patient group recovery.
There is significant difference between types of medications as to their effects on
patients’ recovery.
Based on the post-hoc analysis employing the Tukey statistic, treatment means having
the same grouping letters do not significantly differ from each other.
8. Implications
Although the treatments are of the same type of medicines which is paracetamol, their
effects in terms of number of recovery days are different for the same ailment of
patients. It could be that some patient groups are more receptive to paracetamol
treatment than the other patient groups.
9. Conclusion
There is enough evidence to conclude that there were differences in the number of
recovery days as affected by the treatment medications. The type of paracetamol has
effect on the number of recovery days of patients. Biogesic is the most effective
paracetamol with regard to its effect on the patients’ recovery from the ailments.
Classroom Exercise
A course on inferential statistics was taught in the CNU graduate school to three
different classes of students by three different professors. At the end of the semester, the
students were given the same departmental examination of 100 pts. The scores of
randomly selected students are given below. Is there any significant difference in the
scores between the three groups of students?