Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
research
Major Categories of Data
2 Major Categories
1. Primary
2. Secondary Data
Major Categories of Data
Questionnaire survey
It is distributed—or made accessible if online—to a
predetermined selection of individuals.
Individuals complete and return the questionnaire or
submit online.
Face-to-face interview
Interviewer asks questions, usually following a guide
or protocol.
Interviewer records answers.
Sources of Primary data
Telephone interview
Interviewer asks questions, usually following a guide or protocol.
Interviewer records responses.
Group techniques (interview, facilitated workshop, focus group)
This involves group discussion of predetermined issue or topic in
person or through teleconferencing.
Group members share certain common characteristics.
Facilitator or moderator leads the group.
Assistant moderator usually records responses.
Sources of Secondary Data
Document review
Researchers review documents, and identify relevant information.
They keep track of the information retrieved from documents.
Continuous Discrete
Examples: Examples:
• The height of a horse (could be any • The number of people in your class (no
value within the range of horse heights). fractional parts of a person).
• Time to complete a task (which could be • The number of TV sets in a home (no
measured to fractions of seconds). fractional parts of a TV set).
• The outdoor temperature at noon (any • The number of puppies in a liter (no
value within possible temperatures fractional puppies).
ranges.) • The number of questions on a math test
• The speed of a car on Route 3 (no incomplete questions).
(assuming legal speed limits).
NOTE: Continuous data usually requires a NOTE: Discrete data is counted. In whole
measuring device. (Ruler, stop watch, numbers. The description of the task is
thermometer, speedometer, etc.). It usually preceded by the words "number
Types of Measurement Scales
Age
Educational Attainment
Occupation
Religion
Examples of nominal scale
1. Mean
the mathematical average
the most popular and well known measure of central
tendency
It can be used with both discrete and continuous data,
although its use is most often with continuous data
Measures of Central Tendency:
the MEAN
Measures of Central Tendency:
the MEAN
2. Median
is the middle score for a set of data that has
been arranged in order of magnitude.
Measures of Central Tendency:
the MEDIAN
We first need to rearrange that data into order of magnitude (smallest first):
14 35 45 55 55 56 56 65 87 89 92
Our median mark is the middle mark - in this case, 56 (highlighted in bold). It is
the middle mark because there are 5 scores before it and 5 scores after it. This
works fine when you have an odd number of scores,
Measures of Central Tendency:
the MEDIAN
But what happens when you have an even number of scores? What if you had only 10
scores? Well, you simply have to take the middle two scores and average the result. So, if
we look at the example below:
65 55 89 56 35 14 56 55 87 45
We again rearrange that data into order of magnitude (smallest first):
14 35 45 55 55 56 56 65 87 89
Only now we have to take the 5th and 6th score in our data set and average them to
get a median of 55.5.
Measures of Central Tendency:
the MODE
3. Mode
The mode is the most frequent score in our data set.
Therefore, sometimes consider the mode as being the most popular
option.
It is used for categorical data where we wish to know which is the
most common category.
Measures of Central Tendency:
the MODE
1. chi-square test
2. T-test
3. Z-test
4. ANOVA
The Chi-Square Test
Here,
O = Observed frequency
E = Expected frequency
∑ = Summation
X 2 = Chi Square value
The Paired T-test
Where,
d bar is the mean difference between two samples,
s² is the sample variance,
n is the sample size and
t is a paired sample t-test with n-1 degrees of
freedom
The Paired T-test
Where,
F = Anova Coefficient
MST = Mean sum of squares due to treatment
MSE = Mean sum of squares due to error.
The ANOVA
Where, Where,
SST = Sum of squares due to treatment SSE = Sum of squares due to error
p = Total number of populations S = Standard deviation of the
n = Total number of samples in a population. samples
N = Total number of observations.