Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Numerical Data
Properties & Measures
Numerical Data
Properties
Central
Tendency
Mean
Variation
Shape
Median
Range
Variance
Mode
Standard Deviation
Skew
Mean
Sum of scores divided by the number of
people. Population mean is (mu)
and sample mean is X (X-bar).
We calculate the sample mean by:
Arit
Geo
X
N
X n
X n
FX
Ungrouped Data
Number of a family
children in Sleman
No of Child
Frequency
20
15
33
33
31
17
20
46
39
29
38
34
37
Grouped Data
Example
The heights (in cm) of a group of
students are summarized below. Draw a
histogram and polygon to illustrate
these data
Mean
i 1
X
n
10 11
-1
-1
0
0
0
1
1
Deviation scores
-2
Median
Score that separates top 50% from bottom 50%
Ungrouped Data
Even number of scores, median is half way between two
middle scores.
Letak Med1= n /2
Letak Med2 = (n+2)/2
Med = (Med1+Med2)/2
1 4 6 8 9 10 17 18 Median is (8+9)/2 = 8.5
Median
1. Measure of Central Tendency
2. Middle Value In Ordered Sequence
If Odd n, Middle Value of Sequence
If Even n, Average of 2 Middle Values
Positioning Point
n 1
2
Median Example
Odd-Sized Sample
21.5
22.6
22.6
23.7
24.1
Position:
Positioning
Median = 22.6
Point
n +1
2
5 +1
2
Median Example
Even-Sized Sample
4.9
1
Positioning
Median
6.3
2
Point
7.7 + 8.9
2
7.7
3
n +1
2
8.3
8.9
4
10.3
5
6 +1
2
11.7
6
3 .5
Mode
1. Measure of Central Tendency
2. Value That Occurs Most Often
3. Not Affected by Extreme Values
4. May Be No Mode or Several Modes
5. May Be Used for Numerical &
Categorical Data
Grouped/Classified Data
Mode Example
No Mode
Raw Data:
One Mode
Raw Data: 6.3 4.9 8.9 6.3 4.9 4.9
More Than 1 Mode
Raw Data: 21 28
28
41
43
43
Thinking Challenge
Youre a financial analyst.
You have collected the
following closing stock
prices of new stock issues:
17, 16, 21, 18, 13, 16, 12,
11.
Describe the stock prices
in terms of central
tendency.
Classified Data
Median
Good for bad distributions
Good for distributions with arbitrary
ceiling or floor
Shape of a Distribution
Describes how data are distributed
Measures of shape
Symmetric or skewed
Left-Skewed
Symmetric
Right-Skewed
Mean = Median
Chap 3-26
Influence of Distribution
Shape
Review
Review
Range
Average deviation
Variance
Standard Deviation
Z score
Variation
Numerical Data
Properties & Measures
Numerical Data
Properties
Central
Tendency
Variation
Shape
Mean
Range
Median
Variance
Standard Deviation
Mode
Skew
AD
N
Note difference between Hays & undergrad textdeviation from Median vs. Mean
Variance
2
(
X
Population Variance:
N
Where 2means population variance,
means population mean, and the other
terms have their usual meaning.
The variance is equal to the average squared
deviation from the mean.
To compute, take each score and subtract the
mean. Square the result. Find the average
over scores. Ta da! The variance.
2
15
10
X X (X X )
-10
100
15
-5
25
15
15
20
15
25
25
15
10
100
Total:
75
250
Mean:
Variance
Is
50
Standard Deviation
Variance is average squared deviation
from the mean.
To return to original, unsquared units,
we just take the square root of the
variance. This is the standard
deviation.
2
Population formula:
( X )
Standard Deviation
Sometimes called the root-mean-square
deviation from the mean. This name
says how to compute it from the inside
out.
Find the deviation (difference between
the score and the mean).
Find the deviations squared.
Find their mean.
Take the square root.
5
10
15
20
25
Total:
Mean:
Sqrt
15
15
15
15
15
75
Variance
SD
X X
-10
-5
0
5
10
0
Is
Is
100
25
0
25
100
250
50
50 7.07
Mode = 21
12
Frequency
Mean=25.73
SD = 6.47
0
10
20
30
age
40
50
Standard or z score
A z score indicates distance from the
mean in standard deviation units.
Formula:
X X
z
S
X
z
SKEWNESS
KURTOSIS
EQUATION
skewness: g1 = m3 / m23/2
kurtosis: a4 = m4 / m22
Example
Calculation of Skewness ON
CLASSIFIED DATA
Interpretation
If skewness = 0, the data are perfectly symmetrical. But a skewness of exactly
zero is quite unlikely for real-world data, so how can you interpret the
skewness number?
Bulmer, M. G., Principles of Statistics (Dover, 1979) a classic suggests this
rule of thumb:
If skewness is less than 1 or greater than +1, the distribution is highly skewed.
If skewness is between 1 and or between + and +1, the distribution is
moderately skewed.
If skewness is between and +, the distribution is approximately
symmetric.
With a skewness of 0.1098, the sample data for student heights are
approximately symmetric.
Calculation of Kurtosis
Influence of Distribution
Shape