Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Chapter 3. DESCRIPTION
OF DATA
Frequency Distributions
Grouped Data
Percentiles, Deciles & Quartiles
Graphical Representations
Symmetry and Skewness
Statistical data collected should be
arranged in such a manner that will
allow a reader to distinguish their
essential features. Depending on the
type and the objectives of the person
presenting the information, data may
be presented using one or a
combination of three forms.
Three Forms of Presenting Data
4000
3500
3000 1991
form
r
ve r
r
ch
ce r
Fe ry
ay
ril
ne
ly
t
ry
be
e
be
De be
Se gus
Ap
Ju
a
ob
ua
ar
Ju
nu
m
em
m
M
br
ct
Au
Ja
O
pt
No
When that data include a large
number of observations, it is
convenient to group the values into
mutually exclusive classes and show
the number of observations occurring
in each class in a tabular form.
Frequency Distribution
MEAN
Methods :
1. Midpoint Method
2. Short Method
Midpoint Method
After the f column, make another column and
enter the midpoint (Xm) of each class. Multiply the
frequency with the midpoint and enter it in the
next column. Label the column f Xm. Get the sum.
Use the formula:
x
( fX m )
N
Short Method
x z
( fxd) a lg
i
N
where z = midpt. of class chosen as origin
Problem:
Classes f
Classes f Xm fXm
x
( fX m )
54-50 4 52 208 N
49-45 7 47 329
1905
44-40 12 42 504 x
39-35 10 37 370 50
34-30 9 32 288
29-25 6 27 162 x 38.1
24-20 2 22 44
N = 50 fX m 1905
Using Short Method
Classes f d fd
x z
( fxd) a lg
i
54-50 4 3 12
N
49-45 7 2 14
11
44-40 12 1 12 x 37 (5)
39-35 10 0 0 50
34-30
29-25
9
6
-1
-2
-9
-12
x 38.1
24-20 2 -3 -6
N = 50 fd 11
MEDIAN
Steps:
N
Find
2
Find the accumulated sum of the
frequencies up to the sum that
contains N
2
Use the formula:
( N cf )
Md L 2 i
f
Median
R. Mode
T. Mode
Solution: Computing for the Median
Classes f cf i=5
54-50 4 N 50
25
49-45 7 2 2
44-40 12 ( N cf )
39-35 10 27
Md L 2 i
f
34-30 9 17 (25 17)
29-25 6 8 Md 35 (5)
10
24-20 2 2
Md 39
N = 50
Computing for the Mode
Classes f R. Mode = 42
54-50 4
49-45 7 T . Mode 3Md 2 x
44-40 12 since x 38.1
39-35 10 Md 39
34-30 9
29-25 6
24-20 2
T . Mode 3(39) 2(38.1)
N = 50 T . Mode 40.8
Other Measures of Position
Quartiles
Deciles
Percentiles
Quartiles - those which divide the
distribution into 4 parts
( kN cf )
Qk L 4 i
f
Deciles - those which divide the
distribution into 10 parts
( kN cf )
Dk L 10 i
f
Percentiles - those which divide the
distribution into 100
parts
( kN cf )
Pk L 100 i
f
Problem:
Q1
D3
P88
Solution: Computing for Q1
i=5
Classes f cf
kN (1)50
54-50 4 12.5
49-45 7
4 4
(kN cf )
44-40 12
Qk L 4 i
39-35 10 f
34-30 9 17 (12.5 8)
29-25 6 8 Q1 30 (5)
9
24-20 2 2 Q1 32.5
N = 50
Computing for D3
i=5
Classes f cf
kN (3)50
54-50 4 15
10 10
49-45 7
44-40 12 (kN cf )
Dk L 10 i
39-35 10 f
34-30 9 17
(15 8)
29-25 6 8 D3 30 (5)
24-20 2 2
9
D3 33.89
N = 50
Computing for P88
i=5
Classes f cf
kN (88)50
54-50 4 44
100 100
49-45 7 46
44-40 12 39 (kN cf )
Pk L 100 i
39-35 10 27 f
34-30 9 17
(44 39)
29-25 6 8 P88 45 (5)
24-20 2 2
7
P88 48.57
N = 50
For Grouped Data ( > 30 values)
MEASURES OF VARIATION
RANGE
The range is computed as the
difference between the upper limit of
the highest class interval and the
lower limit of the lowest class interval.
VARIANCE
2
f (x m x) 2
N
STANDARD DEVIATION
f (x m x) 2
N
MEAN DEVIATION
D
f x m x
N
QUARTILE DEVIATION
Q3 Q1
Q
2
Problem:
variance
standard deviation
mean deviation
quartile deviation
Classes f Classes f
89-85 1 59-55 7
84-80 1 54-50 6
79-75 2 49-45 6
74-70 3 44-40 6
69-65 4 39-35 3
64-60 4 34-30 1
Solution: Computing for the Mean
Classes f Xm fXm
89-85 1 87 87
84-80 1 82 82 x
( fX m )
79-75 2 77 154
N
74-70 3 72 216
2443
69-65 4 67 268
x
64-60 4 62 248 44
59-55 7 57 399
54-50 6 52 312 x 55.5
49-45 6 47 282
44-40 6 42 252
39-35 3 37 111
34-30 9 32 32
N = 44 fX m 2443
Computing for the Variance
Classes f xm – X (xm - X )2 f(xm - X )2
89-85 1 31.5 992.25 992.25
2
84-80 1 26.5 702.25 702.25
f ( xm x ) 2
79-75 2 21.5 462.25 924.50
74-70 3 16.5 272.25 816.75 N
69-65 4 11.5 132.25 529.00
7329
64-60 4 6.5 42.25 169.00
2
Since 166.57
2
2
166.57
12.906
Computing for the Mean Deviation
Classes f /xm –X / f /xm - X /
89-85 1 31.5 31.5
84-80 1 26.5 26.5
79-75
74-70
2
3
21.5
16.5
43.0
49.5 D
f x m x
N = 44 fx m x 465
Computing for the Quartile Deviation
Classes f cf
Qk
kN cf
L 4 i
89-85 1 44 f
84-80 1 43 kN 1( 44)
11
79-75 2 42 4
Q1
4
45
11 10 5
45.83
74-70 3 40
6
69-65 4 37
64-60 4 33 kN 3( 44)
33
4 4
33 335
59-55 7 29
54-50 6 22 Q3 60 60
49-45 6 16
4
44-40 6 10
Q3 Q1 60 45.83
39-35 3 4 Q
34-30 1 1 2 2
N = 44 Q 7.085
Types
of
Graphs
BAR GRAPH
Classes f
For the following 54-50 4
frequency distribution, 49-45 7
construct: 44-40 12
bar graph 39-35 10
histogram 34-30 9
frequency polygon
29-25 6
24-20 2
BAR GRAPH
15
Fre que ncy
10
0
20-24 25-29 30-34 35-39 40-44 45-49 50-54
Class Marks
HISTOGRAM
15
12
10
Frequency
9
10
7
6
4
5
2
0
Class Boundaries
FREQUENCY POLYGON
15
Frequency
10
0
20-24 25-29 30-34 35-39 40-44 45-49 50-54
Classes
PIE CHART
Engineering 5280
Engineering
Commerce 3000 Commerce
Education 1800 Education
Arts & Sciences
Arts & Sciences 1320 Law
Law 600
CUMULATIVE FREQUENCY
CURVE (Ogive Curve)
N 376
120
100
80
Ogive
CP
60
Curve
40
20
0
0 9 14 19 24 29 34 39 44 49 54 59 64
UL
Q
ku
P90 P10
Types of Kurtosis
3( x md )
sk
s
If sk = 0, the distribution is normal.
X Md Mo
If sk < 0, the distribution is
negatively skewed.
X Md Mo
( Mo Md X )
If sk > 0, the distribution is
positively skewed.
Mo Md X
( X Md Mo )
Problem:
For a certain frequency distribution,
the ff. data are given:
s 13.7 Q3 155.8 md 147
x 147 P90 167.5 Q1 138
D1 128.8
Determine the kurtosis and skewness of the
distribution. Is it a normal distribution?
Solution:
Q3 Q1
Q 2
ku
P90 P10 P90 D1
155.8 138
ku 2 0.23
167.5 128.8
Distribution is leptokurtic.
3( x md )
sk
s
3(147 147.25)
sk 0.05
13.7
Distribution is negatively
skewed.
Student
Activity
Part I. Answer the following: