Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Dante V. Partosa
Mathematics Department
College of Science and Information Technology
Ateneo de Zamboanga University
Preliminaries
A 5 20
B 7 28
O 9 36
AB 4 16
Ungrouped Frequency
Distributions
Ungrouped frequency distributions - can
be used for data that can be enumerated
and when the range of values in the data
set is not large.
Examples - number of miles your
instructors have to travel from home to
campus, number of girls in a 4-child family
etc.
Number of Miles Traveled -
Example
Class Frequency
5 24
10 16
15 10
Grouped Frequency Distributions
C l as s C l as s F r e q u e n c y C u m u l a ti v e
l i m i ts Bo u n d a r i e s fr e q u e n c y
24 - 30 2 3 .5 - 3 7 .5 4 4
38 - 51 3 7 .5 - 5 1 .5 14 18
52 - 65 5 1 .5 - 6 5 .5 7 25
Terms Associated with a Grouped
Frequency Distribution
10 8 6 14
22 13 17 19
11 9 18 14
13 12 15 15
5 11 16 11
Grouped Frequency Distribution -
Example
05 t o 07 4.5 - 7.5 2 2
08 t o 10 7.5 - 10.5 3 5
11 t o 13 10.5 - 13.5 6 11
14 t o 16 13.5 - 16.5 5 16
17 t o 19 16.5 - 19.5 3 19
20 t o 22 19.5 - 22.5 1 20
Histograms, Frequency Polygons,
and Ogives
5
Frequency
5 8 11 14 17 20
N u m b e r o f C ig a re tte s S m o k e d p e r D a y
Histograms, Frequency Polygons,
and Ogives
Frequency Polygon
5
Frequency
2 5 8 11 14 17 20 23 26
10
2 5 8 11 14 17 20 23 26
Pareto C hart for the num ber of Crim es Inves tigated by Law
Enforcement Officers in U.S. National Parks During 1995.
250 100
200 80
Percent
Count
150 60
100 40
50 20
0 0
Defec t
Count 164 34 29 13
Perc ent 68.3 14.2 12.1 5.4
Cum % 68.3 82.5 94.6 100.0
Other Types of Graphs
P O R T AU T H O R IT Y T R AN S IT R ID E R S H IP
89
Ridership (in millions)
87
85
83
81
79
77
75
199 0 19 91 1992 1993 19 94
Y ear
Other Types of Graphs
Assaults
(164,
68.3%)
Organizing Data
Describing Data
Measures of Central Tendency
A statistic is a characteristic or
measure obtained by using the data
values from a sample.
A parameter is a characteristic or
measure obtained by using the data
values from a specific population.
The Mean (arithmetic average)
The mean is defined to be the sum
of the data values divided by the
total number of values.
We will compute two means: one
for the sample and one for a finite
population of values.
The mean, in most cases, is not an
actual data value.
The Sample Mean
X + X + ... + X
X= 1 2 n
n
X.
=
n
The Sample Mean - Example
T h e a g es i n w eek s o f a r a n d o m sa m p l e
o f s i x k i tte n s a t a n a n i m a l s h e l te r a r e
3 , 8 , 5 , 1 2 , 1 4 , a n d 1 2 . F i n d th e
a v e r a g e a g e o f t h i s s a m p l e.
T h e sa m p l e m ea n i s
X = X
=
3 + 8 + 5 +12 +14 +12
n 6
54
= = 9 w e e k s.
6
The Population Mean
X + X + ... + X
m=
1 2 N
N
X.
=
N
The Population Mean - Example
(f X)
X= .
n
H ere f i s the frequency for the
correspondi ng val ue of X , and n = f .
The Sample Mean for an Ungrouped
Frequency Distribution - Example
SSccoorree,,XX FFrreeqquueennccyy,,ff
00 22
11 44
22 1122
33 44
5
44 33
5
The Sample Mean for an Ungrouped
Frequency Distribution - Example
f X 52
X= = = 2.08.
n 25
The Sample Mean for a Grouped
Frequency Distribution
( f X m)
X= .
n
Here X is thecorresponding
m
class midpoint.
The Sample Mean for a Grouped
Frequency Distribution - Example
CCllaassss FFrreeqquueennccyy,,ff
1155.5
.5--2200.5.5 33
2200.5
.5--2255.5
.5 55
2255.5
.5--3300.5
.5 44
3300.5
.5--3355.5
.5 33
3355.5
.5--4400.5
.5 22
5
5
The Sample Mean for a Grouped
Frequency Distribution - Example
f X m = 54 + 115 + 112 + 99 + 76
= 456
and n = 17. So
f Xm
X=
n
456
= = 26.82.
17
The Median
NNoo..SSeetstsSSoold
ld FFrreeqquueennccyy CCuum muulalatitv
ivee
FFrreeqquueennccyy
11 44 44
22 99 1133
33 66 1199
44 22 2211
55 33 2244
5
3355.5
.5--4400.5
.5 22
5
The Median for a Grouped
Frequency Distribution - Example
n =17
cf = 8
f =4
w = 25.520.5=5
Lm = 25.5
(n 2) - cf (17/ 2) 8
MD = (w) + Lm = (5) + 25.5
f 4
= 26.125.
The Mode
5
The Mode - Grouped Frequency
Distribution
The mode for grouped data is the
modal class.
The modal class is the class with the
largest frequency.
Sometimes the midpoint of the class
is used rather than the boundaries.
The Mode for a Grouped Frequency
Distribution - Example
5
The Midrange
w + w +...+ wn
1 2 w
where w , w , ..., wn are the wei ghts
1 2
Y
Positively Skewed
X
Mode < Median < Mean
Symmetrical
Y
Symmetrical
X
Mean = Median = Mode
Negatively Skewed
Negatively Skewed
X
Mean < Median < Mode
Measures of Variation - Range
( X - m ) , where
2
s =
2
N
X = i ndi vi dual val ue
m = popul ati on mean
N = popul ati on si ze
Measures of Variation - Population
Standard Deviation
( X - m) 2
s = s = .
2
N
Measures of Variation - Example
(X - X ) 2
s = , and
2
n-1
X = sample mean
n = sample size
Measures of Variation - Sample
Standard Deviation
( X - X )2
s = s =
2
.
n-1
Shortcut Formula for the Sample
Variance and the Standard Deviation
X - ( X ) / n
2 2
s=
2
n-1
X - ( X ) / n
2 2
s=
n-1
Sample Variance - Example
X - ( X ) / n
2 2
s =
2
n-1
1263- (79)/ 5
2
= = 3.7
4
s = 3.7 = 1.9.
Sample Variance for Grouped and
Ungrouped Data
f X - [( f X ) / n]
2 2
s = .
2 m m
n-1
For ungrouped data, replace Xm
with the observe X value.
Sample Variance for Grouped Data
- Example
XX ff ffX
X ffX 2
X 2
55 22 1010 5050
66 33 18
18 108
108
77 88 56
56 392
392
88 11 88 64
64
99 66 54
54 486
486
10
10 44 40
40 400
400
nn= 24
f X
=
= 24 f X = 186 186
f
fX=
X
22
=1500
1500
Sample Variance for Ungrouped
Data - Example
f X 2 - [( f X )2 / n]
s =
2
n-1
1500- [(186)/ 24] =
2
= 2.54.
23
s = 2.54 = 1.6.
Coefficient of Variation
m s -- m s -- 95% m s --
m -s m -s m -s m m +s m +s m +s
Measures of Position z score
Given the data set 5, 6, 12, 13, 15, 18, 22, 50,
can the value of 50 be considered as an
outlier?
Q1 = 9, Q3 = 20, IQR = 11. Verify.
(1.5)(IQR) = (1.5)(11) = 16.5.
9 16.5 = 7.5 and 20 + 16.5 = 36.5.
The value of 50 is outside the range 7.5 to
36.5, hence 50 is an outlier.
Exploratory Data Analysis - Stem
and Leaf Plot
0 2
1 3 4
2 0 3 5
3 1 2 2 2 2 3 6
4 3 4 4 5
5 1 2 7
Exploratory Data Analysis
Box Plot
LH UH
MINIMUM MAXIMUM
MEDIAN
0 10 20 30 40 50 60
Information Obtained from a
Box Plot