Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
STATISTICS
STATISTICS STATISTICS
• Other situations involving Statistics:
▫ Sports Two Meanings
▫ Poll Ratings during elections
▫ Product Testing
Set of numbers (small ‘s’)
▫ Census of households Method of analysis (capital ‘S’)
statistics Statistics
Specific number Method of analysis
numerical measurement determined by a a branch of science which deals with the
set of data collection, presentation, analysis, and
interpretation of data.
Examples: vital statistics, BMI, shooting average,
monthly sales, daily exchange rate
1
6/22/2017
2
6/22/2017
What’s wrong with this graph? What’s wrong with this graph?
Frequency Distribution
• Suppose you want to find out how well 11Z
students did in the last 5-point seatwork. The
Frequency Distribution following are the scores of 20 students:
5 5 4 3 3
Table and Histogram 2 2 1 2 5
3 3 1 2 4
4 4 5 4 3
BONUS [size 1]:
Answer item no. 26 (p. 53) and
construct its histogram. • It is difficult to make any inference based on
just the raw data.
3
6/22/2017
Example1: Example1:
Frequency Distribution for the Weights Frequency Distribution for the Weights
of 50 Pieces of Luggage of 50 Pieces of Luggage
4
6/22/2017
Example1: Example1:
Frequency Distribution for the Weights Frequency Distribution for the Weights
of 50 Pieces of Luggage of 50 Pieces of Luggage
Example1: Example1:
Frequency Distribution for the Weights Frequency Distribution for the Weights
of 50 Pieces of Luggage of 50 Pieces of Luggage
Example1: Example1:
Frequency Distribution for the Weights of 50
Frequency Distribution for the Weights Pieces of Luggage
of 50 Pieces of Luggage
5
6/22/2017
Example1:
Example1:
Frequency Distribution for the Weights
Frequency Distribution for the Weights
of 50 Pieces of Luggage
of 50 Pieces of Luggage
Example1: SW: #1 p. 26
Find the class boundaries, class marks, and
Frequency Distribution for the Weights
class widths given the following intervals:
of 50 Pieces of Luggage
a) 5 - 9 4.5 – 9.5; 7; 5
Answer:
SW # 2
Here is a set of marks obtained by a class on
a mathematics test.
Here is a set of marks obtained by a class on
a mathematics test. 72 53 73 59 68 83 71 67
77 78 70 67 63 65 56 86
72 53 73 59 68 83 71 67 47 78 72 79 67 74 62 84
77 78 70 67 63 65 56 86 92 88 71 74 81 70 66 64
47 78 72 79 67 74 62 84 75 65 46
92 88 71 74 81 70 66 64
75 65 46 a. Identify the highest and lowest value 92 and 46
b. Subtract. 92 – 46 = 46 (this is the Range)
Construct a frequency table c. Divide the range by the number of intervals.
and use 6 class intervals. 46 6 7.6 8 class width/size
6
6/22/2017
72 53 73 59 68 83 71 67 77
78 70 67 63 65 56 86 47 78
72 79 67 74 62 84 92 88 71 Histogram – graphical representation
74 81 70 66 64 75 65 46 of a frequency distribution table
Math Scores of Students
Class Class Class Frequency, Cumulative
Interval Boundaries Mark f Frequency (cf) 15
10 11
10
Freq
6
46 – 53 45.5 – 53.5 49.5 3 3
5 3 2 3
54 – 61 53.5 – 61.5 57.5 2 5
Z
62 – 69 61.5 – 69.5 65.5 10 15 0
73.5 46.5-53.5 53.5-61.5 61.5-69.5 69.5-77.5 77.5-85.5 85.5-93.5
70 – 77 69.5 – 77.5 11 26
81.5 Scores
78 – 85 77.5 – 85.5 6 32
86 – 93 85.5 – 93.5 89.5 3 35
SEATWORK
• The following set of raw data shows the lengths
in millimeters, measured to the nearest
millimeter of 40 leaves taken from plants of a STATISTICS:
certain species. Measures of Central
40 54 25 50 58 45 47 49 30 28
52 31 52 41 47 44 46 39 41 59
Tendency
Objectives:
49 38 43 48 43 43 40 51 40 56 •identify the three Measures of Central tendency
31 53 44 37 35 37 33 38 46 36 •describe the ways to obtain the Measures of
Central Tendency for ungrouped data
• Create a frequency distribution table with 7
class intervals.
• Construct a histogram.
• Mode Example:
The recorded rainfall, in inches, for seven consecutive days
in Rizal is given.
0.25 0.38 0.03 0.03 0.03 0.25 0.56
7
6/22/2017
Arithmetic mean (or simply mean) Arithmetic mean (or simply mean)
The mean of a set of numbers is the Example:
arithmetical average of the numbers. The recorded rainfall, in inches, for seven consecutive days
Example: in Rizal is given.
0.25 0.38 0.03 0.03 0.03 0.25 0.56 0.25 0.38 0.03 0.03 0.03 0.25 0.56
sum of all numbers
mean =
number of numbers
0.25 0.38 0.03 0.03 0.03 0.25 0.56 What do you think will happen to the mean if I
7 change 0.25 to 0.90?
1.53
0.21857...
7
0.22
Median Median
The median of a set of numbers is the middle The median of a set of numbers is the
number when the numbers are arranged in order middle number when the numbers are
(array). arranged in order.
If there is an even number of numbers, the median Example:
is the mean of the two middle numbers. 0.25 0.38 0.03 0.03 0.03 0.25 0.56
Example: Arrange the numbers in order (array)
The recorded rainfall, in inches, for seven consecutive days 0.03 0.03 0.03 0.25 0.25 0.38 0.56
in Rizal is given.
0.25 0.38 0.03 0.03 0.03 0.25 0.56
median = middle number
0.25
8
6/22/2017
Mode
Notice that The mode of a set of numbers is the most
frequently occurring number. There may be
The calculation of the median more than one mode, or there may be no
involves just one or two mode.
the POSITION of each item 0.25 0.38 0.03 0.03 0.03 0.25 0.56
in the series.
Mode
The mode of a set of numbers is the most
Notice that
frequently occurring number. There may be
more than one mode, or there may be no The mode is not affected by
mode.
Example: extreme values
0.25 0.38 0.03 0.03 0.03 0.25 0.56
It does not always exist; and
mode = value that occurs most often
if it does, it may not be
0.03
unique.
9
6/22/2017
Do as indicated.
10
6/22/2017
Example1: Example1:
Frequency Distribution for the Weights of 50 Pieces of Luggage Frequency Distribution for the Weights of 50 Pieces of Luggage
Class Class Class Frequency, Cumulative Class Class Class Frequency, Cumulative
Mark Mark
Interval Boundaries (x) f Frequency (cf) Interval Boundaries (x) f Frequency (cf)
7 - 9 6.5 – 9.5 8 2 2 7 - 9 6.5 – 9.5 8 2 2
10 – 12 9.5 –12.5 11 8 10 10 – 12 9.5 –12.5 11 8 10
13 – 15 12.5 – 15.5 14 14 24 13 – 15 12.5 – 15.5 14 14 24
16 - 18 15.5 – 18.5 17 19 43 16 - 18 15.5 – 18.5 17 19 43
19 - 21 18.5 – 21.5 20 7 50 19 - 21 18.5 – 21.5 20 7 50
highest frequency
Groupe classmark of the interval The mean of a set of numbers is the
dMode 17 arithmetical average of the numbers.
= with highest frquency
Example1: Example1:
Frequency Distribution for the Weights of 50 Pieces of Luggage Frequency Distribution for the Weights of 50 Pieces of Luggage
Example1: Example1:
Frequency Distribution for the Weights of 50 Pieces of Luggage Frequency Distribution for the Weights of 50 Pieces of Luggage
Class Frequency, Cumulative Class Class Class Frequency, Cumulative
Class Class
Mark Mark
Interval Boundaries (x) f Frequency (cf) Interval Boundaries (x) f Frequency (cf)
7 - 9 6.5 – 9.5 8 2 2 7 - 9 6.5 – 9.5 8 2 2
10 – 12 9.5 –12.5 11 8 10 10 – 12 9.5 –12.5 11 8 10
13 – 15 12.5 – 15.5 14 14 24 13 – 15 12.5 – 15.5 14 14 24
16 - 18 15.5 – 18.5 17 19 43 16 - 18 15.5 – 18.5 17 19 43
19 - 21 18.5 – 21.5 20 7 50 19 - 21 18.5 – 21.5 20 7 50
Groupe 16 88 196 323 140 763 Median: The median of a set of numbers is the
dMean 15.26 middle number when the numbers are
50 50
= arranged in order.
11
6/22/2017
Class Class Class Frequency, Cumulative Class Class Class Frequency, Cumulative
Mark Mark
Interval Boundaries (x) f Frequency (cf) Interval Boundaries (x) f Frequency (cf)
7 - 9 6.5 – 9.5 8 2 2 7 - 9 6.5 – 9.5 8 2 2
10 – 12 9.5 –12.5 11 8 10 10 – 12 9.5 –12.5 11 8 10
13 – 15 12.5 – 15.5 14 14 24 13 – 15 12.5 – 15.5 14 14 24
16 - 18 15.5 – 18.5 17 19 43 16 - 18 15.5 – 18.5 17 19 43
19 - 21 18.5 – 21.5 20 7 50 19 - 21 18.5 – 21.5 20 7 50
L – lower boundary of median L – lower boundary of median
class class
Class Class Class Frequency, Cumulative Class Class Class Frequency, Cumulative
Mark Mark
Interval Boundaries (x) f Frequency (cf) Interval Boundaries (x) f Frequency (cf)
7 - 9 6.5 – 9.5 8 2 2 7 - 9 6.5 – 9.5 8 2 2
10 – 12 9.5 –12.5 11 8 10 10 – 12 9.5 –12.5 11 8 10
13 – 15 12.5 – 15.5 14 14 24 13 – 15 12.5 – 15.5 14 14 24
16 - 18 15.5 – 18.5 17 19 43 16 - 18 15.5 – 18.5 17 19 43
19 - 21 18.5 – 21.5 20 7 50 19 - 21 18.5 – 21.5 20 7 50
L – lower boundary of median Grouped HN L – lower boundary of median
Grouped L H N C
class L C class
Median = f 2 H - size (median class)
f 2
Median = 3N
15.5 C
HN f 2
15.5 C
f 2 18.5-15.5 =
3
Class Class Class Frequency, Cumulative Class Class Class Frequency, Cumulative
Mark Mark
Interval Boundaries (x) f Frequency (cf) Interval Boundaries (x) f Frequency (cf)
7 - 9 6.5 – 9.5 8 2 2 7 - 9 6.5 – 9.5 8 2 2
10 – 12 9.5 –12.5 11 8 10 10 – 12 9.5 –12.5 11 8 10
13 – 15 12.5 – 15.5 14 14 24 13 – 15 12.5 – 15.5 14 14 24
16 - 18 15.5 – 18.5 17 19 43 16 - 18 15.5 – 18.5 17 19 43
19 - 21 18.5 – 21.5 20 7 50 19 - 21 18.5 – 21.5 20 7 50
Grouped HN L – lower boundary of median Grouped HN L – lower boundary of median
L C class L C class
Median = f 2 H - size (median class)
Median = f 2 H - size (median class)
3 N f – frequency (median class) 3 50 f – frequency (median class)
15.5 C 15.5
19 2 C N – total frequency
19 2
2+8+14+19+7=50
12
6/22/2017
Class Class Class Frequency, Cumulative Class Class Class Frequency, Cumulative
Mark Mark
Interval Boundaries (x) f Frequency (cf) Interval Boundaries (x) f Frequency (cf)
7 - 9 6.5 – 9.5 8 2 2 7 - 9 6.5 – 9.5 8 2 2
10 – 12 9.5 –12.5 11 8 10 10 – 12 9.5 –12.5 11 8 10
13 – 15 12.5 – 15.5 14 14 24 13 – 15 12.5 – 15.5 14 14 24
16 - 18 15.5 – 18.5 17 19 43 16 - 18 15.5 – 18.5 17 19 43
19 - 21 18.5 – 21.5 20 7 50 19 - 21 18.5 – 21.5 20 7 50
L – lower boundary of median Grouped HN L – lower boundary of median
Grouped HN class L C class
L C H - size (median class)
Median = f 2 H - size (median class)
Median = f 2
f – frequency (median class) 3 50 f – frequency (median class)
3 50 N – total frequency 15.5 24 N – total frequency
15.5 24 19 2
19 2 C – cumulative frequency C – cumulative frequency
(before the median class) 15.657... 15.66 (before the median class)
Example2: SEATWORK:
Find the grouped mean, median and the mode of Find the grouped mean, median and the mode of
each set of data. Give the answers to 2 decimal each set of data. Give the answers to 2 decimal
places where necessary. places where necessary.
Rating Class Class f Cf
Mark Class Class Class Frequency, Cumulative
Boundaries
Mark
0-2 -0.5 – 2.5 1 20 20 Interval Boundaries (x) f Frequency (cf)
3-5 2.5 – 5.5 4 14 34 46 – 53 45.5 – 53.5 49.5 3 3
6-8 5.5 – 8.5 7 15 49 54 – 61 53.5 – 61.5 57.5 2 5
62 – 69 61.5 – 69.5 65.5 10 15
9 - 11 8.5 – 11.5 10 2 51 70 – 77 69.5 – 77.5 73.5 11 26
12 - 14 11.5 – 14.5 13 1 52 78 – 85 77.5 – 85.5 81.5 6 32
86 – 93 85.5 – 93.5 89.5 3 35
Class Class
Class Frequency, Cumulative
Mark
Interval Boundaries (x) f Frequency (cf)
25 – 29 24.5 – 29.5 27 2 2
30 – 34 29.5 – 34.5 32 4 6 Statistics and Probability
35 – 39 34.5 – 39.5 37 7 13
40 – 44 39.5 – 44.5 42 10 23
45 – 49 44.5 – 49.5 47 8 31
50 – 54 49.5 – 54.5 52 6 37
55 – 59 54.5 – 59.5 57 3 40
13
6/22/2017
Range Range
= highest value – lowest value = highest value – lowest value
Example: Example:
The recorded rainfall, in inches, for seven consecutive The salaries for the staff of the XYZ Manufacturing Co.
days in Rizal is given. are shown below. Find the range.
0.25 0.38 0.03 0.03 0.03 0.25 0.56 Staff Salary
Owner 100,000 Range
Range = 0.56 – 0.03 = 0.53
Manager 40,000 = 100,000 – 15,000
Sales Representative 30,000 = 85,000
Worker 15,000
14
6/22/2017
To calculate the Variance, take each difference, square
it, and then average the result:
15
6/22/2017
Source: http://www.mathsisfun.com/data/standard-deviation.html
SCORE FREQUENCY
5 5
4 6
3 4
2 4
1 2
0 1
16
6/22/2017
Remark
Dividing the squared deviation by n, underestimates
the population variance.
The division of n–1, instead of n gives a larger value for
the sample variance which will no longer
underestimate the population variance.
In a statistical sense, the formula for the sample
variance is an unbiased estimator of the population
variance.
17