Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
I. Range
The range for a set of data items is the difference between the largest and smallest
values. Although the range is the easiest of the numerical measures of variability
to compute, it is not widely used because it is based on only two of the items in the
data set and thus is influenced too much by extreme data values.
A form of the range that avoids the dependence on extreme values in the data set is
the interquartile range (IQR), or Q-spread. This descriptive measure of variability
is simply the difference between the third quartile (Q3 ) , or 75%-tile data item, and
the first quartile (Q1 ) , or 25%-tile data item. In effect, it is showing the range for
the middle 50% of the data and, as such, is not affected by the extreme values in the
3
data set. To calculate Q3 , let i = N where N is the number of data items. If i is
4
not an integer, then the next integer greater than i denotes the position of the 75%-tile;
if i is an integer, then the 75%-tile is the average of the data values in positions i and
1
i + 1. Similarly, to calculate Q1 , let i = N and follow the same guidelines as
4
above.
Example 1: Given the following data: 2, 3, 5, 7, 11, 13, 17, 19, 23, 29. Find the
IQR.
3
N = 10 ⇒ i = (10) = 7.5 ⇒ Q3 is the 8th data item ⇒ Q3 = 19 . Next,
4
1
i= (10 ) = 2.5 ⇒ Q1 is the 3rd data item ⇒ Q1 = 5 . Therefore, IQR = 19-5 =
4
14.
Example 2: Given the following data: 2, 3, 5, 7, 11, 13, 17, 19. Find the IQR.
3
N =8⇒i = (8) = 6 ⇒ Q3 is the average of the data values in the 6th and 7th
4
13 + 17 1
positions ⇒ Q3 = = 15 . Next, i = (8) = 2 ⇒ Q1 is the average of the
2 4
3+5
values in the 2nd and 3rd positions ⇒ Q1 = = 4. Therefore, IQR = 15-4 = 11.
2
1
ΣX −X
Average Absolute Deviation =
N
This average absolute deviation gives the average distance of any data item from the
mean and thus is a good measure of spread.
If you were to calculate the average absolute deviation of a distribution using a value
other than the mean, you could possibly get a smaller average absolute deviation.
This result is one of the reasons that the average absolute deviation is not the best
measure of variability. Instead, calculate the average of the squared differences from
the mean; this is the variance of a distribution. If you were to calculate the average
of the squared differences of a distribution by using a value other than the mean,
you would always get a larger value. The mean is the one number that minimizes
the average of the squared differences in a distribution.
Σ( X − X ) 2
Variance = σ 2 =
N
There are still two slight inconveniences in using variance as our measure of
variability. First, variance does not give an estimate of the distance of a typical data
from the mean; it is too big. Second, if the data items have a unit of measurement
associated with them, then the variance would not have the same unit of measure-
ment; it would have square units. By taking the square root of variance, we get
standard deviation, which is the measure of variability that we want.
2
2
Standard Deviation = σ = Σ( X − X )
N
2
Standard Deviation = σ = ΣX − X 2
N
%/cig
2 30%
40%
(.5) 20%
10%
0
0 10 20 40 80
Number of cigarettes
3
Important Note:
Some textbooks will give the following formulas for variance and standard deviation:
2 2 2
Variance = s 2 = Σ( X − X ) = ΣX − N X
N −1 N −1
2 2 2
Standard Deviation = s = Σ( X − X ) = ΣX − N X
N −1 N −1
These formulas should be used when N data items are taken as a sample from a
larger population in which the variance and standard deviation of that population are
unknown. These formulas give good approximations of the variance and standard
deviation of the population.
46 64 72 79 89
49 66 74 79 91
53 66 75 80 94
60 67 76 83 95
61 71 79 88 98
4
III. In the histogram given below, the class intervals include the right endpoint, not the
left:
%/$1000
1.25
1.00
0.75
0.50
0.25
0
0 20 40 80 100 120
Income (in $1000)
II. (1) 11
(2) 3 1
3
(3) 11.5
(4) 3 1
3
(5) 4.2
(6) 4.2
(7) 4.2
(8) 29.4
(9) .84
III. (1) 56
(2) 26
(3) 76 – 35 = 41