Sei sulla pagina 1di 34

Chapter 01 - Introduction and Descriptive Statistics

CHAPTER 1
INTRODUCTION AND DESCRIPTIVE STATISTICS
1-1. 1. quantitative/ratio
2. qualitative/nominal
3. quantitative/ratio
4. qualitative/nominal
5. quantitative/ratio
. quantitative/interval
!. quantitative/ratio
". quantitative/ratio
#. quantitative/ratio
10. quantitative/ratio
11. quantitative/ordinal
1-2. Data are $ased on numeric measurements o% some varia$le& either %rom a data set
comprisin' an entire population o% interest& or else o$tained %rom onl( a sample )su$set*
o% the %ull population. Instead o% doin' the measurements ourselves& +e ma( sometimes
o$tain data %rom previous results in pu$lished %orm.
1-3. ,he +ea-est is the .ominal Scale& in +hich cate'ories o% data are 'rouped $( qualitative
di%%erences and assi'ned num$ers simpl( as la$els& not usa$le in numeric comparisons.
.e/t in stren'th is the 0rdinal Scale1 data are ordered )ran-ed* accordin' to relative si2e
or qualit(& $ut the num$ers themselves don3t impl( speci%ic numeric relationships.
Stron'er than this is the Interval Scale1 the ordered data points have meanin'%ul distances
$et+een an( t+o o% them& measured in units. 4inall( is the 5atio Scale& +hich is li-e an
Interval Scale $ut +here the ratio o% an( t+o speci%ic data values is also measured in units
and has meanin' in comparin' values.
1-4. .ame1 6ualitative
7ealth1 6uantitative
8'e1 6uantitative
Industr(1 6ualitative
Countr( o% Citi2enship1 6ualitative
1-5. 0rdinal.
1-. 8 qualitative varia$le descri$es di%%erent cate'ories or qualities o% the mem$ers o% a data set&
+hich have no numeric relationships to each other& even +hen the cate'ories happen to $e coded
as num$ers %or convenience. 8 quantitative varia$le 'ives numericall( meanin'%ul in%ormation& in
terms o% ran-in'& di%%erences& or ratios $et+een individual values.
1-!. ,he people %rom one particular nei'h$orhood constitute a non-random sample )dra+n
%rom the lar'er to+n population*. ,he 'roup o% 100 people +ould $e a random sample.
1-1
Chapter 01 - Introduction and Descriptive Statistics
1-". 8 sample is a su$set o% the %ull population o% interest& %rom +hich statistical in%erences
are dra+n a$out the population& +hich is usuall( too lar'e to permit the varia$les to $e
measured %or all the mem$ers.
1-#. 8 random sample is a sample dra+n %rom a population in a +a( that is not a priori $iased
+ith respect to the -inds o% varia$les $ein' measured. It attempts to 'ive a representative
cross-section o% the population.
1-10. .ationalit(1 qualitative. 9en'th o% intended sta(1 quantitative.
1-11. 0rdinal. ,he colors are ran-ed& $ut no units o% di%%erence $et+een an( t+o o% them are
de%ined.
1-12. Income1 quantitative& ratio
.um$er o% dependents1 quantitative& ratio
4ilin' sin'l(/:ointl(1 qualitative& nominal
Itemi2ed or not1 qualitative& nominal
9ocal ta/es1 quantitative& ratio
1-13. 9o+er quartile ; 25th percentile ; data point in position )n < 1*)25/100* ;
34)25/100* ; position ".5. )=ere n ; 33.* 9et us order our o$servations1 10#& 110&
114& 11& 11"& 11#& 120& 121& 121& 123& 123& 125& 125& 12!& 12"& 12"& 12"& 12"& 12#& 12#&
130&
131& 132& 132& 133& 134& 134& 134& 134& 13& 13& 13& 13.
9o+er quartile ; 121
>?sin' the %ormula 'iven in the te/t1 )n<1*)p/100*@
Aiddle quartile is in position1 34)50/100* ; 1!. Boint is 12".
?pper quartile is in position1 34)!5/100* ; 25.5. Boint is 133.5
10th percentile is in position1 34)10/100* ; 3.4. Boint is 114.".
15th percentile is in position1 34)15/100* ; 5.1. Boint is 11".1.
5th percentile is in position1 34)5/100* ; 22.1. Boint is 131.1.
I65 ; 133.5 - 121 ; 12.5.
>?sin' the C/cel ,emplate1 DEasic Statistics./lsF@
Percentile and Percentile Rank Calculations

x-t
Percentile
Percentile
rank o! " x "
10 116.4 116.4 10
15 118.8 118.8 15
65 130.8 130.8 65
#uartiles
1st #uartile 121
$edian 128 I#R 12
%rd #uartile 133
1-2
Chapter 01 - Introduction and Descriptive Statistics
1-14. 4irst& order the data1
-1.2& 3.#& ".3& #& #.5& 10& 11& 11.& 12.5& 13& 14."& 15.5& 1.2& 1.!& 1"
>?sin' the %ormula 'iven in the te/t1 )n<1*)p/100*@
,he median& or 50th percentile& is the point in position 1)50/100* ; ". ,he point is 11..
4irst quartile is in position 1)25/100* ; 4. Boint is #.
,hird quartile is in position 1)!5/100* ; 12. Boint is 15.5.
55th percentile is in position 1)55/100* ; ".". Boint is 12.32.
"5th percentile is in position 1)"5/100* ; 13.. Boint is 1.5.
1-15. 0rder the data1
-1.3& -0.!& -0.!& -0.5& -0.4& 0.1& 0.2& 0.!& 0."& 1.
>?sin' the %ormula 'iven in the te/t1 )n<1*)p/100*@
Aedian is in position 11)50/100* ; 5.5. Boint is G0.15.
20th percentile is in position 11)20/100* ; 2.2. Boint is G0.!.
30th percentile is in position 11)30/100* ; 3.3. Boint is G0.4.
0th percentile is in position 11)0/100* ; .. Boint is 0.1.
#0th percentile is in position 11)#0/100* ; #.#. Boint is 1.52.
>?sin' the C/cel ,emplate1 DEasic Statistics./lsF@
Percentile and Percentile Rank Calculations

x-t
Percentile x "
20 -0.7
30 -0.56
60 0.14
90 0.88
#uartiles
1st #uartile -0.65
$edian -0.15 I#R 1.225
%rd #uartile 0.575
1-1. 0rder the data1 1& 1& 1& 2& 2& 2& 3& 3& 4& 4& 5& 5& & & !.
9o+er quartile is the 25th percentile& in position 1)25/100* ; 4. Boint is 2.
,he median is in position 1)50/100* ; ". ,he point is 3.
?pper quartile is in position 1)!5/100* ; 12. Boint is 5.
I65 ; 5 - 2 ; 3.
0th percentile is in position 1)0/100* ; #.. Boint is 4.
1-3
Chapter 01 - Introduction and Descriptive Statistics
>?sin' the C/cel ,emplate1 DEasic Statistics./lsF@
Percentile and Percentile Rank Calculations

x-t
Percentile x "
60 4 4.0
1 0
1 0
#uartiles
1st #uartile 2
$edian 3 I#R 3
%rd #uartile 5
1.1!. ,he data are alread( orderedH there are 1 data points.
>?sin' the %ormula 'iven in the te/t1 )n<1*)p/100*@
,he median is the point in position 1!)50/100* ; ".5 It is 51.
9o+er quartile is in position 1!)25/100* ; 4.25. It is 30.5.
?pper quartile is in position 1!)!5/100* ; 12.!5. It is 1#4.25.
I65 ; 1#4.25 - 30.5 ; 13.!5.
45th percentile is in position 1!)45/100* ; !.5. Boint is 42.2.
>?sin' the C/cel ,emplate1 DEasic Statistics./lsF@
Percentile and Percentile Rank Calculations

x-t
Percentile x "
45 43 43.0
0
0
#uartiles
1st #uartile 31.5
$edian 51 I#R 131.25
%rd #uartile 162.75
1-1". ,he mean is a central point that summari2es all the in%ormation in the data. It is sensitive to
e/treme o$servations. ,he median is a point Iin the middleI o% the data set and does not contain
all the in%ormation in the set. It is resistant to e/treme o$servations. ,he mode is a value that
occurs most %requentl(.
1.1#. Aean& median& mode)s* o% the o$servations in Bro$lem 1-13:
= = = 4 . 12 Aean
i
x x
Aedian ; 12"
Aodes ; 12"& 134& 13 )all have 4 points*
1-4
Chapter 01 - Introduction and Descriptive Statistics
>?sin' the C/cel ,emplate1 DEasic Statistics./lsF@
$easures o! Central tendenc"

$ean 126.63636 $edian 128 $ode 128
1-20. 4or the data o% Bro$lem 1-141
Aean ; 11.2533
Aedian ; 11.
Aode1 none
1-21. 4or the data o% Bro$lem 1-151
Aean ; .#55
Aedian ; !0
Aode ; 45
>?sin' the C/cel ,emplate1 DEasic Statistics./lsF@
$easures o! Central tendenc"

$ean 66.954545 $edian 70 $ode 45
1-22. 4or the data o% Bro$lem 1-161
Aean ; 3.4
Aedian ; 3
Aode ; 1 and 2
>?sin' the C/cel ,emplate1 DEasic Statistics./lsF@
$easures o! Central tendenc"

$ean 3.4666667 $edian 3 $ode 1
1-23. 4or the data o% Bro$lem 1-171
Aean ; 1##."!5
Aedian ; 51
Aode1 none
>?sin' the C/cel ,emplate1 DEasic Statistics./lsF@
$easures o! Central tendenc"

$ean 199.875 $edian 51 $ode #N/A
1-5
Chapter 01 - Introduction and Descriptive Statistics
1-24. 4or the data o% C/ample 1-11
Aean ; 13&20
Aedian ; 1&"00
Aode1 none
1-25. )?sin' the template1 DEasic Statistics./lsF& enter the data in column J.*
&asic Statistics !ro' Ra( Data
$easures o! Central tendenc"

$ean 21.75 $edian 13 $ode 12
1-2. )?sin' the template1 DEasic Statistics./lsF*
-10
0
10
20
30
40
50
I
n
t
e
l
A
T
&
T
G
E
E
x
x
o
n
M
o
b
i
l
e
M
i

!
o
"
o
#
t
$
#
i
%
e
!
&
i
t
i
'
!
o
(
)
Aean ; 1!.5!1
Aedian ; 1.#
0utliers1 -.#& 4.5
1-2!. >?sin' the C/cel ,emplate1 DEasic Statistics./lsF@
$easures o! Central tendenc"

$ean 18.35 $edian 19.1 $ode #N/A
1-2". Aeasures o% varia$ilit( tell us a$out the spread o% our o$servations.
1-2#. ,he most important measures o% varia$ilit( are the variance and its square root- the standard
deviation. Eoth re%lect all the in%ormation in the data set.
1-30. 4or a sample& +e divide the sum o% squared deviations %rom the mean $( n G 1& rather than $( n.
1-
Chapter 01 - Introduction and Descriptive Statistics
1-31. 4or the data o% Bro$lem 1-13, assumed a sample1 5an'e ; 13 G 10# ; 2!
Kariance ; 5!.!4 Standard deviation ; !.5#"
I! te data is o! a
Sa')le Po)ulation
Variance 57.7386364
St* De+* 7.59859437
1-32. 4or the data o% Bro$lem 1-141 5an'e ; 1" G )G1.2* ; 1#.2
Kariance ; 25.#0 Standard deviation ; 5.0"#
1-33. 4or the data o% Bro$lem 1-15: 5an'e ; #" G 3" ; 0
Kariance ; 321.3" Standard deviation ; 1!.#2!
I! te data is o! a
Sa')le Po)ulation
Variance 321.378788
St* De+* 17.9270407
1-34. 4or the data o% Bro$lem 1-161 5an'e ; ! G 1 ;
Kariance ; 3.#" Standard deviation ; 1.##5
I! te data is o! a
Sa')le Po)ulation
Variance 3.98095238
St* De+* 1.99523241
1-35. 4or the data o% Bro$lem 1-17: 5an'e ; 1&20# G 23 ; 1&1"
Kariance ; 110&2"!.45 Standard deviation ; 332.0#
I! te data is o! a
Sa')le Po)ulation
Variance 110287.45
St* De+* 332.095543
1-3. [ ] "4 . 141 & 44 . 111 2 / so & 0 . ! s & 4 . 12 / & 33 n = = = = s H this captures 31/33 o% the
data points& so Che$(shev3s theorem holds. ,he data set is not mound-shaped& so the empirical
rule does not appl(.
1-3!.
[ ] 433 . 21 & 0!3 . 1 s 2 / so & 0#0 . 5 s & 253 . 11 / & 15 n = = = =
H this captures 14/15 o% the
data points& so Che$(shev3s theorem holds. ,he data set is not mound-shaped& so the empirical
rule does not appl(
1-!
Chapter 01 - Introduction and Descriptive Statistics
1-3". [ ] "1 . 102 & 0# . 31 2 / so & #3 . 1! s & #5 . / & 22 n = = = = s H this captures all the data
points& so Che$(shev3s theorem holds. ,he data set is not mound-shaped& so the empirical rule
does not appl(.
1-3#.
[ ] 45! . ! & 523 . 0 s 2 / so & ##5 . 1 s & 4! . 3 / & 15 n = = = =
H this captures all the data
points& so Che$(shev3s theorem holds. ,he data set is not mound-shaped& so the empirical rule
does not appl(.
1-40. [ ] 1 . "4 & 3 . 44 s 2 / so & 1 . 332 s & # . 1## / & 1 n = = = = H this captures 15/1 o% the data points&
so Che$(shev3s theorem holds. ,he data set is not mound-shaped& so the empirical rule does not
appl(.
1-41.
1-42.
1-"
Electrolux
,E
$atsusita
-irl)ool
&-S
Pili)s
$a"ta.
/ 0 1/ 10 1/
Stock 1
Stock 1
Stock %
Stock 2
Stock 0
Chapter 01 - Introduction and Descriptive Statistics
1-43.
1-44. >?sin' the C/cel ,emplate1 DEasic Statistics./lsF@


To) Pri+ate E3uit" Deals
0
5
10
15
20
25
30
35
40
45
1 2 3 4 5 6 7 8 9 10
4
5
6
i
l
l
i
o
n
s
7
1-#
$easures o! Central tendenc"

$ean 24.13 $edian 23.65
$easures o! Dis)ersion
I! te data is o! a
Sa')le Po)ulation
Variance 70.6312222
St* De+* 8.40423835
Endo('ents 54 6illions7
/
1
1
%
2
H
a
r
+
a
r
d
T
e
x
a
s
P
r
i
n
c
e
t
o
n
8
a
l
e
S
t
a
n
!
o
r
d
C
o
l
u
'
6
i
a
T
e
x
a
s
A
9
$
Uni+ersit"
4

6
i
l
l
i
o
n
s
Chapter 01 - Introduction and Descriptive Statistics
1-45. >?sin' the C/cel ,emplate1 DEasic Statistics./lsF@

&!e*it +e# ,(lt -.,) /,l(e"
1-4.
1-4!. ?sin' AI.I,8E
Stem Leaves
4 5 5""
" 0123
14 !!!"#
)#* ! 002223334
11 ! 55!""#
3 " 224
1-10
$easures o! Central tendenc"

$ean 13.333333 $edian 12.5
Sales 547
/
1
1
%
2
0
:
;
<
/
=
0
0
=
1
/
1
/
=
1
0
1
0
=
1
/
1
/
=
1
0
Sales
!
r
e
3
u
e
n
c
i
e
s
Chapter 01 - Introduction and Descriptive Statistics
1-4".
,here are no outliers. Distri$ution is s-e+ed to the le%t.
1.4#. 8 stem-and-lea% displa( is a quic-l( dra+n t(pe o% histo'ram use%ul in anal(2in' data. 8 $o/ plot
is a more advanced displa( use%ul in identi%(in' outliers and the shape o% the distri$ution o% the
data.
1-50. Stem Leaves
1 0 5
1 1
1 2
! 3 2345!"
)13* 4 22345!!"""##
11 5 012235!"
2 3
1 ! "
1-11
Eo/ and 7his-er Blot
34 cases
".5
!.#
!.3
.!
.1
5.5
C
1
Chapter 01 - Introduction and Descriptive Statistics
1-51. ,he data are narro+l( and s(mmetricall( concentrated near the median )I65 and the +his-er
len'ths are small*& not countin' the t+o e/treme outliers.
1-52. 7ider dispersion in data set L2. .ot much di%%erence in the lo+er +his-ers or lo+er hin'es o% the
t+o data sets. ,he hi'h value& 24& in data set L2 has a si'ni%icant impact on the median& upper
hin'e and upper +his-er values %or data set L2 +ith respect to data set L1.
1-53. Aean ; 12!
Kar ; 13!
sd ; 11.!05
mode ; 12!
outliers1 ,78& 9u%thansa
1-12
Eo/ and 7his-er Blot
31 cases
"0
0
40
20
0
C
1
Chapter 01 - Introduction and Descriptive Statistics
100
110
120
130
140
150
160
1-54. Stem-and-lea% o% C2 . ; 45
9ea% ?nit ; 1.0
f Stem Leaves
13 1 0011111223444
1" 1 55"#
)* 2 022333
21 2 5!!"#
15 3 0122234
" 3 !"
4 012
3 4 !
2 5 23
1-55. 0utliers are detected $( loo-in' at the data set& constructin' a $o/ plot or stem-and-lea% displa(.
8n outlier should $e anal(2ed %or in%ormation content and not merel( eliminated.
1-5. ,he median is the line inside the $o/. ,he hin'es are the upper and lo+er quartiles. ,he inner
%ences are the t+o points at a distance o% 1.5 )I65* %rom the upper and lo+er quartiles. 0uter
%ences are similar to the inner %ences $ut at a distance o% 3 )I65*. ,he $o/ itsel% represents 50M
o% the data.
1-13
Chapter 01 - Introduction and Descriptive Statistics
1-5!. Mine A: Mine B:
f Stem Leaves f Stem Leaves
2 3 24 2 2 34
4 3 5! 4 2 "#
! 4 123 3 24
)5* 4 55"# # 3 5!"
! 5 123 )3* 4 034
4 5 ! 4 !"#
4 0 4 5 012
3 ! 3 1 5 #
1 " 5
Kalues %or Aine 8 are smaller than %or Aine E& ri'ht-s-e+ed& and there are three outliers. Kalues
%or Aine E are lar'er and the distri$ution is almost s(mmetric. ,here is lar'er variance in E.
1-5". .o. 0ne needs to use descriptive statistics and/or statistical in%erence.
1-5#. >?sin' the template1 DEo/ Blot./lsF@
&ox Plot +,il0 $e!ent,'e &1,n'e in -to2 $!ie"
>o(er
-isker
>o(er
Hin.e $edian
U))er
Hin.e
U))er
-isker
-0.3 0.275 0.6 1.15 1.6
1.0. >?sin' the C/cel ,emplate1 DEasic Statistics./lsF@
,he t+o measures are virtuall( equivalent.
>?sin' the template1 DEo/Blot./lsF@
1-14
$easures o! Central tendenc"

$ean 4.88 $edian 4.9
Chapter 01 - Introduction and Descriptive Statistics
&ox Plot 0 to 60 ti3e"
>o(er
-isker
>o(er
Hin.e $edian
U))er
Hin.e
U))er
-isker
4.2 4.725 4.9 5.1 5.3
1-1. 8ns+ers +ill var(.
a. I% +e add the value D5F to all the data points& then the avera'e& median& mode& %irst quartile&
third quartile and "0
th
percentile values +ill chan'e $( D5F. ,here is no chan'e in the
variance& standard deviation& s-e+ness& -urtosis& ran'e and interquartile ran'e values.
$. 8vera'e1 i% +e add D5F to all the data points& then the sum o% all the num$ers +ill increase $(
D5NnF& +here n is the num$er o% data points. ,he sum is divided $( n to 'et the avera'e. So
5Nn / n ; 51 the avera'e +ill increase $( D5F.
Aedian1 I% +e add D5F to all the data points& the median value +ill still $e the mid+a( point
in the ordered arra(. Its value +ill also increase $( D5F
Aode1 8ddin' D5F to all the data points chan'es the num$er that occurs most %requentl( $(
D5F
4irst 6uartile1 addin' D5F to all the data points does not chan'e the location o% the %irst
quartile in the ordered arra( o% num$ers& +hich is1 ).25*)n<1* +here n is the num$er o% data
points. 7hether the %irst quartile %alls on a speci%ic data point or $et+een t+o data points& the
resultin' value +ill have $een increased $( D5F.
,hird 6uartile1 addin' D5F to all the data points does not chan'e the location o% the third
quartile in the ordered arra( o% num$ers& +hich is1 ).!5*)n<1* +here n is the num$er o% data
points. 7hether the third quartile %alls on a speci%ic data point or $et+een t+o data points&
the resultin' value +ill have $een increased $( D5F.
"0
th
percentile1 addin' D5F to all the data points has the same e%%ect as in the calculation o%
the %irst or third quartile. ,he value +ill $e increased $( D5F
5an'e1 addin' D5F to the all the data points +ill have no e%%ect on the calculation o% the
ran'e. Since $oth the hi'hest value and the lo+est value have $een increase $( the same
num$er& the su$traction o% the lo+est value %rom the hi'hest value still (ields the same value
%or the ran'e.
Kariance1 addin' D5F to all the data points has no e%%ect on the calculation o% the variance.
Since each data point is increased $( D5F and the avera'e has also $een sho+n to increase $(
the same %actor& the di%%erences $et+een each individual ne+ data point and the ne+ avera'e
+ill not chan'e and +ill not $e a%%ected $( squarin' the di%%erence& summin' the squared
di%%erences and dividin' $( num$er o% data points.
1-15
Chapter 01 - Introduction and Descriptive Statistics
Standard Deviation1 since the variance is not a%%ected $( addin' D5F to each data point&
neither is the standard deviation.
S-e+ness1 Since each data point is increased $( D5F and the avera'e has also $een sho+n to
increase $( the same %actor& the di%%erences $et+een each individual ne+ data point and the
ne+ avera'e +ill not chan'e. ,here%ore& the numerator in the %ormula %or s-e+ness is not
a%%ected. Since the standard deviation is not a%%ected as +ell )the denominator*& there is no
chan'e in the value %or s-e+ness.
Jurtosis1 Since each data point is increased $( D5F and the avera'e has also $een sho+n to
increase $( the same %actor& the di%%erences $et+een each individual ne+ data point and the
ne+ avera'e +ill not chan'e. ,here%ore& the numerator in the %ormula %or -urtosis is not
a%%ected. Since the standard deviation is not a%%ected as +ell )the denominator*& there is no
chan'e in the value %or -urtosis.
Interquartile 5an'e1 'iven that $oth the %irst quartile and the third quartile increased $( the
same %actor& D5F& the di%%erence $et+een the t+o values remains the same.
c. Aultipl(in' each data point $( a %actor D3F results in the %ollo+in' chan'es. ,he mean&
median& mode& %irst quartile& third quartile and "0
th
percentile values +ill $e increased $( the
same %actor D3F. In addition& the standard deviation and the ran'e +ill also increase $( the
same %actor D3F. ,he variance +ill increase $( the %actor squared& and the s-e+ness and
-urtosis values +ill remain unchan'ed.
d. Aultipl(in' all data points $( a %actor D3F and addin' a value D5F to each data point has the
%ollo+in' results. ,he order o% operation is %irst to multipl( each data point and then add a
value to each data point. Cach data point is %irst multiplied $( the %actor D3F and then the
value D5F is added to each ne+l( multiplied data point. Aultipl(in' each data point $( the
%actor D3F (ields the results listed in c*. 8ddin' a value 5 to the ne+l( multiplied data points
(ields the results listed in a*.
1.2. >?sin' the template1 DEasic Statistics./lsF@
1-1
$easures o! Central tendenc"

$ean 41.01 $edian 23.8
$easures o! Dis)ersion
I! te data is o! a
Sa')le Po)ulation
Variance 1136.941
St* De+* 33.7185557
Chapter 01 - Introduction and Descriptive Statistics
1-3. ; 504."" ; #4.54!
$easures o! Central tendenc"

$ean 504.6875 $edian 501.5 $ode #N/A
$easures o! Dis)ersion
I! te data is o! a
Sa')le Po)ulation
Variance 8939.15234 Ran.e 346
St* De+* 94.5470906 I#R 149.5
1-4.
Step 11 Cnter the data %rom pro$lem 1-3 into cells O41O35 o% the template1 =isto'ram./ls %rom Chapter
1. ,he template +ill order the data automaticall(.
Step 21 7e need to select a startin' point %or the %irst class& an endin' point %or the last class& and
a class interval +idth. ,he startin' point o% the %irst class should $e a value less than the
smallest value in the data set. ,he smallest value in the data set is 344& so (ou +ould
+ant to set the %irst class to start +ith a value smaller than 344. 9etPs use 320. 7e also
selected !10 as the endin' value o% the last class& and selected 50 as the interval +idth.
,he data input column and the histo'ram output %rom the template are presented $elo+.
,he end-point %or each class is included in that classH i.e.& the %irst class o% data 'oes %rom
more than 320 up to and includin' 3!0& the second class starts +ith more than 3!0 up to
and includin' 420& etc.
1-1!
Chapter 01 - Introduction and Descriptive Statistics
1-5. 5an'e1 #0 G 344 ; 34
#0
th
percentile lies in position1 33)#0/100* ; 2#.! It is 32.!
4irst quartile lies in position1 33)25/100* ; ".25 It is 41#.25
Aedian lies in position1 33)50/100* ; 1.5 It is 501.5
,hird quartile lies in position1 33)!5/100* ; 24.!5 It is 5"5.!5
1-.
1-!. Stem Leaves
2 1 24
! 1 5!"#
)3* 2 023
2 55
4 3 24
2 3
2 4 01
1-".
1-1"
/
1
1
%
2
0
:
;
*
0
1
1
*
0
1
;
*
0
1
1
*
0
1
;
*
0
%
1
*
0
%
;
*
0
2
1
*
0
2
;
*
0
TV sets
!
r
e
3
Ogive: TV Sets
/
0
1/
10
1/
1/ 10 1/ 10 %/ %0 2/ 20
TV Sets
c
u
'

!
r
e
3
Eo/ and 7his-er Blot
42
3
30
24
1"
12
C
2
Chapter 01 - Introduction and Descriptive Statistics
,he data is s-e+ed to the ri'ht.
1-#. Stem Leaves
3 1 012
4 1 #
12 2 1122334
)#* 2 55!!""#
3 024
3 3 5!
1 4
1 4
1 5
1 5
1 2
,he data is s-e+ed to the ri'ht +ith one e/treme outlier )2* and three suspected outliers
)10&11&12*
1-!0.
1.!1. >?sin' the template1 DEasic Statistics./lsF@
$easures o! Central tendenc"

$ean 8.0666667 $edian 9 $ode 10
Eased on :ust these three measures& cheap +ine appears to +or- +ell in coo-in'
1-1#
Eo/ and 7his-er Blot
"0
0
40
20
0
C
1
Chapter 01 - Introduction and Descriptive Statistics
1-!2. >?sin' the template1 DEasic Statistics./lsF@
$easures o! Central tendenc"

$ean 20.3 $edian 20.2

$otorola?s Stock Prices
19.2
19.4
19.6
19.8
20
20.2
20.4
20.6
20.8
21
1 2 3 4 5 6 7 8 9 10 11 12
4
&ox Plot
Moto!ol,4" -to2
$!ie"
>o(er
-isker
>o(er
Hin.e $edian
U))er
Hin.e
U))er
-isker
19.8 20.075 20.2 20.525 20.8
1-20
$easures o! Dis)ersion
I! te data is o! a
Sa')le Po)ulation
Variance 0.10909091
St* De+* 0.33028913
Chapter 01 - Introduction and Descriptive Statistics
1-!3. Aean ; 33.2!1
sd ; 1.#45
var ; 2"!.15
69 ; 25.41
Aed ; 2.!1
6? ; 35
0utliers1 Aor'an Stanle( )#1.3M*
1-!4. Aean ; 3.1"
sd ; 1.34"
var ; 1."1!
69 ; 1.#!5
Aed ; 2.#5
6? ; 3.!5
0utliers1 ".!0
1-21
Eo/ and 7his-er Blot
100
"0
0
40
20
C
1
15 cases
Eo/ and 7his-er Blot
20 cases
#
!
5
3
1
C
1
Chapter 01 - Introduction and Descriptive Statistics
1-!5.
a. I65 ; 3.5
$. data is ri'ht-s-e+ed
c. #.5 is more li-el( to $e the mode& since the data is ri'ht-s-e+ed
d. 7ill not a%%ect the plot.
1.!. Ainita$ output1
7hile the avera'e o% the Chan'e in Brovisions is close to the 4.1 avera'e %or all $an-s& the
avera'e o% the Chan'e in Ead 9oans is considera$l( hi'her than the industr( avera'e o% 11.00.
,he $o/ plot %or chan'e in Ead 9oans does not sho+ an( outliers.
c
h
a
n
g
e

i
n

b
a
d

l
o
a
n
s
160
140
120
100
80
60
40
20
0
Boxplot of change in bad loans
1-22
Descri)ti+e Statistics@ can.e in 6ad loansA can.e in
Pro+isions
Variable Mean StDev Median
change bad loans 56.28 42.73 43.40
change Provisions 8.12 12.21 8.60
Chapter 01 - Introduction and Descriptive Statistics
,he $o/ plot %or chan'e in Brovisions does sho+ one possi$le outlier %or 7 =oldin' at 3!.31
c
h
a
n
g
e

i
n

P
r
o
v
i
s
i
o
n
s
40
30
20
10
0
-10
Boxplot of change in Provisions
1.!!. ,he Ainita$ output1
,he avera'e %or the $an- assets o% the 1# lendin' institutions is lar'er than the industr( avera'e o%
14#.30.
1-23
Descri)ti+e Statistics@ 6ank assets
Variable Mean StDev Median
ban assets 186.7 355.6 56.2
Chapter 01 - Introduction and Descriptive Statistics
,he $o/ plot o% $an- assets sho+ three possi$le outliers %or Ean- o% 8merica )145#*& 7achovia
)!0!.1*& and 7ells 4ar'o )4"1.#*
b
a
n
k

a
s
s
e
t
s
1600
1400
1200
1000
800
600
400
200
0
Boxplot of bank assets
1-!". >?sin' the template1 DEasic
Statistics./lsF@
$easures o! Central tendenc"

$ean 56.266667 $edian 57
$easures o! Dis)ersion
I! te data is o! a
Sa')le Po)ulation
Variance 164.780952 153.795556
St* De+* 12.8367033 12.4014336
,he mean and median %or the 15 selected countries are hi'her than the overall mean approval
ratin' o% 53M.
1.!". ,he chart indicates that there is a si'ni%icantl( lar'e di%%erence $et+een the annual sales per
square %oot %or 8pple Stores relative to the other %our companies listed.
>?sin' the template1 DEasic Statistics./lsF@
1-24
$easures o! Central tendenc"

$ean 1720.2 $edian 930
Chapter 01 - Introduction and Descriptive Statistics
1-"0. Aean ; ##.03#
sd ; .43
var ; .1#0!
Aedian ; ##.155
1-"1. Aean ; 1!.5"!
sd ; .4
var ; .21!2
$easures o! Central tendenc"

$ean 17.5875 $edian 17.5 $ode 18.3
$easures o! Dis)ersion
I! te data is o! a
Sa')le Po)ulation
Variance 0.21716667 0.20359375 Ran.e 1.4
St* De+* 0.46601144 0.45121364 I#R 0.75
1-"2. Aean ; 25#."2
sd ; 35!.24
)?sin' the template1 DEasic Statistics./lsF*
$easures o! Central tendenc"

$ean 259.82 $edian 9.5
$easures o! Dis)ersion
I! te data is o! a
Sa')le Po)ulation
Variance 127622.462
St* De+* 357.242861
1-25
$easures o! Dis)ersion
I! te data is o! a
Sa')le Po)ulation
Variance 1987680.96
St* De+* 1409.8514
Chapter 01 - Introduction and Descriptive Statistics
1-"3. Aean ; 3!.1!
sd ; 13.12"
Aedian ; 34
$easures o! Central tendenc"

$ean 37.166667 $edian 34
$easures o! Dis)ersion
I! te data is o! a
Sa')le Po)ulation
Variance 172.333333
St* De+* 13.1275791
1-"4. Stoc- Brices %or period1 8pril& 2001 throu'h Qune& 2001 >8ns+ers +ill var( due to dates
used.@
a*. Aean and Standard Deviation %or 7al-Aart
1-2
&asic Statistics !ro' Ra( Data -to2 $!ie"5 6,l-M,!t
$easures o! Central tendenc"

$ean 51.041478 $edian 51.1266 $ode 50.158
$easures o! Dis)ersion
I! te data is o! a
Sa')le Po)ulation
Variance 2.25711298 2.22128579 Ran.e 6.1911
St* De+* 1.50236912 1.49039786 I#R 1.9613
Hi.er $o'ents
I! te data is o! a
Sa')le Po)ulation
Ske(ness 0.07083784 0.06913994
5Relati+e7 Burtosis -0.711512 -0.7500338
Chapter 01 - Introduction and Descriptive Statistics
$*. Aean and Standard Deviation %or J-Aart
c*. Coe%%icient o% variation1
CK ; std. dev mean
4or 7al-Aart1 %or J-Aart1
considerin' the data as a population1
CK ; 1.49039786 / 51.041478 7 0.0292 &/ 7 0.9846645 / 10.450952 7 0.0942
on"i*e!in' t1e *,t, ," , ",3)le5
&/ 7 1.50236912 / 51.041478 7 0.02943 &/ 7 0.99257358 / 10.450952 7 0.09497
d*. ,here is a 'reater de'ree o% ris- in the stoc- prices %or J-Aart than %or 7al-Aart over
this three month period.
e8. 9o! +:IA
considerin' the data as a population1
CK ; 42!.#13!#1 / 10"1.11 ; 0.0400
on"i*e!in' t1e *,t, ," , ",3)le5
&/ 7 431.350905 / 10681.11 7 0.04038
7al-Aart stoc-s provided a less ris-( return %or this time period relative to DQI8 and J-
Aart.
%*. 100 Shares o% 7al-Aart stoc-s purchased 8pril 2& 20011
Brice ; R50.5!4 Cost ; R505.!4
Aean o% holdin' 100 shares1 R5104.15
1-2!
&asic Statistics !ro' Ra( Data -to2 $!ie"5 ;-M,!t
$easures o! Central tendenc"

$ean 10.450952 $edian 10.66 $ode 11.8
$easures o! Dis)ersion
I! te data is o! a
Sa')le Po)ulation
Variance 0.9852023 0.96956417 Ran.e 3.51
St* De+* 0.99257358 0.9846645 I#R 1.955
Hi.er $o'ents
I! te data is o! a
Sa')le Po)ulation
Ske(ness -0.4070262 -0.3972703
5Relati+e7 Burtosis -1.132009 -1.1378913
Chapter 01 - Introduction and Descriptive Statistics
Std dev o% holdin' 100 shares1 1.4#04 )rounded1 i% data considered a population*
1.5024 )rounded1 i% data considered a sample*
1-"5.
a*. %or a process mean ; 2004
K85B ; 8vera'e SSD2004 < o%%set
2
K85B ; 3.5 < o%%set
2
+here o%%set ; tar'et G process
$*. i% tar'et ; process& then o%%set ; 0
su$stitutin'1 K85B ; 3.5 < o%%set
2
; 3.5 < 0
2
; 3.5
1-".
a* S $*1 CBI and Tas prices %or period1 Qune #! throu'h Aa( 01. ).on-seasonall(
ad:usted series.*
CBI inde/ converted )$( 100* in order to compare $oth series on same chart. ,here is no
seasonal pattern present in the CBI inde/. Stead( trend present in CBIH considera$le
varia$ilit( in 'as prices. Tas prices increased considera$l( more than the overall CBI %or
the same time period.
1-2"
Chapter 01 - Introduction and Descriptive Statistics
1-"!.
a*. Bie Chart1 8IDS cases $( 8'e 'roups
A'e G!o() No. <
=n*e! 55 6812 0.90<
A'e" 5 to 125 1992 0.26<
A'e" 13 to 195 3865 0.51<
A'e" 20 to 245 26518 3.52<
A'e" 25 to 295 99587 13.21<
A'e" 30 to 345 168723 22.38<
A'e" 35 to 395 168778 22.39<
A'e" 40 to 445 124398 16.50<
A'e" 45 to 495 72128 9.57<
A'e" 50 to 545 38118 5.06<
A'e" 55 to 595 20971 2.78<
A'e" 60 to 645 11636 1.54<
A'e" 65 o! ol*e!5 10378 1.38<
A'e" 25 to 295 >13.21<8
A'e" 20 to 245 >3.52<8
A'e" 13 to 195 >0.51<8
A'e" 5 to 125 >0.26<8
=n*e! 55 >0.90<8
A'e" 45 to 495 >9.57<8
A'e" 50 to 545 >5.06<8
A'e" 55 to 595 >2.78<8
A'e" 60 to 645 >1.54<8
A'e" 65 o! ol*e!5 >1.38<8
A'e" 30 to 345 >22.38<8
A'e" 40 to 445 >16.50<8
A'e" 35 to 395 >22.39<8
AIDS cases 6" a.e
$*. Bie Chart1 8IDS cases $( 5ace
?,e No. <
61ite@ not Ai"),ni 324822 43.09<
Bl,2@ not Ai"),ni 282720 37.50<
Ai"),ni 137575 18.25<
A"i,n/$,i#i I"l,n*e! 5546 0.74<
A3e!i,n In*i,n/Al,"2, N,tiCe 2234 0.30<
?,e/et1niit0 (n2no.n 1010 0.13<
1-2#
Chapter 01 - Introduction and Descriptive Statistics
61ite@ not Ai"),ni >43.09<8
Ai"),ni >18.25<8
A"i,n/$,i#i I"l,n*e! >0.74<8
A3e!i,n In*i,n/Al,"2, N,tiCe >0.30<8
?,e/et1niit0 (n2no.n >0.13<8
Bl,2@ not Ai"),ni >37.50<8
AIDS cases 6" Race
1-"". )?sin' the template1 DEo/ Blot 2./lsF*
Co')arin. t(o data sets usin. &ox Plots -,l,!ie" 2004
>o(er
-isker
>o(er
Hin.e $edian
U))er
Hin.e
U))er
-isker
Cu6s 300000 650000 1550000 5750000 9500000
-ite Sox 301000 340000 775000 3875000 8000000
Cu6s
-ite Sox
0utliers1 Cu$s1 SosaPs salar( o% R1A
7hite So/1 0rdone2Ps salar( o% R14A
4urthermore& the median salar( o% the Cu$s is t+ice the median salar( o% the 7hite So/. ,here
are some pla(ers on $oth teams ma-in' the lea'ue minimum salar(.
Some+hat lo+er salar( ran'e %or the 7hite So/ relative to the Cu$s due to the %act that onl(
seven )!* pla(ers on the Cu$s +ere paid R500&000 or less +hile eleven )11* pla(ers earned less
than that amount on the 7hite So/.
1-30
Chapter 01 - Introduction and Descriptive Statistics
1-"# >?sin' the Easic Statistics./ls@
$easures o! Central tendenc"

$ean 5.1477778 $edian 5.35
$easures o! Dis)ersion
I! te data is o! a
Sa')le Po)ulation
Variance 0.36249444
St* De+* 0.60207512
Case 11 .8SD86 Kolatilit(
1* .8SD86 Com$ined Composite Inde/ %or 2000
>?sin' template1 ,ime Blot 2./ls@
NA-+AD &o3)o"ite In*ex
1///
0
500
1000
1500
2000
2500
3000
3500
4000
4500
5000
:,n 9eb M,! A)! M,0 :(n :(l A(' -e) Et NoC +e
:,n 3940.35
9eb 4696.69
M,! 4572.83
A)! 3860.66
M,0 3400.91
:(n 3966.11
:(l 3766.99
A(' 4206.35
-e) 3672.82
Et 3369.63
NoC 2597.93
+e 2470.52
1-31
Chapter 01 - Introduction and Descriptive Statistics

2* Compare 200 +ith 200!. >Blease note1 at the time o% printin'& data %or 200! +as availa$le onl(
throu'h close on 5U25/0!.@
Blots su''est there ma( $e more volatilit( in 200.
Standard deviation %or 200 ; 105.331!
Standard deviation %or 200! ; "2.300
1-32
Chapter 01 - Introduction and Descriptive Statistics
>?sin' template1 ,ime Blot 2./ls@
NA-+AD &o3)o"ite
In*ex
1//: 1//;
:,n 2305.82 2463.93
9eb 2281.39 2416.15
M,! 2339.79 2421.64
A)! 2322.57 2525.09
M,0 2178.88 2604.52
:(n 2172.09 2588.96
:(l 2091.47
A(' 2183.75
-e) 2258.43
Et 2366.71
NoC 2431.77
+e 2415.29



3* Comparison o% .8SD86 +ith SSB 500 Inde/ %or 200!
>?sin' template1 ,ime Blot 2./ls@
Co')arison usin. Ti'e Plot NA-+AD C" -&$ #o! 2007
S9P NASDA#
:,n 1438.24 2463.93
9eb 1406.82 2416.15
M,! 1420.86 2421.64
A)! 1482.37 2525.09
M,0 1530.62 2604.52
:(n 1502.56 2588.96
:(l
A('
-e)
Et
NoC
+e


,here +as more volatilit( in the .8SD86 Inde/ in 200! than in the SSB 500 Inde/ in 200!.
Standard deviation %or .8SD86 in 200! ; "2.300
Standard deviation %or SSB 500 in 200! ; 4#.1033
4* Comparison o% the .8SD86 +ith DQI8 %or 2000
1-33
Chapter 01 - Introduction and Descriptive Statistics
>?sin' template1 ,ime Blot 2./ls@
Co')arison usin. Ti'e Plot NA-+AD C" +:I #o! 2007
DCI NASDA#
:,n 12621.69 2463.93
9eb 12268.63 2416.15
M,! 12354.35 2421.64
A)! 13062.91 2525.09
M,0 13627.64 2604.52
:(n 13360.26 2588.96
:(l
A('
-e)
Et
NoC
+e


,here +as more volatilit( in the DQI Inde/ in 200! than in the .8SD86 Inde/.
Standard deviation %or .8SD86 in 200! ; "2.300
Standard deviation %or DQI8 in 200! ; 554.#4"
5*. 8ns+ers +ill var( 'iven date o% assi'nment.
1-34

Potrebbero piacerti anche