Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
USINGSTATISTICS
@,BLK Foods
In this chapteq you learn how to use hypothesistesting for comparing the difference between:
I The means of two independentpopulations
t The means of two related populations
I Two proportions
I The variancesoftwo independentpopulations
370 Tests
CHAPTERTENTwo-Sample
ZTESTFORTHE DIFFERENCE
BETWEEN
TWO MEANS
-_-( X r - X ) - ( p r - p z ) (10.1)
l a ?T -o 4
r/-
\nt n2
populations 3l I
10.l: Comparingthe Meansof Two Indepcndent
where
X, : mean of the sample taken from population I
:
F1 meon of population I
I
of : variance of population I
nr: size of the sample taken from population 1
X2: meanof the sample taken from population 2
HoiVt- F2orp,-Fz:0
POOLED-VARIANCE
t TESTFORTHEDIFFERENCE
BETWEEN
TWO MEANS
-trz)
., -_- ( X r - X ) - ( t r r (10.2)
L( t l)
. i r ; l - + n z )|
!'\nt
where
.s2- ( n r - \ S ? + @ z - D S j
(rr -l)+(n2-l)
*qp2 : pooled variance
10.1
FIGURE
Regionsof rejection
and nonrejectionfor
the oooled-variance
t test for the difference
betweenthe means
(two-tailtest)
TABLE 10.1
Display Location
ComparingBLKCola
WeeklySalesfrom Normal
Two DifferentDisplay
Locations(in Number 22 34 52 62 30 52 7l 76 54
of Cases) 40 64 84 56 59 83 66 90 77
10.2
test of
isfor the
betweenthe
at the 0.05 level
ficancewith
of freedom
-2.100e
t
r nofo fI
Region Regionof I Regionof
Reiection
:tion I Nonrejection lRejection
I
Critical Critical\r-f;,
Value Value
forthistestis-3.0446,
t statistla
lromFiguel0 3,thecomputa0 is0.0070.
andthep-value
10.3
:ft Excelttest
for the two
locations 5ttJ 72
3flr,6rt8 r5t3333
t0 t0
Verlancc 25frxt66
llypolhcdredtecn Dlfierenco 0
SecrionE10.3 to create
CT t8
t Stat 30{{6
Pft<'{ orrroll 0J035
Crldcal one{all 1.73t1
F{f<-{ tro{6ll 0"0d?0
?.rmt
- (-x t - x ) - ( t q - u z )
l ^ - , (r
tJ:l -+-l
l)
1 "\'t n z)
where
Therefore,
(s0.3-72.0\-0.0
=# = 4.0446
{50.801
zs+.ooso[!+ l)
( r 0 r 0)
E X A M P L E1 0 . 1 TESTING
FORTHEDIFFERENCE
IN THEMEANDELTVERY
TIMES
A local pizzarestaurantand a local branch ofa national chain are located acrossthe streetfi'
a college campus.The local pizza restaurantadvertisesthat it delivers to the dormitoriesfal
than the national chain. In order to determine whether this advertisementis valid, you andso
friends have decided to order 10 pizzas from the localpizza restaurantand l0 pizzasfromt
national chain, all at different times. The delivery times, in minutes (see the [[!!@l!fiI
are shown in Table 10.2.
At the 0.05 level of significance, is there evidencethat the mean delivery time for theloc
ptzza restaurantis less than the mean delivery time for the national pizza chain?
SOLUTION Becauseyou want to know whether the mean is lower for the local pizza resta
rant than for the national pizza chain, you have a one-tail test with the following null and alte
native hypotheses:
Ho:vt ) p, (The mean delivery time for the local pizza restaurantis equal to
or greater than the mean delivery time for the national pizza chain.)
Hi vr < F2 (The mean delivery time for the local pizza restaurantis lessthan
the mean delivery time for the national pizza chain.)
Figure 10.4 displays Microsoft Excel results of the pooled I test for these data.
FIGURE10.4
MicrosoftExcelresults
of the pooled t test
for the pizzadelivery Local Cilein
time data 4 :Mean 16.7 18.S
5 ,Variance 9.58228.2151
6 Obseryallons 10 10
7 ,PooledVarlance 8.8987
I HypotheslzedMean Olfierence 0
s jdf 18
See Section E10.3 to create 10 ,t Siat -1.6341
this. l1 ;Pff.-0 ono-tail 0.0598
1_2.t Crltlcalone-tall 1.7341
13 rP[f<-$ two-tall 0.1196
14 rt Crlticaltwo-tail ?.1009
10.I : Comparingthe Meansof Two IndependentPopulations 37 5
(10.2)on page371,
UsingEquation
(Xt-X)-(trr-pz)
Frr.
" l n r
rl
nz)
!
. from where
faster
some
m the (nr-l)Si+(rz-l)Sz2
e2 _
file), (rr-l)+(n2-l)
9 ( 9 . s 8 2+2 e) ( 8 . 2 l s l=)
8.8987
9 +9
Therefore.
( t 6 . 7- r 8 . 8 8 ) - 0 . 0 - 2 . 1 8
= -1.6341
sqszfl+I I ffi
(10 r0)
local
In testing for the difference between the means, you assumethat the populations are nor-
mally distribute4 with equal variances.For situations in which the two populations have equal
variances,the pooled-variance/ test is robust (or not sensitive) to moderate departuresfrom
the assumption of normality, provided that the sample sizes are large. ln such situations, you
can use the pooled-varianceI test without serious effects on its power. However, if you cannot
assumethat the data in each group are from normally distributed populations, you have two
choices.You can use a nonparametricprocedure,such as the Wilcoxon rank sum test (covered
in Section 12.5),that doesnot dependon the assumptionof normality for the two populations,
or you can use a normalizing transformation (seereference5) on each of the outcomesand then
use the pooled-varianceI test.
To check the assumptionof normality in each of the two groups, observethe box-and-
whisker plot of the salesfor the two display locationsin Figure 10.5.There appearsto be only
moderate departurefrom normality, so the assumptionof normality neededfor the / test is not
seriously violated.
37 6 CHAPTER Tests
TEN Two-sample
INTERVAL
CONFIDENCE OF THEDIFFERENCE
ESTIMATE
BETWEENTWO MEANS
1
where /,, +," -2 is the critical value of the I distribution with n, * nz - 2 degreesof freedom
for an area of alZ in the upper tail.
andr,, = 2.1009:
nr = 50.3,nr = 10,Xz = 72.n2 = 10.S; = 254.0056,
* Il
- 7 2 ) ! ( 2 . t 0 0 e ) 2s4.oos6(!
(50.3
(10 t0)
- 2t.7X (2.r00ex7.| 275)
- 2 1 . 7t 1 4 . 9 7
- 3 6 . 6 7S p r - S t 2{ - 6 . 7 3
10.1:Comparing
theMeansof TwoIndependent
Populations377
Therefore,you are 95% confident that the differencein meansalesbetweenthe normal shelf
locationandthe end-aislelocationis between-36.67 casesof cola and-6.73 casesof cola.In
otherwords,the end-aislelocation sells,on average,6.73 to 36.67casesmore than the normal
aisle location. From a hypothesistesting perspective,becausethe interval does not include
zero,you rejectthe null hypothesisof no differencebetweenthe meansof the rwo populations.
10.6 'e*
Excelresults Anmlng
Two-Scmplo Vcrhnccr
separate-variance
forthe display
data 350.6ilt0, tf/3EB
Oiccrntlom 10 10
llypottrdzed lcan Dlfirrencr 0
df 15
Stt 3-0tr6
P(f<-! onc{all OIIF9
E10.5to create Crltlcal onc-tcll r:i{rei
P(f<-Q mo{all o"mn,
Crhlcalrwo.rall 2.1$a
10.4 Assumethat you havea sampleof r, : 8, have led to a host of new products.A study by Dr.
with the samplemeanX1: 42 anda samplestan- Sternof the PhiladelphiaVeteransAdministration
dard deviationof ,S,: 4. and vou have an inde- comparedweight lossbetweenobesepatientsona
pendentsampleoflr: l5 from anotherpopulationwith a diet and obesepatientson a low-carb diet (Extracted
samplemeanof X 2 : 34 and the samplestandarddevia- R. Bazell. "Studv CastsDoubt on Advantasesof
tion S, : 5. Diet," msnbc.com,May 17,2004).Let p,
a. What is the value of the pooled-variance /-teststatistic meannumberof poundsobesepatientson a low-fat
for testingHo: 1tr: 1t"r? lose in six monthsand p, representthe mean
b. In finding the critical value ofthe test statistic/, how pounds obese patients on a low-carb diet lose i
manydegreesof freedomarethere? months.
c. Usingthe levelof significancecx,:0.01,whatis the crit- a. State the null and alternative hypothesesifyou
ical valuefor a one-tailtestof the hypothesisHo: trt,t trt, testwhetherthe meanweisht lossbetweenthetwo
againstthe alternativeHr: 1tr> 1tr? is equal.
d. What is your statisticaldecision? b. In the contextof this study,what is the meaning
TypeI error?
10.5 What assumptions aboutthe two populationsare
c. In the contextof this study,what is the meaning
in Problem10.4?
necessary
Type II error?
10.6 Referringto Problem10.4,constructa 95o/oconfi- d. Suppose thata sampleof 100obesepatientsona
dence interval estimate of the population difference diet losta meanof 7.6poundsin six months,witha
betweenF1andpr. dard deviationof 3.2 pounds,while a sampleof
obesepatientson a low-carbdiet lost a meanof
pounds in six months,with a standarddeviationof
Applying the Concepts
pounds.Assuming that the population variances
10.7 Theoperations managerat a light bulb fac- equalandusinga 0.05levelof significance, is there
tory wantsto determinewhetherthereis any dif- dence of a difference in the mean weight loss of
ferencein the mean life expectancyof bulbs patients between the low-fat and low-carb diets?
manufacturedon two differenttypesof machines.The pop-
10.10 Whendo childrenin the United Statesdevelop
ulationstandarddeviationof machineI is 110hoursandof
erencesfor brand-nameproducts?In a studyreportedin
machineII is 125hours.A randomsampleof 25 light bulbs
Journal of ConsumerPsychology(Extractedfrom G.
from machineI indicatesa samplemeanof 375 hours,and
AchenreinerandD. R. John,"The Meaningof Brand
a similarsampleof 25 from machineII indicatesa sample
to Children:A DevelopmentalInvestigation,"Journal
meanof 362 hours.
ConsumerPsychology,2003, 13(3), pp. 205-219),
a. Using the 0.05 level of significance,is there any evi-
ketersshowedchildren identicalpicturesof athletic
denceof a differencein the meanlife of bulbsproduced
One picture was labeled Nike, and one was labeled K-
by the two typesof machines?
The childrenwereaskedto evaluatethe shoesbasedon
b. Computethep-valuein (a) andinterpretits meaning.
appearance, qualiry price,prestige,favorableness,
and
10.8 The purchasingdirector for an industrial erencefor owning.A scorefrom 2 (highestproduct
parts factory is investigatingthe possibility of tion possible)to1(lowest productevaluationpossible)
purchasinga new type of milling machine.She recordedfor each child. The following table reports
determines that the new machinewill be boughtif thereis resultsof the study:
evidencethat the partsproducedhavea highermeanbreak-
ing strengththan thosefrom the old machine.The popula-
Age by Sample Sample SampleStandard
tion standarddeviationofthe breakingstrengthfor the old
Brand Size Mean Deviation
machineis l0 kilogramsandfor the newmachineis 9 kilo-
grams.A sampleof 100partstakenfrom the old machine Age8
indicatesa samplemeanof 65 kilograms,and a similar Nike 27 0.89 0.98
sampleof 100 from the new machineindicatesa sample K-Mart 22 0.86 r.07
meanof 72 kilograms. Age12
a. Usingthe0.01levelof significance,is thereevidencethat Nike 39 0.88 1.01
the purchasingdirectorshouldbuy the new machine? K-Mart 4l 0.09 1.08
b. Computethep-valuein (a) andinterpretits meaning. Age15
Nike 35 0.41 0.81
10.9 Millions of dollarsarespenteachyearon diet foods. -0.29
K-Mart 33 0.92
Trendssuchas the low-fat diet or the low-carbAtkins diet
'il
a. Conducta pooled-variance/ test for the difference Taper Locks Locking Pins
betweentwo meansfor eachof the three age groups.
x 1.262 0.561
Usea levelof significanceof 0.05.
,s 0.297 0.307
b. Whatassumptionsareneededto conductthe testsin (a)?
n20 20
c. Write a brief summaryof your findings.
Extracted
Source: fromJ.Duncan,"Ghosts WhoYa
in YourProcess?
10.11 According to a survey conducted in Goingto Call?" QualityProgress,
May2005,pp.52-57.
October2001,consumerswere trying to reduce
their credit card debt (Extractedfrom M. Price, a. Assumingthat the populationvariancesare equaland
"CreditDebtsGet Cut Down to Size,"Newsday,November the populationsare normally distributed,at the 0.05
25,2001,p. F3). Basedon a sampleof 1,000consumers levelofsignificance,is thereevidenceofa differencein
in October2001 and in October2000, the meancredit the meansbetweentaperlocks and locking pins?
carddebtwas $2,41I in October2001 as comparedto b. Repeat(a), assumingthat the population variancesare
not equal.
$2,814in October2000.Supposethat the standarddevia-
tionwas$847.43 in October2001and $976.93in October c. Comparethe resultsof (a) and(b).
2000. 10.14 A bankwith a branchlocatedin a commercialdis-
a. Assumingthat the populationvariancesfrom both years trict of a city has developedan improvedprocessfor serv-
areequal,is there evidencethat the mean credit card ing customersduring the noon-to-l p.m. lunch period.The
debtwaslower in October2001 than in October2000? waiting time (operationallydefined as the time elapsed
(Usethe cx: 0.05levelof significance.) from when the customerentersthe line until he or she
b. Determinethep-value in (a) and interpretits meaning. reachesthe teller window) of all customersduringthis hour
c. Assumingthat the populationvariancesfrom both years is recordedover a period of one week.A randomsample
are equal,constructand interpret a 95o/oconfidence of 15 customersis selected(and stored in the file
intervalestimateof the differencebetweenthe popula- El[trED, and the results(in minutes)are as follows:
tionmeansin October2001andOctober2000.
4.2r 5.55 3.02 5.13 4.77 2.34 3.54 3.20
10,12 The Computer Anxiety Rating Scale 4.s0 6.10 0.38 5.12 6.46 6.19 3.79
(CARS) measuresan individual'slevel of com-
puter anxiety,on a scalefrom 20 (no anxiety) to Supposethat anotherbranch,locatedin a residentialarea,
100(highest level of anxiety). Researchersat Miami is also concernedwith the noon-to-l p.m. lunchperiod.A
ty administeredCARS to l'12 businessstudents. randomsampleof l5 customersis selected(and storedin
of the objectives of the study was to determrne the file EE&[D, and the resultsare as follows:
thereis a differencein the level of computeranxr- 9.66 5.90 8.02 s.79 8.13 3.82 8.01 8.35
experiencedby female and male businessstudents.
10.49 6.68 5.64 4.08 6.r7 9.91 s.47
foundthe following:
a. Assumingthat the populationvariancesfrom both banks
Males Females are equal,is there evidenceof a differencein the mean
waitingtime betweenthe two branches? (Usea: 0.05.)
40.26 36.85
b. Determinethep-value in (a) and interpretits meaning.
13.35 9.42
c. What otherassumption is necessary in (a)?
100 72
d. Assuming that the population variancesfrom both
' ExtractedfromT Broome and D. Havelka, "Determinants
branchesare equal,constructand interpreta95o/oconfi-
ComputerAnxiety in BusinessStudents,"The Review of Business
denceinterval estimateof the differencebetweenthe
Spring2002,6(2),pp. 9-16
Systems,
populationmeansin the two branches.
At the 0.05level ofsignificance,is thereevidenceofa 10.15 RepeatProblem10.14(a), assumingthatthepopu-
differencein the meancomputeranxietyexperiencedby lation variances in the two branches are not equal.
femaleandmalebusinessstudents? Comparethe resultswith thoseof Probleml0.la (a).
Determinethep-value and interpretits meaning.
10.16 A problem with a telephoneline that preventsa
Whatassumptionsdo you have to make about the two
customerfrom receivingor making calls is disconcertingto
populations in orderto justify the useofthe r test?
both the customerand the telephonecompany.The datain
13 A companymakingplasticoptical componentswas the file EfiftllE represent samples of 20 problems
ing inconsistencies
in an optical measurementcalled reportedto two different offices of a telephonecompany
Twodifferenttypes of pins usedin the mold produced and the time to clear theseproblems(in minutes)from the
followingresults: customers'lines:
380 CHAPTERTEN
Two-SamoleTests
1O.2 COMPARING
THE MEANSOF TWO RELATED
POPULATIONS
The hypothesis-testing proceduresexaminedin Section10.1enableyou to make comparisons
and examinedifferencesin the meansof two independentpopulations.In this section,you will
learn about a procedurefor analyzingthe differencebetweenthe meansof two populations
when you collect sampledata from populationsthat are related-that is, when resultsof the
first populationarenol independentofthe resultsofthe secondpopulation.
There are two casesthat involve relateddata betweenpopulations.In the first case,you
take repeated measurementsfrom the sameset of items or individuals.In the secondcase,
items or individualsare matched accordingto somecharacteristic.In either case,the variable
of interestbecomesthe dffirence betweenthe valuesratherthan the valuesthemselves.
The first casefor analyzingrelatedsamplesinvolvestaking repeatedmeasurements on the
sameitemsor individuals.Underthe theorythat the sameitemsor individualswill behavealike
if treatedalike, the objectiveis to showthat any differencesbetweentwo measurements of the
sameitems or individuals are due to different treatmentconditions.For example,when per-
forming a taste-testingexperiment,you can use eachpersonin the sampleas his or her own
control so that you can haverepeatedmeasurements onthe sameindividual.
The secondapproachfor analyzingrelatedsamplesinvolvesmatchingitemsor individuals
accordingto somecharacteristicof interest.For example,in testmarketinga productundertwo
different advertisingcampaigns,a sampleof test marketscan be matchedon the basisof the
test marketpopulationsize and/ordemographicvariables.By controlling thesevariables,you
arebetterableto measurethe effectsof the two different advertisingcampaigns.
Regardlessof whetheryou havematchedsamplesor repeatedmeasurements, the objec-
tive is to study the differencebetweentwo measurements by reducingthe effect of the vari-
ability that is due to the items or individuals themselves.Table 10.3 showsthe differencesin
the individualvaluesfor two relatedpopulations.To readthis table,let X1, Xt2, . . . , Xrrrep-
resentthe n valuesfrom a sample.And let X2t, X22,. . . , X2, representeither the correspond-
ing n matchedvalues from a secondsampleor the correspondingn repeatedmeasurements
from the initial sample.Then,Dp D2, . . . , Dnwill representthe correspondingset of n differ-
encescoressuchthat
D t : X t t - X 2 t , D 2 : X 1 2 - X 2 2 , .. . , a n dD r : X r n - X r n
10.3 Group
ininqthe Value Difference
BetweenTwo
Populations I xtt Xzt Dr: Xtt - Xrt
2 xtz xzz Dz: xr..z-Xn
samplesize is large,
Centnl LimitTheorem To test for the meandifferencebetweentwo relatedpopulations,you treat the difference
page268)ensures
you scores,eachD,, as valuesfrom a singlesample.If you know the populationstandarddeviation
tt the sam pling di stributi o n of the differencescores,you usetheZtest definedin Equation(10.q.2This Ztest for themean
follows a normal differenceusing samplesfrom two relatedpopulationsis equivalentto the one-sampleZtest for
the meanof the differencescores[seeEquation(9.1)on page334].
382 CHAPTERTEN Two-Samole
Tesrs
Z TESTFORTHEMEAN DIFFERENCE
D-po
Z* (10.4)
-r
6D
,\ln
where
n
S n-I.
.L/
D:i=r
n
Fp: hypothesized
meandifference
or: populationstandarddeviationofthe differencescores
r : samplesize
The teststatisticZ followsa standardized
normaldistribution.
Paired t Test
In mostcases,the populationstandarddeviationis unknown.The only informationyou
havearethe samplemeanandthe samplestandarddeviation.
lf you assumethat the difference scores are randomly and independentlyselectedfiom
population that is normally distributed you can use the paired I test for the mean differenr
in relatedpopulationsto determinewhetherthereis a significantpopulationmeandiffe
Like the one-sampleI test developedin Section 9.4 [see Equation (9.2) on page 347], thet
statistic developedhere follows the r distribution, with n - I degreesof freedom.Although
mustassume thatthepopulationis normallydistribute{as longasthe samplesizeis not
smallandthe populationis not highly skewedyou canusethe pairedr test.
To test the null hypothesisthat there is no differencein the meansof two related
Hr: P'o+0
PAIRED
t TESTFORTHEMEAN DIFFERENCE
D -po
sp
(r0.s)
'""r1;
where
\ - ! -
/-/"i
D: ;-l
,
n
sa.^ :.,
).\u; -Df
so:
,-l
The teststatisticl followsa / distributionwith n - l desreesof freedom.
10.2:Comparing
theMeansof TwoRelatedPopulations 383
For a two-tail test with a given level of significance, o, you reject the null hypothesisif the
computed I test statisticis greater than the upper-tail critical value tr_, from the I distribution or
if the computed test statistic is less than the lower-tail critical value -t, , from the / distribu-
t i o n .T h e d e c i s i o nr u l e i s
R e j e c tH o i f t > t n _ l
or if / < -/, ,:
otherwise, do not reject 11n.
The following example illustrates the use of the / test for the mean difference. The
Automobile Assocation of America (AAA) conducted a mileage test to compare the gasoline
mileage from real-life driving done by AAA members and results of city-highway driving done
according to current (as of 2005) government standards(extracted from J. Healey, "Fuel
Economy Calculationsto Be Altered," USA Today,January 11,2006, p. 1B).
What is the best way to design an experiment to compare the gasoline mileage from real-
life driving done by AAA members and results of city-highway driving done according to cur-
rent (as of 2005) government standards?One approachis to take two independentsamplesand
then use the hypothesistestsdiscussedin Section I 0. | . In this approach,you would use one set
of automobiles to test the real-life driving done by AAA members.Then you would use a sec-
llv
ond set of different automobiles to test the results of city-highway driving done according to
current (as of2005) government standards.
na
However,becausethe first set of automobilesto test the real-life driving done by AAA
lce
membersmay get lower or higher gasoline mileage than the secondset of automobiles,this
ce.
is not a good approach.A better approach is to use a repeated-measurements experiment.
est
In this experiment, you use one set of automobiles. For each automobile, you conduct a
/ou
test of real-life driving done by an AAA member and a test of city-highway driving
3ry
done according to current (as of 2005) government standards.Measuring the two gaso-
l i n e m i l e a g e s f o r t h e s a m e a u t o m o b i l e s s e r v e st o r e d u c e t h e v a r i a b i l i t y i n t h e g a s o l i n e
NS:
mileagescomparedwith what would occur if you used two independentsetsof automobiles.
This approach focuses on the differences between the real-life driving done by an AAA
member and the city highway driving done according to current (as of 2005) government
standards.
Table10.4displaysresults(storedin the file EEE@EEffr) from a sampleof
n : 9 automobilesfrom such an exoeriment.
You want to determine whether there is any difference in the mean gasoline mileage
between the real-life driving done by an AAA member and the city-highway driving done
according to current (as of2005) government standards.In other words, is there evidencethat
384 Two-SampleTests
CHAPTERTEN
\sr{\ren--\t\\srursts\st\S$lt\\),$-t.rss$s,rsssN\\RsxsN\Rbls
Yo
Ll '
ar l
D=i=t _-Lt't=_2.3444
n9
and
\rn, - D)'
SD= = 2.893575
,-1
FIGURE10.7
Two-tailpairedttest
at the 0.05level
with
of significance
B degreesof freedom
Q / +2.306f f8
Regionof I Regionof I Regionof
Rejection I Nonrejection lRejection
I
10.2:Comparing
theMeans
ofTwoRelated
Populations385
You can computethis test statisticalong with the p-value by using Microsoft Excel (see
Figure10.8).Because thep-value:0.0412 < o:0.05, you rejectHo.The p-valueindicatesthat
if the two typesof driving havethe samemeangasolinemileage,the probabilitythat onetype of
driving would havea meanthat was 2.3444miles per gallon lessthan the othertype is 0.0412.
Becausethis probabilityis lessthancr:0.05, you concludethatthe alternativehypothesisis true.
10.8
A MicrosoftExcel
ultsof paired t test
thecarmileagedata
I
Condadon 0T613.
ilocn Dlferencr 0
e:
Jrrot.
onc{all 0@-
Crldcal onc.tcll 1.8S||Il
two{rll 0.0t1?1
Crldcal two{dl
B MicrosoftExcel Box-and-Whisker
Plot for casolaneMileageDifierences
iskerplot
thecarmileagedata
the ten times pizzas were ordered"you have one measurementfrom the local pizza
and one from the national chain. At the 0.05 level of significance, is the mean delivery time
the local pizza restaurantless than the mean delivery time for the national pizza chain?
SOLUTION Use the paired r test to analyzethe data in Table 10.5(seethe file [@@@.
Figure 10.9 illustrates Microsoft Excel paired I test results for the pizza delivery data.
FIGURE10.9
MicrosoftExceloaired
t test resultsfor the
pizzadeliverydata Local Cltp'in
4 Mean 16.7 18.S
t-l -5 Variance 9.58?2 8.2151
6 Observallors 10 10
l-ll 7 rPeartonCorrelatlon 0.714'l
l-l I Hypotheslzed MeanDlfference 0
SeeSectionE10.6to create gidf I
this. 10 t Stat 3.0448
1.'ltPff."q one-tail 0.0070
12.t Crltlcalone-tall 1.8331
13 Pfr.=q rro-rall 0.0139
14 rt Crlrlcalrwo-rail 2.2622
Ho: Vn > 0 (Mean delivery time for the local pizzarestaurant is greater than or equal tothe
mean delivery time for the national pizza chain.)
Hi Fp < 0 (Mean delivery time for the local pizzarestaurantis less than the mean delivery time
for the national pizza chain.)
Choosing the level of significance o(: 0.05 and assumingthat the differencesare normally dis-
tribute4 you use the paired / test [Equation(10.5) on page 382]. For a sampleof r : 10 deliv-
ery times, there are n - | -- 9 degreesof freedom. Using Table E.3, the decision rule is
R e j e c tH o i f t < t n : - 1 . 8 3 3 1 ;
otherwise, do not reject 11n.
tr,
- -7rR
D_,=, = -"" =_2.13
n10
10.2: Cornparingthe Meansof Two Relatecl
Populations 387
s1.'=
(10.5)
FromEquation onpage382,
D -Vn -) tR-0
s, 2.2641
t; tTt
Becauset - -3.0448 is less than - I .833I, you reject the null hypothesisHn (the 7r-valueis
0 . 0 0 7 0 < 0 . 0 5 ) .T h e r e i s e v i d e n c et h a t t h e m e a n d e l i v e r y t i m e i s l o w e r f o r t h e l o c a l p i z z a
restaurantthan for the national pizza chain.
This conclusionis differentfrom the one you reachedwhen you usedthe pooled-variance r
test for these data. By pairing the delivery tirnes, you are able to focus on the differences
betweenthe two pizza delivery servicesand not the variability createdby ordering pizzas at dif-
ferenttimes of day.The pairedI test is a more powerful statisticalprocedurethat is betterable
to detect the difference between the two pizza delivery services.
CONFIDENCE
INTERVAL
ESTIMATE
FORTHEMEAN DIFFERENCE
(10.6)
s^
*U
D - tr-t
P=u,< D + t . - , G
"!n
Dlflslrlnce
sJtrfi1
rslle
'5, for Xcatr
-t0,
Dovladon 115,?32
Varlancr 1T2:t0,,g 7t5(B
1.1038 E0"gt2537.16f7
Skawrer 0.1103, Olrorallom It 35
Rango fl-l Pearon Conelatlon o.r3{2
ttnlmrm 34i Hypoitrcdrrd liin Dlfrerencr . . . 0
{3xlmum u,: df
sr||
i3{
-2599n
Sum .:t&f, k-{ oncsft o"arr
Counf 3il Cdtlcal one{all t5s
llsq{0 n' obios
aa: ,rffi.
390 CHAPTERTEN
Two-SampleTests
10.30 The datain the file@EErepresent the com- a. At the 0.01 level of significance, is there evidence
pressivestrength,in thousandsof poundsper squareinch the mean strength is lower at two days than at seven
(psi),of 40 samplesof concretetakentwo and sevendays b. What assumptionis necessary to performthistest?
afterpouring. c. Findthep-valuein (a) andinterpretits meaning.
Source: Extracted from O. Caruillo-Gamboa and R. F Gunst,
"Measurement-Error- ModeI Col linearities," Technometrics,-34,
1992,pp. 454-464.
ZTESTFORTHE DIFFERENCE
BETWEEN
TWO PROPORTIONS
(10.7)
- ( r- + - 1 1 )
P)l
\rt n z)
with
Xr+X1 Y. x)
l)r=-
P2=-
\*n2 n,
' ' l n2
wnere
p1 : proportionof successes
in sampleI
X, : numberof successes
in sample1
n,: samplesizeof sample1
fil : proportionof successes
in population1
Under the null hypothesis,you assumethat the two population proportions are equal
(n, : n).Because the pooled estimatefor the populationproportion is basedon the null
hypothesis,you combine,or pool, the two sampleproportionsto computean overall esti-
mate of the commonpopulationproportion.This estimateis equal to the numberof suc-
cessesin the two samplescombined (Xt + X) divided by the total samplesize from the two
samplegroups(nr+ n2).
As shownin the following table,you can usethis Ztestfor the differencebetweenpopula-
tion proportionsto determinewhetherthereis a differencein the proportionof successes
in the
two groups(two-tail test) or whetherone group has a higher proportion of successes
than the
other group (one-tailtest):
where
fil : proportionofsuccesses
in populationI
7[2: proportionofsuccesses
in population2
Ho:nr: n,
thatthetwopopulation
againstthealternative proportions
arenotthesame:
Hl T\+T.2
usethe test statisticZ, givenby Equation(10.7). For a given level of significanceo, rejectthe
null hypothesisif the computedZ test statisticis greaterthan the upper-tailcritical value from
the standardizednormal distribution or if the computedtest statisticis lessthan the lower-tail
critical value from the standardizednormal distribution.
To illustratethe useof the Z test for the equality of two proportions,supposethat you are
the managerof T.C. ResortProperties,a collectionof five upscaleresorthotelslocatedon two
resort islands.On one of the islands,T.C. ResortPropertieshas two hotels,the Beachcomber
andthe Windsurfer.In tabulatingthe responses to the singlequestion,'Are you likely to choose
this hotel again?"163of 227 guestsat the Beachcomberrespondedyes,and 154of 262 guests
at the Windsurferrespondedyes.At the 0.05 level of significance,is thereevidenceof a signif-
icant differencein guestsatisfaction(as measuredby the likelihood to return to the hotel)
betweenthe two hotels?
The null and alternativehypothesesare
H o : n , : r 2 o r n .- n r : 0
Hr: n, + Tc2or fil - n2+ 0
RejectHoif Z<-1.96
or if Z> +1.96;
otherwise,do not reject.Flo.
392 Tests
CHAPTERTENTwo-SamPle
FI
10.10
FIGURE M
Reqions of reiection fo
a n i n o n r e j e c i i o nw h e n di
testing a hyPothesisfor tV
the difference between h
two proportions at the P
0.05 level of significance
Regionof
Rejection
Regionof
Rejection
l
S
tt
Critical Regionof Criticat
Value Nonrejection Value
UsingEquation(10.7)on Pagel q o
-.( r . ; )
p(r - u'li
where
x1 163 -0.7181
Pt rnL . = I z - 1 5 4 = 0 . 5 8 7 8
ny 227 fi2 262
and
so that
- (0)
( 0 . 7 1 8-1 0.s878)
0.6483(l _o64sr(*.#)
0 . 13 0 3
l(urrsxo.oo8r)
0 . 13 0 3
- o ' 1 3 0 3= + 3 . 0 0 8 8
0.0432
z: +3'0088> +1'96'
Using the 0.05 level of significance,rejectthe null hypothesisbecause
Thel-value is 0.0026(cai-culated from iable E.2 or from the MicrosoftExcelresultsof Figure
that aZtest statisticis
10.11).This meansttratlf the null hypothesisis true, the probability
-3.0088 is 0.0013,an4 similarly' the probabilitythat a Z teststatisticis greaterthan
lessthan
test,thep-value is 0'0013 + 0'0013:0'0026'
+3.0088is 0.0013.Thus,for this two-tail
: There is evidence to concludethat
Because0.0026< cr 0.05,you rejectthe null hypothesis.
guest satisfaction; a greaterprop0r'
the two hotelsare significantlydifferentwith respectto
tion of guestsarewilling to returnto the Beachcomber thanto the Windsurfer'
10.3: ComparingTwo PopulationProportions 393
FIGURE 10.11
1
MicrosoftExcelresults t
fortheZ test for the ?
4
differencebetween b
Using the 0.05 level of significance,for the one-tailtest in the uppertail, the criticalvalueis
+1.645.Thedecisionrule is
R e j e cH
t o i fZ > + 1 . 6 4 5 ;
otherwise,do not rejectF1o.
UsingEquation(10.7)on page390,
Z
96.
ure
:is
1an
where
26.
hat
ror- - 2ol = x2 - 1 7 8=
Pr = X t 0.609 rz 0.539
n1 330 n2 330
394 CHAPTERTEN Two-Samole
Tests
and
Xr+X, 2 0 1+ 1 7 8 379
=.........'.................'-=_-n<1L"'
n=
nl+n2 330+ 330 660
(0.60e-0.s3e)-(0)
Z_
\ \"0 330/
0.07
^l1o.z++s11o.00606
)
0.07
ffi
0.07 tr:t
= = +l'818
or*
mt
asl
- nr) p2(l - p2)
p,1(1
(n-pz)-Z T--1!i1 -ii2
I
mt
n1 n2 L.
n1 n2
X,
Dr= 1= _1 6 3= 0 . 7 1 8 1 x,
Dt=i=__-_
154
=0.5878
n1 zzl n2 262
10.3: ComparingTwoPopulationProportions 395
UsingEquation(10.8),
( 0 . 7 1 8- 10 . 5 8 7 8t )( 1 . 9 6 ) " / L
\l 227 262
0.1303 r (1.96x0.0426)
0.1303
r 0.0835
0.0468( (nr - TE)< 0.2138
rt : variance of sample 2
For a givenlevelofsignificance,
o, to testthenull hypothesis
ofequalityofvariances:
nn:ol : ol
againstthe alternativehypothesisthat the two population variancesare not equal:
)
t
H,'
" l'
ol +o)
you reject the null hypothesisif the computed tr test statistic is greater than the upper-tail
critical value, Fy, from the Fdistribution with n, - I degreesof freedom in the numerator
and n, - I degreesof freedom in the denominator,or if the computed Ftest statisticis less
than the lower-tail critical value, F., from the F distribution with r, - I and n, - 1 degreesof
freedom in the numerator and denominator, respectively.Thus, the decision rule is
R e j e c tH o l f F > F u
orifF<tr,:
n
otherwise.do not relecrH,,.
S
-t
FIGURE10.12
l
R e g i o n so f r e j e c t i o n
a n d n o n r e j e c t i o nf o r I
the two-tail F test
I.a-
o i FL rLJ 1 r
1
R e g i o no f R e g i o no f R e g i o no f
Rejection Nonrejection Rejection
To illustrate how to use the F test to determine whether the two variancesare equal,return
to the Using Statisticsscenarioconcerningthe salesof BLK cola in two differentaisleloca-
tions. To determinewhether to use the pooled-varianceI test or the separate-variance I testin
Section 10.I , you can test the equalityof the two populationvariances.The null and alternative
hypothesesare
)
I t
l).
ol o)
)
tII ol 6;
Becausethis is a two-tail test,the rejectionregion is split into the lower and uppertailsof thef
distribution.Using the level of significancecr : 0.05, each rejectionregion contains0.025of
the distribution.
Becausethere are samplesof l0 stores for each of the two display locations,thereare
l0 - I :9 degreesof freedom for group I and also for group 2. FL,, the upper-tailcritical
value of the Fdistribution, is found directly frorr-rTable E.5, a portion of which is presentedin
Table 10.6.Becausethere are 9 degreesoffreedom in the numeratorand 9 degreesoffreedon
in the denominator,you find the upper-tailcritical value,.Fr, by looking in the column labeled
9 and the row labeled9. Thus, the upper-tailcritical value of this Fdistribution is 4.03.
T A B L E1 0 . 6 Denominator Numeratord/,
F i n d i n g F , , ,t h e U p p e r - df1
f -
t a i lL n t | c a tv a t u eo T F
with9and9Degrees I 641.80 799.s0 864.20 948.20 956.70 30
o f F r e e d o mf o r U p p e r - 2 38.51 39.00 39.17 39.36 39.37 39
Tail Area of 0.025 3 t7.44 16.04 1s.44 14.62 14.54 47
VALUESFROMTHE F DISTRIBUTION
CRITICAL
FINDINGLOWER-TAIL
.t
I
f r =
LE'
- (10.10)
tu*
-
wheref'* is from an F distributionwith n, ! degreesof freedomin the numeratorand
-
n, I degreesof freedomin the denominator'
In the cola salesexample,the degreesof freedomare9 and9 for both the numeratorsam-
just takethe
ple anddenominatorru-pi., sothereis no "switching"of degreesof freedom;you
reciprocal.Therefore,to computethe lower-tail0.025criticalvalue,you needto find theupper-
free-
lrN tail b.ozs criticalvalueof F with 9 degreesof freedomin the numeratorand9 degreesof
shown in Table 10.6on page 398, this
iL- dom in the denominatorand take its reciprocal.As
;in upper-tailvalueis 4.03.UsingEquation(10.10),
ive
_l - 1
f r = - = 0.248
fU* 4.03
10.13
ionsof rejectionand
reiectionfor two-tail
forequalityof
variances at the
levelof significance
9 and9 degrees
Regionol
Rejection
colasalesdata(seeTablel0.l on page372),
UsingEquation(10.9)on page397 and,the
the F teststatisticis
F={
S;
= 3 5 0 ' 6 7 7 8= 2.228e
G;d
FTGURE 10.14
MicrosoftExcelFtest
results
for the BLKcola
salesdata
4t0.6778 1t.333
10 10
99
2.2,,'.9
o||G-tsll 0.1211
Crltlcal ono{.ll 3.1789
See Section E10.8 to create
this.
In testinsfor a differencebetweentwo variances
usinstheF testdescribedin this
you assumethat eachof the two populationsis normally distributed.The Ftest is very
to thenormalityassumption. lf box-and-whisker plotsor normalprobabilityplotssuggest
a mild departurefrom normalityfor eitherof thetwo populations,
you shouldnot usethe,F
If this happens,a nonparametric approachis moreappropriate (seereferencesI and2).
In testing for the equality of variances.as part of assessingthe validity of the
variancer test procedure,the F test is a two-tail test. However,when you are interestedin
variabilityitself,the F testis oftena one-tailtest.Thus.in testingtheequalityof two vari
you canuseeithera two-tailor one-tailtest,dependingon whetheryou aretestingwhether
two populationvariancesaredffirent or whetheronevarianceis greaterthan theothervari
Figure10.15illustratesthethreepossiblesituations.
,A
t \
/\
/\
oFr F
PanelA PanelB PanelC
Two-tailtest One-tailtest One-tail
test
* Regionof Rejection
Ho:cl, = 6f Ho:ol>ol uo: ol < o22
Regionof Nonrejection
H r :o l * o l H.,;ol < of H.,:ol> oj
FIGURE10.15 Determining
the rejectionregionfor testingthe equalityof two populationvariances
ol o7
6? otr
10.4:F Testfor theDifferenceBetweenTwoVariances 401
D - "- 1- -
s?
t
S;
You use Table E.5 to find the upper and lower critical valuesof the F distribution.With
nt- l:7 degrees of freedomin thenumerator,nr-l:9 degrees of freedomin thedenom-
inator, and o, : 0.05 split equally into the lower- and upper-tailrejectionregionsof 0.025
each,the uppercriticalvalue,Fu, is 4.20 (seeTable10.7).
F,
7e
To find the lower critical va|ue,Fp with 7 degreesof freedomin the numeratorand 9
degreesof freedomin the denominator,you takethe reciprocalof Fr* with degreesof freedom
switchedto 9 in the numeratorandT in the denominator. Thus,from Equation(10.10)on page
399 andTable10.7,
| - I =0.207
F,=
" Fu* 4.82
The decisionrule is
-si
S;
=16'q-211
24.0
the 0.05level of significance,is there evidenceof a Central Office ll Time to Clear Problems(Minutes)
ce in the variability of the computeranxiety 7. 5 5 3 . 7 5 0 . 1 0 l . l 0 0 . 6 0 0 . 5 2 3 . 3 0 2 . t 0 0 . 5 8 4 . 0 2
3.75 0.65 t.92 0.60 1.53 4.23 0.08 1.48 1.65 0.72
assumptiondo you need to make about the two
a. Is there evidenceof a differencein the variability
ionsin orderto justiff the useof the f'test?
of the waiting times betweenthe two offices? (Use
Basedon (a) and (b), which I test defined in Section
cr: 0.05.)
I shouldyou useto testwhetherthereis a significant
b. Determinethep-value in (a) and interpretits meaning.
ifferencein mean computeranxiety for female and
c. What assumptionis necessary in (a)?Is the assumption
students?
valid for thesedata?
A bankwith a branchlocatedin a commercialdis- d. Based on the results of (a), which / test defined in
of a city hasdevelopedan improvedprocessfor serv- Section10.1 shouldyou use to comparethe meansof
customers duringthe noon-to-1p.m. lunchperiod.The the two offices?
ing time (defined as the time elapsedfrom when the
10.53 The director of training for a companythat manu-
entersthe line until he or shereachesthe teller
factureselectronicequipmentis interestedin determining
) all customersduring this hour is recordedover
of
whether different training methodshave an effect on the
of oneweek.A randomsampleof 15 customersis
productivity of assembly-lineemployees.She randomly
(andstoredin the file !@), andthe results(in
assigns42 recentlyhired employeesto two groups of 21.
) areasfollows:
The first group receivesa computer-assisted,individual-
basedtraining program, and the other receivesa team-
4.21 5.55 3.02 5.13 4.77 2.34 3.s4 3.20
basedtraining program.Upon completionof the training,
4.50 6.10 0.38 5.12 6.46 6.t9 3.79 the employeesare evaluatedon the time (in seconds)it
takesto assemblea part. The resultsare in the data file
that anotherbranch,locatedin a residentialarea,
concerned with the noon-to-l p.m. lunchperiod.A
@.
a. Usinga 0.05levelof significance,is thereevidenceof a
sampleof l5 customersis selected(and storedin differencebetweenthe variancesin assemblytimes (in
file @@), and the results (in minutes) are as seconds)of employeestrainedin a computer-assisted,
individual-basedprogram and those trained in a team-
basedprogram?
9.66 5.90 8.02 5.79 8.73 3.82 8.01 8.35 b. On the basisof the resultsin (a), which r test defined in
10,496.68 5.64 4.08 6.17 9.91 5.47 Section10.1 shouldyou use to comparethe meansof
the two groups?Discuss.
Isthereevidenceof a differencein the variability of the
waitingtime betweenthe two branches? (Usea:0.05.) 10.54 Is therea differencein the variationof the yield of
Determine thep-value in (a) and interpretits meaning. different types of investmentbetweenbanks?The follow-
Whatassumptionis necessaryin (a)? Is the assumption ing data, from the fil" El@![f[, representthe nation-
mlid for thesedata? wide highest yields for money market accountsand one-
Basedon the resultsof (a), is it appropriateto use the yearCDs as ofJanuary24,2006:
pooled-variance I test to comparethe meansof the two
branches? Money MarketAccounts I One-YearCD
10.52 A problem with a telephoneline that 4.ss 4.s0 4.40 4.38 4.38 | 4.94 4.90 4.85 4.85 4.85
IJJ prevents a customerfrom receiving or making Source: Extractedfrom Bankrate.com,January 24, 2006.
is disconcertingto both the customerand the tele-
company.The following data (stored in the file At the 0.05levelof significance,is thereevidenceof a dif-
representsamplesof 20 problemsreportedto two ferencein the varianceof the yield betweenmoneymarket
t offices of a telephonecompanyand the time to accountsand one-yearCDs?Assumethat the population
theseproblems(in minutes)from the customers'lines: yields arenormally distributed.
TA
Su
In this chapter,you were introducedto a variety of two- select thc test that is most appropriate for a givensetof tn \
sar.nple tests.For situationsin which the samplesare inde- conditions and to critically investigatethe validity of the
pendent,you learnedstatisticaltest proceduresfor analyz- a s s u m p t i o n su n d e r l y i n g e a c h o f t h e h y p o t h e s i s - t e s t i n g
i n g p o s s i b l ed i f f e r e n c e sb e t w e e nm e a n s ,v a r i a n c e s .a n d procedures.
proportions.ln addition, you learneda test proceclurethat The roadmap in Figurc 10.l6 illustrates the stens
is frequentlyused when analyzingdifferencesbetweenthe neededin determiningwhich two-sar.nple test of hypothesis
meansof two relatedsamples.Rcmernberthat you needto to use:Thc fbllowing arethe questionsyou needto consider,
Two-Sample
Tests
IZ'
Categorical Type
Numerical
of
Data
CentraI
Tendency Variability
Focus
No FTest
o?=otrz Yes
r l =ol
roa
Paired
t Test
Separate-Variance Pooled-Variance
t Test f Test
l. What typc of datado you lrave'/If you are clealingwith 4 . If your focus is central tendency,determine whether
categorical variables, use the Z rest for the differencc you can assumethat the variancesof the two groups
betweentwo proportions.(This test assumesindeoen_ a r e e q u a l . ( T h i s a s s u r n p t i o nc a n b e t e s t e du s i n gt h e
d e n ts a m p l e s . . l F test.)
2. lf you have a numerical variable, deten.ninewhether If yoLrcan assumethat the two groups have equal vari-
you have independentsar.nples or related samples.lf a n c e s ,u s e t h e p o o l e d - v a r i a n c eI t e s t . I f y o u c a n n o t
you haverelatedsamples,use the pairedI test. assumethat the two groups have equal variances,use
3. Ifyou haveindependentsamples,is your focus on vari- the separate-variance / test.
ability or central tendency'?If the focus is variabilitv.
u s et h e , t r t e s t . Table 10.8providesa list of topics coveredin this chapter.
Kcy Equations 405
I
cf
TABLE 10.8
Summary of Topics
i nC h a p t e 1r 0
Type of Analysis Numerical
Typesof Data
Categorical
Comparingtwo Z and t tests for the difference Z test for the difference
:le
populations in the means of two betweentwo proportions
1g
independentpopulations ( S e c t i o n1 0 . 3 )
( S e c t i o n1 0 .I )
ps
;is
P a i r e dI t e s t( S e c t i o n1 0 . 2 )
ZTestfor the Difference Between Two Means ConfidenceInterval Estimatefor the Mean Difference
(Xt-X)-(pr -trz)
Z- (r0.1) (10.6)
f , )
loi o;
rl r
! /7t n2
Pooled-Variance
I Test for the Difference
D-t,,-r#=u,<D+,,-,h
BetweenTwo Means
- X t ) - ( $ r- p z ) Z Test for the Difference BetweenTwo Proportions
(r0.2)
, _(pt- pz\- (nr-/t.)
( r 0.7)
S(X, - Xrl+t,,,.,,._,
FEB a(h-pz)+Z
pt(l - pr)
t1l
p2(l- p2l
t7-t
t(
f(
ti
F distribution 397 pooled-variance
/ test 371 Z test for differencebetween I\
.Fteststatisticfor testingthe equality repeatedmeasurements 381 two means 370 n
of two variances 397 robust 375 Ztestfor the differencebetween S
and measuredthe effectivenessof both. The a. Interpretthe resultsofthe r test for question1.
concludedthat the stateadsweremore effec- b. Interpretthe resultsofthe t test for question2.
in relayingthe dangersof smoking than the Philip c. Interpretthe resultsofthe t testfor question3.
is ads.The article suggests,however,that the study is d. Interpretthe resultsofthe I test for question4.
statisticallyreliable becausethe samplesize was too e. Write a short summary about the differencesbetween
and becausethe study specifically selectedpartici- malesand femalesconcerninstheir views towardcause-
who areconsideredmore likelv to startsmokinethan relatedsponsorship.
How do you think the researchersmeasuredeffective- 10.66 Two professorswantedto study how studentsfrom
their two universitiescomparedin their capabilitiesof using
Definethe null and alternativehypothesesfor this study. Excel spreadsheets in undergraduateinformation systems
Explainthe risks associatedwith Type I and Type II courses(Extractedfrom H. Howe, and M. G. Simkin,
enorsin this study. "FactorsAffecting theAbility to DetectSpreadsheet Errors,'o
Whattype of test is most appropriatein this situation? Decision SciencesJournal of InnovativeEducation, January
Whatdo you think is meant by the phrasestatistically 2006, pp. l0l-122). A comparisonof the studentdemo-
reliable2 graphicswasalsoperformed.Oneschoolis a stateuniversity
in the WesternUnited States.and the otherschoolis a state
The FedEx St. JudeClassicprofessionalgolftour- universityin the EasternUnited States.The following table
t is held eachyear in Memphis,Tennessee. FedEx containsinformation regardingthe agesof the students:
this PGA tournament,and part of the proceedsgo
theSt. JudeChildren'sResearchHospital. In 2003, the Sample Mean Standard
raised$679,115for the hospital.This type of School Size Age Deviation
sponsorship is known ascause-related
marketing.
Western 93 23.28 6,29
sampleof spectatorsat the tournamentwere surveyed
askedto respondto a seriesof statements
Eastern r35 21.16 1.32
on a 5-point
(l = StronglyDisagree,2 : Disagree,3 : Neutral,
Agree,and 5 : StronglyAgree).The following are a. Using a 0.01levelof significance,is thereevidenceof a
ofthe questionsasked: differencebetweenthe variancesin age of studentsat
Cause-related marketing createsa positive company the Westernschooland at the Easternschool?
image. b. Discussthe practicalimplicationsof the test performed
I would be willing to pay more for a servicethat sup- in (a). Address, specifically, the impact equal (or
portsa causeI careabout. unequal)variancesin agehason teachingan undergrad-
Cause-related marketingshouldbe a standardpart of a uateinformation systemscourse.
company's activities. c. To test for a differencein the meanageof students,is it
Basedon its supportof St. Jude,I will be more likely to most appropriateto usethe pooled-variance/ test or the
useFedExservices. separate-variance I test?
eachquestion,the researchers testedthe null hypothe-
thatthe meanresponsefor malesand femalesis equal. The following table containsinformation regardingthe
alternativehypothesisis that the meanresponseis dif- yearsofspreadsheet usageofthe students:
for malesand females.The followine table summa-
theresults: Sample Mean Standard
School Size Years Deviation
10,77 The manufacturerof Bostonand Vermontasphalt 2005 return-Twelve-month return in 2005 ffi
M
'Managingthe SpringvilleHerald
OW,
A marketingdepartment team is charged with improving correspondingly,to increasedsubscription sales.The team
thetelemarketingprocess in order to increasethe number selected a sample of 30 female callers who staff the tele-
ual of home-deliverysubscriptions sold. After several brain- phone bank on Wednesday evenings and randomly
iof $ormingsessions,it was clear that the longer a caller assigned 15 of them to the "early" group and l 5 to the
lnd speaks to a respondent,the greater the chance that the "later" group. The callers knew that the team was observ-
I
callerwill sell a home-delivery subscription.Therefore,the ing their efforts that evening but didn't know which calls
teamdecidedto find ways to increase the length of the were monitored. The callers had been trained to make their
ual phonecalls. telephone presentationsin a structured manner.They were
hat Initially,the team investigatedthe impact that the time to read from a script, and their greeting was personal but
'ear
ofa call mrght have on the length of the call. Under current informal ("Hi, this is Mary Jones from the Springville
)ort arrangements, calls were made in the evening hours, Herald. May I speakto Bill Richards?").
between 5:00 p.m. and 9:00 p.m., Monday through Friday. Measurementswere taken on the length of the call
Theteamwantedto comparethe length of calls made early (defined as the difference, in seconds,betweenthe time the
'50 in the evening(before 7:00 p.m.) with those made later in person answeredthe phone and the time he or she hung
theevening(after 7:00 p.m.) to determine whether one of up). The results (stored in the file !fff[!) are presentedin
D. these time periods is more conducive to lengthier calls and, T a b l eS H l 0 . l .
rfa
rint
T A B L ES H 1 O . 1 Time of Call Time of Call
ive
Length
of Calls,in Early Late Early
rfa Basedon
Seconds,
r'
rililg vt
r - tl
9oil-Lqtty
41.3 37.1 40.6 40.7
rtO
Versus
Late in the 37.5 38.9 JJ ---) 38.0
ad- Evening 39.3 A"t 1
AL.L 39.6 43.6
ing 1a
) t.+
A
45.7 35.7 43.8
ing 33.6 42.4 3r . 3 34.9
38.5 39.0 36.8 35.7
32.6 40.9 36.3 41.4
50
31.3 40.5
f,ur
50 Web Case
BA
rts. Applyltour knowledge about hypothesis testing in this Web Even after the recent public experiment about cereal
Case,which continues the cereal-fill packaging dispute box weights, the Consumers Concerned About Cereal
WebCuse fiom Chapters 7 and 9. Cheaters(CCACC) remains convinced that Oxford Cereals
412 TEN Two-Sample
CHAPTER Tests
has misled the public. The group has createdand posteda l . Do the CCACC'sresultsprove that there is a stati
documentin which it claims that cerealboxesproducedat differencein the meanweightsof cerealboxes
PlantNumber 2 in Springvilleweigh lessthan the claimed at PlantNumbersI and2?
meanof 368 grams.Visit the CCACCMore Cheatingpage ) Performthe appropriateanalysisto test the CC
at www.prenhall.com/Springville/MoreCheating.htm (or hypothesis.What conclusionscanyou reachbasedon
openthis Web page file from the text CD's WebCasefolder) data?
andthenanswerthe following:
T A B L EE 1 0 . 1 UnsummarizedData SummarizedData
Two-SampleTest (ExcelCompanionSection) (ExcelCompanionSection)
Two-Sample Tests
inMicrosoftExcel Z test for the difference ToolPakz-Test:Two Samnle Z'lwo Means.xls or PHStat2
betweentwo means for Means(E 10.I ) Z Testfor Differencesin
TwoMeans(E10.2)
Pooled-variance/ test ToolPakt-Test:Two-Sample Pooled-varianceT.xlsor
for the difference AssumingEqualVariances PHStat2t Testfor Differences
betweentwo means (E10.3) BetweenTwo Means(E10.4)
Separate-variance I test ToolPakt-Test:Two-Sample Not included
for the difference AssumingUnequalVariances
betweentwo means (Er0.s)
Paired I test ToolPakt-Test:PairedTwo Not included
Samplefor Means(E10.6)
Z test for the difference Not included ZTwo Proportions.xls or PHStat2
between two Z Testfor DifferencesBetween
proportions Two Proportions(E 10.7)
,F test for the difference ToolPakF-TestTwo-Sample F TwoVariances.xlsor
between two variances (E10.8)
for Variances PHStat2F Testfor Differences
BetweenTwo Variances
wo Variances (E10.9)
t'E10.9
-l
4I4 EXCELCOMPANION
to Chanter
10
varribbaRerqc: lE t- c*d I
I Hdt
TestOptbns
t"Urothsii:ed
l"1a.n
Ddffefcrcor l
i; T$ro-TdTest
Va|l$lc 1Y6ri!nc6(knM);
i l.lppar-Tail
Test
Yafii?bh2 vqiffie (krnm):
i . Lorarer-Tail
Te*
mL&b
Aloha: 0,05
0r"rtp.rt
O$icns
i-uiiil! ifrtloftt
FIGUREE1O.2
1
Z T w o M e a n sw o r k s h e e t 2
3
4
5
'er- I
6
I 7
)_z !
F
9
ino 10
t1
12
l
l IJ
11
15
! to -88 " 812
I 17 -s0RT(89^ 2)rB7)+ (813^ 2)/811)
18 .(816 - 8{f417
19
20
21 -NOftllSltlV{85,21
n -lroRlitstNv(l - (852)|
l -2' (1 - N0RtlSDlSr(ABs(Bl8))
for ! -lF(823 < 85, "Rojecrth. trull hypothesb".
'0o
the I x not r.jocl tho null hyporhosisJ
Using Z Two Means.xls from the Data Analysis list and click OK. In the procedure's
dialog box (shown below), enterthe cell rangeof one sample
Open
to theZTwo Meansworksheet
of theEIEEEEEE as the Variable I Range and the cell rangeof the other sam-
workbook. (seeFigureE10.2)usesthe
Thisworksheet ple as the Variable 2 Range.Enterthe HypothesizedMean
NORMSINV?<h function to determine the lower and Difference,click Labels,and click OK. Resultsappearon a
uppercritical values and the NORMSDIST(Z value) func- new worksheet. Figure 10.3 on page 373 shows the results
tion to compute the p-values from the Z value calculatedin for the Table 10.I BLK cola salesdata.
c e l l B 1 8 . T h e w o r k s h e e tc o n t a i n s t h e e n t r i e s t o s o l v e
Probleml0.l on page 377. To adapt this worksheetto other
problerns,change,as is necessary,the hypothesizeddiffer-
enceand level of significancein cells 84 and B5 and the
samplesize,samplerneanand population standarddeviation
of eachsamplein the cell range87:89 and B 11:B 13.
Xit*r*"*", tq1 m(ll
vebueaRane.: tE t- c"*d I
lf you want to use this worksheetto show only one test, --l
T E'eh
firstmake a copy of the worksheet(seethe Excel Companion HypottErizdtvte.nutference:
to Chapter l). For a two-tail test, select and delete rows 26 nt beb
through34. For a lower-tail-test-onlyworksheet,selectand Abh.: 0.05
deleterows 31 through 34 and then selectand deleterows 20 -.lDut tFh'lr'r
'!-
through25.For an upper-tail-test-on1y worksheet,selectand {-i: Eutprf nonqc:
deleterows 20 through 30. ('.: Nes wwkshsetPly:
i-I Neei,gwkbook
E10.3USINGTHE POOLED-VARIANCE
t TEST(UNSUMMARIZEDDATA)
E10.4 USINGTHE POOLED.VARIANCE
For unsummarized data, you conduct a pooled-variance I
testby selectingthe ToolPak t-Test:Two-SampleAssuming
t TEST(SUMMARTZED
DATA)
EqualVariancesprocedure. F o r s u m r n a r i z e dd a t a , y o u c o n d u c t a p o o l e d - v a r i a n c e
Open to the worksheetthat containsthe unsummarized I test by either using the PHStat2 t Test for Differences
datafor the two samples.SelectTools t Data Analysis, in Two Means procedureor by making entries in the
selectt-Test: Two-Sample Assuming Equal Variances nnnFf,lElEil*inn workbook.
416 EXCELCoMPANIoNto Chaoter
lo
FIGURE 10.3
1
PVtworksheet 2
J
I
10
ll
't2
13
14
15
'16 -87-1
1a -811.1
18 -816 r 817
19 -(8i6' 89^2)+ (!17'813^2))i818
n -88 - 812
21 -{820 - 81/SARTG19' {t/87 + 1lB11)}
22
21 --(nNV(85,818))
x -nNvG5,818)
6 -TDIST(Atss{821},818, 2}
77 -lF{826 < 85, 'Roj.cl lhe null hypothcair",
a -Do
rot feJocttho ntrll hypolh$bJ
a
30 --rNV@'.85,810))
31 -lF(821 < 0, E32.E33)
12 -lF(B3l < 85, "Rejsd tho n{ll hypothGb-,
'Do
not rolost th. null hypotheeir']
InFut
Using PHStat2 Z Test for
F==;-".--l
VdladelRffsc: lLl \sdnin@rj Differences in Two Proportions
lrdiadceRtrq.r tE t c"*"1 I
Select PHStat t Two-Sample Tests ) Z Test for
l- H"h_l Differences in Two Proportions. In the Z Test for
il@hgdzcdl,bfiDifferc'rrar
trpact Differences in Two Proportions dialog box (shown on page
gPha: 0.6 418), enter values for the Hypothesized Difference and the
OutputoFtrorrs Level of Significance.Enter the Number of Successes and
Cguprrnrnc: L the Sample Size for the population 1 sample and the popu-
O iFr wortdnctgfyr lation 2 sample.Click one of the test options,entera title as
O tlawWatloot the Title. and click OK. To include a confidenceinterval
estimate of the difference between the two proportions
418 EXCELcoMPANIoN
to chanter
lo
(similar to one shown in Figure El0.6 below), click Using Z Two Proportions.xls
ConfidenceInterval Estimatebeforeyou click OK.
You open and use eitherthe ZTP_TT or the ZTP
worksheets
of theEEEEEEEEEE workbook
to
a Z test for the difference between two proportions.
0Sa worksheetsuse the NORMSINV@<$ function to
Hlrpotha$zedDaffererrc:
Lcvd d ggnficmcc:
r- mine the lower and upper critical values, and
NORMSDIST(Z value)functionto computethep-
lfis from the Z valuecalculatedin cell Bl8. To better
Populatbn156nr* stand how a messagegets displayedin these
lfumbcrd Srccsscsr read the 'About the IF function" part of SectionE9.l
page 364.
Sanple5be:
The ZTP_TT worksheet (seeFigure10.11on
Potrirtiln 2 sanplt 393) appliesthe two-tailZ testto the Section10.3
iflrber of Sr-acesasl guestsatisfactionexample.The ZTP_All worksheet
both the upper-tailand lower-tailtest (seeFigureE
Samplaftar f- plus a confidenceinterval estimateof the diffe
betweenthe two proportions(seeFigureE10.6).To
Test@tklns theseworksheetsto otherproblems,change,if n
i3 trr,n-tC test the hypothesized difference, level of significance,
r tffi-TdTest number of successesand sample size for each sample
f torarer-Teil c e l l s8 4 , 8 5 , 8 7 , 8 8 , B l 0 , a n d B l 1 .
Test
lf you want the ZTP_Allworksheet to show only
Qfrput Sions of the single-tail tests, first make a copy of that
sheet(seethe Excel Companionto Chapterl). For
Ttle: i
lower-tail-test-onlyworksheet,select and deleterows
[* Corfldence
lr*ervd E*inate
through 34 and then select and delete rows 20 through
For an upper-tail-test-only worksheet, select and
rows 20 through 30. If you do not want to includea
|e'-----'- --al I
F I G U R EE 1 0 . 5
Zf P_Allsingle-tail Its
t?c
test a rea Iu -iloRrslilv(a5,
-iORISDTSI/Atr,
-rF(W < Ao, .R.l.c.
h. natt htt"oth.*-,
la- -Do
not ,.1.d a/'. ,rutt ,rrrpottto&.1
321 -lloRilsll{v(iA5}
FjI -1 -l{oRllSDtST(Brsl
34i -lF(833 < 85, -Rojecilho
null hypoth$fs,.,
-Do
not rorocl the null hyporhedr.)
FIGUREE10.6
ZTP_All
confidence '1
interval estimate area t:
4
5
7
R
at
io -NoRISll{v(t . E7}z}
11
-sORT((81a'(t . Btll/EB).
i2 -ABS{E10'Eltl t8ts . n 8r5)/811)l
ti-
14
t-s
15
E I 0.9: Using the F Testfor the DifferenceBetweenTwo Variances(SummarizedData) 419
CI"tpt Options
E10.9USINGTHEF TESTFORTHE Ttler I
DIFFERENCEBETWEENTWO ffi
(SUMMARTZED
VAR|ANCES | ---"-'*'-*'-i! {
DATA)
_ryq*J l; OK il Cancd i
FIGUREElO.7
I
F Two Variances
worksheet
5
7
I
9
10
-87^2810^?
14 -86 .1
15 -89"1
16
17
18 -FltlV(l . Bl2, B'tl, 815l
19 -FINV(81/2, B1{,81O
n - l F ( 8 r 3> 1 ,2 ' 8 1 72, ' E l 8 )
21 'lF(820 < 85, 'Rejea rhonull hypottads',
1)o norrejed lhe null hypothcels"l
a
-Fltlvfl . B{, 811,815)
6 -Elt
26 -lF(825< 85, "ReJect the null hypothcsls',
27 'Do nol reied rhe null hlporhcCe"]
n
B -FlilV€r, 811,815)
30 -E17
31 rlF{830< 85, -Rojod rhenull hypothqds',
'Do not frlscl lh. null hypotheC6''l
t{ol drown
Cellt17: -F0|ST(813,811,
B15l
Cellt10: -1 - Et7