Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Chapter 10
Hypothesis Testing: Deciding between Reality and Coincidence
2/10/2012
Slide 10-2
Hypothesis Testing
e.g., Is it real? Or is it just coincidence?
Deciding between two possibilities based on data Hypothesis: a statement about the population
e.g., More than 30% of customers recognize our product e.g., You will win the election e.g., Strategy Z will make you rich in the stock market
Slide 10-3
Individual packages of Cascade may weigh slightly more or less than the marked weight due to normal variations incurred with high speed packaging machines, but each days production of Cascade will average slightly above the marked weight
Slide 10-4
If you toss 1,000 times, it is not surprising to find ten Heads in a row somewhere in the sequence But if you toss only 10 times and find ten Heads
You will rightly be suspicious!!!!! Reject the hypothesis that this is an ordinary coin?
2/10/2012
Slide 10-5
Null Hypothesis H0
The Default. Accept unless disproven x Often specific Often randomness A, B are independent (no connection) Ad has no effect on purchase
Q = Q0
Population mean equals reference value Long-run mean oven temperature, Q, equals the desired setting Q0=3250
2/10/2012
Q { Q0
Population mean does not equal reference value Long-run mean oven temperature, Q, does not equal the desired setting
Q0=3250
Slide 10-6
Decide
H0
UNLESS it is unlikely that the data could have come from the null hypothesis H0, so choose
H1
2/10/2012
Slide 10-7
Does Q = Q0?
Test by comparing X to Q0
Does Q = Q0?
The hypotheses
H0 : Q = Q 0 H1 : Q { Q 0
2/10/2012
Is X close to Q0?
Slide 10-8
t Statistic tstat ! X Q0
/ S X
Is tstat ttable ? Is X only randomly different from Q0? If YES If NO
ACCEPT H0 : Q = Q0 Accept H1 : Q { Q0 Reject H0 : Q = Q0
X IS significantly different
from Q0
Slide 10-9
Mean oven temperature (Q) should be Q0 = 3250 over the long run Adjust as needed, and only if needed
n = 20 X = 323.0 S = 6.0 S X = 1.34 measurements average temperature variability (of individual measurements) variability (of the sample average)
We are 95% sure that the unknown mean temperature Q is between 320.2 and 325.8
Slide 10-10
Example (continued)
We are 95% sure that the unknown mean temperature Q is between 320.2 and 325.8
Q0 = 325
Slide 10-11
So accept the null hypothesis H0: Q = 325 The sample average (323.0) is not significantly different from the reference value (Q0 = 325)
Slide 10-12
Reject H0 Accept H1
Because 1.5 is not in the interval
2/10/2012
Q0 = 1.5
1.4 X ! 1.466
1.5
Slide 10-13
Population mean weight Q could not reasonably be 1.5 ounces Sample average (1.466) is significantly different from claimed weight (Q0 = 1.5)
More than just randomly different. Less than a 5% chance of seeing such a large difference if Q were 1.5
Average weight (1.466) is significantly less than the claimed weight (1.5)
This is a one-sided conclusion
for a significant two-sided test
2/10/2012
Slide 10-14
Example: Binomial
T0 = 20% is the reference value T = ?, the population percent, is unknown X = 83 out of n = 268 interviewed said they would buy p = X/n = 31.0%, the sample percent
Slide 10-15
Conclusions
Slide 10-16
Interpretation
H0 could not reasonably have produced the data Either
H1 is true, or H0 is true, but you made a TYPE I ERROR
Happens 5% of the time when H0 is true
Slide 10-17
Interpretation (continued)
H0 could reasonably have produced the data Either
H0 is true, or H1 is true, but you made a TYPE II ERROR
Difficult to control This error is possible, and is very likely if Q0 is close to Q
If you accept H0
Slide 10-18
2/10/2012
The Truth
Slide 10-19
Assumptions
Slide 10-20
Set the probability of wrongly rejecting the null hypothesis H0 when it is true (Type I error rate)
The usual standard is the 5% test level
Corresponds to 95% confidence interval If H0 is rejected, the result is significant (p<0.05)
Slide 10-21
p-Values
Often provided by computer analysis
e.g., p = 0.0297
Slide 10-22
Slide 10-23
Example (continued)
The yield is significantly above the budget breakeven point Significant at the 5% level but not at the 1% level
If the long-term yield Q were 55, we would see such a large difference (Q !55 to X ! 68.3) less than 5% of the time but not less than 1% of the time