Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Section 2
Hypothesis Tests for a Population Mean
Assuming the Population Standard
Deviation is Known
Chapter 10 Section 2
Learning objectives
1
Chapter 10 Section 2
Learning objectives
1
Chapter 10 Section 2
We have the outline of a hypothesis test, just not
the detailed implementation
How do we quantify unlikely?
How do we calculate Type I and Type II errors?
What is the exact procedure to get to a do not
reject / reject conclusion?
Chapter 10 Section 2
There are three equivalent ways to perform a
hypothesis test
They will reach the same conclusion
The methods
The classical approach
The P-value approach
The confidence interval approach
Chapter 10 Section 2
The classical approach
If the sample value is too many standard deviations
away, then it must be too unlikely
Chapter 10 Section 2
The three methods all begin the same way
We have a null hypothesis, that the actual mean is
equal to a value 0
We have an alternative hypothesis
Chapter 10 Section 2
The three methods all need information
We run an experiment
We collect the data
We calculate the sample mean
Chapter 10 Section 2
In this section we assume that the population
mean is known (as in section 9.1)
We can apply our techniques if either
The population has a normal distribution
Our sample size n is large (n 30)
Chapter 10 Section 2
The three methods all compare the observed
results with the previous criterion
Classical how many standard deviations
P-value the size of the probability
Confidence interval inside or outside the interval
Chapter 10 Section 2
The three methods all conclude similarly
We do not reject the null hypothesis, or
We reject the null hypothesis
Chapter 10 Section 2
We now will cover how each of the
Classical
P-value, and
Confidence interval
Chapter 10 Section 2
Learning objectives
1
Chapter 10 Section 2
The classical approach
We compare the sample mean x to the
hypothesized population mean 0
Measure the difference in units of standard deviations
A lot of standard deviations is far few standard
deviations is not far
Just like using a general normal distribution
Chapter 10 Section 2
How far is too far?
For example, we can use = 0.05 as the level of
significance
Unlikely means that this difference occurs with
probability = 0.05 of the time, or less
This concept applies to two-tailed tests, lefttailed tests, and right-tailed tests
Chapter 10 Section 2
For two-tailed tests
The least likely 5% is the lowest 2.5% and highest
2.5%
2.5% (below
(below 1.96
1.96 and
and above
above +1.96
+1.96 standard
standard
deviations)
deviations)
1.96
1.96 and
and +1.96
+1.96 are
are the
the critical
critical values
values
The region outside this is the rejection region
Chapter 10 Section 2
For left-tailed tests
The least likely 5% is the lowest 5% (below 1.645
standard
standard deviations)
deviations)
1.645
1.645 is
is the
the critical
critical value
value
The region less than this is the rejection region
Chapter 10 Section 2
For right-tailed tests
The least likely 5% is the highest 5% (above 1.645
standard
standard deviations)
deviations)
+1.645
+1.645 is
is the
the critical
critical value
value
The region greater than this is the rejection region
Chapter 10 Section 2
An example of a two-tailed test
A bolt manufacturer claims that the diameter of
the bolts average 10.0 mm
H0: Diameter = 10.0
H1: Diameter 10.0
Chapter 10 Section 2
Do we reject the null hypothesis?
Our conclusion
We reject the null hypothesis
We have sufficient evidence that the population mean
diameter is not 10.0
Sullivan Fundamentals of Statistics 2nd Edition Chapter 10 Section 2 Slide 20 of 55
Chapter 10 Section 2
An example of a left-tailed test
A car manufacturer claims that the mpg of a
certain model car is at least 29.0
H0: MPG = 29.0
H1: MPG < 29.0
Chapter 10 Section 2
Do we reject the null hypothesis?
Our conclusion
We do not reject the null hypothesis
We have insufficient evidence that the population
mean mpg is less than 29.0
Chapter 10 Section 2
An example of a right-tailed test
A bolt manufacturer claims that the defective
rate of their product is at most 1.70 per 1,000
H0: Defect Rate = 1.70
H1: Defect Rate > 1.70
Chapter 10 Section 2
Do we reject the null hypothesis?
Our conclusion
We reject the null hypothesis
We have sufficient evidence that the population mean
rate is more than 1.70
Chapter 10 Section 2
Two-tailed test
The critical values are z/2 and z/2
The rejection region is {less than z/2} and {greater
than z1-/2}
Left-tailed test
The critical value is z
The rejection region is {less than z}
Right-tailed test
The critical value is z
The rejection region is {greater than z}
Sullivan Fundamentals of Statistics 2nd Edition Chapter 10 Section 2 Slide 25 of 55
Chapter 10 Section 2
The difference is
x 0
x 0
/ n
Chapter 10 Section 2
The general picture for a level of significance
Chapter 10 Section 2
Learning objectives
1
Chapter 10 Section 2
The P-value is the probability of observing a
sample mean that is as or more extreme than
the observed
The probability is calculated assuming that the
null hypothesis is true
We use the P-value to quantify how unlikely the
sample mean is
Chapter 10 Section 2
Just like in the classical approach, we calculate
the test statistic
x 0
z0
/ n
Chapter 10 Section 2
For the two-tailed test, the unlikely region are
values that are too high and too low
Small P-values corresponds to situations where
it is unlikely to be this far away
Chapter 10 Section 2
For the left-tailed test, the unlikely region are
values that are too low
Small P-values corresponds to situations where
it is unlikely to be this low
Chapter 10 Section 2
For the right-tailed test, the unlikely region are
values that are too high
Small P-values corresponds to situations where
it is unlikely to be this high
Chapter 10 Section 2
For all three models (two-tailed, left-tailed, righttailed)
The larger P-values mean that the difference is not
relatively large that its not an unlikely event
The smaller P-values mean that the difference is
relatively large that its an unlikely event
Chapter 10 Section 2
Larger P-values
A P-value of 0.30, for example, means that this value,
or more extreme, could happen 30% of the time
30% of the time is not unusual
Smaller P-values
A P-value of 0.01, for example, means that this value,
or more extreme, could happen only 1% of the time
1% of the time is unusual
Chapter 10 Section 2
The decision rule is
For a significance level
Do not reject the null hypothesis if the P-value is
greater than
Reject the null hypothesis if the P-value is less than
Chapter 10 Section 2
An example of a two-tailed test
A bolt manufacturer claims that the diameter of
the bolts average 10.0 mm
H0: Diameter = 10.0
H1: Diameter 10.0
Chapter 10 Section 2
Do we reject the null hypothesis?
Our conclusion
We reject the null hypothesis
We have sufficient evidence that the population mean
diameter is not 10.0
Chapter 10 Section 2
An example of a left-tailed test
A car manufacturer claims that the mpg of a
certain model car is at least 29.0
H0: MPG = 29.0
H1: MPG < 29.0
Chapter 10 Section 2
Do we reject the null hypothesis?
Our conclusion
We do not reject the null hypothesis
We have insufficient evidence that the population
mean mpg is less than 29.0
Chapter 10 Section 2
An example of a right-tailed test
A bolt manufacturer claims that the defective
rate of their product is at most 1.70 per 1,000
H0: Defect Rate = 1.70
H1: Defect Rate > 1.70
Chapter 10 Section 2
Do we reject the null hypothesis?
Our conclusion
We reject the null hypothesis
We have sufficient evidence that the population mean
rate is more than 1.70
Chapter 10 Section 2
Compare the rejection regions for the classical
approach and the P-value approach
They are the same
Classical
P-Value
Chapter 10 Section 2
Learning objectives
1
Chapter 10 Section 2
The confidence interval approach yields the
same result as the classical approach and as
the
P-value approach
We compare
A hypothesis test with a level of significance
to
A confidence interval with confidence (1 ) 100%
Chapter 10 Section 2
The relationship is
Not rejecting the
hypothesis
0 is inside the
Confidence interval
Rejecting the
hypothesis
0 is outside the
Confidence interval
Chapter 10 Section 2
An example of a two-tailed test
A bolt manufacturer claims that the diameter of
the bolts average 10.0 mm
H0: Diameter = 10.0
H1: Diameter 10.0
Chapter 10 Section 2
Do we reject the null hypothesis?
10.12 is 0.12 higher than 10.0
The standard error is (0.3 / 40) = 0.047
The confidence interval is 10.12 1.96 0.047, or
10.03 to 10.21
10.0 is outside (10.03, 10.21)
Our conclusion
We reject the null hypothesis
We have sufficient evidence that the population mean
diameter is not 10.0
Chapter 10 Section 2
An example of a left-tailed test
A car manufacturer claims that the mpg of a
certain model car is at least 29.0
H0: MPG = 29.0
H1: MPG < 29.0
Chapter 10 Section 2
Do we reject the null hypothesis?
28.89 is 0.11 lower than 29.0
The standard error is (0.5 / 40) = 0.079
The confidence interval limit is 28.89 + 1.645 0.079,
or 29.02
29.0 is inside (, 29.02)
Our conclusion
We do not reject the null hypothesis
We have insufficient evidence that the population
mean mpg is less than 29.0
Chapter 10 Section 2
An example of a right-tailed test
A bolt manufacturer claims that the defective
rate of their product is at most 1.70 per 1,000
H0: Defect Rate = 1.70
H1: Defect Rate > 1.70
Chapter 10 Section 2
Do we reject the null hypothesis?
1.78 is 0.08 higher than 1.70
The standard error is (0.06 / 40) = 0.009
The confidence interval limit is 1.78 1.645 0.009,
or 1.76
1.70 is outside (1.76, )
Our conclusion
We reject the null hypothesis
We have sufficient evidence that the population mean
rate is more than 1.70
Chapter 10 Section 2
Learning objectives
1
Chapter 10 Section 2
A significant statistical difference is one where
the hypothesis test, for equality, is rejected
A statistical significance does not necessarily
mean that it is practically significant
If we have a large sample size, we will be able to
pinpoint the rejectable values of the population
mean
Our analysis may be unnecessarily precise