Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Continuous Data
Objectives
• Evaluate the difference between a process (population) mean and a target value
using 1-sample and confidence intervals.
• Assess the power of a hypothesis test using power analysis.
• Test for a difference between two population means using a 2-sample t-test.
• Test for a difference between paired observations using a paired t-test.
Hypothesis Tests: Continuous Data
• In this laboratory
Examples and Exercises Purpose
Hypothesis Testing and Confidence Intervals
Example 1: Filling Cereal Boxes Evaluate the difference between a mean and a
target value using a 1-sample t-test
Exercise C: Filling Cereal Boxes: Evaluate the difference between a mean and a
Minitab Assistant target
Power and Sample Size
Example 2: Evaluating Power Assess the power of a t-test
Hypothesis testing
When to use a hypothesis test
• Use a hypothesis test to make inferences about one or more
populations when sample data are available.
Why use a hypothesis test
A hypothesis test answers questions such as:
• Is a process centered correctly?
• Is the product from one supplier better than the product from
another?
• Do treatment groups in an experiment differ?
Hypothesis testing
For example,
• Is the mean width of paper stock the desired 8.5 inches?
• Does fuel from one supplier burn more cleanly than fuel from another
supplier?
• Do customers prefer one formulation of soft drink over another?
1-sample t-
t-test
What is a 1-sample t-test
• Use 1-sample t-test to determine whether (the population mean) is
equal to the hypothesized mean.
• The test uses the standard deviation of the sample to estimate (the
population standard deviation).
• If the difference between the sample mean and the hypothesized
mean is large relative to the variability of the sample mean, then is
unlikely to be equal to the hypothesized mean.
1-sample t-
t-test
When to use a 1-sample t-test
• Use a 1-sample t-test with continuous data from a single random
sample.
• The test assumes the population is normally distributed.
• However, the test is robust to violations of this assumption, provided
the observations are collected randomly and the data are continuous,
unimodal, and reasonably symmetric (see[1]).
1-sample t-
t-test
Why use a 1-sample t-test
A 1-sample t-test answers questions such as:
• Is a process on target?
• Does a key characteristic of a supplier’s material have the desired
mean value?
For example,
• Is the mean width of razor blades higher or lower than target?
• Is the mean strength of the bolts from a supplier lower than the
required minimum?
Testing the null hypothesis
• The manufacturer needs to determine whether the mean weight for
the packaging process differs significantly from the target weight of
365 grams. In statistical terms, the process mean is the population
mean, or (mu).
One-Sample T: BoxWeigh
One-Sample T: BoxWeigh
Note
• The t-test is robust to violations of the normality assumption,
provided that observations are collected randomly and the data are
continuous, unimodal, and reasonably symmetric.
Testing the assumption of normality
Normality Test
1. Choose Stat > Basic Statistics >
Normality Test.
2. Complete the dialog box as
shown below.
3. Click OK.
Note
• When data are not normally distributed, you may be able to
transform them using a Box-Cox transformation or use a
nonparametric procedure such as the 1-sample sign test.
Interpreting your results
Probability Plot of BoxWeigh
Normal
99
Mean 366.7
StDev 2.403
95 N 6
AD 0.248
90
P-Value 0.599
80
70
Percent
60
50
40
30
20
10
5
1
360 362 364 366 368 370 372
BoxWeigh
Confidence intervals
What is a confidence interval
• A confidence interval is a range of likely values for a population
parameter (such as ) that is based on sample data.
For example:
• With a 95% confidence interval for , you can be 95% confidence that
the interval contain .
• In other word, 95 out of 100 intervals will contain upon repeated
sampling.
Confidence intervals
When to use a confidence interval
Use confidence intervals to:
• Make inferences about one or more populations from sample data.
• Quantify the precision of the estimate of a population parameter,
such as .
Confidence intervals
Why use a confidence interval
A confidence interval often answer the same questions as a hypothesis
test:
• Is on target?
• How precise is the estimate of ?
• How low or high might be?
For example:
• Is the mean length of a part longer than 5 cm?
• What is the range of likely values for the mean weight of bags of
coffee beans?
Using the confidence interval
• The Session window results for 1-Sample t include values for the
upper and lower bounds of the 95% confidence interval.
• Obtain a graphical representation of the interval by selecting
Individual value plot in the One-Sample t: Graphs subdialog box.
_
X
Ho
• If H is inside the interval, the p-value for the hypothesis test will be
greater than 0.05.
• You cannot reject the null hypothesis at the 0.05 -level.
Interpreting your results
• Because H falls within the confidence interval, you cannot reject the
null hypothesis.
• Not enough evidence is available to conclude that is different from
the target of 365 grams at the 0.05 significance level.
One-Sample T: BoxWeigh
Test of μ = 365 vs ≠ 365
_
X
Ho
Final considerations
Summary and conclusions
• Based on the sample data, you cannot reject the null hypothesis at
the 0.05 -level.
• Not enough evidence is available to suggest that the process mean fill
weight is different from 365 gram.
• Like the t-test, the 95% confidence interval does not provide enough
evidence to reject the null hypothesis that the population mean for
cereal box weights is 365 grams.
• However, the confidence interval provides extra information that with
95% confidence, the process mean is at least 364.2 g and at most
369.2 g.
Final considerations
Hypotheses
A hypothesis test begins with two opposing hypotheses.
Final considerations
Hypotheses
The alternative hypothesis ( ):
• States that the null hypothesis is wrong.
• Can also specify the direction of the difference.
Final considerations
Significance level
Choose the -level before conducting the test.
• Increasing increases the chance of detecting a difference, but it also
increases the chance of rejecting when it is actually true (a Type I
error).
• Decreasing decreases the chance of making a Type I error, but also
decreases the chance of detecting a difference when one actually
exists (a Type II error).
Final considerations
Assumptions
Each hypothesis test is based on one or more assumptions about the
data. If these assumptions are not met, the conclusions may not be
correct.
Final considerations
Confidence interval
The confidence interval provides a likely range of values for (or other
population parameters). You can conduct a 2-tailed hypothesis test
(alternative hypothesis of ≠) using a confidence interval.
For example:
• If the test value is not within a 95% confidence interval, you can reject
at the 0.05 -level.
• Likewise, if you construct a 99% confidence interval and it does not
include the test mean, you can reject at the 0.01 -level.
Exercise C: Filling Cereal Boxes: Minitab Assistant
Problem
• A cereal manufacturer wants to determine whether the box-filling
process is on target.
• The target fill weight for cereal boxes is 365 grams.
Data Collection
• Engineering choose six boxes of cereal at random, weigh them, and
use the sample data to estimate the mean of the population (the
process mean).
Power analysis
• The power of the test is the probability that you will correctly reject
the null hypothesis, given that the null hypothesis is false.
• Use a power analysis to determine how much power a test has or to
design a new test with adequate power.
Power analysis
When to use power analysis
Use a power analysis:
• Before collecting data, to determine the sample size.
• After collecting data, to evaluate the power to detect a difference.
Why use power analysis
A power analysis answer question such as:
• Is the sample size large enough?
• How large a difference can the test detect?
• Is the test powerful enough to give credibility to its conclusion?
Power analysis
For example,
• How many samples are needed to determine whether the mean
thickness of paper is greater than 0.04 mm?
• How large a difference will be detectable between the mean strength
of new steel beams and a historic mean, using eight samples?
• Can you trust the conclusion from a t-test that indicates the strengths
of two glue formulations are not different?
Determining the difference parameter
Values
To estimate power, you must specify values for any two of the following
parameters of the test; Minitab calculates the remaining parameter.
• Sample sizes – the number of observations in the sample.
• Differences – a meaningful shift away from the target that you are
interested in detecting with high probability.
• Power values – the power (probability of rejecting when it is false)
that you would like the test to have.
Determining power
Standard deviation
• Because variability in the data affects the power of a test, you must
provide an estimate of .
• Use a historical estimate or the sample standard deviation.
• For the cereal data, the standard deviation (2.403) comes from the t-
test results.
Determining power
Sample
Difference Size Power
-2.5 6 0.537662
2.5 6 0.537662
0.4
0.2
0.0
-5.0 -2.5 0.0 2.5 5.0
Difference
Determining power
• With 6 observations, the power of the test was only 0.5377.
• To have a better chance of detecting a difference, increase the power
of the test to at least 0.80 by increasing the sample size.
• Calculate the sample sizes that are required to achieve power levels
of 0.80, 0.85, 0.90, and 0.95.
Determining power
Sample Target
Difference Size Power Actual Power
-2.5 10 0.80 0.832695
-2.5 11 0.85 0.873928
-2.5 12 0.90 0.905836
-2.5 15 0.95 0.962487
2.5 10 0.80 0.832695
2.5 11 0.85 0.873928
2.5 12 0.90 0.905836
2.5 15 0.95 0.962487
Interpreting your results
• The power curves for the larger sample sizes increase more rapidly as
the difference moves away from zero.
• The power is reasonably good for detecting a difference of 2.5 grams
with sample sizes between 10 and 15.
Assumptions
0.6 α 0.05
Power
StDev 2.403
Alternative ≠
0.4
0.2
0.0
-3 -2 -1 0 1 2 3
Difference
Final considerations
Summary and conclusions
• The results for the first test of cereal fill weights may have been
insignificant simply because the power was too low.
• Based on the number of observations (6), the difference to detect
(2.5), and the variability in the process, the test had a power of only
0.537662.
• Using a larger sample will give more power, enabling you to detect a
difference if one exists.
Final considerations
Additional considerations
• To ensure that the test will have sufficient power, conduct a power
analysis prior to collecting data.
• To increase the power of a test:
• Increase the sample size.
• Decrease the variability that is not attributed to the effect of interest.
• Increase (although this increases the chance of Type I error).
Final considerations
Additional considerations
• Higher power means a greater probability of detecting a difference.
• However, it also increases the chance of detecting small effects that
may not be of practical interest.
• Use process knowledge to determine the smallest difference worth
detecting and the optimal level of power for a test.
1-Sample t-
t-Test
Example 3: Increased Power
Problem
• The results of the first power analysis suggest that a larger sample
would be useful in evaluating the process.
• Six observations did not have enough power to detect a 2.5-gram
difference.
Data collection
• Engineers randomly select 12 boxes of cereal and weigh them.
1-Sample t-
t-Test
Example 3: Increased Power
Tools
• 1-Sample t
• Normality Test
Data set
L01_Dataset_CerealBx.MPJ
Variable Description
MoreObs Cereal box weight (g)
_
X
Ho
One-Sample T: MoreObs
Test of μ = 365 vs ≠ 365
60
50
40
30
20
10
5
1
362 364 366 368 370 372
MoreObs
Final considerations
Summary and conclusions
• Based on a sample size of 12, = 0.05, and a p-value of 0.019, reject
and conclude that the process mean is not equal to the target
value of 365 grams.
Final considerations
Summary and conclusions
The sample size played an important role in evaluating this process
mean:
• When the sample size was 6, the test found no statistical difference.
• When the sample size was 12, the test found a statistically significant
difference.
Final considerations
Additional considerations
• Whenever possible, investigate the power and sample size before
collecting the data.
• Otherwise, if you discover later that the power is inadequate, it may
be difficult to obtain additional samples under the same conditions as
the original samples.
Final considerations
Additional considerations
• A 1-tailed test provides greater power to detect the specified
difference than does a 2-tailed test.
• However, a 1-tailed test cannot detect a difference in the opposite
direction specified by the alternative hypothesis.
Sample Target
Difference Size Power Actual Power
-1 17 0.8 0.807037
-1 23 0.9 0.912498
1 17 0.8 0.807037
1 23 0.9 0.912498
Assumptions
α 0.05
StDev 1
0.6 Alternative ≠
Power
0.4
0.2
0.0
-1.5 -1.0 -0.5 0.0 0.5 1.0 1.5
Difference
Final considerations
Summary and conclusions
• The company should use between 17 and 23 samples per supplier to
detect a difference of 1 standard deviation (1) between supplier
means with reasonably high power.
• The larger the sample size, the higher the power of the test.
• Collect enough observations to achieve adequate power, but not so
many that you waste time and money on unnecessary
measurements.
Final considerations
Additional considerations
• Sometimes a difference and standard deviation are not readily
available when computing power.
• Because the power calculation is based on the ratio of the difference
to the standard deviation, you do not need to specify them if you are
willing to accept the answer in terms of the number of standard
deviations that separate the two population means.
Final considerations
Additional considerations
• For example, if the difference is 2 and the standard deviation is 2, the
power will be the same as if the difference is 1 and the standard
deviation is 1, because both are 1 difference.
• As shown in this example, you can specify a 1 difference by setting
the difference and the standard deviation equal to one another.
• Similarly, you can specify a 2 difference by specifying a difference
that is two times the standard deviation.
2-Sample t-
t-Test
Example 5: Plastic Strength
Problem
• A calculator manufacturer is selecting a plastics supplier.
• Based on the results of the Power and Sample size, the manufacturer
decides to use a sample size of 20 from each supplier to compare the
breaking strength of the suppliers of plastic pellets.
2-Sample t-
t-Test
Example 5: Plastic Strength
Data collection
• A machine presses pellets from randomly selected batches of plastic
into wafers of uniform thickness.
• Technicians record the breaking strength (in Newtons) for each wafer.
2-Sample t-
t-Test
Example 5: Plastic Strength
Tools
• Dotplot
• Graphical Summary
• 2-Sample t
• 2 Variances
• Layout Tool
• Send to Microsoft PowerPoint
2-Sample t-
t-Test
Example 5: Plastic Strength
Data set
• L01_Dataset_Plastic.MPJ
Variable Description
SupplrA Breaking strength of wafers prepared from supplier A
plastic (Newfons)
SupplrB Breaking strength of wafers prepared from supplier B
plastic (Newfons)
Creating dotplots
• Graphs are an important preliminary step in data analysis.
• They enable you to examine the data and to identify patterns,
relationships, and potential problems.
Dotplots
• Use dotplots to examine the shape and spread of sample data.
• Begin the analysis by examining a dotplot of the breaking strength
data for each supplier.
Creating dotplots
Dotplots
1. Open Plastic.MPJ
2. Choose Graph > Dotplot
3. Under Multiple Y’s, choose
Simple, then click OK.
4. Complete the dialog box as
shown below.
5. Click OK.
Interpreting your results
• The graph shows the strength data from both suppliers.
• On average, the breaking strength from supplier A appears higher
than from supplier B.
• An unusually low value for supplier B appears on the far left side of
the plot.
SupplrA
SupplrB
0 25 50 75 100 125 150 175
Data
Updating a graph
Icon Indicates the graph is… Because…
Update Graph Automatically Updates the selected graph and all other graphs
created with the same dialog box or command
whenever the worksheet changes
Update All Graphs Now (under Editor > Update Updates all open graphs for current data changes
or Window menu)
Updating a graph
1. To choose the dotplot, click in the Project Manager toolbar.
Click the graph to make it the active window.
2. Choose Editor > Update > Update Graph Now, or right-click the
graph and choose Update Graph Now, as shown below.
Interpreting your results
• The graph updates to reflect the modified data
• In the graph on the right, the x-axis automatically adjusts to represent the
current data and the icon in the upper left corner is now a green check.
• In general, supplier A has more spread or variation than supplier B.
• The look of a dotplot or histogram is controlled by the number of bins.
• With the missing value, the number of bins changed from the previous
graph resulting in plots that make it easier to see the center and spread.
• However, they no longer show bell-shaped curves.
SupplrA
SupplrB
154.0 157.5 161.0 164.5 168.0 171.5 175.0
Data
Changing number of bins
• The default binning of the x-axis can be changed by selecting a certain
number of intervals or by entering the actual positions of the
midpoints or endpoints.
SupplrA
SupplrB
150 155 160 165 170 175
Data
Displaying a graphical summary
The graphical summary provides graphical representations of the data and
descriptive statistics.
The summary components include:
• Histogram with normal curve-Shows the distribution of the data overlaid with a
normal curve.
• Boxplot-Summarizes information about the shape, spread, and center of the data.
• Confidence intervals-Provide interval estimates of the population parameters of
the data.
• Anderson-Darling normality test – Tests whether the data follow a normal
distribution.
• Summary statistics – Describe important features of the data, such as the mean
and standard deviation.
Mean Mean
Median Median
161 162 163 164 165 166 167 158 159 160 161 162
Independent 2-
2-sample t-
t-test
What is an independent 2-sample t-test
• An independent 2-sample t-test helps determine whether two
population means are different.
• The test uses the sample standard deviations to estimate for each
population.
• If the difference between the sample means is large relative to the
estimated variability of the sample means, then the population
means are unlikely to be the same.
• You can also use an independent 2-sample t-test to evaluate whether
the means of two populations are different by a specific amount.
Independent 2-
2-sample t-
t-test
When to use an independent 2-sample t-test
• Use an independent 2-sample t-test with continuous data from two
independent random samples.
• Samples are independent if observations from one sample are not
related to the observations from the other sample.
• In this example, the strengths of pellets from supplier A’s process are
not influenced by the strengths of supplier B’s pellets; thus, samples
are independent.
Independent 2-
2-sample t-
t-test
When to use an independent 2-sample t-test
• The test also assumes that the data come from normally distributed
populations
• However, the test is robust to violations of this assumption, provided
the observations are collected randomly and the data are continuous,
unimodal, and reasonably symmetric.
Independent 2-
2-sample t-
t-test
Why use an independent 2-sample t-test
An independent 2-sample t-test answers questions such as:
• Are the means of a product characteristic between two suppliers
comparable?
• Is one formulation of a product better on average than another?
For example,
• Is the mean viscosity of oil from two different vendors similar?
• Is one ink formulation brighter on average than another?
Conducting 2-
2-sample t-
t-test
A 2-sample t-test will show whether the mean strengths of the plastic
from the two suppliers are different.
Conducting 2-
2-sample t-
t-test
Assume unequal variances
• The t-test will be more powerful if the variances of the two
populations are equal.
• However, assuming the variances are equal when they are not may
give misleading results.
• Thus, if you have any doubt, do not assume equality.
Conducting 2-
2-sample t-
t-test
2-Sample t
1. Choose Stat > Basic Statistics
> 2-Sample t.
2. Complete the dialog box as
shown below.
3. Click Graphs.
4. Check Boxplot.
5. Click OK in each dialog box.
175
170
165
Data
160
155
150
SupplrA SupplrB
Comparing variances
• You already created boxplots that suggest that the variability in the
strength of the two plastics is different.
• Now, conduct a format hypothesis test.
• The hypotheses for the 2 Variances test are:
H : σ /σ = 1
H : σ /σ ≠ 1
• To increase the power of the equality of variance test, use a larger
type I error rate, such as = 0.10.
Comparing variances
2 Variances
1. Choose Stat > Basic Statistics
> 2 Variances.
2. Complete the dialog box as
shown below.
3. Click Options and enter 90 in
Confidence level.
4. Click OK in each dialog box.
SupplrB
2 3 4 5 6 7 8
SupplrA
SupplrB
Mean Mean
Median Median
161 162 163 164 165 166 167 158 159 160 161 162
Final considerations
Additional considerations
When using a 2-sample t-test:
• Samples must be independent and random.
• Sample data must be continuous.
• Sample data are from normally distributed populations.
Anode Height
Exercise F: Anode Height
Data collection
• Samples from the anode insertion process are available but expensive
to measure, so the engineers must not collect more than necessary.
Anode Height
Anode Height (continued)
Instructions
1. Use Power and Sample Size to determine the number of observations required
to detect that the process has improved by at least 0.4 mm with a power of
0.80. Assume a standard deviation of 0.55 mm and specify a greater than
alternative hypothesis.
2. What is the required sample size if the t-test users a two-sided alternative
hypothesis?
3. Use Graphical Summary to check each sample for normality.
4. Check the two samples to verify the assumption of equal variance. Use alpha =
0.10.
Anode Height
Anode Height (continued)
Instructions
5. Use a 2-sample t-test to determine if the anode height has improved after the
process change. Specify the “greater than” alternative hypothesis, and create a
boxplot to visually compare the two sample’s data.
6. Use the Layout Tool to put the two graphical summaries and the boxplot on a
single graph for presentation.
7. Copy the output for the two sample t-test from the Session window. On the
Layout graph, click on the Graph Annotation toolbar to draw a text box in the
lower right corner that was left blank in step 6 and paste the Session Window
output for the 2-sample t-test.
8. Send the finished Layout graph to Microsoft PowerPoint.
Anode Height
Data set
• L01_Dataset_Anode.MPJ
Variable Description
Anode Height The height of an electrical anode above the
ceramic surface (mm)
Process Change Before or After the process change
Paired t-
t-test
Example 6: Parking Cars
Problem
• A consumer group wants to determine whether two popular cars
differ in handling ability.
• To measure the handling ability of the two cars, the group compares
the time it takes drivers to parallel park each car.
Data collection
• Twenty drivers park both cars in random order and parking time in
seconds is recorded.
Paired t-
t-test
Example 6: Parking Cars
Tools
• Open Excel
• Assistant – Paired t
• Calculator
• Boxplot
• Help
• Outlier Test
Paired t-
t-test
Example 6: Parking Cars
Data set
• L01_Dataset_Parking.xlsx
Variable Description
Car_A Park times driving car A (sec)
Car_B Park times driving car B (sec)
Importing data from Excel
Excel file structure
Before you import the data into
Minitab, consider the structure of
the Excel file:
• There are no variable names.
• The data begin in Row 1.
• The worksheet has no spaces or
blank rows.
Paired t-
t-test
What is a paired t-test
• A paired t-test helps determine whether the mean difference
between paired observations is significant.
• Statistically, the paired t-test is equivalent to performing a 1-sample t-
test on the differences.
• A paired t-test also helps you evaluate whether the mean difference is
equal to a specific value.
Paired t-
t-test
Paired observations are related. Examples include:
• Weights recorded for individuals before and after an exercise
program.
• Measurements of the same part taken with two different measuring
devices.
Paired t-
t-test
When to use a paired t-test
• Use a paired t-test with a random sample of paired observations.
• The test also assumes that the paired difference come from a
normally distributed population.
• However, the test is robust to violations of this assumption, provided
the observations are collected randomly and the data are continuous,
unimodal, and reasonably symmetric.
Paired t-
t-test
Why use a paired t-test
A paired t-test answers questions such as:
• Does a new treatment result in a difference in the product?
• Do two different instruments provide similar measurements for the same
sample?
For example,
• Does treating lumber with certain chemicals increase its usable life?
(Assume that each sample piece of lumber is cut in half; one half is treated
with the chemicals while the other half is untreated.)
• Do two calipers differ in measuring parts? (Assume that several parts are
measured once with each caliper.)
Conducting a paired t-
t-test
• The consumer group wants to determine whether drivers can park
one car more quickly than the other.
• Because the data are paired (each individual parked both cars), use a
paired t-test to test the following hypotheses:
• : The mean difference between paired observations in the population is
zero.
• : The mean difference between paired observations in the population is
not zero.
Conducting a paired t-
t-test
Paired t
1. Choose Assistant > Hypothesis
Tests.
2. Under Compare two samples
with each other, choose
Paired t.
3. Complete the dialog box as
shown below.
4. Click OK.
50 Car_A
Car_B
40
30
20
Power is a function of the sample size and the standard deviation. To detect smaller differences, consider
increasing the sample size.
Interpreting your results
Summary Report
• The Summary Report contains the overall conclusions from the data
analysis and is designed to be easy to explain to Management.
• The mean parking times are 34.86 seconds for car A and 32.9 seconds
for car B.
• The mean difference is about 2 seconds.
• You can be 95% confident that the difference in parking times
between car A and car B is at least 0.17 seconds, and at most 3.76
seconds.
Individual Samples
Statistics Car_A Car_B
Mean 34.865 32.9
Distribution of the Differences Standard deviation 7.5916 7.2743
Where are the differences relative to zero?
0 Comments
• Test: You can conclude that the means differ at the 0.05
level of significance. The mean of the paired differences is
greater than zero.
• CI: Quantifies the uncertainty associated with estimating
the mean difference from sample data. You can be 95%
confident that the true mean difference is between 0.16988
and 3.7601.
• Distribution of Differences: Compare the location of the
differences to zero. Look for unusual differences before
interpreting the results of the test.
Boxplot
• One sample difference is identified as unusual.
• We want to investigate this difference using a boxplot and determine
whether it is statistically an outlier.
• Before doing these, store the pairwise differences in the worksheet.
Formulas
• Check Assign as a formula to assign the expression as a formula to the
column.
• This allows Minitab to recalculate the values automatically whenever
you add, remove, or change associated data.
Boxplot
1. Choose Calc > Calculator.
2. Complete the dialog box as
shown below.
3. Click OK.
4. Choose Graph > Boxplot.
5. Under One Y, choose Simple,
Click OK.
6. In Graph variables, enter Diff.
7. Click OK.
Diff
0
-5
-10
Null hypothesis All data values come from the same normal
population
Alternative hypothesis Smallest or largest data value is an outlier
Significance level α = 0.05
Grubbs' Test
-10 -5 0 5 10
Diff
Final considerations
Summary and conclusions
• On average, drivers took about 2.0 seconds longer to park car A than
car B. This difference, although small, is statistically significant.
• Given the confidence interval for the mean difference, is a 0.17
second difference, or even a 3.76 second difference, of any practical
importance?
• If slightly longer parking time increases driver frustration, then the
difference may be important.
• This difference may also be more important to drivers who parallel
park frequently.
Final considerations
Additional considerations
When using a paired t-test:
• Observations must be paired.
• The data must be continuous.
• The paired differences are from a normally distributed population.
Final considerations
Additional considerations
• The t-test procedure is robust to violations of the normality
assumption, provided that pairs of observations are collected
randomly and the data are continuous, unimodal, and reasonably
symmetric.
Final considerations
Additional considerations
Using paired observations eliminates the variability caused by
individuals.
For example,
• Driver 1 took 18.9 seconds to park car A and 18.2 seconds to park car B.
• In contrast, driver 18 took 43.4 and 37.3 seconds to park the same cars.
• Analyzing the differences for each driver eliminates the variability between
drivers from the calculations, increasing the power of the test.
Comparing Calipers
Exercise G: Comparing Calipers
Problem
• A company considers purchasing two different gages to measure ball
bearings: a caliper by EasyGage and a caliper by ToolIt.
• Compare the two brands to determine whether they yield different average
measurements.
• Use an -level of 0.05 for the test.
Comparing Calipers
Exercise G: Comparing Calipers
Instructions
1. Use File > Open Worksheet to open the data. Be sure to change the
Variable Names option to None.
2. Name the columns (Bearing, EasyGage, ToolIt).
3. Use the Assistant paired t-test to determine whether the measurements
given by the calipers are different.
4. Use Calculator to create a Diff column. Check the differences for
normality.
Comparing Calipers
Exercise G: Comparing Calipers
Data collection
• Analysts randomly sample 12 ball bearings from the process.
• A single operator measures each ball bearing with the EasyGage caliper
and with the ToolIt caliper.
Comparing Calipers
Exercise G: Comparing Calipers
Data set
• L01_Dataset_Calipers.xlsx
Variable Description
Bearing Identifies the bearing measured
EasyGage Bearing diameter measured by EasyGage caliper (cm)
ToolIt Bearing diameter measured by TooIt caliper (cm)