Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
June 7, 2017
Agenda
1 Session 15
Confidence Interval
2 References
Session 15 References
Confidence Interval
Z has a standard normal distribution
X
Z= .
/ n
Confidence Interval
Z has a standard normal distribution
X
Z= .
/ n
Confidence Interval
Z has a standard normal distribution
X
Z= .
/ n
Confidence Interval
P{L U} = 1
where 0 1.
Once we have selected the sample and computed l and u, the
resulting confidence interval for is l u.
l and u are called the lower- and upper-confidence limits,
respectively, and 1 is called the confidence coefficient.
Session 15 References
Confidence Interval
P{L U} = 1
where 0 1.
Once we have selected the sample and computed l and u, the
resulting confidence interval for is l u.
l and u are called the lower- and upper-confidence limits,
respectively, and 1 is called the confidence coefficient.
Session 15 References
Confidence Interval
P{L U} = 1
where 0 1.
Once we have selected the sample and computed l and u, the
resulting confidence interval for is l u.
l and u are called the lower- and upper-confidence limits,
respectively, and 1 is called the confidence coefficient.
Session 15 References
Confidence Interval
Because Z has a standard normal distribution, we may write
X
P{z/2 z/2 } = 1 .
/ n
Definition
If x is the sample mean of a random sample of size n from a
normal population with known variance 2 , a 100(1 )% CI on
is given by
x z/2 x + z/2
n n
where z/2 is the upper 100/2 percentage point of the standard
normal distribution.
Session 15 References
Example
Ten measurements of impact energy (J) on specimens of
A238 steel cut at 60o C are as follows: 64.1, 64.7, 64.5, 64.6,
64.5, 64.3, 64.6, 64.8, 64.2, and 64.3.
Assume that impact energy is normally distributed with
= 1J.
We want to find a 95% CI for , the mean impact energy.
The required quantities are
z/2 = z0.025 = abs(norminv (0.025)) = 1.96, n = 10, = 1, and
x = 64.46. The resulting 95% CI is found as follows:
Session 15 References
Example
Ten measurements of impact energy (J) on specimens of
A238 steel cut at 60o C are as follows: 64.1, 64.7, 64.5, 64.6,
64.5, 64.3, 64.6, 64.8, 64.2, and 64.3.
Assume that impact energy is normally distributed with
= 1J.
We want to find a 95% CI for , the mean impact energy.
The required quantities are
z/2 = z0.025 = abs(norminv (0.025)) = 1.96, n = 10, = 1, and
x = 64.46. The resulting 95% CI is found as follows:
Session 15 References
Example
Ten measurements of impact energy (J) on specimens of
A238 steel cut at 60o C are as follows: 64.1, 64.7, 64.5, 64.6,
64.5, 64.3, 64.6, 64.8, 64.2, and 64.3.
Assume that impact energy is normally distributed with
= 1J.
We want to find a 95% CI for , the mean impact energy.
The required quantities are
z/2 = z0.025 = abs(norminv (0.025)) = 1.96, n = 10, = 1, and
x = 64.46. The resulting 95% CI is found as follows:
Session 15 References
Example
Ten measurements of impact energy (J) on specimens of
A238 steel cut at 60o C are as follows: 64.1, 64.7, 64.5, 64.6,
64.5, 64.3, 64.6, 64.8, 64.2, and 64.3.
Assume that impact energy is normally distributed with
= 1J.
We want to find a 95% CI for , the mean impact energy.
The required quantities are
z/2 = z0.025 = abs(norminv (0.025)) = 1.96, n = 10, = 1, and
x = 64.46. The resulting 95% CI is found as follows:
Session 15 References
Example
Ten measurements of impact energy (J) on specimens of
A238 steel cut at 60o C are as follows: 64.1, 64.7, 64.5, 64.6,
64.5, 64.3, 64.6, 64.8, 64.2, and 64.3.
Assume that impact energy is normally distributed with
= 1J.
We want to find a 95% CI for , the mean impact energy.
The required quantities are
z/2 = z0.025 = abs(norminv (0.025)) = 1.96, n = 10, = 1, and
x = 64.46. The resulting 95% CI is found as follows:
Session 15 References
Example
x z/2 x + z/2
n n
1 1
64.46 1.96 64.46 + 1.96
10 10
63.84 65.08
Example
x z/2 x + z/2
n n
1 1
64.46 1.96 64.46 + 1.96
10 10
63.84 65.08
Example
x z/2 x + z/2
n n
1 1
64.46 1.96 64.46 + 1.96
10 10
63.84 65.08
Example
x z/2 x + z/2
n n
1 1
64.46 1.96 64.46 + 1.96
10 10
63.84 65.08
Example
x z/2 x + z/2
n n
1 1
64.46 1.96 64.46 + 1.96
10 10
63.84 65.08
Example
The true value of is unknown and the statement
63.84 65.08 is either correct or incorrect.
The correct interpretation lies in the realization that a CI is a
random interval because in the probability statement defining
the end-points of the interval, L and U are random variables.
Consequently, the correct interpretation of a 100(1 )% CI
depends on the relative frequency view of probability.
Specifically, if an infinite number of random samples are
collected and a 100(1 )% confidence interval for is
computed from each sample, 100(1 )% of these intervals
will contain the true value of .
Lets see the following intervals. Do all of them contains ?
Session 15 References
Example
The true value of is unknown and the statement
63.84 65.08 is either correct or incorrect.
The correct interpretation lies in the realization that a CI is a
random interval because in the probability statement defining
the end-points of the interval, L and U are random variables.
Consequently, the correct interpretation of a 100(1 )% CI
depends on the relative frequency view of probability.
Specifically, if an infinite number of random samples are
collected and a 100(1 )% confidence interval for is
computed from each sample, 100(1 )% of these intervals
will contain the true value of .
Lets see the following intervals. Do all of them contains ?
Session 15 References
Example
The true value of is unknown and the statement
63.84 65.08 is either correct or incorrect.
The correct interpretation lies in the realization that a CI is a
random interval because in the probability statement defining
the end-points of the interval, L and U are random variables.
Consequently, the correct interpretation of a 100(1 )% CI
depends on the relative frequency view of probability.
Specifically, if an infinite number of random samples are
collected and a 100(1 )% confidence interval for is
computed from each sample, 100(1 )% of these intervals
will contain the true value of .
Lets see the following intervals. Do all of them contains ?
Session 15 References
Example
The true value of is unknown and the statement
63.84 65.08 is either correct or incorrect.
The correct interpretation lies in the realization that a CI is a
random interval because in the probability statement defining
the end-points of the interval, L and U are random variables.
Consequently, the correct interpretation of a 100(1 )% CI
depends on the relative frequency view of probability.
Specifically, if an infinite number of random samples are
collected and a 100(1 )% confidence interval for is
computed from each sample, 100(1 )% of these intervals
will contain the true value of .
Lets see the following intervals. Do all of them contains ?
Session 15 References
Example
The true value of is unknown and the statement
63.84 65.08 is either correct or incorrect.
The correct interpretation lies in the realization that a CI is a
random interval because in the probability statement defining
the end-points of the interval, L and U are random variables.
Consequently, the correct interpretation of a 100(1 )% CI
depends on the relative frequency view of probability.
Specifically, if an infinite number of random samples are
collected and a 100(1 )% confidence interval for is
computed from each sample, 100(1 )% of these intervals
will contain the true value of .
Lets see the following intervals. Do all of them contains ?
Session 15 References
Example
Example
Notice that one of the intervals fails to contain the true value
of .
If this were a 95% confidence interval, in the long run only 5%
of the intervals would fail to contain .
Now in practice, we obtain only one random sample and
calculate one confidence interval.
Since this interval either will or will not contain the true value
of , it is not reasonable to attach a probability level to this
specific event.
We dont know if the statement is true for this specific sample,
but the method used to obtain the interval [l, u] yields correct
statements 100(1 )% of the time.
Session 15 References
Example
Notice that one of the intervals fails to contain the true value
of .
If this were a 95% confidence interval, in the long run only 5%
of the intervals would fail to contain .
Now in practice, we obtain only one random sample and
calculate one confidence interval.
Since this interval either will or will not contain the true value
of , it is not reasonable to attach a probability level to this
specific event.
We dont know if the statement is true for this specific sample,
but the method used to obtain the interval [l, u] yields correct
statements 100(1 )% of the time.
Session 15 References
Example
Notice that one of the intervals fails to contain the true value
of .
If this were a 95% confidence interval, in the long run only 5%
of the intervals would fail to contain .
Now in practice, we obtain only one random sample and
calculate one confidence interval.
Since this interval either will or will not contain the true value
of , it is not reasonable to attach a probability level to this
specific event.
We dont know if the statement is true for this specific sample,
but the method used to obtain the interval [l, u] yields correct
statements 100(1 )% of the time.
Session 15 References
Example
Notice that one of the intervals fails to contain the true value
of .
If this were a 95% confidence interval, in the long run only 5%
of the intervals would fail to contain .
Now in practice, we obtain only one random sample and
calculate one confidence interval.
Since this interval either will or will not contain the true value
of , it is not reasonable to attach a probability level to this
specific event.
We dont know if the statement is true for this specific sample,
but the method used to obtain the interval [l, u] yields correct
statements 100(1 )% of the time.
Session 15 References
Example
Notice that one of the intervals fails to contain the true value
of .
If this were a 95% confidence interval, in the long run only 5%
of the intervals would fail to contain .
Now in practice, we obtain only one random sample and
calculate one confidence interval.
Since this interval either will or will not contain the true value
of , it is not reasonable to attach a probability level to this
specific event.
We dont know if the statement is true for this specific sample,
but the method used to obtain the interval [l, u] yields correct
statements 100(1 )% of the time.
Session 15 References
Example
What would have happened if we had chosen a higher level of
confidence, say, 99%?
At = 0.01 we find z/2 = z0.01/2 = z0.005
= abs(norminv (0.005)) = 2.58. While for = 0.05,
z0.025 = 1.96.
Thus, the length of the 95% confidence interval is
2(1.96/ n) = 3.92/ n
whereas the length of the 99% CI is
2(2.58/ n) = 5.16/ n
Thus, the 99% CI is longer than the 95% CI.
Generally, for a fixed sample size n and standard deviation ,
the higher the confidence level, the longer the resulting CI.
Session 15 References
Example
What would have happened if we had chosen a higher level of
confidence, say, 99%?
At = 0.01 we find z/2 = z0.01/2 = z0.005
= abs(norminv (0.005)) = 2.58. While for = 0.05,
z0.025 = 1.96.
Thus, the length of the 95% confidence interval is
2(1.96/ n) = 3.92/ n
whereas the length of the 99% CI is
2(2.58/ n) = 5.16/ n
Thus, the 99% CI is longer than the 95% CI.
Generally, for a fixed sample size n and standard deviation ,
the higher the confidence level, the longer the resulting CI.
Session 15 References
Example
What would have happened if we had chosen a higher level of
confidence, say, 99%?
At = 0.01 we find z/2 = z0.01/2 = z0.005
= abs(norminv (0.005)) = 2.58. While for = 0.05,
z0.025 = 1.96.
Thus, the length of the 95% confidence interval is
2(1.96/ n) = 3.92/ n
whereas the length of the 99% CI is
2(2.58/ n) = 5.16/ n
Thus, the 99% CI is longer than the 95% CI.
Generally, for a fixed sample size n and standard deviation ,
the higher the confidence level, the longer the resulting CI.
Session 15 References
Example
What would have happened if we had chosen a higher level of
confidence, say, 99%?
At = 0.01 we find z/2 = z0.01/2 = z0.005
= abs(norminv (0.005)) = 2.58. While for = 0.05,
z0.025 = 1.96.
Thus, the length of the 95% confidence interval is
2(1.96/ n) = 3.92/ n
whereas the length of the 99% CI is
2(2.58/ n) = 5.16/ n
Thus, the 99% CI is longer than the 95% CI.
Generally, for a fixed sample size n and standard deviation ,
the higher the confidence level, the longer the resulting CI.
Session 15 References
Example
What would have happened if we had chosen a higher level of
confidence, say, 99%?
At = 0.01 we find z/2 = z0.01/2 = z0.005
= abs(norminv (0.005)) = 2.58. While for = 0.05,
z0.025 = 1.96.
Thus, the length of the 95% confidence interval is
2(1.96/ n) = 3.92/ n
whereas the length of the 99% CI is
2(2.58/ n) = 5.16/ n
Thus, the 99% CI is longer than the 95% CI.
Generally, for a fixed sample size n and standard deviation ,
the higher the confidence level, the longer the resulting CI.
Session 15 References
Example
What would have happened if we had chosen a higher level of
confidence, say, 99%?
At = 0.01 we find z/2 = z0.01/2 = z0.005
= abs(norminv (0.005)) = 2.58. While for = 0.05,
z0.025 = 1.96.
Thus, the length of the 95% confidence interval is
2(1.96/ n) = 3.92/ n
whereas the length of the 99% CI is
2(2.58/ n) = 5.16/ n
Thus, the 99% CI is longer than the 95% CI.
Generally, for a fixed sample size n and standard deviation ,
the higher the confidence level, the longer the resulting CI.
Session 15 References
Recommendation
The length of a confidence interval is a measure of the
precision of estimation. We see that precision is inversely
related to the confidence level.
It is desirable to obtain a confidence interval that is short
enough for decision-making purposes and that also has
adequate confidence.
One way to achieve this is by choosing the sample size n to be
large enough to give a CI of specified length or precision with
prescribed confidence.
Session 15 References
Recommendation
The length of a confidence interval is a measure of the
precision of estimation. We see that precision is inversely
related to the confidence level.
It is desirable to obtain a confidence interval that is short
enough for decision-making purposes and that also has
adequate confidence.
One way to achieve this is by choosing the sample size n to be
large enough to give a CI of specified length or precision with
prescribed confidence.
Session 15 References
Recommendation
The length of a confidence interval is a measure of the
precision of estimation. We see that precision is inversely
related to the confidence level.
It is desirable to obtain a confidence interval that is short
enough for decision-making purposes and that also has
adequate confidence.
One way to achieve this is by choosing the sample size n to be
large enough to give a CI of specified length or precision with
prescribed confidence.
Session 15 References
n, 2E and
Notice the general relationship between sample size, desired length
of the confidence interval 2E, confidence level 100(1 ), and
standard deviation :
As the desired length of the interval 2E decreases, the
required sample size n increases for a fixed value of and
specified confidence.
As increases, the required sample size n increases for a fixed
desired length 2E and specified confidence
As the level of confidence increases, the required sample size n
increases for fixed desired length 2E and standard deviation .
Session 15 References
Definition
When n is large, the quantity
X
S/ n
Definition
When n is large, the quantity
X
S/ n
Definition
When n is large, the quantity
X
S/ n
Class Activity
1 For a normal population with known variance, answer the
following questions:
What is theconfidence level for the
interval
x 2.14/ n x + 2.14/ n?
What value of z/2 gives 98% confidence?
2 A confidence interval estimate is desired for the gain in a
circuit on a semiconductor device. Assume that gain is
normally distributed with standard deviation = 20.
Find a 95% CI for when n = 10 and x = 1000.
Find a 95% CI for when n = 25 and x = 1000.
Session 15 References
Class Activity
1 For a normal population with known variance, answer the
following questions:
What is theconfidence level for the
interval
x 2.14/ n x + 2.14/ n?
What value of z/2 gives 98% confidence?
2 A confidence interval estimate is desired for the gain in a
circuit on a semiconductor device. Assume that gain is
normally distributed with standard deviation = 20.
Find a 95% CI for when n = 10 and x = 1000.
Find a 95% CI for when n = 25 and x = 1000.
Session 15 References
Class Activity
1 For a normal population with known variance, answer the
following questions:
What is theconfidence level for the
interval
x 2.14/ n x + 2.14/ n?
What value of z/2 gives 98% confidence?
2 A confidence interval estimate is desired for the gain in a
circuit on a semiconductor device. Assume that gain is
normally distributed with standard deviation = 20.
Find a 95% CI for when n = 10 and x = 1000.
Find a 95% CI for when n = 25 and x = 1000.
Session 15 References
Class Activity
1 For a normal population with known variance, answer the
following questions:
What is theconfidence level for the
interval
x 2.14/ n x + 2.14/ n?
What value of z/2 gives 98% confidence?
2 A confidence interval estimate is desired for the gain in a
circuit on a semiconductor device. Assume that gain is
normally distributed with standard deviation = 20.
Find a 95% CI for when n = 10 and x = 1000.
Find a 95% CI for when n = 25 and x = 1000.
Session 15 References
Class Activity
1 For a normal population with known variance, answer the
following questions:
What is theconfidence level for the
interval
x 2.14/ n x + 2.14/ n?
What value of z/2 gives 98% confidence?
2 A confidence interval estimate is desired for the gain in a
circuit on a semiconductor device. Assume that gain is
normally distributed with standard deviation = 20.
Find a 95% CI for when n = 10 and x = 1000.
Find a 95% CI for when n = 25 and x = 1000.
Session 15 References
Class Activity
1 For a normal population with known variance, answer the
following questions:
What is theconfidence level for the
interval
x 2.14/ n x + 2.14/ n?
What value of z/2 gives 98% confidence?
2 A confidence interval estimate is desired for the gain in a
circuit on a semiconductor device. Assume that gain is
normally distributed with standard deviation = 20.
Find a 95% CI for when n = 10 and x = 1000.
Find a 95% CI for when n = 25 and x = 1000.
Session 15 References
Class Activity
3 Consider the previous problem. How large must n be if the
length of the 95% CI is to be E = 40?
4 Following are two confidence interval estimates of the mean
of the cycles to failure of an automotive door latch
mechanism (the test was conducted at an elevated stress level
to accelerate the failure). 3124.9 3215.7 and
3110.5 3230.1
What is the value of the sample mean cycles to failure?
The confidence level for one of these CIs is 95% and the
confidence level for the other is 99%. Both CIs are calculated
from the same sample data. Which is the 95% CI? Explain
why.
Session 15 References
Class Activity
3 Consider the previous problem. How large must n be if the
length of the 95% CI is to be E = 40?
4 Following are two confidence interval estimates of the mean
of the cycles to failure of an automotive door latch
mechanism (the test was conducted at an elevated stress level
to accelerate the failure). 3124.9 3215.7 and
3110.5 3230.1
What is the value of the sample mean cycles to failure?
The confidence level for one of these CIs is 95% and the
confidence level for the other is 99%. Both CIs are calculated
from the same sample data. Which is the 95% CI? Explain
why.
Session 15 References
Class Activity
3 Consider the previous problem. How large must n be if the
length of the 95% CI is to be E = 40?
4 Following are two confidence interval estimates of the mean
of the cycles to failure of an automotive door latch
mechanism (the test was conducted at an elevated stress level
to accelerate the failure). 3124.9 3215.7 and
3110.5 3230.1
What is the value of the sample mean cycles to failure?
The confidence level for one of these CIs is 95% and the
confidence level for the other is 99%. Both CIs are calculated
from the same sample data. Which is the 95% CI? Explain
why.
Session 15 References
Class Activity
3 Consider the previous problem. How large must n be if the
length of the 95% CI is to be E = 40?
4 Following are two confidence interval estimates of the mean
of the cycles to failure of an automotive door latch
mechanism (the test was conducted at an elevated stress level
to accelerate the failure). 3124.9 3215.7 and
3110.5 3230.1
What is the value of the sample mean cycles to failure?
The confidence level for one of these CIs is 95% and the
confidence level for the other is 99%. Both CIs are calculated
from the same sample data. Which is the 95% CI? Explain
why.
Session 15 References
Class Activity
5 An article of the Transactions of the American Fisheries
Society reports the results of a study to investigate the
mercury contamination in largemouth bass. A sample of fish
was selected from 53 Florida lakes and mercury concentration
(MC) in the muscle tissue was measured (ppm)
as:
Session 15 References
Class Activity
Class Activity
6 n = 100 random samples of water from a fresh water lake
were taken and the calcium concentration (milligrams per
liter) measured. A 95% CI on the mean calcium concentration
is 0.49 0.82.
Would a 99% CI calculated from the same sample data been
longer or shorter?
Consider the following statement: There is a 95% chance that
is between 0.49 and 0.82. Is this statement correct? Explain
your answer.
Session 15 References
Class Activity
6 n = 100 random samples of water from a fresh water lake
were taken and the calcium concentration (milligrams per
liter) measured. A 95% CI on the mean calcium concentration
is 0.49 0.82.
Would a 99% CI calculated from the same sample data been
longer or shorter?
Consider the following statement: There is a 95% chance that
is between 0.49 and 0.82. Is this statement correct? Explain
your answer.
Session 15 References
Class Activity
6 n = 100 random samples of water from a fresh water lake
were taken and the calcium concentration (milligrams per
liter) measured. A 95% CI on the mean calcium concentration
is 0.49 0.82.
Would a 99% CI calculated from the same sample data been
longer or shorter?
Consider the following statement: There is a 95% chance that
is between 0.49 and 0.82. Is this statement correct? Explain
your answer.
Session 15 References
References I