15

Session 15 References
Probability and Statistical Course.
Instructor: Dr.Ing.(c) Sergio A. Abreo C.
Escuela de Ingenieras Electrica, Electronica y de

Telecomunicaciones
Universidad Industrial de Santander
June 7, 2017
Connectivity and Signal Processing Research Group.

info@cps.uis.edu.co https://cpsuis.wordpress.com
Agenda
1 Session 15
Confidence Interval
2 References
CI on the mean of a normal distribution: variance known
Confidence Interval
Z has a standard normal distribution
X
Z= .
/ n
A confidence interval estimate for is an interval of the form

l u, where the end-points l and u are computed from
the sample data.
Because different samples will produce different values of l
and u, these end-points are values of random variables L and
U, respectively.
Suppose that we can determine values of L and U such that
the following probability statement is true:
Confidence Interval
X
Z= .
/ n

the sample data.
U, respectively.
Confidence Interval
X
Z= .
/ n

the sample data.
U, respectively.
Confidence Interval
P{L U} = 1
where 0 1.
Once we have selected the sample and computed l and u, the
resulting confidence interval for is l u.
l and u are called the lower- and upper-confidence limits,
respectively, and 1 is called the confidence coefficient.
Confidence Interval
P{L U} = 1
where 0 1.
Confidence Interval
P{L U} = 1
where 0 1.
Confidence Interval
Because Z has a standard normal distribution, we may write
X
P{z/2 z/2 } = 1 .
/ n
Now manipulating the quantities inside the brackets

P{X z/2 X + z/2 } = 1 .
n n
This leads to the following definition:

Definition
If x is the sample mean of a random sample of size n from a
normal population with known variance 2 , a 100(1 )% CI on
is given by

x z/2 x + z/2
n n
where z/2 is the upper 100/2 percentage point of the standard
normal distribution.
Example
Ten measurements of impact energy (J) on specimens of
A238 steel cut at 60o C are as follows: 64.1, 64.7, 64.5, 64.6,
64.5, 64.3, 64.6, 64.8, 64.2, and 64.3.
Assume that impact energy is normally distributed with
= 1J.
We want to find a 95% CI for , the mean impact energy.
The required quantities are
z/2 = z0.025 = abs(norminv (0.025)) = 1.96, n = 10, = 1, and
x = 64.46. The resulting 95% CI is found as follows:
Example
64.5, 64.3, 64.6, 64.8, 64.2, and 64.3.
= 1J.
Example
64.5, 64.3, 64.6, 64.8, 64.2, and 64.3.
= 1J.
Example
64.5, 64.3, 64.6, 64.8, 64.2, and 64.3.
= 1J.
Example
64.5, 64.3, 64.6, 64.8, 64.2, and 64.3.
= 1J.
Example

x z/2 x + z/2
n n
1 1
64.46 1.96 64.46 + 1.96
10 10
63.84 65.08
How does one interpret a confidence interval?

It is tempting to conclude that is within this interval with
probability 0.95.
However, with a little reflection, its easy to see that this
cannot be correct.
Example

x z/2 x + z/2
n n
1 1
64.46 1.96 64.46 + 1.96
10 10
63.84 65.08

probability 0.95.
cannot be correct.
Example

x z/2 x + z/2
n n
1 1
64.46 1.96 64.46 + 1.96
10 10
63.84 65.08

probability 0.95.
cannot be correct.
Example

x z/2 x + z/2
n n
1 1
64.46 1.96 64.46 + 1.96
10 10
63.84 65.08

probability 0.95.
cannot be correct.
Example

x z/2 x + z/2
n n
1 1
64.46 1.96 64.46 + 1.96
10 10
63.84 65.08

probability 0.95.
cannot be correct.
Example
The true value of is unknown and the statement
63.84 65.08 is either correct or incorrect.
The correct interpretation lies in the realization that a CI is a
random interval because in the probability statement defining
the end-points of the interval, L and U are random variables.
Consequently, the correct interpretation of a 100(1 )% CI
depends on the relative frequency view of probability.
Specifically, if an infinite number of random samples are
collected and a 100(1 )% confidence interval for is
computed from each sample, 100(1 )% of these intervals
will contain the true value of .
Lets see the following intervals. Do all of them contains ?
Example
Example
Example
Example
Example
Figure : Repeated construction of a confidence interval for . Taken

from [Montgomery and Runger, 2010]
Example
Notice that one of the intervals fails to contain the true value
of .
If this were a 95% confidence interval, in the long run only 5%
of the intervals would fail to contain .
Now in practice, we obtain only one random sample and
calculate one confidence interval.
Since this interval either will or will not contain the true value
of , it is not reasonable to attach a probability level to this
specific event.
We dont know if the statement is true for this specific sample,
but the method used to obtain the interval [l, u] yields correct
statements 100(1 )% of the time.
Example
of .
specific event.
Example
of .
specific event.
Example
of .
specific event.
Example
of .
specific event.
Example
What would have happened if we had chosen a higher level of
confidence, say, 99%?
At = 0.01 we find z/2 = z0.01/2 = z0.005
= abs(norminv (0.005)) = 2.58. While for = 0.05,
z0.025 = 1.96.
Thus, the length of the 95% confidence interval is

2(1.96/ n) = 3.92/ n
whereas the length of the 99% CI is

2(2.58/ n) = 5.16/ n
Thus, the 99% CI is longer than the 95% CI.
Generally, for a fixed sample size n and standard deviation ,
the higher the confidence level, the longer the resulting CI.
Example
At = 0.01 we find z/2 = z0.01/2 = z0.005
z0.025 = 1.96.

2(1.96/ n) = 3.92/ n

2(2.58/ n) = 5.16/ n
Example
At = 0.01 we find z/2 = z0.01/2 = z0.005
z0.025 = 1.96.

2(1.96/ n) = 3.92/ n

2(2.58/ n) = 5.16/ n
Example
At = 0.01 we find z/2 = z0.01/2 = z0.005
z0.025 = 1.96.

2(1.96/ n) = 3.92/ n

2(2.58/ n) = 5.16/ n
Example
At = 0.01 we find z/2 = z0.01/2 = z0.005
z0.025 = 1.96.

2(1.96/ n) = 3.92/ n

2(2.58/ n) = 5.16/ n
Example
At = 0.01 we find z/2 = z0.01/2 = z0.005
z0.025 = 1.96.

2(1.96/ n) = 3.92/ n

2(2.58/ n) = 5.16/ n
Recommendation
The length of a confidence interval is a measure of the
precision of estimation. We see that precision is inversely
related to the confidence level.
It is desirable to obtain a confidence interval that is short
enough for decision-making purposes and that also has
adequate confidence.
One way to achieve this is by choosing the sample size n to be
large enough to give a CI of specified length or precision with
prescribed confidence.
Recommendation
Recommendation

Choice of Sample Size

The precision of the confidence interval is 2z/2 / n
This means that in using x to estimate , the error

E = |x | is less than or equal to z/2 / n with confidence
100(1 ).
The appropriate sample size is found by choosing n such that

z/2 / n = E .
Figure : Error in estimating with x. Taken from

[Montgomery and Runger, 2010]

If x is used as an estimate of , we can be 100(1 )% confident
that the error |x | will not exceed a specified amount E when
the sample size is
z/2 2
n=( )
E
If the right-hand side of Equation is not an integer, it must be
rounded up. This will ensure that the level of confidence does not
fall below 100(1 )%. Notice that 2E is the length of the
resulting confidence interval.
n, 2E and
Notice the general relationship between sample size, desired length
of the confidence interval 2E, confidence level 100(1 ), and
standard deviation :
As the desired length of the interval 2E decreases, the
required sample size n increases for a fixed value of and
specified confidence.
As increases, the required sample size n increases for a fixed
desired length 2E and specified confidence
As the level of confidence increases, the required sample size n
increases for fixed desired length 2E and standard deviation .
A Large-Sample Confidence Interval for

We have assumed that the population distribution is normal
with unknown mean and known standard deviation .
Let X1 , X2 , .., Xn be a random sample from a population with
unknown mean and variance 2 .
Now if the sample size n is large, the central limit theorem
implies that X has approximately a normal distribution with
mean and variance 2 /n.

Therefore Z = (X )/(/ n) has approximately a standard
However, the standard deviation is unknown.
It turns out that when n is large, replacing by the sample
standard deviation S has little effect on the distribution of Z .










Definition
When n is large, the quantity
X

S/ n
has an approximate standard normal distribution. Consequently,

s s
x z/2 x + z/2
n n
is a large sample confidence interval for , with confidence level of

approximately 100(1 )%.
This equation holds regardless of the shape of the population
distribution.
Generally n should be at least 40 to use this result reliably.
Definition
X

S/ n

s s
x z/2 x + z/2
n n

distribution.
Definition
X

S/ n

s s
x z/2 x + z/2
n n

distribution.
CI on of a normal distribution: 2 known
Class Activity
1 For a normal population with known variance, answer the
following questions:
What is theconfidence level for the
interval
x 2.14/ n x + 2.14/ n?
What value of z/2 gives 98% confidence?
2 A confidence interval estimate is desired for the gain in a
circuit on a semiconductor device. Assume that gain is
normally distributed with standard deviation = 20.
Find a 95% CI for when n = 10 and x = 1000.
Class Activity
interval
x 2.14/ n x + 2.14/ n?
Class Activity
interval
x 2.14/ n x + 2.14/ n?
Class Activity
interval
x 2.14/ n x + 2.14/ n?
Class Activity
interval
x 2.14/ n x + 2.14/ n?
Class Activity
interval
x 2.14/ n x + 2.14/ n?
Class Activity
3 Consider the previous problem. How large must n be if the
length of the 95% CI is to be E = 40?
4 Following are two confidence interval estimates of the mean
of the cycles to failure of an automotive door latch
mechanism (the test was conducted at an elevated stress level
to accelerate the failure). 3124.9 3215.7 and
3110.5 3230.1
What is the value of the sample mean cycles to failure?
The confidence level for one of these CIs is 95% and the
confidence level for the other is 99%. Both CIs are calculated
from the same sample data. Which is the 95% CI? Explain
why.
Class Activity
3110.5 3230.1
why.
Class Activity
3110.5 3230.1
why.
Class Activity
3110.5 3230.1
why.
Class Activity
5 An article of the Transactions of the American Fisheries
Society reports the results of a study to investigate the
mercury contamination in largemouth bass. A sample of fish
was selected from 53 Florida lakes and mercury concentration
(MC) in the muscle tissue was measured (ppm)
as:
Class Activity
Figure : Taken from [Montgomery and Runger, 2010]
Plot the histogram.

Compute the 95% CI on .
Class Activity
6 n = 100 random samples of water from a fresh water lake
were taken and the calcium concentration (milligrams per
liter) measured. A 95% CI on the mean calcium concentration
is 0.49 0.82.
Would a 99% CI calculated from the same sample data been
longer or shorter?
Consider the following statement: There is a 95% chance that
is between 0.49 and 0.82. Is this statement correct? Explain
your answer.
Class Activity
is 0.49 0.82.
longer or shorter?
your answer.
Class Activity
is 0.49 0.82.
longer or shorter?
your answer.
References I
Montgomery, D. C. and Runger, G. C. (2010).

Applied statistics and probability for engineers.
John Wiley & Sons.

15

Caricato da

Informazioni sul documento

Copyright

Formati disponibili

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Copyright:

Formati disponibili

15

Caricato da

Copyright:

Formati disponibili

Session 15 References

Probability and Statistical Course.

Instructor: Dr.Ing.(c) Sergio A. Abreo C.

Escuela de Ingenieras Electrica, Electronica y de

Universidad Industrial de Santander

Connectivity and Signal Processing Research Group.

CI on the mean of a normal distribution: variance known

A confidence interval estimate for is an interval of the form

CI on the mean of a normal distribution: variance known

A confidence interval estimate for is an interval of the form

CI on the mean of a normal distribution: variance known

A confidence interval estimate for is an interval of the form

CI on the mean of a normal distribution: variance known

CI on the mean of a normal distribution: variance known

CI on the mean of a normal distribution: variance known

CI on the mean of a normal distribution: variance known

Now manipulating the quantities inside the brackets

This leads to the following definition:

CI on the mean of a normal distribution: variance known

CI on the mean of a normal distribution: variance known

CI on the mean of a normal distribution: variance known

CI on the mean of a normal distribution: variance known

CI on the mean of a normal distribution: variance known

CI on the mean of a normal distribution: variance known

CI on the mean of a normal distribution: variance known

How does one interpret a confidence interval?

CI on the mean of a normal distribution: variance known

How does one interpret a confidence interval?

CI on the mean of a normal distribution: variance known

How does one interpret a confidence interval?

CI on the mean of a normal distribution: variance known

How does one interpret a confidence interval?

CI on the mean of a normal distribution: variance known

How does one interpret a confidence interval?

CI on the mean of a normal distribution: variance known

CI on the mean of a normal distribution: variance known

CI on the mean of a normal distribution: variance known

CI on the mean of a normal distribution: variance known

CI on the mean of a normal distribution: variance known

CI on the mean of a normal distribution: variance known

Figure : Repeated construction of a confidence interval for . Taken

CI on the mean of a normal distribution: variance known

CI on the mean of a normal distribution: variance known

CI on the mean of a normal distribution: variance known

CI on the mean of a normal distribution: variance known

CI on the mean of a normal distribution: variance known

CI on the mean of a normal distribution: variance known

CI on the mean of a normal distribution: variance known

CI on the mean of a normal distribution: variance known

CI on the mean of a normal distribution: variance known

CI on the mean of a normal distribution: variance known

CI on the mean of a normal distribution: variance known

CI on the mean of a normal distribution: variance known

CI on the mean of a normal distribution: variance known

CI on the mean of a normal distribution: variance known

CI on the mean of a normal distribution: variance known

Figure : Error in estimating with x. Taken from

CI on the mean of a normal distribution: variance known

Choice of Sample Size

CI on the mean of a normal distribution: variance known

A Large-Sample Confidence Interval for

Choice of Sample Size

A Large-Sample Confidence Interval for

Choice of Sample Size