Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Distribution
Sampling
Statistics and Parameters
Reasons for Sampling
2.
3.
For given resources, the sample can broaden the scope of the study.
4. Because the research process is some times destructive, the sample can save
product.
5.
Types of Sampling
Random Sampling, Non-Random Sampling.
Random Sampling
42595
46503
58727
27958
18584
34914
30134
18845
A Population of 30 companies
Alaska Airlines
DuPont
Alcoa
Exxon Mobil
Ashland
General Dynamics
Bank of America
General Electric
BellSouth
General Mills
Petroleum
Chevron
Halliburton
Citigroup
IBM
Gamble
Clorox
Kellogg
Delta Airlines
Kmart
Disney
Lowes
04024
49618
86385
02304
Lucent
Mattel
Mead
Microsoft
Occidental
JCPenney
Procter &
Ryder
Sears
Time Warner
29880
51038
Systematic Sampling:
Convenient and relatively easy to administer
Population elements are an ordered sequence (at least, conceptually).
The first sample element is selected randomly from the first k population
elements.
Thereafter, sample elements are selected at a constant interval, k, from the
ordered sequence frame.
Example: Purchase orders for the previous fiscal year are serialized 1 to 10,000
(N=10,000). A sample of 50 (n=50 purchases orders is needed for an audit.
First sample element is randomly selected from the first 200 purchases orders .
Assume that 45th purchase order was selected. Subsequently 245, 445, 645,till 200 th
element.
Stratified Sampling
Stratified by Age
20 - 30 years old
(homogeneous within)
(alike)
30 - 40 years old
(homogeneous within)
(alike)
40 - 50 years old
(homogeneous within)
(alike)
Hetergeneous
(different)
between
Hetergeneous
(different)
between
Type of Community
Urban
Suburban
Rural
Cluster Sampling
In cluster sampling, we divide the population into groups or clusters and then
select a random sample of these clusters. We assume that these individual clusters
are the representative of the population as a whole.
55
59
63
64 68
69
70
N=8
Frequency
3
2
1
0
52.5
57.5
62.5
67.5
72.5
Suppose we take all possible samples of size n=2 from the population with replacement. The result is the
following pairs of data.
(54,54)
(55,54)
(59,54)
(63,54)
(54,55)
(`55,55)
(59,55)
(63,55)
(54,59)
(54,63)
(54,64)
(54,68)
(54,69)
(54,70)
(64,54)
(64,55)
(64,59)
(64,63)
(64,64)
(64,68)
(64,69)
(64,70)
(68,70)
(69,70)
(70,70)
54.5
56.5
58.5
59
61
61.5
62
54.5
55
57
59
59.5
61.5
62
62.5
56.5
57
59
61
61.5
63.5
64
64.5
58.5
59
61
63
63.5
65.5
66
66.5
59
59.5
61.5
63.5
64
66
66.5
67
60
61.5
63.5
65.5
66
68
68.5
69
61.5
62
64
66
66.5 68.5
69
69.5
62
62.5
64.5
66.5
67
69.5
70
69
Frequency
Histogram
20
10
0
Frequency
Bin
450
400
350
300
250
200
150
100
50
0
Means of 60 Samples (n = 2)
from an Exponential Distribution
F
r
e
q
u
e
n
c
y
9
8
7
6
5
4
3
2
1
0
0.00 0.25 0.50 0.75 1.00 1.25 1.50 1.75 2.00 2.25 2.50 2.75 3.00 3.25 3.50 3.75 4.00
Means of 60 Samples (n = 5)
from an Exponential Distribution
F
r
e
q
u
e
n
c
y
10
9
8
7
6
5
4
3
2
1
0
0.00 0.25 0.50 0.75 1.00 1.25 1.50 1.75 2.00 2.25 2.50 2.75 3.00 3.25 3.50 3.75 4.00
F
r
e
q
u
e
n
c
y
14
12
10
8
6
4
2
0
0.00
0.25
0.50
0.75
1.00
1.25
1.50
1.75
2.00
2.25
2.50
2.75
3.00
250
200
150
100
50
0
0.0
0.5
1.0
1.5
2.0
2.5
3.0
3.5
4.0
4.5
5.0
Means of 60 Samples (n = 2)
from a Uniform Distribution
F 10
r 9
e 8
q 7
u
6
e
n 5
c 4
y 3
2
1
0
1.00
1.25
1.50
1.75
2.00
2.25
2.50
2.75
3.00
3.25
3.50
3.75
4.00 4.25
Means of 60 Samples (n = 5)
from a Uniform Distribution
F 12
r
e 10
q
u 8
e
n 6
c
y 4
2
0
1.00
1.25
1.50
1.75
2.00
2.25
2.50
2.75
3.00
3.25
3.50
3.75
4.00
4.25
25
20
15
10
5
0
1.00
1.25
1.50
1.75
2.00
2.25
2.50
2.75
3.00
3.25
3.50
3.75
4.00
4.25
Uniform
Population
n=2
n=2
n=5
n=5
n = 30
n = 30
Normal
Population
n=2
n=2
n=5
n=5
n = 30
n = 30
x
x
where
N n
X
N 1
n
Exercise 7.21
According to Nielsen Media Research, the average number of hours of TV viewing
per household per week in the United States is 50.4 hours. Suppose the standard
deviation is 11.8 hours and a random sample of 42 is taken.
a.
What is the probability that the sample average is more than 52 hours?
b.
What is the probability that the sample average is less than 47.5 hours?
c. What is the probability that the sample average is less than 40 hours? If the
sample average actually is less than 40 hours, what would it mean in terms of the
Nielsen Media Research figures?
d. Suppose the population standard deviation is unknown. If 71% of all sample
means are greater than 49 hours and the population mean is still 50.4 hours , what
is the value of the population standard deviation?
If the research produces measurable data such as weight, distance, time and
income, the sample mean is often the statistics of choice. However, if research
results in countable items such as how many people in the sample have the flexible
work schedule, the sample proportion is the statistic of choice.
Sample proportion is computed by dividing the frequency with which a given
characteristics occurs in a sample by the number of items in the sample.
Sample
Proportion
x
n
Z formula for
sample
proportions for
n.p>5 and
n.q>5
p p
p.q
n
Exercise 7.27
According to survey by Accountemps, 48% of executives believe
that employees are most productive on Tuesdays. Suppose 200
executives are randomly surveyed
a.
What is the probability that fewer than 90 of the executives
believe employees are most productive on Tuesdays.
b. What is the probability that more than 100 executives believe
employees are most productive in Tuesdays.
c. What is the probability that more than 80 of the executives
believe employees are most productive on Tuesdays