Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
SAMPLING
Sampling
94
on food in India.
What should we ideally do?
Population:
Sample:
Parameter:
Statistic:
Sampling unit:
Sampling design:
95
Sampling
Sampling
96
Considerations in Sampling
Sampling
97
Statistical Studies
Sampling
98
studies
99
100
mixed, and then a neutral person blindly selects two slips from the hat this is the sampling design.
Sampling
R, SAS, SPSS
FREE
Software!
101
Sampling
102
103
is defined as follows:
Each element of the sample should come from the population.
The elements are selected independently of each other.
Sampling
104
Cluster Sampling
105
106
Examples:
1. When sampling for age distribution, households can be
considered as clusters. (But not for income distribution.)
2. Each PGPX syndicate is a cluster as far as last salary,
age etc. are concerned.
Sampling
107
Stratified Sampling
108
Sampling
109
Volunteer/Convenience Samples
110
volunteers
Come with selection bias either in part of the person
conducting survey, or because of self-selection of volunteers
Examples: Any internet survey, telephone survey etc.
Whenever we conduct an internet survey on whether people
support child marriage, the response rates are overwhelmingly
negative. However, the real picture is completely different.
Why?
Bias! What are the biases here?
Necessary Evil?
111
112
SAMPLING
DISTRIBUTIONS
Idea
113
sample statistics.
The characteristics of a population are summarized by the
population parameters.
Statistical inference boils down to estimating a population
parameter using analogous sample statistic.
Examples:
Sample mean estimates population mean
Sample variance (SD) estimates population variance (SD)
Sample proportion estimates population proportion
114
Sampling Distribution
115
Example
Sampling Distribution
116
Sampling Distribution
117
random variable.
118
it is given by
1 n
X Xi
n i1
1 n
2
SX
(
X
X
)
.
i
n 1 i 1
119
Simulation Example
120
121
Notice that the sample mean changes with every sample, and
population mean.
Variance of the sample mean: (where population variance is X2 )
For WR
X2
Var X
n
X
SD(X )=
.
n
For WOR
X2
Var(X)
n
SD(X)=
N-n
N-1
Nn
.
N 1
122
N-n
N-1
123
Example
Distribution of
Sampling Distribution
124
Sampling Distribution
125
Sampling Distribution
126
n X
is approximately N(0,1).
Or, equivalently:
127
Sum of samples:
i1
distributions
128
Example
129
Example
130
131
normal approximation
Moderate skew: need 30 or more
High skew: need 50 or more
Severe skew (Example: binomial with large n and small p
so that Poisson approximation holds): might need very
high sample size, in the range of several hundred or even
higher
Example
Binomial(n,p) with p = 0.01, and various n.
132
Sampling Distribution
133
CLT does not say that large samples become normal, it only
134
135
136
a) P( X 120) 1 P( X 120)
=1-NORM.DIST(120,118,SQRT(16.53),TRUE) = 0.3114.
b) P(110 X 130) P( X 130) P( X 110)
=NORM.DIST(130,118,SQRT(16.53),TRUE)
-NORM.DIST(110,118,SQRT(16.53),TRUE)
= 0.9739.
30
P
X
4000
c) i
P X 4000 / 30
i 1
= NORM.DIST(4000/30,118,SQRT(16.53),TRUE) = 0.9999.