Professional Documents
Culture Documents
Jae-Kwang Kim
Iowa State University
Spring, 2013
Introduction
1
Introduction
2
Example
3
Probability sampling
4
Survey Procedures
5
Survey Errors
Kim (ISU) Ch. 1: Probability Sampling Spring, 2013 2 / 25
Introduction
What is sampling?
Population
Finite Population
Innite Population
Sample : Subset of a population
Sampling : Make statistical inference about the population without
measuring the whole population
Sampling error : the error that results from taking a sample instead of
examining the whole population
Kim (ISU) Ch. 1: Probability Sampling Spring, 2013 3 / 25
Introduction
Why sampling ?
1
To reduce the cost
2
To save the time
3
Sometimes, to get a more accurate information about the population.
4
Sometimes, it is the only way of getting information about the target
population.
Kim (ISU) Ch. 1: Probability Sampling Spring, 2013 4 / 25
Introduction
How to do the sampling ?
Two types of sampling
Probability sampling
Non-probability sampling
Roughly speaking, a probability rule is assigned to obtain a sample in
probability sampling.
Kim (ISU) Ch. 1: Probability Sampling Spring, 2013 5 / 25
Example
1
Introduction
2
Example
3
Probability sampling
4
Survey Procedures
5
Survey Errors
Kim (ISU) Ch. 1: Probability Sampling Spring, 2013 6 / 25
Example
Lets look at an articial nite population (of size N = 4).
ID Size of farms yield
(Acres) (y)
1 4 1
2 6 3
3 6 5
4 20 15
Parameter of interest: Mean yield of the farms in the population
= (y
1
+y
2
+y
3
+y
4
)/4
The value of y is obtained only from the sampled units.
Kim (ISU) Ch. 1: Probability Sampling Spring, 2013 7 / 25
Example
Instead of observing N = 4 farms, we want to select a sample of size
n = 2.
6 possible samples
case sample ID sample mean sampling error
1 1, 2 2
2 1, 3 3
3 1, 4 8
4 2, 3 4
5 2, 4 9
6 3, 4 10
Each sample has a sampling error.
Two ways of selecting one of the six possible samples.
Nonprobability sampling : (using size of farms or etc.) select a sample
subjectively.
Probability sampling : select a sample by a probability rule.
Kim (ISU) Ch. 1: Probability Sampling Spring, 2013 8 / 25
Example
Probability sampling
Simple Random Sampling : Assign the same selection probability to
all possible samples
case sample ID sample sampling selection
mean ( y) error probability
1 1, 2 2 -4 1/6
2 1, 3 3 -3 1/6
3 1, 4 8 2 1/6
4 2, 3 4 -2 1/6
5 2, 4 9 3 1/6
6 3, 4 10 4 1/6
In this case, the sample mean( y) has a discrete probability
distribution.
Kim (ISU) Ch. 1: Probability Sampling Spring, 2013 9 / 25
Example
Probability mass function of y:
P
y
(y) =
_
1/6 if y {2, 3, 4, 8, 9, 10}
0 otherwise.
Unbiased
Variance
Kim (ISU) Ch. 1: Probability Sampling Spring, 2013 10 / 25
Example
Remark
No model assumption about y
i
in the example: totally dierent
framework !
Design-based approach: the reference distribution is the sampling
distribution generated by the repeated application of the given
sampling mechanism.
Kim (ISU) Ch. 1: Probability Sampling Spring, 2013 11 / 25
Probability sampling
1
Introduction
2
Example
3
Probability sampling
4
Survey Procedures
5
Survey Errors
Kim (ISU) Ch. 1: Probability Sampling Spring, 2013 12 / 25
Probability sampling
Denition & Notation
U = {1, 2, , N} : index set of nite population
A : subset of U, index set of the sample.
A: set of samples under consideration, sample support.
=
(y
i
; i A) : statistic
Kim (ISU) Ch. 1: Probability Sampling Spring, 2013 13 / 25
Probability sampling
1
Probability distribution of samples, or sample distribution: probability
mass function P () dened on A. That is, P () satises
1 P (A) [0, 1] , A A
2
AA
P (A) = 1.
2
(Induced) probability distribution of a statistic
1. Expectation : E(
) =
AA
P(A)
(A)
2. Variance : Var
_
_
=
AA
P(A)
_
(A) E(
)
_
2
3. Mean squared error :
MSE
_
_
=
AA
P(A)
_
(A)
_
2
= Var
_
_
+
_
E(
)
_
2
Kim (ISU) Ch. 1: Probability Sampling Spring, 2013 14 / 25
Probability sampling
Probability Sampling
Denition : For each element in the population, the probability that
the element is included in the sample is known and greater than 0.
Advantages
1
Exclude subjectivity of selecting samples.
2
Remove sampling bias (or selection bias)
What is sampling bias ? ( : true value,
: estimated value of )
(sampling) error of
=
=
_
E
_
__
+
_
E
_
_
_
= variation + bias
In nonprobability sampling, variation is 0 but there is a bias. In
probability sampling, there exists variation but bias is 0.
Kim (ISU) Ch. 1: Probability Sampling Spring, 2013 15 / 25
Probability sampling
Probability Sampling
Main theory
1
Law of Large Numbers :
converges to E(
1
What is it ?
2
How to reduce it ?
3
How to measure it ?
Kim (ISU) Ch. 1: Probability Sampling Spring, 2013 25 / 25