Professional Documents
Culture Documents
i=1
x
i
p(x
i
)
= x
1
p(x
1
) + x
2
p(x
2
) + x
3
p(x
3
) + x
4
p(x
4
)
= 4(0.1) + 5(0.2) + 6(0.3) + 8(0.4) = 6.4.
EC114 Introduction to Quantitative Economics 5. Basic Statistical Inference
Populations and Samples 6/29
Another population parameter is the population variance,
denoted
2
.
The variance can be found using
2
= E(X
2
)
2
.
From the table we have
E(X
2
) =
4
i=1
x
2
i
p(x
i
)
= x
2
1
p(x
1
) + x
2
2
p(x
2
) + x
2
3
p(x
3
) + x
2
4
p(x
4
)
= 16(0.1) + 25(0.2) + 36(0.3) + 64(0.4) = 43.
Hence
2
= 43 (6.4)
2
= 2.04.
The standard deviation, , is therefore equal to 1.43; this is
a measure of the average deviation of X around its mean.
EC114 Introduction to Quantitative Economics 5. Basic Statistical Inference
Populations and Samples 7/29
Important: population parameters are constants i.e. xed
numbers.
Unfortunately, they are also usually unknown.
For example, suppose we are interested in the annual
income of residents of a city.
It is too expensive and time consuming to interview all
residents so we could take a sample of, say, 100 or 1000
residents.
EC114 Introduction to Quantitative Economics 5. Basic Statistical Inference
Statistical Inference 8/29
Suppose it were possible to nd the annual income of the
citys residents by conducting a survey of all residents in
week 45 of a given year, 2006.
There could be many millions of values for income (X) in
this population!
But we could use them to nd the mean: = 17,670, for
example (we could also compute
2
if we wanted to).
Furthermore, suppose we wish to do the same thing for
week 45 of 2011 (this week), but either we dont have the
time or we cant afford to survey the entire population to
get this information.
EC114 Introduction to Quantitative Economics 5. Basic Statistical Inference
Statistical Inference 9/29
Instead we could take a sample of, say, 400 residents.
We might nd the sample mean to be
X = 19,110.
Ignoring price changes, based on our sample mean
X, can
we say that the mean annual income of residents in the
population, , has risen in the last ve years?
The mean appears to have risen (based on our sample)
but how condent can we be that it has risen in the
population?
EC114 Introduction to Quantitative Economics 5. Basic Statistical Inference
Statistical Inference 10/29
This is a typical example of statistical inference, which we
can summarise as follows:
We need to nd a level of income, k, such that:
if
X k then we conclude that has risen;
if
X < k then we conclude that is unchanged (or
hasnt risen).
A word on notation:
Greek letters e.g. and
2
, denote population
parameters;
Roman letters e.g.
X and v
2
, denote sample values.
EC114 Introduction to Quantitative Economics 5. Basic Statistical Inference
Statistical Inference 11/29
One of the difculties with statistical inference is that
different samples yield different outcomes.
For example, our sample of 400 residents resulted in
X:
It has a jagged appearance because the class widths are
equal to 1000.
If we reduced the size of the class widths to 100 or 1 it
would become smoother.
EC114 Introduction to Quantitative Economics 5. Basic Statistical Inference
Sampling Distribution of the Mean 14/29
The previous histogram is the sampling distribution of the
mean.
It is the distribution of
X if it were possible to take a large
number of samples of size n (here n = 400) from the given
population.
In practice we typically only have access to a small number
of samples.
In Economics we usually have just one sample to work
with.
Fortunately we can use theoretical results to deduce the
sampling distribution for
X that would result from a large
number of random samples.
EC114 Introduction to Quantitative Economics 5. Basic Statistical Inference
Sampling Distribution of the Mean 15/29
Theorem
If random samples of size n are taken from an innitely sized
population with mean and variance
2
then the sample
distribution of the sample mean
X will have mean and variance
E(
X) = , V(
X) =
2
X
=
2
n
,
respectively.
The rst property suggests that
X should be a good
estimator of because E(
X) = .
For a xed sample size n, the larger the population
variance
2
the larger the variance of
X,
2
X
.
For a xed population variance
2
, the larger the sample
size n the smaller the variance of
X,
2
X
.
EC114 Introduction to Quantitative Economics 5. Basic Statistical Inference
Sampling Distribution of the Mean 16/29
Example (Thomas, Example 3.2) A long series of random
samples, all of size 12, is taken from a population of X
values with mean and variance
2
. For each sample, the
sample mean
X is calculated. The average value of all the
X
=
2
n
= 4;
given that n = 12 we can solve this to obtain
2
= 4 12 = 48.
EC114 Introduction to Quantitative Economics 5. Basic Statistical Inference
Sampling Distribution of the Mean 17/29
The Theorem tells us a lot about the sampling distribution
of
X, but nothing about its shape.
For example, is the distribution normal, uniform or
something else?
The following theorem is important for statistical inference.
Theorem (Central Limit Theorem)
If sufciently large samples are randomly drawn from a
population with mean and variance
2
, then the sampling
distribution of
X will be approximately normally distributed with
mean E(
X) = and variance
2
X
=
2
/n, regardless of the
shape of the distribution of the population.
This tells us that X can have any distribution but, provided
n is sufciently large,
X will be normally distributed.
EC114 Introduction to Quantitative Economics 5. Basic Statistical Inference
Sampling Distribution of the Mean 18/29
CLT Example - Underlying PDF
EC114 Introduction to Quantitative Economics 5. Basic Statistical Inference
Sampling Distribution of the Mean 19/29
CLT Example - Sample Size 2
EC114 Introduction to Quantitative Economics 5. Basic Statistical Inference
Sampling Distribution of the Mean 20/29
CLT Example - Sample Size 3
EC114 Introduction to Quantitative Economics 5. Basic Statistical Inference
Sampling Distribution of the Mean 21/29
CLT Example - Sample Size 4
EC114 Introduction to Quantitative Economics 5. Basic Statistical Inference
Sampling Distribution of the Mean 22/29
CLT Example - Sample Size 8
EC114 Introduction to Quantitative Economics 5. Basic Statistical Inference
Sampling Distribution of the Mean 23/29
CLT Example - Sample Size 16
EC114 Introduction to Quantitative Economics 5. Basic Statistical Inference
Sampling Distribution of the Mean 24/29
CLT Example - Sample Size 32
EC114 Introduction to Quantitative Economics 5. Basic Statistical Inference
Sampling Distribution of the Mean 25/29
The Central Limit Theorem (CLT) implies that we can write
X
approx. N
,
2
n
X
=
2
/n,
Z =
X
/
74 75
25/80
<
2
/n
<
76 75
25/80
2
/n
>
100 75
25/80
X = 75) = 0.
EC114 Introduction to Quantitative Economics 5. Basic Statistical Inference
Summary 29/29
Summary
Populations and samples.
Statistical inference.
The sampling distribution of the mean.
Next week:
Estimation and testing of population parameters.
EC114 Introduction to Quantitative Economics 5. Basic Statistical Inference