Professional Documents
Culture Documents
Chapter 9
Chapter 10
Chapter 11
Chapter 12
1
Chapter 9
A statistic is a random variable describing a
characteristic of a random samples.
Sample mean
Sample variance
We use statistic values in inferential
statistics (make inference about population
characteristics from sample characteristics).
Statistics have distributions of their own.
The Central Limit Theorem
The distribution of the sample mean is normal if the
parent distribution is normal.
The distribution of the sample mean approaches the
normal distribution for sufficiently large samples
(n 30), even if the parent distribution is not normal.
The parameters of the sample distribution of the mean
are:
Mean:
Standard deviation: x x
x
x
n
Problem 1
Given a normal population whose mean is
50 and whose standard deviation is 5,
Find the probability that a random sample of 4
has a mean between 49 and 52
Answer:
49 50 52 50
P(49 x 52) P( Z )
5 4 5 4
P( .4 Z .8) .7881 .3446 .4435
-.4 .8
Problem 2
Find the probability that a random sample of 16
has a mean between 49 and 52.
Answer
49 50 52 50
P(49 x 52) P( Z )
5 16 5 16
P( .8 Z 1.6) .9332 .2119 .7213
Problem 2
The amount of time per day spent by adults
watching TV is normally distributed with =6 and
=1.5 hours.
What is the probability that a What is the probability that 5
randomly selected adult adults watch TV on the
watches TV for more than average 7 or more hours?
7 hours a day?
Answer: Answer:
76 7 6
P(X 7) P Z
P(X 7) P Z
1.5 1.5 5
P(Z .67) 1 .7486 .2514 P(Z 1.49) 1 .9319 .0681
Problem 2
Additional question
What is the probability that the total TV
watching time of the five adults sampled will
exceed 28 hours?
Answer:
5.6 6
P( X 28 / 5) P Z
1.5 5
Sampling distribution of the
sample proportion
In a sample of size n, if np > 5 and n(1-p) > 5,
then the sample proportion p^ = x/n is
approximately normally distributed with the
following parameters:
p(1 p)
p p and p , therefore,
n
p p
Z
p(1 p) n
Problem 3
A commercial of a household appliances
manufacturer claims that less than 5% of all
of its products require a service call in the
first year.
A survey of 400 households that recently
purchased the manufacturer products was
conducted to check the claim.
Problem 3
Assuming the manufacturer is right, what is the
probability that more than 10% of the surveyed
households require a service call within the first
year?
.10 .05
P(p .10) P Z P(Z 4.59) 0
.05(1 .05) 400
x za / 2 x za / 2
n n
-za/2 za/2
Problem 4
How many classes university students miss
each semester? A survey of 100 students
was conducted. (see Missed Classes)
Assuming the standard deviation of the
number of classes missed is 2.2, estimate
the mean number of classes missed per
student.
Use 99% confidence level.
Problem 4
Solution
= 10.21 2.575 = 10.21 .57
2 .2
x za / 2
n 100
Missed classes
1.28 1.91
Type II Error
Type II error occurs when H0 is erroneously not
rejected.
The probability of a type II error is called b.
b=P(Do not reject H0 when H1 is true)
To calculate b:
H1 specifies an actual parameter value (not a range of
values). Example: H0: = 100; H1: = 110
The critical value is expressed in original terms (not in
standard terms).
Problem 7a
What is the probability youll believe the
mean return in problem 7 is 10.2% while
actually its 10.6%, if the sample provided a
mean return of 10.5%?
Problem 7a
Solution
The two hypotheses are:
H0: = 10.2
H1: = 10.6
H0 is not rejected (we believe = 10.2) if the
sample mean is less than a critical value.
Therefore, the probability required is:
b = P(X < Xcr | = 10.6).
Problem 7a
The critical value is (recall, this problem was a case
of a right hand tail test, with 10% significance
level):
.8
X L 0 Z.10 10.2 1.28 10.40
n 26
b = P(X<10.4 when = 10.6) =
P(Z < (10.4-10.6)/[.8/(26)1/2]) = P(Z < -1.27) = .102
Chapter 12
Generally, the standard deviation is unknown the
same way the mean may be unknown.
When the standard deviation is unknown, we need
to change the test statistic from Z to t.
We shall test three population parameters:
Mean
Variance
Proportion
Testing the mean
(unknown variance)
Replace the statistic Z with t
X
t
s n
The original distribution must be normal (or at
least mound shaped).
Problem 8
A federal agency inspects packages to determine if
the contents is at least as great as that advertised.
A random sample of (i)5, (ii)50 containers whose
packaging states that the weight was 8.04 ounces
was drawn. (See Content).
From the sample results
Can we conclude that the average weight does not meet
the weight stated? (use a = .05).
Estimate the mean weight of all containers with 99%
confidence
What assumption must be met?
Problem 8
Solution
We hypothesize on the mean weight.
H0: = 8.04
H1: < 8.04
(i) n=5. For small samples let us solve manually
Assume the sample was: 8.07, 8.03, 7.99, 7.95, 7.94
The rejection region: t < -ta,n1 = -t.05,5-1 = -2.132
The tsample = ?
Mean = (8.07++7.94)/5 = 7.996 -2.132
Std. Dev.={[(8.07-7.996)2++(7.94- 7.996)2]/4}1/2 = 0.054
Problem 8
The t sample is calculated as follows:
X 7.996 8.04
t 1.32
s n 0.054 5
20
15
10
5
0
7.93 7.97 8.01 8.05 8.09 More
Using Excel
To obtain an exact value for t use the TINV
function:
Degrees of
=TINV(0.01,49) freedom
H0 : 2 20
H1 : 2 20 or 20 or 20
p p .8225 .8
Z 1.18
p (1 p ) n .8225(1 .8225) 400
Z.05 = 1.645
Since 1.18 < 1.645 the consumer group cannot confirm
the claim at 5% significance level.