Professional Documents
Culture Documents
9
Peter Adams/Corbis
Hypothesis Testing
Case The Golden Ratio
Study What do Euclids Elements, the Parthenon of ancient Greece, the Mona
Lisa, and the beadwork of the Shoshone tribe of Native Americans have in
common? An appreciation for the golden ratio. Suppose we have two quantities A and B,
with A B 0. Then, A/B is called the golden ratio if
A 1 B
______ A
5 __
A B
that is, if the ratio of the sum of the quantities to the larger quantity equals the ratio of the
larger to the smaller.
The golden ratio permeates ancient, medieval,
Renaissance, and modern art and architecture. For X
example, the Egyptians constructed their great
pyramids using the___ golden ratio. (Specifically, in
Figure 9.1, if A 5 XY is the ___
height from the top A
vertex to the base, and B 5 YZ is the distance from
the center of the base to the edge, then (A 1 B)/A 5
B
A/B.) Some mathematicians have said that the Y Z
golden ratio may be intrinsically pleasing to the
human species. Support for this conjecture would Figure 9.1
be especially strong if evidence was found for the use of the golden ratio in
non-Western artistic traditions. In the Case Study on pages 5457, we examine
whether the decorative beaded rectangles sewn by the Shoshone tribe of Native
Americans follow the golden ratio.
The Big Picture Where we are coming from, and where we are headed
H ypothesis testing is the most widely used method for statistical inference in the
world. Hypothesis testing forms the bedrock of the scientific method, and it
touches nearly every field of scientific endeavor, from biology and genetics, through
physics and chemistry, to sociology and psychology. It also forms the basis for
decision science, and thus for all business-oriented decision-making methods. In
this chapter we will discover what hypothesis testing is all about.
In Chapter 8, we learned about confidence interval estimation. But confidence
intervals represent only the first of a large family of topics in inferential statistics. In
fact, each of Chapters 813 covers a fresh and distinct facet of statistical inference. In
Chapters 7 and 8, we learned that, since the value of the target population parameter is
unknown, we need to estimate it using point and interval estimates provided by sample
data. However, because the target population parameter is unknown, people may dis-
agree about its value. In this chapter, we introduce the statistical method for resolving
conflicting claims about the value of a population parameter, hypothesis testing.
Researchers are interested in investigating many different types of questions, such
as the following:
An accountant may wish to examine whether evidence exists for corporate tax
fraud.
A Department of Homeland Security executive may want to test whether a
new surveillance method will uncover terrorist activity.
A sociologist may want to examine whether the mayors economic policy is
increasing poverty in the city.
Questions such as these can be tackled using statistical hypothesis testing, which is a process
for rendering a decision about the unknown value of a population parameter. We will learn
9.1 Introduction to Hypothesis Testing
how to make decisions about the value of a population mean, and we will examine differ-
ent types of errors that can be made. Among the tests we will consider are tests when
is known and when is unknown. Finally, we will learn how to conduct hypothesis tests
about the value of a population proportion when a large sample is available.
In Chapter 10, we will begin discussion of further topics in statistical inference
with a look at two-sample problems. We will continue to apply both confidence interval
estimation and hypothesis-testing methodology throughout the remaining chapters. n
The Hypotheses
The status quo hypothesis represents what has been tentatively
assumedabout the value of the parameter and is called the null
hypothesis, denoted as H0.
The alternative hypothesis, or research hypothesis, denoted as Ha,
represents an alternative claim about the value of the parameter.
The not-guilty hypothesis is considered the null hypothesis H0 because the jurors
must assume it is true until proven otherwise. The alternative hypothesis Ha, that
the defendant is guilty, must be demonstrated to be true, beyond a reasonable doubt.
How does a court of law determine whether the defendant is convicted or acquitted?
This judgment is based upon the evidence, the hard facts heard in court. Similarly, in
hypothesis testing, the researcher draws a conclusion based on the evidence provided
by the sample data.
In Sections 9.19.4, we will examine hypotheses for the unknown mean . The
null hypothesis will be a claim about certain values for , and the alternative hypoth-
esis will be a claim about other values for . We will begin by considering hypotheses
about the population mean for some fixed-value 0 (that is, 0 is just a specified
value for the unknown mean ). The hypotheses have one of the three possible forms
shown in Table9.1. The right-tailed test and the left-tailed test are called one-tailed
tests. In Section 9.3 we will find out why we use this terminology.
Table 9.1 The three possible forms for the hypotheses for a test for
The null hypothesis H0 states that the population mean is at most 7.2 parasites per
fish. The alternative hypothesis Ha states that the population mean number of parasites
is more than 7.2. Here, 0 5 7.2, which is the possible value of specified in the ex-
ample. Figure 9.2 shows that H0 and Ha cover all possible values of . n
Ha asserts that m is
H0 asserts that m is somewhere in this interval. somewhere in this interval.
0 5 7.2 10
m0
Figure 9.2 The hypotheses divide all possible values of between them.
9.1 Introduction to Hypothesis Testing
Recall that each hypothesis is a claim that the statement is true. We must find
evidence that is consistent with one of the two claims H0 or Ha. The first task in
hypothesis testing is to form hypotheses. To convert a word problem into two
hypotheses, look for certain key words that can be expressed mathematically.
Table 9.2 shows how to convert words typically found in word problems into
symbols.
Table 9.2 Key English words, with mathematical symbols and synonyms
Once you have identified the key words, use the associated mathematical sym-
bol to write two hypotheses. The following strategies can be used to write the
hypotheses.
Step 3 Find the value for 0 and write your hypotheses. The alternative hypoth-
esis Ha states that the mean gas mileage is less than some value 0. Less than what?
The 20.8 mpg from 2004. Write the two hypotheses with 0 5 20.8.
AU T I O Do not blindly apply this strategy without thinking about what you are doing.
!
C
N Rather, use the strategy to help formulate your own hypotheses. There is no substi-
tute for thinking through the problem!
Statistical Significance
A result is said to be statistically significant if it is unlikely to have occurred
due to chance.
Step 3 Find the value for 0 and write your hypotheses. Asking less than
what? (or decreased from what?), we see that 0 5 110, so the hypotheses are
_
blood pressure levels and calculate the sample mean xand sample standard deviation s.
Most likely, the mean of this sample of patients systolic blood pressure levels will not
be exactly equal to 110. Recall that the sample means vary about the population mean.
_
So, even if the null hypothesis is true, the mean of the sample xwill not, in most cases,
be exactly equal to 5 110.
_
Now, suppose that the sample mean blood pressure xis less than the hypothesized
population mean of 110. Is the difference due simply to chance variation, or is it
evidence of a side effect of the cholesterol medication? Lets consider some possible
_
values for x:
_ _
x 5 109: The difference between x
and 5 110 is only |109 2 110| 5 1.
Depending on the variability present in the sample, the researcher would prob-
ably conclude that this small difference is due to chance variation. Thus, the
_
researcher would not reject the null hypothesis, because the value of x
is not evi-
dence that the null hypothesis 110 is not true. The result is not statistically
significant.
_ _
x5 90: The difference between xand 5 110 is |90 2 110| 5 20. Depending
on the variability present in the sample, the researcher would probably con-
clude that this difference is so large that it is unlikely that it is due to chance
variation alone. Thus, the researcher would reject the null hypothesis (that
110) in favor of the alternative hypothesis (that 110). The result is statisti-
cally significant.
To summarize: in a hypothesis test, we compare the sample mean with the value
0 of the population mean used in the H0 hypothesis. If the difference is large, then
H0 is rejected. If the difference is not large, then H0 is not rejected. The question is,
Where do you draw the line? Just how large a difference is large enough? This
is exactly the issue we will face throughout the remainder of Chapter 9 (actually,
throughout the rest of the book).
Table 9.3 shows the four possible jury verdicts in terms of these hypotheses. The
possible verdicts are in the left-hand column, and the two hypotheses appear in the
headings of the middle and right-hand columns.
Chapter 9 Hypothesis Testing
Reality
H0 true: H0 false:
Jurys decision Defendant did not Defendant did
commit the crime commit the crime
Lets look at the two possible decisions the jury can make. It can find the defen-
dant guilty. In this case, the jury is rejecting what is stated in the null hypothesis H0,
so we say that the jury rejects H0. On the other hand, the jury can find the defendant
not guilty. In this case, the jury does not reject the null hypothesis H0. There are two
ways for the jury to render the correct decision. (Note that in Table 9.3 a hypothesis
and decision with the same color lead to correct decisions.)
Unfortunately, there are also two ways for the jury to render an incorrect deci-
sion. Judges, juries, and researchers are just people, and all can make mistakes. In
statistics, the two incorrect decisions are called Type I and Type II errors.
Note that in Table 9.3 a decision and hypothesis with different colors lead to
either a Type I or a Type II error. In the courtroom, errors can be made because of
prejudice, legal ineptitude, or some legal technicality. In statistics, errors in decisions
can be made because of the pervasive presence of chance and sampling variability.
D eveloping
Your Statistical Sense
A Decision Is Not Proof
It is important to understand that the decision to reject or not reject H0 does not
prove anything. The decision represents whether or not there is sufficient evidence
against the null hypothesis. This is our best judgment given the data available. You
cannot claim to have proven anything about the value of a population parameter
unless you elicit information from the entire population, which in most cases is not
possible. n
Section 9.1 Exercises
How can we make decisions about population parameters using the limited infor
mation available in a sample? The answer is that we base our decisions on probability.
_
When the difference between the sample mean x and the hypothesized population
mean m0 is large, then the null hypothesis is probably not correct. When the differ-
ence is small, then the data are probably consistent with the null hypothesis. But we
dont know for sure.
The probability of a Type I error is denoted as (alpha). We set the value of
to be some small constant, such as 0.01, 0.05, or 0.10, so that there is only a small
probability of rejecting a true null hypothesis. To say that 5 0.05 means that, if this
hypothesis test were repeated over and over again, the long-term probability of re-
jecting a true null hypothesis would be 5%. The level of significance of a hypothesis
test is another name for , the probability of rejecting H0 when H0 is true. A smaller
makes it harder to wrongfully reject H0 just by chance. If the consequences of mak-
ing a Type I error are serious, then the level of significance should be small, such as
5 0.01. If the consequences of making a Type I error are not so serious, then one
may choose a larger value for the level of significance, such as 5 0.05 or 5 0.10.
The probability of a Type II error is denoted as (beta). This is the probability
of not rejecting H0 when H0 is false, such as acquitting someone who is really guilty.
Making smaller inevitably makes larger (for a fixed sample size). Of course, our
goal is to simultaneously minimize both and . Unfortunately, the only way to do
this is to increase the sample size.
There are only two possible hypothesis-testing conclusions:
Reject H0, or
Do not reject H0.
Note: When we reject H0, we say
that the results are statistically
The material in this chapter is challenging but well worth the effort. You are
significant. If we do not reject learning critical thinking through statistics. What you are learning is a bedrock com-
H0, the results are not statistically ponent of the scientific method, which is the backbone of scientific research all over
significant. the planet.
1
Statistical hypothesis testing is a way of formalizing the H0 hypothesis. If the difference is large, then H0 is rejected.
decision-making process so that a decision can be rendered If the difference is not large, then H0 is not rejected.
about the unknown value of the parameter. The status quo
hypothesis that represents what has been tentatively assumed 3
When performing a hypothesis test, there are two ways
about the value of the parameter is called the null hypothesis of making a correct decision: to not reject H0 when H0 is
and is denoted as H0. The alternative hypothesis, or research true and to reject H0 when H0 is false. Also, there are two
hypothesis, denoted as Ha, represents an alternative conjecture types of error: a Type I error is to reject H0 when H0 is true,
about the value of the parameter. and a Type II error is to not reject H0 when H0 is false. The
probability of a Type I error is denoted as (alpha). The
In a hypothesis test, we compare the sample mean x_
2 probability of a Type II error is denoted as b (beta).
with the value 0 of the population mean used in the
Section 9.1Exercises
Clarifying the Concepts 3. What are some characteristics of the alternative hypothesis?
1. Explain in your own words why we need hypothesis 4. Explain what is meant by 0.
testing. 5. In the hypothesis test for the population mean, how many
2. What are some characteristics of the null hypothesis? forms of the hypotheses are there? Write out these forms.
10 Chapter 9 Hypothesis Testing
6. Say we are interested in testing whether the a hypothesis test and that in actuality the population mean
population mean is less than 100, and the sample we take number of years it takes to recoup their initial cost is
yields a mean of 90. Is this sufficient evidence that the two years.
population mean is less than 100? Explain clearly why or
why not. Applying the Concepts
7. In a criminal trial, what are the two possible decision For Exercises 22227, do the following.
errors? What do statisticians call these errors? a. Provide the null and alternative hypotheses.
8. Do hypothesis tests prove anything? Why or why not? b. Describe the two ways a correct decision could be
made.
Practicing the Techniques c. Describe what a Type I error would mean in the
For Exercises 916, provide the null and alternative context of the problem.
hypotheses. d. Describe what a Type II error would mean in the
9. Test whether is greater than 10. context of the problem.
10. Test whether is less than 100. 22. Shares Traded on the Stock Market. The Statistical
11. Test whether is different from 0. Abstract of the United States reports that the mean daily
12. Test whether is at least 40. number of shares traded on the New York Stock Exchange
13. Test whether equals 4.0. in 2005 was 1.602 billion. Based on a sample of this years
14. Test whether is at most 25. trading results, a financial analyst would like to test whether
15. Test whether has changed from 36. the mean number of shares traded will be larger than the
16. Test whether exceeds 24. 2005 level.
23. Traffic Light Cameras. The Ministry of Transportation
For Exercises 17221, do the following. in the province of Ontario reported in 2003 that the
a. Provide the null and alternative hypotheses. installation of cameras that take pictures at traffic lights has
b. Determine if a correct decision has been made. If an decreased the mean number of fatal and injury collisions to
error has been made, indicate which type of error. 339.1 per year. A hypothesis test was performed this year
17. Child Abuse. The U.S. Administration for Children to determine whether the population mean number of such
and Families reported that the national rate for child abuse collisions has changed since 2003.
referrals was 43.9 per 1000 children in 2005. A hypothesis 24. Price of Milk. The Bureau of Labor Statistics reports
test was carried out that tested whether the population mean that the mean price for a gallon of milk in 2005 was
referral rate had increased this year from the 2005 level. The $3.24. Suppose that we conduct a hypothesis test to
null hypothesis was not rejected. Suppose that, in actuality, investigate if the population mean price of milk this year
the population mean child abuse referral rate for this year is has increased.
45 per 1000 children. 25. Americans Height. Americans used to be on average
18. California Warming. Science Daily reported in 2007 the tallest people in the world. That is no longer the case,
that the mean temperature in California has increased from according to a study by Dr. Richard Steckel, professor of
1950 to 2000 by 2 degrees Fahrenheit (F).2 Suppose that economics and anthropology at The Ohio State University.
we perform a hypothesis test today to determine whether the The Norwegians and Dutch are now the tallest, at
population mean temperature increase exceeds two degrees, 178 centimeters, followed by the Swedes at 177, and then
and suppose that the actual population mean temperature the Americans, with a mean height of 175 centimeters
was three degrees. (approximately 5 feet 9 inches). According to Dr. Steckel,
19. Travel Costs. In 2007, a motorists guide reported that The average height of Americans has been pretty
travel costs had increased over the previous two years. much stagnant for 25 years.4 Suppose that we conduct
Suppose that this report was based on a hypothesis test a hypothesis test to investigate whether the population
and that in actuality the population mean travel costs were mean height of Americans this year has changed from
lower. 175 centimeters.
20. Eating Trends. According to the NPD Group, higher 26. Credit Score in Florida. According to CreditReport
gasoline prices are causing consumers to go out to eat less .com, the mean credit score in Florida in 2006 was 673.
and eat at home more.3 Suppose that this report found that Suppose that a hypothesis test was conducted to determine
the mean number of meals prepared and eaten at home is if the mean credit score in Florida has decreased since
less than 700 per year, and that in actuality the population that time.
mean number of such meals is 600. 27. Salary of College Grads. According to the U.S. Census
21. Hybrid Vehicles. A 2006 study by Edmunds.com Bureau, the mean salary of college graduates in 2002 was
showed that owners of hybrid vehicles can recoup their $52,200. Suppose that a hypothesis test was carried out to
increased initial cost through reduced fuel consumption in determine whether the population mean salary of college
at most three years. Suppose that this report was based on graduates has increased.
9.2 Z Test for the Population Mean m: p-Value Method 11
Figure 9.3 10 15 20
_
m 0 = 10 x = 15
CD price ($)
However, now suppose that you were conducting a hypothesis test about whether
the mean cost of a certain surgical procedure was less than $6000:
_
To account for variability, we use the sampling distribution of the sample mean x.
You recall from Chapters 7 and 8 that the sampling distribution of the sample mean
_
x, for large samples, is approximately normal with mean m_x 5 m and standard devia-
__
tion sx_ 5s/
n. Now, if we assume H0 is true, then the sampling distribution of the
sample mean will have a mean of m _5 m0, the value of m given in H0. Figure 9.5
_ x
shows the sampling distribution of xunder the assumption that the population mean
_
equals m0. Notice that the distribution of x is centered at m0, the value of m that is
claimed in H0.
Spread determined
by / n
Figure 9.5
The sampling distribution
_
of x
is centered at m0
if H0 is correct.
_ _
Centered at m0 Extreme value of x x
_
Note, however, that we do not actually observe this distribution of x . We observe
_
only a single value of x, the sample mean of the observed data set. We need to deter-
_
mine whether this value of xis unusual or extreme, according to this distribution. By
_
unusual or extreme, we mean a value of xlocated in one of the tails of the distribu-
tion, far from the hypothesized mean m0.
_
Suppose that our observed value xis indeed unusual or extreme. Then this indi-
cates a conflict between what we hypothesize to be true about m (that is, that m 5 m0)
and the actual sample data. In this case, we are faced with choosing between one of
the following two scenarios:
1. H0 is correct, the value of m0 is accurate, and our observation of this unusual
_
value of xis an amazingly unlikely event.
_
2. H0 is not correct, and the true value of m must be closer to x
.
D eveloping
Your Statistical Sense
The Data Prevail!
Since we are reluctant to base our scientific methodology on amazingly unlikely
events, we would therefore conclude that H0 is not correct. Recall that the null
_
hypothesis is simply a conjecture. On the other hand, the sample mean x represents
data that are directly observable and not conjectural. The modern scientific method
states that, in the face of a conflict between a conjecture and observed data, the data
prevail and we need to rethink our null hypothesis. This leads us to the following
statement of the essential idea about hypothesis testing for the mean. n
Specifically, the null hypothesis asserts that the mean cost of CDs is at most $10. Sup-
_
pose the population standard deviation is $9. Show that the observed value of x5 $15
is unusual if we assume that H0 is correct.
Solution
Assuming that H0 is correct, samples of size n 5 36 are approximately normally dis-
tributed, because n is at least 30. The ___ mean of the sampling distribution is $10, with
__
standard deviation s _x 5 s/ n5 9/
365 $1.50.
_
Figure 9.6 shows the sampling distribution of xif H0 is correct. Note that the ob-
_
served value of x
5 $15 is deep in the right-hand tail of the distribution, indicating that
_
this is an unusual value, assuming H0 is correct. Thus, the observed value x 5 $15 is
not consistent with H0: m 10. Since we must base our conclusion on the evidence we
uncovered, we reject the null hypothesis that m 10. n
5 10 15
CD price ($) $15 is unusual
if H0 is correct
However, as you may have guessed, this particular form of Z is very special, and we
give it a special name,. Zdata.
14 Chapter 9 Hypothesis Testing
This Zdata is an example of a test statistic, a statistic generated from a data set for
the purposes of testing a statistical hypothesis. We will meet several other test statistics
throughout the remainder of the text. This type of hypothesis test is called the Z test
because the test statistic Zdata comes from the standard normal Z distribution.
3 0 3 Zdata = 3.33
Figure 9.7
2 p-Values
The two main methods for carrying out a hypothesis test are
p-Value
_
The p-value is the probability of observing a sample statistic (such as xor
Zdata) at least as extreme as the statistic actually observed if we assume that
the null hypothesis is true. Roughly speaking, the p-value represents the
probability of observing the sample statistic if the null hypothesis is true.
Since the term p-value means probability value, its value must always lie
between 0 and 1.
There are three possible cases for the p-value, summarized in Table 9.4. Note
that the method for calculating the p-value depends on the form of the hypothesis
test.
Table 9.4 Finding the p-value depends on the form of the hypothesis test
Type of hypothesis test p-Value is tail area associated with Zdata
Right-tailed test
p-value
H0: m m0 versus Ha: m . m0
p-value 5 P(Z . Zdata)
Area to right of Zdata
0 Zdata
Left-tailed test
H0: m m0 versus Ha: m , m0 p-value
p-value 5 P(Z , Zdata)
Area to left of Zdata
Zdata 0
H0 : m 10 versus Ha: m 10
p-value
= P(Z > 3.33)
Area = 0.9996 = 1 0.9996
= 0.0004
0 Zdata = 3.33
Figure 9.8
as shown in Figure 9.8. The Z table tells us that the area to the left of Zdata 5 3.33 is
0.9996. Thus, using Case 2 from Table 6.7 in Section 6.4 (page 000), we find the area
to the right of Zdata 5 3.33 to be
Alternatively, we can use the TI-83/84 to help us calculate the p-value. Press 2nd
DISTR then 2: normalcdf(-1E99, 3.33) (Figure 9.9). The TI-83/84 provides the area
to the left of Z 5 3.33 as 0.9995657137 (Figure 9.10). Thus,
Figure 9.9 Figure 9.10
_
Unusual and extreme values of x , and therefore of Zdata, will have a small
_
p-value, while values of x
and Zdata nearer to the center of the distribution will have
a large p-value.
Assuming H0 is true:
_
Unusual and extreme values of x
and Zdata Small p-value
_
(close to 0)
Values of xand Zdata near center Large p-value
(greater than,
say, 0.15)
Keep in mind that the p-value is a probability. In fact, the term p-value actu-
ally means probability value. Since it is a probability, the p-value must always be
between 0 and 1. A small p-value indicates a conflict between your sample data and
the null hypothesis. Therefore, since the data are real and the hypothesis is a conjec-
ture, this small p-value will lead us to reject H0.
9.2 Z Test for the Population Mean m: p-Value Method 17
However, how small is small? We have already met another fellow in this
chapter, a type of probability called a, the probability of a Type I error. (Recall
that a Type I error is rejecting H0 when H0 is true.) The researcher chooses a to be
some small constant like 0.01, 0.05, or 0.10 and then compares the p-value with
a. A p-value is small if it is smaller than a. This leads us to the rejection rule
that tells us the conditions under which we may reject the null hypothesis. In
fact, this rejection rule can be applied to any type of p-value hypothesis test we
perform.
The rejection rule for performing a hypothesis test using the p-value method is
Reject H0 when the p-value is less than a.
Note that this rejection rule calls for a new role for a.
Level of Significance, a
The value of a represents the boundary between results that are statistically
significant (where we reject H0) and results that are not statistically significant
(where we do not reject H0). Thus, a is called the level of significance of the
hypothesis test.
where m represents the population mean cost of a CD. Since the conclusion for the
hypothesis test in Example 9.7 was to reject H0, the generic interpretation is: There is
evidence that [whatever Ha says]. Here, Ha states that m 10; that is, the population
mean cost of a CD is greater than $10. Hence, we interpret our conclusion as follows:
There is evidence that the population mean cost of a CD is greater than $10. n
Example 9.9 The Z test for the mean using the p-value method: left-tailed test
The technology Web site www.cnet.com publishes user reviews of computers, soft-
ware, and other electronic gadgetry. The mean user rating, on a scale of 110, for the
Dell XPS 410 desktop computer as of September 10, 2007, was 7.2. Assume that the
population standard deviation of user ratings is known to be s 5 0.9. A random sample
_
taken this year of n 5 81 user ratings for the Dell XPS 410 showed a mean of x5 7.05.
Using level of significance a 5 0.05, test whether the population mean user rating for
this computer has fallen since 2007.
Solution
The sample size n 5 81 is large, and the population standard deviation is known. We
may therefore perform the Z test for the mean.
Step 1 State the hypotheses and the rejection rule. The key words here are has
fallen, which means is less than. The only form of the test with the , sign in it is
the left-tailed test (from Table 9.1):
We find the value of m0 by asking, Is less than what? The answer is, Less than the
mean user rating from 2007, which was 7.2. Thus, m0 5 7.2, and our hypotheses are
where m refers to the population mean user rating for the Dell XPS 410 computer. We
will reject H0 if the p-value is less than a 5 0.05.
_
Step 2 Find Zdata. We have x 5 7.05, m0 5 7.2, n 5 81, and s 5 0.9. Thus, our test
statistic is
_ _
2 m0 ______
x
x2m
Zdata 5 ______
s_
5 0
7.05 2
5 _________ 7.2
20.15
5 ______
5 21.5
x s/
n 0.9/ 81 0.1
Step 3 Find the p-value. Our hypotheses represent a left-tailed test from Table
9.4. Thus,
p-value 5 P(Z , Zdata) 5 P(Z , 21.5)
This is a Case 1 problem from Table 6.7 (page 000). The Z table provides us with the
area to the left of Z 5 21.5 (Figure 9.11):
p-value =
P(Z < 1.5)
Zdata = 1.5 0
Figure 9.11
20 Chapter 9 Hypothesis Testing
Step 4 State the conclusion and interpretation. In Step 1, we said that we would
reject H0 if the p-value was less than our level of significance a 5 0.05. From Step3,
we have p-value 5 0.0668. Since 0.0668 is not , 0.05, we therefore do not reject H0.
_
The difference between x 5 7.05 and m0 5 7.2 is not statistically significant. The ge-
neric interpretation is There is insufficient evidence that [whatever Ha says]. Here,
Ha states that m , 7.2; that is, the population mean user rating for the Dell XPS 410
computer is less than 7.2. Hence, we interpret our conclusion as follows:
3837 3334 3554 3838 3625 2208 1745 2846 3166 3520 3380 3294
2576 3208 3521 3746 3523 2902 2635 3920 3690 3430 3480 3116
3428 3783 3345 3034 2184 3300 2383 3428 4162 3630 3406 3402
3500 3736 3370 2121 3150 3866 3542 3280
Health care officials are interested in whether there is evidence that recent prenatal
nutrition campaigns are working. Formerly, the mean birth weight of babies was
3200 grams. Assume that the population standard deviation s 5 528 grams. The
_
sample mean birth weight is x 5 3276 grams. Is there evidence that the population
mean birth weight of Brisbane babies now exceeds 3200 grams? Use technology to
perform the appropriate hypothesis test, with level of significance a 5 0.10.
25
20
Frequency
15
10
0
1500 2000 2500 3000 3500 4000
_
m0 x
= 3200 = 3276
Baby weights (grams)
Solution
Since the sample size n 5 44 is large and 5 528 is known, we may proceed with the
hypothesis test, using Case 2 (n $ 30).
Step 1 State the hypotheses and the rejection rule. The key word exceeds
means that we have a right-tailed test:
where m refers to the population mean birth weight of Brisbane babies. We will reject
H0 if the p-value is less than a 5 0.10.
Step 2 Find Zdata. We will use the instructions provided in the Step-by-Step Tech-
nology Guide at the end of this section (page 26). Figure 9.13 shows the TI-83/84
results from the Z test for m:
_
2 m0 ___________
x
Zdata 5 ______ 5 3276 2
3200
5 0.9547859245 0.9548
s/
n 528/44
Form of Ha:
Zdata
p-value
Sample mean x
Sample standard deviation s
Sample size n
Step 3 Find the p-value. We have a right-tailed test from Step 1, so that from
Table9.4 our p-value is
p-value 5 P(Z . Zdata) 5 P(Z . 0.9548)
And from Figure 9.13 we have
Step 4 State the conclusion and interpretation. This p-value of 0.1698 repre-
sents the probability of observing a sample mean birth weight at least as extreme as the
sample mean birth weight of 3276 grams that we actually observed if we assume the
null hypothesis is true. In Step 1 we said that we would reject H0 if the p-value is less
than our level of significance a 5 0.10. Since 0.1698 is not less than 0.10, we do not
_
5 3276 was not far away from
reject H0. This is as we expected from the fact that x
the hypothesized mean birth weight of m0 5 3200 grams. We interpret our conclusion
as follows: There is insufficient evidence that the population mean birth weight is
greater than 3200 grams. n
PPLET The P-Value applet allows you to experiment with various hypotheses, means, stan-
A
dard deviations, and sample sizes in order to see how changes in these values affect
the p-value.
Next, we will learn how to find the p-value manually. Notice that each of the for-
mulas from Table 9.4 for finding the p-value uses the standard normal methods for
finding probability shown in Table 6.7 (page 000).
Example 9.11 Hemoglobin levels in adult males undergoing cardiac surgery: two-tailed test
When the level of hemoglobin in the blood is too low, the person is anemic. Un-
usually high levels of hemoglobin are undesirable as well and can be associated
with dehydration. A study by the Harvard Medical School Cardiogenomics Group
recorded the hemoglobin levels, in grams per deciliter (g/dL) of blood, in patients
undergoing cardiac surgery.5 A random sample of 20 male cardiac patients yielded
a sample mean hemoglobin level of 12.35 g/dl. Assume that the population standard
9.2 Z Test for the Population Mean m: p-Value Method 23
deviation is 2.8 g/dl. Test whether the population mean hemoglobin level for males
undergoing cardiac surgery differs from 13.8 g/dl using the p-value method at level
of significance a 5 0.05.
100
95
90
80
70
Percentage
60
50
40
30
20
10
5
1
0 5 10 15 20
Hemoglobin levels for male cardiac patients (g/dl)
Solution
The normal probability plot shown in Figure 9.16 indicates an acceptable level of nor-
mality, allowing us to proceed with the hypothesis test.
Step 1 State the hypotheses and the rejection rule. Since either too much or too
little hemoglobin is bad, we have a two-tailed test here: H0 : m 5 13.8 versus Ha : m
13.8, where m is the population mean hemoglobin level in male cardiac patients. We
will reject H0 if the p-value is less than a 5 0.05.
_
Step 2 Find Zdata. We have a sample of n 5 20 patients, for which x
5 12.35. We
know that s 5 2.8. Plugging into the formula for Zdata, we get
_ _
2 m0 ______
x 2 m0 ___________
x
Zdata 5 ______
s_
5 5 12.35 2
13.8
22.32
x s/
n 2.8/
20
When calculating the p-value manually, round Zdata to two decimal places.
Step 3 Find the p-value. Because we have a two-tailed test, Table 9.4 tells us that
Thus, our p-value, which is the sum of these two tail areas as shown in Figure 9.17,
equals
Step 4 State the conclusion and interpretation. The p-value 0.0204 is smaller
than a 5 0.05. Therefore, our conclusion is to reject H0. The difference between
_
x512.35 g/dl and m0 5 13.8 g/dl is statistically significant. Using the generic inter-
pretation, we can say: There is evidence that the population mean hemoglobin level
in male cardiac patients is not equal to 13.8 g/dl. n
Table 9.5 Strength of evidence against the null hypothesis for various levels of p-value
D eveloping
Your Statistical Sense
The Role of the Level of Significance a
Suppose that in Example 9.11, our level of significance a was 0.01 rather than
0.05. Would this have changed anything? Certainly. Since our p-value of 0.0204
is not less than the new a 5 0.01, we would not reject H0. Think about that for a
moment. The data havent changed at all, but our conclusion is reversed simply by
changing a. What is a data analyst to make of a situation like this? There are two
alternatives.
1. Since we dont want the choice of a to dictate our conclusion, then perhaps we
should turn to a direct assessment of the strength of evidence against the null
hypothesis, as provided in Table 9.5. In this case, the p-value of about 0.0204
would offer solid evidence against the null hypothesis, regardless of the value
of a.
2. Obtain more data, perhaps through a call for further research. This option is
especially relevant for this example, where the sample size n 5 20 is rather
small. n
26 Chapter 9 Hypothesis Testing
We will use the weight data from Example 9.10 (page 20).
TI-83/84
If you have the data values: If you have the summary statistics:
Step 1 Enter the data into list L1. Step 1 Press STAT, highlight TESTS, and press ENTER.
Step 2 Press STAT, highlight TESTS, and press ENTER. Step 2 Press 1 (for Z-Test; see Figure 9.19).
Step 3 Press 1 (for Z-Test; see Figure 9.18). Step 3 For input (Inpt), highlight Stats and press ENTER
Step 4 For input (Inpt), highlight Data and press ENTER (Figure 9.20).
(Figure 9.19). a. For m0, enter the value of m0, 3200.
a. For m0, enter the value of m0, 3200. b. For s, enter the value of s, 528.
_
b. For s, enter the value of s, 528. c. For x, enter the sample mean 3276.
c. For List, press 2nd, then L1. d. For n, enter the sample size 44.
d. For Freq, enter 1. e. For m, select the form of Ha. Here we have a right-tailed
e. For m, select the form of Ha. Here we have a right-tailed test, so highlight . m0 and press ENTER.
test, so highlight . m0 and press ENTER. f. Highlight Calculate and press ENTER. The results are shown
f. Highlight Calculate and press ENTER. The results are shown in Figure 9.13 in Example 9.10.
in Figure 9.13 in Example 9.10.
1
The essential idea about hypothesis testing for the mean
_
based on the assumption that H0 is correct, we should reject
may be described as follows. When the observed value of x H0. Otherwise, there is insufficient evidence against H0,
_
is unusual or extreme in the sampling distribution of xthat is and we should not reject H0. All of the remaining steps and
Section 9.2 Exercises 27
Section 9.2Exercises
37. Stock Market. The Statistical Abstract of the United to recoup their additional initial cost through reduced fuel
States reports that the mean daily number of shares traded consumption. Suppose that a random sample of 9 hybrid
on the New York Stock Exchange in 2005 was 1.6 billion. cars showed a sample mean time of 2.1 years. Assume that
Let this value represent the hypothesized population mean, the population is normal with s 5 0.2. Test using level of
and assume that the population standard deviation equals significance a 5 0.01 whether the population mean time it
0.5 billion shares. Suppose that, in a random sample of takes owners of hybrid cars to recoup their initial cost is less
36 days from the present year, the mean daily number than three years.
of shares traded equals 1.5 billion. We are interested in 44. Americans Height. Americans used to be on average
testing, whether the population mean daily number of the tallest people in the world. That is no longer the case,
shares traded has increased since 2005, using level of according to a study by Dr. Richard Steckel, professor of
significance a 5 0.05. economics and anthropology at The Ohio State University.
38. Child Abuse. The U.S. Administration for Children The Norwegians and Dutch are now the tallest, at 178
and Families reported that the national rate for child abuse centimeters, followed by the Swedes at 177, and then
referrals was 43.9 per 1000 children in 2005. Suppose that the Americans, with a mean height of 175 centimeters
a random sample of 1000 children taken this year shows (approximately 5 feet 9 inches). According to Dr. Steckel,
47 child abuse referrals. Assume s 5 5. Test whether the The average height of Americans has been pretty much
population mean referral rate has increased this year from stagnant for 25 years.8 Suppose a random sample of 100
the 2005 level, using level of significance a 5 0.10. Americans taken this year shows a mean height of 174
39. California Warming. A 2007 report found that the centimeters, and we assume s 5 10 centimeters. Test using
mean temperature in California has increased from 1950 to level of significance a 5 0.01 whether the population
2000 by 2 degrees Fahrenheit (F). Suppose that a random mean height of Americans this year has changed from
sample of 36 California locations showed a mean increase 175 centimeters.
of 4F over 1950 levels. Assume s 5 0.5. Test whether the
population mean temperature increase in California is greater Sodium in Breakfast Cereal. Use the following information
than 2F, at level of significance a 5 0.05. for Exercises 4548. A random sample of 23 breakfast
40. Eating Trends. According to an NPD Group report cereals containing sodium had a mean sodium content
the mean number of meals prepared and eaten at home is per serving of 192.39 grams. Assume that the population
less than 700 per year. Suppose that a random sample of standard deviation equals 50 grams. We are interested in
100 households showed a sample mean number of meals whether the population mean sodium content per serving is
prepared and eaten at home of 650. Assume s 5 25. Test less than 210 grams.
whether the population mean number of such meals is less 45. a.The normal probability plot of the sodium content is
than 700, using level of significance a 5 0.10. shown in Figure 9.21. Should we proceed to apply the
41. DDT in Breast Milk. Researchers compared the amount Z test? Why or why not?
of DDT in the breast milk of 12 Hispanic women in the b. Test whether the population mean sodium content
Yakima Valley of Washington State with the amount of per serving is less than 210 grams, using level of
DDT in breast milk in the general U.S. population.6 They significance a 5 0.01.
measured the mean DDT level in the general population to
100
be 47.2 parts per billion (ppb) and the mean DDT level in the 95
12 Hispanic women to be 219.7 ppb. Assume s 5 36 and a 90
normally distributed population. Test whether the population 80
mean DDT level in the breast milk of Hispanic women in the 70
Percentage
a 5 0.05. 50 grams had been a typo, and the actual population standard
43. Hybrid Vehicles. A 2006 study by Edmunds.com deviation was smaller? How would this have affected the
examined the time it takes for owners of hybrid vehicles following?
Section 9.2 Exercises 29
a. The standard deviation of the sampling distribution Mean Family Size. Use the following information for
b. Zdata Exercises 5154. According to the Statistical Abstract
c. p-value of the United States, the mean family size in 2002 was
d. The conclusion 3.15persons, reflecting a slow decrease since 1980, when
H AT I F
W
the mean family size was 3.29 persons. Has this trend
47. What if our level of significance a equaled 0.05
?
A
51. We are interested in testing whether the population
using level of significance a 5 0.01. Have the data mean family size in America has decreased since 2002, using
changed? Why did your conclusion change? the p-value method and level of significance a 5 0.05. (Try
c. Suggest two alternatives for addressing the using the P-Value applet to help you solve this problem.)
contradiction between Exercise 45b and Exercise 47a. a. State the hypotheses and the rejection rule.
48. Assess the strength of the evidence against the null b. Find Zdata.
hypothesis. c. Find the p-value.
49. Womens Heart Rates. A random sample of 15 women d. State the conclusion and interpretation.
produced the normal probability plot in Figure 9.22 for 52. Refer to the previous exercise. Suppose that we used
their heart rates. The sample mean was 75.6 beats per level of significance a 5 0.10 rather than a 5 0.05. Redo
minute. Suppose the population standard deviation is (a)(d). Comment on any apparent contradictions. Suggest
known to be 9. two ways to resolve this dilemma.
100 53. Refer to Exercise 51, with our original hypothesis test
95 (with a 5 0.05).
90 a. What is the smallest p-value for which you will reject
80
H0?
70
b. What do you think of the assumption of 1 person for
Percentage
60
50 the value of s? How would this have been obtained,
40 if at all possible?
30 c. Which type of error is it possible that we are making,
20
a Type I error or a Type II error? Which type of error
10
5 are we certain we are not making?
1 d. Suppose a newspaper headline referring to the study
50 60 70 80 90 100 was Mean Family Size Decreasing. Is the headline
Womens heart rates (beats/min) supported or not supported by the data and the
Figure 9.22 hypothesis test?
H AT I F
W
b. Explain in your own words the difference between Online Shopping Privacy Protection. Use the P-Value
the hypotheses in (b) and (c). Also, explain how there applet and the following information for Exercises 5558. A
could be evidence that the population mean heart rate 2007 Carnegie Mellon University study reports that online
is less than 78 but not different from 78. shoppers were willing to pay an average of an extra 60 cents
c. Assess the strength of the evidence against the null on a $15 purchase in order to have better online privacy
hypothesis for the hypothesis tests in (b) and (c). protection.9 Assume that the population standard deviation
30 Chapter 9 Hypothesis Testing
is 20 cents. Suppose that a study conducted this year found a. Enter n 5 30. Click Show P to get the p-value. What
that the mean amount online shoppers were willing to pay is the p-value? Sketch the normal curve, showing the
_
for privacy was x5 65 cents. Does this represent evidence p-value.
that the mean amount online shoppers are willing to pay b. Would we reject the null hypothesis based on these
_
for privacy has increased from 60 cents? As it turns out, values of xand n?
_
the answer depends on the sample size. If this x
5 65 cents 56. Repeat Exercise 55 for n 5 40. Compare the p-value
comes from a large sample, then the evidence against H0 with that from Exercise 55.
will be stronger than if it comes from a small sample. Use 57. Repeat Exercise 55 for n 5 50. Compare the p-value
the P-Value applet to see this, using level of significance with those from Exercises 55 and 56.
_
a 5 0.05. 58. Note that x has not changed in Exercises 55257; only
_
55. Enter H0 : m # 60, Ha : m . 60, s 5 20, and x 5 65. the sample size has changed. Write a sentence to describe the
pattern you see in the p-values as the sample size increases.
2 Perform and interpret the Z test for the population mean using the critical-value
method.
3 Describe the relationship between the p-value method and the critical-value method.
4 Use confidence intervals for to perform two-tailed hypothesis tests about .
5 Calculate the probability of a Type II error, and compute the power of a hypothesis
test.
he critical region consists of the range of values of the test statistic Zdata
T
for which we reject the null hypothesis.
he noncritical region consists of the range of values of the test statistic
T
Zdata for which we do not reject the null hypothesis.
he value of Z that separates the critical region from the noncritical region
T
is called the critical value, Zcrit.
tem of San Diego reported in 2006 that active users of debit cards used them an average
of 11 times per month.10 Suppose that we are interested in testing whether debit card
usage has increased since 2006. For level of significance 5 0.01, find the critical
value Zcrit for this hypothesis test.
Solution
The key word here is increased, which leads (through our strategy) to the following
hypotheses:
H0: 11 versus Ha: 11
where is the population mean number of times consumers use debit cards per month.
Because our level of significance is chosen to be 0.01, the critical region will consist
of the values of Zdata that lie in the uppermost 1% of the Z distribution (see Figure 9.23).
_
These are the extreme values of Zdata. The extreme values of x that are associated with
the values of Z in the critical region will thus lead us to reject H0. The critical value
Zcrit is that value of Z which separates the uppermost 1% of the values of Zdata from the
remaining 99%.
We find the actual value of Zcrit using either technology or the standard normal table
(see Section 6.4, on page 000). We are looking for the value of the Z distribution that has
0.01 area to the right of it and thus has an area of 0.99 to the left of it. We look up the area
0.99 on the inside of the Z table and take the closest area we can find, which is 0.9901.
Then working the Z table backward, we find that Zcrit equals 2.33, as shown in Figure 9.24.
Any values of Zdata greater than Zcrit 2.33 will lead us to reject the null hypothesis. Any
values of Zdata less than Zcrit 2.33 will lead us to not reject the null hypothesis. n
Area = 0.99
a = 0.01 2.0 0.9772 0.9778 0.9783 0.9788 0.9793
2.1 0.9821 0.9826 0.9830 0.9834 0.9838
2.2 0.9861 0.9864 0.9868 0.9871 0.9875
0 Zcrit 2.3 0.9893 0.9896 0.9898 0.9901 0.9904
Noncritical region Critical region 2.4 0.9918 0.9920 0.9922 0.9925 0.9927
Figure 9.23 Zcrit separates the critical region from the
noncritical region. Figure 9.24 Finding Zcrit.
D eveloping
Your Statistical Sense
We Havent Seen the Sample Data Yet
Notice that so far we have not actually looked at any sample data for the debit card
example. Since we were not given the sample mean, sample size, or population
standard deviation, we cannot find Zdata and thus cannot form a conclusion. The
critical value Zcrit does not depend at all on the sample data. Zcrit depends only on the
value of our level of significance and the form of the hypothesis test. n
32 Chapter 9 Hypothesis Testing
Table 9.6 Table of critical values Zcrit for common values of the level of significance
Table 9.6 shows values of Zcrit for the most commonly used values of the level of
significance : 0.01, 0.05, and 0.10. In Example 9.13, the critical region consists of
the values of the test statistic Zdata that are greater than Zcrit because the alternative hy-
pothesis states that the population mean is greater than 0. This is why this form of the
test is called a right-tailed testbecause the critical region is in the right (upper) tail.
Similarly, when you perform a left-tailed test, H0: 0 versus Ha: 0,
the critical region will consist of the values of Zdata that are less than Zcrit because the
alternative hypothesis states that the population mean is less than 0. This is why this
form of the hypothesis test is called a left-tailed testbecause the critical region is
in the left (lower) tail (Figure 9.25).
Figure 9.25
Critical region for
a
a left-tailed test.
Zcrit 0
Critical region Noncritical region
Finally, for a two-tailed hypothesis test, H0: 0 versus Ha: 0, the alterna-
tive hypothesis states that the population mean is not equal to 0. This can happen in
either of two ways: (1) the population mean may be greater than 0, or (2) the population
mean may be less than 0. So the critical region must contain values of Zdata that are either
9.3 Z Test for the Population Mean : Critical-Value Method 33
extremely large (in the right tail) or extremely large negatively (in the left tail). The
critical region will therefore consist of the values of Zdata that are greater than Zcrit
and the values of Zdata that are less than 2Zcrit. This is why this form of the hypothe-
sis test is called a two-tailed testbecause the critical region is in both of the tails.
Figure 9.26 shows what the critical region looks like for a two-tailed test. n
Figure 9.26
Critical region for
a/2 a /2
a two-tailed test.
Zcrit 0 Zcrit
Critical region Noncritical region Critical region
where is the population mean hemoglobin level. Then, from Table 9.6, your critical
value Zcrit is 1.96. Because it is a two-tailed hypothesis test, we reject H0 for values of
Zdata that are either larger than 1.96 or smaller than 21.96. n
Table 9.7 summarizes the rejection rules for each form of the hypothesis test.
Notice that you are essentially comparing the test statistic Zdata with the critical value
Zcrit and basing your decision on their relative values.
Therefore, the conditions under which we may use the Z test critical-value method
are the same as the conditions for using the Z test p-value method. If the population
standard deviation is not known, then the Z test should not be applied.
Step 4 State the conclusion and the interpretation. If Zdata falls in the
critical region, then reject H0. Otherwise, do not reject H0. Interpret your
conclusion so that a nonspecialist can understand.
where is the population mean number of times people use their debit cards per month.
Step 2 Find Zcrit and state the rejection rule. We have a right-tailed test and level
of significance 0.01, which, from Table 9.6, tell us that Zcrit 2.33. Using Table 9.7,
because we have a right-tailed test, the rejection rule will be Reject H0 if Zdata is
greater then Zcrit; that is, Reject H0 if Zdata is greater than 2.33 (see Figure 9.27).
a = 0.01
0 Zcrit = 2.33
Critical region
Figure 9.27
9.3 Z Test for the Population Mean : Critical-Value Method 35
Step 3 Find _
Zdata. We have a sample of n 36 people who used their debit cards
11.5 times, and we assume 3. Also, 0 is the hypothesized value
an average of x
of , in this case 11. Using these values, we get
_ _
2 m
x 2
x m
Zdata 5 ______
0
5 ______
s_x
0
11.5 2 11.0
__________ 1.0
s/
n 3/36
Step 4 State the conclusion and interpretation. Our rejection rule states that we
will reject H0 if Zdata is greater than 2.33. Since Zdata equals 1.0, which is not greater than
2.33, the conclusion is to not reject H0 (Figure 9.28). Even though the sample mean
of 11.5 exceeds 11.0, it does not do so by a wide enough margin to dispel the reason-
able doubt that the difference between this sample mean and may have been due to
chance. We interpret our conclusion as follows: There is insufficient evidence that the
population mean monthly debit card use is greater than 11 times per month. n
a = 0.01
0 Zcrit = 2.33
Zdata = 1 Critical region
Figure 9.28
Probabilities p-value a
is compared with
p-Value
Method
36 Chapter 9 Hypothesis Testing
Since Zdata helps us to determine the p-value, these two values are related. Simi-
larly, since the level of significance helps to determine the value of Zcrit, these two
values are related. Moreover, just as we compare Zdata with the threshold Zcrit, we
compare the p-value statistic with the threshold to determine significance. Thus,
the two methods for carrying out hypothesis tests are equivalent and, in fact, are
quite thoroughly interwoven.
Figures 9.30a and 9.30b illustrate this equivalence for a right-tailed test. The
rejection rule for the p-value method is to reject H0 when the p-value is less than
. The rejection rule for the critical-value method is to reject H0 when Zdata Zcrit.
Note in Figures 9.30a and 9.30b how the p-value is determined by Zdata, and is
determined by Zcrit. In Figure 9.30a, when Zdata Zcrit, it must also happen that the
p-value is larger than . In both cases we do not reject H0. However, in Figure
9.30b when Zdata Zcrit, it also follows that the p-value is smaller than . In both
cases we reject H0. Thus, the p-value method and the critical-value method are
equivalent.
p-value
a
a p-value
for and is a plausible value for . Table 9.8 shows the confidence levels and associ-
ated levels of significance that will produce the equivalent inference.
Figure 9.32 Reject H0 for values of m0 that lie outside (165.53, 219.25).
Dynamic Graphics/Jupiterimages Test using level of significance a 5 0.01 whether the population mean so-
dium content per serving differs from these values: (a) 165, (b) 166, (c) 220.
Solution
We set up the three two-tailed hypothesis tests as follows:
a. H0 : m 5 165 vs. Ha : m 165
b. H0 : m 5 166 vs. Ha : m 166
c. H0 : m 5 220 vs. Ha : m 220
To perform each hypothesis test, simply observe where each value of m0 falls on
the number line shown in Figure 9.32. For example, in the first hypothesis test, the
hypothesized value m0 5 165 lies outside the interval (165.53, 219.25). Thus, we reject
H0. The three hypothesis tests are summarized here. n
Where m0 lies in
Value Form of hypothesis test, relation to 99% Conclusion of
of m0 with a 5 0.01 confidence interval hypothesis test
a. 165 H0 : m 5 165 vs. Ha : m 165 Outside Reject H0
b. 166 H0 : m 5 166 vs. Ha : m 166 Inside Do not reject H0
c. 220 H0 : m 5 220 vs. Ha : m 220 Outside Reject H0
38 Chapter 9 Hypothesis Testing
Let us illustrate the steps for calculating b, the probability of a Type II error, using
an example.
where m represents the population mean debit card usage per month. From Example 9.16
_
we have n 5 36, x
5 11.5, Zcrit 5 2.33, and s 5 3.
9.3 Z Test for the Population Mean : Critical-Value Method 39
b = 0.0475
_
x crit = 12.165 ma = 13
Figure 9.33
Thus, b 5 0.0475. This represents the probability of making a Type II error, that is,
of not rejecting the hypothesis that the population mean debit card usage is at most 11
times per month when in actuality it is 13 times per month. n
W
H AT I F
W
?
Figure 9.34
b = probability 1 b = power
of Type II error of the test
_
x crit = 12.165 ma = 12.5
A power curve plots the values for the power of the test versus the values of ma.
9.3 Z Test for the Population Mean : Critical-Value Method 41
11.0
(
12.165 2
P Z , ___________ 11
___
3/ 36
5 P(Z , 2.33) 5 0.9901
) 1 2 0.9901 5 0.0099
11.5
(12.165211.5
P Z , ____________
3/
___
36
5 P(Z , 1.33) 5 0.9082
) 1 2 0.9082 5 0.0918
12.0
(12.165 ___
P Z , ___________
3/
12
2
36
5 P(Z , 0.33) 5 0.6293
) 1 2 0.6293 5 0.3707
12.165
( 12.165 2
P Z , ______________
3/
12.165
___
36
5 P(Z , 0.00) 5 0.5
) 1 2 0.5 5 0.5
12.5
(
12.165 2___
P Z , ____________
3/
12.5
36
5 P(Z , 20.67) 5 0.2514
) 1 2 0.2514 5 0.7486
13.5
(12.165 2___
P Z , ____________
3/
13.5
36
5 P(Z , 22.67) 5 0.0038
) 1 2 0.0038 5 0.9962
b. Figure 9.35 represents a power curve, since it plots the values for the power of
the test on the vertical axis against the values of ma on the horizontal axis. Note
that, as ma moves farther away from the hypothesized mean m0 5 11, the power
of the test increases. This is because it is more likely that the null hypothesis will
be correctly rejected as the actual value of the mean ma gets farther away from
the hypothesized value m0. n
1
0.9
0.8
Power of the test: 1 a
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0
11 11.5 12 12.5 13 13.5
Value of la
Figure 9.35
42 Chapter 9 Hypothesis Testing
1
The critical region consists of the range of values of the 3
The p-value method and the critical-value method are
test statistic Zdata for which we reject the null hypothesis. equivalent. In fact, Zdata, the p-value, the level of significance
The value of Z that separates the critical region from the , and Zcrit are all interrelated.
noncritical region is called the critical value Zcrit.
4
We can use a single 100(1 2 )% confidence interval for
The critical-value method of performing the Z test for the
2 m to help us perform any number of two-tailed hypothesis
mean is equivalent to the p-value method in that the same tests about m.
conclusion would be drawn using either method. In the
critical-value method, we compare one Z-value (Zdata) with 5
We can calculate , the probability of a Type II error, for
another Z-value (Zcrit). a given value of ma. The power of the hypothesis test is the
probability of rejecting the null hypothesis when the null
hypothesis is false. Power is calculated as power 5 1 2 .
Section 9.3Exercises
32. A 99% Z confidence interval for the population mean Health Care Premiums. Use the following information
is (45, 55). Using the equivalence between Z intervals and for Exercises 3840. According to the National Coalition
Z tests, test using level of significance 5 0.01 whether on Health Care, the mean annual premium for an employer
the population mean differs from each of the following health plan covering a family of four cost $11,500 in 2007. A
hypothesized values. random sample of 100 families of four taken this year showed
a. 0 b. 44 c. 50 d. 54 e. 56 a mean annual premium of $11,750. Assume 5 $1500.
38. Test whether the population mean annual premium has
Applying the Concepts increased, using level of significance 5 0.05.
H AT I F
W
For Exercises 3336, do the following. 39. What if the sample mean premium equaled some
?
a. State the hypotheses. value larger than $11,750, while everything else stayed the same?
b. Find Zcrit and the critical region. Also, draw a standard Explain how this change would affect the following, if at all.
normal curve showing the critical region. a. The hypotheses d. Zdata
c. Find Zdata. Also, draw a standard normal curve b. Zcrit e. The conclusion
showing Zcrit, the critical region, and Zdata. c. The critical region
d. State the conclusion and the interpretation. 40. Test whether the population mean annual premium has
33. Price of Milk. The Bureau of Labor Statistics reports increased, using level of significance 5 0.01. Comment.
that the mean price for a gallon of milk in 2005 was $3.24. 41. Accountants Salaries. According to the Wall Street
A random sample of 100 retail establishments this year Journal, the mean salary for accountants in Texas in 2007
provides a mean price of $3.40. Assume 5 $0.25. Perform was $50,529. A random sample of 16 Texas accountants this
a hypothesis test using level of significance 5 0.05 to year showed a mean salary of $52,000. We assume that the
investigate whether the population mean price of milk this population standard deviation equals $4000. The histogram
year has increased from the 2005 value. of the salary (in $1000s) is shown here. If it is appropriate to
34. Household Size. The U.S. Census Bureau reported apply the Z test, then do so, using the critical-value method
that the mean household size in 2002 was 2.58 persons. A and level of significance 5 0.05. If not, then explain
random sample of 900 households this year provides a mean clearly why not.
size of 2.5 persons. Assume 5 0.1. Conduct a hypothesis 5
test using level of significance 5 0.10 to determine
whether the population mean household size this year has 4
decreased from the 2002 level.
Frequency
3
35. Salaries of Assistant Professors. Suppose that the mean
salary for assistant professors in science in 2005 was $50,000. 2
A random sample of 81 assistant professors in science taken
this year shows a mean salary of $51,750. Assume 5 $1000. 1
Perform a hypothesis test using level of significance 5 0.01
0
to determine whether the population mean salary of assistant
49 50 51 52 53 54
professors in science has increased since 2005. Salary
36. Americans Height. A random sample of 400 Americans
yields a mean height of 176 centimeters. Assume 5 2.5 Salaries of 16 accountants.
centimeters. Conduct a hypothesis test to investigate whether 42. Honda Civic Gas Mileage. Cars.com reported in 2007
the population mean height of Americans this year has changed that the mean city gas mileage for the Honda Civic was
from 175 centimeters, using level of significance 5 0.05. 30 mpg. This year, a random sample of 20 Honda Civics
37. Cost of Education. The College Board reports that had a mean gas mileage of 36 mpg. Assume 5 5 mpg. A
the mean annual cost of education at a private four-year Minitab histogram of the data is shown here.
college was $22,218 for the 20062007 school year. Suppose 5
that a random sample of 49 private four-year colleges this
year gives a mean cost of $24,000 per year. Assume the 4
population standard deviation is $3000.
Frequency
a. Is it appropriate to apply the Z test? Explain clearly b. We have a sample mean that is greater than the mean
why or why not. in the null hypothesis of 5.9 cents. Isnt this enough
b. Test at level of significance 5 0.10 whether the by itself to reject the null hypothesis? Explain why or
population mean city gas mileage has increased since why not.
_ __
2007. c. What is the standard deviation of x , s/ n?
c. What if we now performed the same test on the same d. How many standard deviations above the mean is the
data but used 5 0.05 instead. Without carrying out 6.2 cents per mile? Do you think this is extreme?
H AT I F
the hypothesis test, state whether this would affect W
?
our conclusion. Why or why not?
Try to answer the following questions by thinking about the
43. Automobile Operation Cost. The Bureau of
relationship between the statistics rather than by redoing all
Transportation Statistics reports that in 2002 the mean cost
the calculations. Suppose that the 36 mpg is a typo. We are
of operating an automobile in the United States, including
not sure what the actual sample mean is, but it is less than
gas and oil, maintenance and tires, was 5.9 cents per mile.
36 mpg.
Suppose that a sample taken this year of 100 automobiles
a. How does this affect Zdata?
shows a mean operating cost of 6.2 cents per mile, and
b. How does this affect Zcrit?
assume that the population standard deviation is 1.5 cents per
c. How does this affect the conclusion?
mile. We are interested in whether the population mean cost
45. Automobile Operation Cost. Refer to Exercise 43.
has increased since 2002, using the critical-value method and
a. Construct the hypotheses.
level of significance 5 0.05.
b. Find the Z critical value and state the rejection rule.
a. Is it appropriate to apply the Z test? Why or why
c. Calculate the value of the test statistic Zdata.
not?
d. State the conclusion and the interpretation.
t (df = 1)
t (df = 2)
t (df = 10)
Z
Figure 9.36
Distribution of t
approaches distribution
of Z as sample size
increases.
5 4 3 2 1 0 1 2 3 4 5
_
Just as we did for the Z test, lets look at the sampling distribution of x, but this time,
_
when s is unknown. See Figure 9.37. The sampling distribution of xwhen s is unknown
has a t distribution with mean m. Since s is unknown, the estimate of the standard devia-
__
tion of the sampling distribution is s/ n. Just as for the Z tests, if we assume that H0 is
true, then this sampling distribution will have a mean of m0, the value of m given in H0.
Spread determined
Figure 9.37 by s/n
The sampling distribution
_
of x
when s is unknown
is centered at m0 if
H0 is correct.
_ _
Centered at m0 Extreme value of x : x
Recall from Section 8.2 that the t distribution may be used only if either of the
following conditions is satisfied:
ing statistic
_
2 m0
x
______
t 5
s/
n
has a t distribution with n 2 1 degrees of freedom (if either the population is normal
or the sample size is large). We call this t statistic tdata.
The test statistic used for the t test for the mean is
_
x
m0
tdata 5 ______
s/ n
_ _
Extreme values of x, that is, values of xthat are significantly far from the hypoth-
esized m, will translate into extreme values of tdata. In other words, just as with Zdata,
_
when xis far from m0, tdata will be far from 0.
46 Chapter 9 Hypothesis Testing
Just as with the Z test, the p-value method and the critical-value method are equiv-
alent for the same level of significance a, and they will give you the same conclu-
sion. Again, however, the p-value method is used more widely than the critical-value
method. Therefore, we will begin with the p-value method. The definition of p-values
for t tests is the same as that for the Z test for the mean. Unusual and extreme values
_ _
of x, and therefore of tdata, will have a small p-value, while values of xand tdata nearer
to the center of the distribution will have a large p-value.
Table 9.9 summarizes the definition of the p-value for t tests. Note that we will
not be finding these p-values manually but will either (a) use a computer or calculator
or (b) estimate them using the t table. Once again, based on the p-value, we assess the
strength of evidence against the null hypothesis, using Table 9.5 and keeping in mind
that, for certain domains, alternative interpretations would be appropriate.
p-value
Right-tailed test
P(t . tdata)
H0: m # m0
Area to the right of tdata
Ha: m . m0
0 tdata
100 Solution
95
Since we do not know the value of s, we cannot perform a Z test.
90
80
To use a t test, either the population must be normal or the sample
70 size must be large. Since n 5 14 is not a large sample size, we check
Percentage
Step 3 Find the p-value. Also from Figure 9.39, we have the p-value, which is
Note that we could not have found this p-value without a computer or calculator. Using
the t table, the p-value may only be estimated.
Step 4 State the conclusion and interpretation. Since the p-value 0.181 is not
less than a 5 0.10, we do not reject H0. There is insufficient evidence that the popu-
lation mean number of home runs per player is less than 16. Even though the sample
_
5 13.8571 is less than 16, the difference is not sufficiently great to con-
mean of x
clude beyond a reasonable doubt that the population mean m is less than 16 home
runs per player. n
Example 9.22 Performing the t test for m using technology: nurse-to-patient ratios
The United States is undergoing a nursing shortage. According to the American Asso-
ciation of Colleges of Nursing, the estimated shortage of registered nurses will increase
to 340,000 by the year 2020. The table below contains a random sample of ten highly
rated cancer care facilities, along with their nursing index (nurse-to-patient ratio), as
reported by U.S. News & World Report in 2007.12 Suppose that the population mean
nursing index in 2005 was 1.6 nurses per cancer patient, and that we are interested in
whether the population mean index has changed since then. Perform the appropriate
hypothesis test using level of significance a 5 0.05.
Hospital Index
Memorial Sloan Kettering Cancer Center 1.5
M. D. Anderson Cancer Center 2.0
Johns Hopkins Hospital 2.3
Mayo Clinic 2.8
Dana Farber Cancer Institute 0.8
Univ. of Washington Medical Center 2.2
Duke University Medical Center 1.8
Univ. of Chicago Hospitals 2.3
UCLA Medical Center 2.2
UC San Francisco Medical Center 2.3
Solution
100
95
Since the sample size is small, we check normality. The normal
90 probability plot (Figure 9.40) is not perfectly linear, but there are no
80 points outside the bounds, and it is difficult to determine normality
70
for such small sample sizes. We proceed to perform the t test using
Percentage
60
50
Case 1, with the caveat that the normality assumption could be bet-
40 ter supported and that more data would be helpful.
30
20 Step 1 State the hypotheses and the rejection rule. The key
10 words has changed means that we have a two-tailed test:
5
1 H0: m 5 1.6 versus Ha: m 1.6
0 1 2 3 4
Nursing index
where m represents the population mean nursing index. We will
Figure 9.40 Normal probability plot of nursing index. reject H0 if the p-value is less than a 5 0.05.
9.4 t Test for the Population Mean m 49
Step 2 Find tdata. We use the instructions supplied in the Step-by-Step Technology
Guide at the end of this section. Figure 9.41 shows the TI-83/84 results from the t test
for m.
Form of Ha:
tdata
p-value
Sample mean x
Sample standard deviation s
Sample size n
Using the statistics from Figure 9.41, we have the test statistic
_
2
x m
tdata 5 ______
0
2.02 2
1.6
5 ________________
5 2.417718103
p-value for a two-tailed test s/
n 0.549343042/10
is sum of two tail areas.
Step 3 Find the p-value. From Figures 9.41 and 9.42, we have
Example 9.23 The t test for m using the estimated p-value method
The table below lists the prices in dollars for a sample of 17 mathematics journals in
2005, as reported by the American Mathematical Society.
Suppose that the mean cost of all mathematics journals in 2001 was $400, and we are
interested in whether the population mean cost is increasing. Evaluate the normality
assumption, and perform the appropriate hypothesis test by estimating the p-value and
comparing it with level of significance a 5 0.05.
Solution
Though not perfect, the normal probability plot indicates an acceptable level of nor-
mality of the math journal prices. We may therefore proceed to apply the t test.
50 Chapter 9 Hypothesis Testing
100 Step 1 State the hypotheses and the rejection rule. The hypoth
95
90
eses are
80
70
H0: m # 400 versus Ha: m 400
Percentage
60
50
40 where m represents the population mean math journal price. We will
30
20
reject the null hypothesis if the p-value is less than a 5 0.05.
10 _
Step 2 Find tdata. Our sample statistics are x
5 $506.82, s 5 $230,
5
1 and n 5 17, so our test statistic is
400 200 0 200 400 600 800 1000 1200 1400
Price of mathematics journals
506.82 2
tdata 5 ____________400
1.91
Normal probability plot for price of math journals. 230/ 17
We see that our value of tdata 5 1.91 would fit between t 5 1.746 and t 5 2.120. There-
fore, the tail area associated with tdata 5 1.91 is between area 0.025 and area 0.05.
Therefore, since this is a right-tailed test, the estimated p-value also lies between 0.025
and 0.05.
Step 4 State the conclusion and interpretation. Since the p-value is less than a 5
0.05, we reject H0. There is evidence that the population mean price of math journals
is higher than $400. n
Step 4 State the conclusion and the interpretation. If tdata falls within the
critical region, then reject H0. Otherwise, do not reject H0. Interpret your
conclusion so that a nonspecialist can understand.
Table 9.11 contains the rejection rules and critical regions for the t test.
Table 9.11 Critical regions and rejection rules for various forms of the t test for m
Form of test Rejection rule Critical region
Right-tailed test
a
H0: m # m0 Reject H0 if tdata . tcrit
Ha: m . m0 tcrit
0
Noncritical Critical
region region
Left-tailed test a
H0: m $ m0 Reject H0 if tdata , 2tcrit
Ha: m , m0 tcrit 0
Critical Noncritical
region region
Example 9.24 Has the mean age at onset of anorexia nervosa been decreasing?
We are interested in testing, using level of significance a 5 0.05, whether the mean
age at onset of anorexia nervosa in young women has been decreasing. Assume that
Variable N Mean StDev
Patient Age 20 14.251 1.512 the previous mean age at onset was 15 years old. Data were gathered for a study
of the onset age for this disease.13 From these data, a random sample was taken of
100 20 young women who were admitted under this diagnosis to the
95 Toronto Hospital for Sick Children. The Minitab descriptive statis-
90
tics shown here indicate a sample mean age of 14.251 years and a
80
70 sample standard deviation of 1.512 years. If appropriate, perform
Percentage
60 the t test.
50
40
Solution
30 Since the sample size n 5 20 is not large, we need to verify normal-
20 ity. The normal probability plot of the ages at onset, shown here,
10
indicates that the ages in the sample are normally distributed. We
5
1
may proceed to perform the t test for the mean.
10 12 14 16 18 20
Patient age at onset of anorexia nervosa
Step 1 State the hypotheses. The key word decreasing guides
us to state our hypotheses as follows:
Normal probability plot for age at onset of anorexia
nervosa.
H0: m $ 15 versus Ha: m , 15
a = 0.05
tcrit = 1.729 0
Critical region Noncritical region
Figure 9.43 Critical region for left-tailed t test with a 5 0.05, df 5 19.
Step 3 Find tdata. _The sample data show that our sample of n 5 20 patients has a
mean age at onset of x 5 14.251 years with a standard deviation of s 5 1.512 years.
Also, m0 5 15, since this is the hypothesized value of m stated in H0. Therefore, our test
statistic is
_
2 m
x
tdata 5 ______
0 14.251 2
5 ___________15 22.2154
s/n 1.512/20
9.4 t Test for the Population Mean m 53
Step 4 State the conclusion and interpretation. The rejection rule from Step
2 says to reject H0 if tdata , 21.729. From Step 3, we have tdata 5 22.2154. Since
22.2154 is less than 21.729, our conclusion is to reject H0. If you prefer the graphical
approach, consider Figure 9.44, which shows where tdata falls in relation to the critical
region. Since tdata 5 22.2154 falls within the critical region, our conclusion is to reject
H0. There is evidence that the population mean age of onset has decreased from its
previous level of 15 years. n
a = 0.05
This quantity was used in Sections 9.2 and 9.3 to calculate the test statistic Zdata when
s is known. However, here in Section 9.4 we no longer assume that s is known. When
s is not known, we use the following quantity to estimate the standard deviation of the
sampling distribution of the sample mean.
_
Standard Error of the Sample Mean x
s
____
_5
s
x
n
Since the denominator of tdata equals the standard error s_x, tdata is expressed in terms of
standard errors. (The quantity in the denominator is called a scaling factor.) This leads
us to the following interpretation of the meaning of the test statistic tdata.
where m represents the population mean age at onset of anorexia nervosa in young
_
women. The sample data showed that n 5 20 patients had a mean age at onset of x5
14.251 years with a standard deviation of s 5 1.512 years.
__
a. Calculate the standard error of the mean, s_x5 s/ n.
s
s_x5 ___ 1.512 0.3381
5 _____
n
20
_
x2 m
b. From Example 9.24, we had tdata 5 ______
0
14.251 2 15
5 ___________ 22.2154
s/
n 1.512/20
Figure 9.45 illustrates the sampling distribution of the sample mean, when m0 5 15.
The hypothesized mean m0 5 15 is shown, along with m0 1s_x 5 1520.3381 5
14.6619 (one standard error below the mean) and m0 2 2s_x 5 15 2 2(0.3381) 5
_
14.3238 (two standard errors below the mean). The sample mean x 5 14.251 lies
somewhat more than two standard errors below the mean. Specifically, tdata 5 22.2154
_
indicates that the sample mean age at onset of anorexia nervosa x5 14.251 lies 2.2154
standard errors below the hypothesized mean age at onset m0 5 15. n
x = 14.251 lies
2.2154 standard
errors below m 0.
Figure 9.45
A
Suppose we have two quantities A and B, with A > B > 0. Then A/B is called
the golden ratio if
B
A 1 B
______ A
5 __
A B
Purestock
that is, if the ratio of the sum of the quantities to the larger quantity equals the
ratio of the larger to the smaller (see Figure 9.46).
A B
A+B
A + B is to A as A is to B
Euclid wrote about the golden ratio in his Elements, calling it the extreme and
mean ratio. The ratio of the width A and height B of the Parthenon, one of the most
Figure 9.46
famous temples in ancient Greece, equals the golden ratio (Figure 9.46). If you enclose
the face of Leonardo da Vincis Mona Lisa in a rectangle, the resulting ratio of the long
side to the short side follows the golden ratio (Figure 9.47). The golden ratio has a
A value of approximately 1.6180339887.
Now we will test whether there is evidence for the use of the golden ratio in the
artistic traditions of the Shoshone, a Native American tribe from the American West.
Perhaps the best-known Shoshone is Sacajawea, whose assistance was crucial to the
B
success of the Lewis and Clark expedition in the early nineteenth century. The Sho-
shone are superb artisans, with a long tradition of skilled beadwork.
Figure 9.48 shows a fine example of a nineteenth-century Shoshone beaded
dress.14 This dress belonged to Nahtoma, the daughter of Chief Washakie of the Eastern
Shoshone. Figure 9.49 shows a detail of the upper part of the dress, including five
beaded rectangles. It is intriguing to consider whether Shoshone beaded rectangles
such as these follow the golden ratio.
Alamy
Figure 9.47
Table 9.12 contains the ratios of lengths to widths of 18 beaded rectangles made
by Shoshone artisans.15 We will perform a hypothesis test to determine whether the
population mean ratio of Shoshone beaded rectangles equals the golden ratio of
1.6180339887.
100
95
90
80
70
Percentage
60
Figure 9.50 50
Normal probability 40
plot (n 5 18). 30
20
10
5
1
1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 2.0
Shoshone beaded rectangle ratios (n = 18)
Solution
We use the TI-83/84 to perform this hypothesis test, using the instructions provided at
the end of this section.
Step 1 State the hypotheses and the rejection rule. Since we are interested in
whether the population mean length-to-width ratio of Shoshone beaded rectangles
equals the golden ratio of 1.6180339887, we perform a two-tailed test:
p-value for a two-tailed test Step 3 Find the p-value. From Figures 9.51 and 9.52, we have
is sum of two tail areas.
We will use the nurse-to-patient ratio data from Example 9.22 (page 48).
TI-83/84
If you have the data values: If you have the summary statistics:
Step 1 Enter the data into list L1. Step 1 Press STAT, highlight TESTS, and press ENTER.
Step 2 Press STAT, highlight TESTS, and press ENTER. Step 2 Press 2 (for T-Test; see Figure 9.53).
Step 3 Press 2 (for T-Test; see Figure 9.53). Step 3 For input (Inpt), highlight Stats and press ENTER
Step 4 For input (Inpt), highlight Data and press ENTER (Figure 9.55).
(Figure 9.54). a. For m0, enter the value of m0, 1.6.
a. For m0, enter the value of m0, 1.6. b. For Sx, enter the value of s, 0.549343042.
_
b. For List, press 2nd, then L1. c. For x
, enter the sample mean 2.02.
c. For Freq, enter 1. d. For n, enter the sample size 10.
d. For m, select the form of Ha. Here we have a two-tailed test, e. For m, select the form of Ha. Here we have a two-tailed test,
so highlight m0 and press ENTER. so highlight m0 and press ENTER.
e. Highlight Calculate and press ENTER. The results are shown f. Highlight Calculate and press ENTER. The results are shown
in Figure 9.41 in Example 9.22. in Figure 9.41 in Example 9.22.
EXCEL
WHFStat Macros Step 4 Select cells A1 to A10 as the Dataset Range.
Step 1 Enter the data into column A. (If you have only the (Alternatively, you may enter the summary statistics.)
summary statistics, go to Step 2.) Step 5 Select your Confidence level, which should be 1 a.
Step 2 Load the WHFStat Macros. Here, because a 5 0.05, we select 95%.
Step 3 Select Add-Ins . Macros . Testing a Mean . t Test Step 6 Enter the Null Hypothesis Value, m0 5 1.6, and click
Confidence Interval One Sample. OK.
(Continued)
58 Chapter 9 Hypothesis Testing
MINITAB
If you have the data values: If you have the summary statistics:
Step 1 Enter the data into column C1. Step 1 Click Stat . Basic Statistics . 1-Sample t.
Step 2 Click Stat . Basic Statistics . 1-Sample t. Step 2 Click Summarized Data,
Step 3 Click Samples in Columns and select C1. Step 3 Enter the Sample Size 10, the Sample Mean 2.02, and
Step 4 For Test Mean, enter 1.6. the Sample Standard Deviation 0.549343042.
Step 5 Click Options. Step 4 Click Options.
a. Choose your Confidence Level as 100(1 2 a). Our level of a. Choose your Confidence Level as 100(1 2 a). Our level of
significance a here is 0.05, so the confidence level is 95.0. significance a here is 0.05, so the confidence level is 95.0.
b. Select not Equal for the Alternative. b. Select not Equal for the two-tailed test.
Step 6 Click OK and click OK again. Step 5 Click OK and click OK again.
1
The test statistic used for the t test for the mean is
_
method. For the p-value method, we reject H0 if the p-value
2
x m is less than a.
tdata5 ______
__0
n
s/ 2
For the critical-value method, we compare the values of
with n 2 1 degrees of freedom. The t test may be used tdata and tcrit. If tdata falls in the critical region, we reject H0.
under either of the following conditions: (Case 1) the
population is normal, or (Case 2) the sample size is large 3 The value of tdata may be interpreted as the number of
(n 30). We perform hypothesis testing for the mean standard errors above or below the hypothesized mean m0
_
with the t test, using either the p-value or the critical-value that the sample mean xlies.
Section 9.4Exercises
Clarifying the Concepts 12. Describe what happens to the t critical value tcrit for the
1. What assumption is required for performing the Z test two-tailed tests in Exercises 911 as the level of significance
that is not required for the t test? a decreases.
2. What do we use to estimate the unknown population
For Exercises 1318, find the value of tdata.
standard deviation s? _
13. x5 20, s 5 4, n 5 36, m0 5 22
3. What are the two conditions for using the t test for the _
14. x5 2, s 5 1, n 5 49, m0 5 3
mean? _
15. x5 10, s 5 3, n 5 64, m0 5 11
4. What three elements are needed for finding the t critical _
16. x5 80, s 5 5, n 5 64, m0 5 82
value? _
17. x5 106, s 5 10, n 5 81, m0 5 102
_
Practicing the Techniques 18. x5 99, s 5 4, n 5 100, m0 5 95
For each of the following hypothesis tests in Exercises 57
and 911, find the critical value tcrit, and sketch the critical For Exercises 1924, estimate the p-value. Assume normality.
region under the normal curve. Assume normality. 19. H0: m # 100 vs. Ha: m 100, n 5 8, tdata 5 2.5
5. H0: m $ 40 vs. Ha: m , 40, n 5 12, a 5 0.10 20. H0: m # 100 vs. Ha: m 100, n 5 8, tdata 5 2.0
6. H0: m $ 40 vs. Ha: m , 40, n 5 12, a 5 0.05 21. H0: m 5 210 vs. Ha: m 10, n 5 18, tdata 5 2.0
7. H0: m $ 40 vs. Ha: m , 40, n 5 12, a 5 0.01 22. H0: m 5 210 vs. Ha: m 10, n 5 18, tdata 5 0
8. Describe what happens to the t critical value tcrit for the 23. H0: m $ 40 vs. Ha: m 40, n 5 12, tdata 5 22.5
left-tailed tests in Exercises 57 as the level of significance a 24. H0: m $ 40 vs. Ha: m 40, n 5 12, tdata 5 24.0
decreases. For Exercises 2530, assess the strength of the evidence
9. H0: m 5 210 vs. Ha: m 210, n 5 18, a 5 0.10 against the null hypothesis.
10. H0: m 5 210 vs. Ha: m 210, n 5 18, a 5 0.05 25. Exercise 19 26. Exercise 20
11. H0: m 5 210 vs. Ha: m 210, n 5 18, a 5 0.01 27. Exercise 21 28. Exercise 22
29. Exercise 23 30. Exercise 24
Section 9.4 Exercises 59
For Exercises 3134, do the following. 40. In Example 9.22, we have n 5 10, s 5 0.549343042,
a. State the hypotheses and the rejection rule using the m0 5 1.6, and tdata 2.42, where m represents the population
p-value method. mean nursing index.
b. Calculate the test statistic tdata. 41. In Example 9.23, we have n 5 17, s 5 230, m0 5 400,
c. Find the p-value. (Use technology or estimate the and tdata 1.91, where m represents the population mean
p-value.) math journal price.
d. State the conclusion and the interpretation. 42. In the Case Study on page 55, we have n 5 18, s 5
31. A random sample of size 9 from a normal population 0.1234600663, m0 5 1.6180339887, and tdata 21.18,
_
yields x
5 1 and s 5 0.5. Researchers are interested in where m represents the population mean length-to-width
finding whether the population mean differs from 0, using ratio of Shoshone beaded rectangles.
level of significance a 5 0.05.
32. A random sample of size 400 from a population with Applying the Concepts
an unknown distribution yields a sample mean of 230 and a Internet Response Times. Use the following information for
sample standard deviation of 5. Researchers are interested Exercises 4345. The Web site www.Internettrafficreport
in finding whether the population mean is greater than 200, .com monitors Internet traffic worldwide and reports on the
using level of significance a 5 0.05. response times of randomly selected servers.
33. A random sample of size 100 from a population with an 43. On July 13, 2004, the Web site reported the following
_
unknown distribution yields x5 27 and s 5 10. Researchers response times to Asia, in milliseconds:
are interested in finding whether the population mean is less 441 271 283 244 789 211 205
than 28, using level of significance a 5 0.05. 148 330 886 238 197 187
34. A random sample of size 16 from a normal population We are interested in testing whether the population mean
_
yields x
5 2.2 and s 5 0.3. Researchers are interested in response time is slower than 200 milliseconds, using a t test
finding whether the population mean differs from 2.0, using and level of significance a 5 0.05.
level of significance a 5 0.01. a. Does Case 2 apply?
b. Here is a boxplot of the data. Does Case 1 apply?
For Exercises 3538, do the following. (Hint: The boxplot is right-skewed and the normal
a. State the hypotheses. distribution is symmetric.)
b. Calculate the t critical value tcrit and state the rejection c. Can we proceed with the t test? Explain.
rule. Also, sketch the critical region.
c. Find the test statistic tdata.
d. State the conclusion and the interpretation.
35. A random sample of size 25 from a normal population 44. On July 13, 2004, the Web site reported the following
_
yields x
5 104 and s 5 10. Researchers are interested in response times to North America, in milliseconds:
finding whether the population mean exceeds 100, using 100 53 67 37 59 75 87
level of significance a 5 0.01. 88 78 70 58 71 77 69
36. A random sample of size 100 from a population with We are interested in testing whether the population mean
an unknown distribution yields a sample mean of 25 and a response time is slower than 60 milliseconds, using a t test
sample standard deviation of 5. Researchers are interested in and a 5 0.05.
finding whether the population mean is less than 24, using a. Does Case 2 apply?
level of significance a 5 0.05. b. Here is a normal probability plot of the data. Does
37. A random sample of size 36 from a population with an Case 1 apply?
_
unknown distribution yields x5 10 and s 5 3. Researchers c. Can we proceed with the t test? Explain.
are interested in finding whether the population mean differs
from 9, using level of significance a 5 0.10. 100
38. A random sample of size 16 from a normal population 95
_ 90
yields x
5 995 and s 5 15. Researchers are interested in
80
finding whether the population mean is less than 1000, using 70
Percentage
_
b. tdata (though still smaller than x
)? Otherwise, everything else
c. tcrit remains unchanged. Describe how this change would affect
d. The p value the following, if at all.
Section 9.4 Exercises 61
60
50
Marriage Age. Use the following information Exercises 5659.
40 The U.S. Census Bureau reported that the median age at first
30 marriage for females in 2003 was 25.3. Lets assume that the
20 mean age at first marriage among females in 2003 was 26 years
10 old. A random sample of 157 females this year yielded a
5
1
mean age at first marriage of 26.5 with a standard deviation of
1000 1500 2000 2500 3000 3500 4000 4 years. A histogram of the ages was decidedly not normal.
Tuition 56. Answer the following.
a. Is it appropriate to apply the t test for the mean? Why
Figure 9.57 Normal probability plot.
or why not?
Test of mu = 2272 vs not = 2272
b. Calculate the standard error of the mean.
Variable N Mean StDev SE Mean 95% CI T P
c. Compute tdata. Interpret tdata using the standard error.
tuition 10 2538.92 404.75 127.99 (2249.38, 2828.46) 2.09 0.067 d. Is there evidence that the population mean age at first
marriage for females has changed since 2003? Test
Figure 9.58 Minitab t test output.
using the critical-value method at level of significance
53. Analysts are interested in whether the population mean a 5 0.10.
tuition and fees this year have increased. 57. Test using the critical-value method at level of
a. Is it appropriate to apply the t test for the mean? Why significance a 5 0.10 whether the population mean age at
or why not? first marriage for females has increased since 2003.
b. What are the sample size, sample mean cost, and 58. Challenger Exercise.
sample standard deviation of the cost? a. Compare your conclusions in the previous two
c. Verify the value for the standard error of the mean. exercises. Note that we have concluded that there
d. Find tdata in the Minitab output. Interpet tdata using the is insufficient evidence that the population mean
standard error. marriage age has changed, but that there is evidence
e. It appears that the data analyst who produced the that the population mean marriage age has increased.
Minitab printout asked for the wrong hypothesis test. How can the mean marriage age have increased
How can we tell? without changing? Explain what is going on here, in
54. Refer to your work in the previous exercise. terms of either critical regions or p-values.
a. Test whether the population mean tuition and fees b. For which of the two hypothesis tests is the test
have increased using the p-value method and level of statistic closer to the critical value?
significance a 5 0.05. How can we use the p-value 59. Answer the following.
on the Minitab printout to find the p-value needed for a. Find or estimate the p-value for the test in Exercise 56.
this right-tailed hypothesis test? b. Assess the strength of the evidence against the null
b. Compare the conclusion from (a) with the conclusion hypothesis.
we would have gotten had we not noticed that the data c. Is the p-value closer to 0.05 or 0.10? Why?
analyst performed the wrong hypothesis test. What are d. Suppose a newspaper story referred to the study under
62 Chapter 9 Hypothesis Testing
the headline Mean Female Marriage Age Increasing. graphs for the total occupied housing units. What is
How would you respond? Is the headline supported or the sample mean? The sample standard deviation?
not supported by the data and the hypothesis test? Comment on the symmetry or skewness of the data set.
60. New York Towns. Work with the New York data set for c. Suppose we are using the data in this data set as a
the following. sample of the total occupied housing units of all
a. How many observations are in the data set? How the counties in the southwestern United States. Use
many variables? technology to test at level of significance a 5 0.05
b. Use technology to explore the variable tot_pop, whether the population mean total occupied housing
which lists the population for each of the towns and units for these counties differs from 40,000.
cities in New York with at least 1000 people. 62. Calories. Work with the Nutrition data set.
c. Suppose we are using the data in this data set as a a. How many observations are in the data set? How
sample of the population of all the towns and cities many variables?
in the northeastern United States with at least 1000 b. Use technology to explore the variable calories,
people. Use technology to test at level of significance which lists the calories in the sample of foods.
a 5 0.05 whether the population mean population of Generate numerical summary statistics and graphs
these towns differs from 50,000. for the number of calories. What is the sample mean
61. Texas Towns. Work with the Texas data set for the number of calories? The sample standard deviation?
following. c. Use technology to test at level of significance a 5 0.05
a. How many observations are in the data set? How whether the population mean number of calories per
many variables? food item differs from 300.
b. Use technology to explore the variable tot_occ, which d. Use technology to test at level of significance a 5
lists the total occupied housing units for each county 0.05 whether the population mean number of calories
in Texas. Generate numerical summary statistics and per food item differs from 320.
it has a sampling distribution. We learned that the mean of this distribution is p, and
the standard deviation of the sampling distribution (the measure of variability) is
________
p (1 2 p)
sp5 ________
n
We learned that this sampling distribution of p is approximately normal whenever
both of the following conditions are met: np $ 5 and n(1 2 p) $ 5. Just as with the
Z test for the mean, in the Z test for the proportion the null hypothesis will include
a certain hypothesized value for the unknown parameter, which we call p0. For
example, the hypotheses for the two-tailed test have the following form:
p (1 2 p )
p(1 2 p)
s5 ________
n 5 _________
0 0
n
p
Spread determined by
p0 (1 p0)
n
Figure 9.59
Sampling distribution of
p
, assuming H0 is true.
Centered at p0 p
Extreme value of p
the hypothesized mean m0. In this section, Zdata will represent the number of stan-
dard deviations (sp) the sample proportion p
lies above or below the hypothesized
proportion p0.
The test statistic used for the Z test for the proportion is
p
2 p0 ___________
p
2 p0
Zdata 5 ______
sp 5 _________
p (1 2 p0)
_________
0
n
where p
is the observed sample proportion of successes, p0 is the value of
p hypothesized in H0, and n is the sample size. Zdata represents the number
of standard deviations (sp) the sample proportion p
lies above or below the
hypothesized proportion p0. Extreme values of p will be associated with
extreme values of Zdata.
Just as for hypothesis tests for the mean, there are three possible forms of hy-
pothesis tests for the proportion, as shown in Table 9.13.
Table 9.13 The three possible forms for the hypotheses for a test for p
p (1 2 p )
_________
0 0
n
Step 3 Find the p-value. Either use technology to find the p-value, or
calculate it using the form in Table 9.4 that corresponds to your hypotheses.
Step 4 State the conclusion and the interpretation. If the p-value is less
than a, then reject H0. Otherwise do not reject H0. interpret your conclusion
so that a nonspecialist can understand.
Note that the p-value has precisely the same definition and behavior as in the
Z test for the mean. That is, the p-value is roughly a measure of how extreme your
value of Zdata is and takes values between 0 and 1, with small values indicating ex-
treme values of Zdata.
9.5 Z Test for the Population Proportion p 65
D eveloping
Your Statistical Sense
The Difference Between the p-Value and the Population Proportion p
Be careful to distinguish between the p-value and the population proportion p. The
latter represents the population proportion of successes for a binomial experiment and
is a population parameter. The p-value is the probability of observing a value of Zdata
at least as extreme as the Zdata actually observed. The p-value depends on the sample
data, but the population proportion p does not depend on the sample data. n
1824 had an accident rate of 12%, meaning that on average 12 out of every 100 young
drivers per year had an accident. A more recent study examined 1000 young drivers
aged 1824 and found that 134 had an accident this year. We are interested in whether
the population proportion of young drivers having accidents has increased since 2003,
using the p-value method with level of significance a 5 0.05.
Solution
First we check that both of our normality conditions are met. Since we are interested in
whether the proportion has increased from 12%, we have p0 5 0.12.
The normality conditions are met and we may proceed with the hypothesis test.
Step 1 State the hypotheses and the rejection rule. Our hypotheses are
H0 : p # 0.12 versus Ha : p . 0.12
where p represents the population proportion of young people aged 1824 who had an
accident. We reject the null hypothesis if the p-value is less than a 5 0.05.
66 Chapter 9 Hypothesis Testing
p (1 2 p )
(0.12)(0.88)
sp5 _________
0 n 0 5 __________
0.0103
1000
p0 ___________
p 2 p0
p 0.134 2 12
Zdata 5 _____
s
5 _________ 5 _____________
1.36
___________
p p (1 2 p ) (0.12)(0.88)
___________
_________
0 n 0
1000
Step 4 State the conclusion and the interpretation. Since the p-value is
not less than a 5 0.05, we do not reject H0. There is insufficient evidence that
the population proportion of young people aged 1824 who had an accident has
increased. n
D eveloping
Your Statistical Sense
What About When the p-Value Is Close to a?
The sampling distribution of p for Example 9.27 is shown in Figure 9.60. Note that
the sample proportion p 5 0.134 is not very extreme, but it is not safely snug in the
middle either. It is a close call whether to consider this sample proportion extreme,
since our p-value 0.0869 is not much larger than a 5 0.05. But who says that a
needs to be 0.05? If a were 0.10, then our conclusion would be reversed. Perhaps
we should simply report the strength of evidence against the null hypothesis using
Table 9.5. In this case, our p-value of 0.0869 provides mild evidence against the null
hypothesis that the population proportion of young people having an accident has
not increased. n
4000
3000
Frequency
2000
0
0.08 0.09 0.10 0.11 0.12 0.13 0.14 0.15 0.16 0.17
p = 0.134
W
H AT I F
W
?
hat If Scenario What If More Young Drivers Were Having Accidents?
To answer the following question, try to think about the relationships among the vari-
ous statistics rather than doing all the calculations over again. Consider Example 9.27,
where the level of significance a was 0.05. Suppose our sample proportion p of young
drivers having accidents was somewhat higher than 12%. Otherwise, everything else in
the example is the same. Describe how this change would affect the following.
a. sp
b. Zdata
c. The p-value
d. a
e. The conclusion
Solution ___________
a. Since sp5 p0 (1 2 p0)/n
depends only on p0 and n, and not on p , the stan-
dard deviation spis unaffected by an increase in p .
b. Since p is greater than p0, increasing p increases the difference p
2 p0, the
numerator of Zdata. Then, since sp(the denominator of Zdata) is unchanged, Zdata itself
increases.
c. Since Zdata increases, the p-value, which measures (in this case) the area to the
right of Zdata, decreases.
d. Since the level of significance a is not affected by sample characteristics, a
leaves a unaffected.
change in p
e. Previously, our p-value was 0.0869, which was not less than a 5 0.05, so
that we did not reject H0. From (c), we know that the p-value decreases, but we do not
know by how much. Therefore, the answer cannot be determined. It is possible that our
conclusion may change from nonrejection to rejection, but we are not sure. We would
have to specify the new value of p and recalculate.
The normality conditions are met and we may proceed with the hypothesis test.
Step 1 State the hypotheses and the rejection rule. From Example 9.26, our
hypotheses are
H0: p # 0.01 versus Ha: p . 0.01
where p represents the population proportion of American Internet users who are mar-
ried or in a long-term relationship and who met on a blind date or through a dating
service. We will reject H0 if p-value , 0.05.
68 Chapter 9 Hypothesis Testing
Step 2 Find Zdata. We use the instructions supplied in the Step-by-Step Technology
Guide at the end of this section. Figure 9.61 shows the TI-83/84 results from the Z test
for p, and Figure 9.62 shows the results from Minitab.
Form of Ha:
Zdata
p-value
Sample proportion p
Sample size n
p p (1 2 p ) (0.01)(0.99)
_________
0 0 __________
n
500
Step 3 Find the p-value. From Figures 9.61, 9.62, and 9.63, we have
p-value5 P(Z . 1.348399725) 5 0.0887649866
p-value =
P(Z > 1.3484399725)
0 Zdata = 1.3484399725
Figure 9.63
Step 4 State the conclusion and interpretation. Since p-value 0.08876 is not
less than our level of significance a 5 0.05, we do not reject H0. There is insufficient
evidence that the population proportion of American Internet users who are married or
in a long-term relationship and who met on a blind date or through a dating service has
increased since 2006. n
Table 9.14 Table of critical values Zcrit for common values of the level of significance a
Form of Hypothesis Test
Right-tailed Left-tailed Two-tailed
H0: p # p0 H0 : p $ p0 H0: p 5 p0
Level of significance a Ha: p . p0 Ha: p , p0 Ha: p p0
0.10 Zcrit 5 1.28 Zcrit 5 21.28 Zcrit 5 1.645
0.05 Zcrit 5 1.645 Zcrit 5 21.645 Zcrit 5 1.96
0.01 Zcrit 5 2.33 Zcrit 5 22.33 Zcrit 5 2.58
the proportion is exactly equivalent to the p-value method for the Z test for the
mean. Therefore, the normality conditions under which you can use the Z test
critical-value method are the same as the conditions for using the Z test p-value
method, namely, np0 $ 5 and n(1 2 p0) $ 5. In the critical-value method, we
compare one Z-value (Zdata) with another Z-value (Zcrit) rather than comparing the
p-value with a. You can use Table 9.14 to find the critical values Zcrit (this table is
the same as Table9.6 except that the ms have been changed to ps). To find the
rejection rules or the critical regions, you can use Table 9.15 (which is the same
as Table 9.7 except that the ms have been changed to ps).
The Z test for proportions is essentially the same as the Z test for means. They
differ only in that Zdata is, of course, calculated differently, and the hypotheses and
interpretations differ.
Step 1 State the hypotheses. Use one of the forms from Table 9.15. Clearly
describe the meaning of p.
Step 2 Find Zcrit and state the rejection rule. Use Tables 9.14 and 9.15.
Step 3 Find Zdata. Either use technology to find the value of Zdata or calculate
Zdata as follows:
p
p0 ___________p
2 p0
Zdata 5 ______
sp
5 _________
p (1 2 p )
_________
0 0
n
Step 4 State the conclusion and the interpretation. If Zdata falls in the critical
region, then reject H0. Otherwise, do not reject H0. Interpret the conclusion so
that a nonspecialist can understand.
70 Chapter 9 Hypothesis Testing
p (1 2 p )
(0.21)(0.79)
sp5 _________
0 n 0 5 __________
0.020365
400
Thus, our test statistic is
2 p0 ___________
p p0
p
Zdata 5 ______
s
5 0.195
5 _____________ 0.21
2
20.74
___________
p p (1 2 p ) (0.21)(0.79)
_________ __________
0
n
0
400
p = 0.195
Figure 9.64
9.5 Z Test for the Population Proportion p 71
Step 4 State the conclusion and the interpretation. Since Zdata 5 0.74 is not
greater than 1.645 and not less than 1.645, we do not reject H0. There is insufficient
evidence that the population proportion of Americans who smoke tobacco has changed
since 2006. n
Use the confidence interval to test, using level of significance a 5 0.05, whether the
population proportion differs from
a. 0.85
b. 0.90
c. 0.95
Solution
There is equivalence between a 100(1 2 a)% confidence interval for p and a two-tailed
test for p with level of significance a. Values of p0 that lie outside the confidence in-
terval lead to rejection of the null hypothesis, while values of p0 within the confidence
interval lead to not rejecting the null hypothesis. Figure 9.65 illustrates the 95% con-
fidence interval for p.
Lower Bound = 0.88 Upper Bound = 0.94
Figure 9.65 Reject H0 for values p0 that lie outside the interval (0.88, 0.94).
TI-83/84
Step 1 Press STAT, highlight TESTS, and press ENTER.
Step 2 Press 5 (for 1-PropZTest; see Figure 9.66).
Step 3 For p0, enter the value of p0, 0.01.
Step 4 For x, enter the number of successes, 8.
Step 5 For n, enter the number of trials 500.
Step 6 For prop, enter the form of Ha. Here we have a right-
tailed test, so highlight .p0 and press ENTER (see Figure 9.67).
Figure 9.66 Figure 9.67
Step 7 Highlight Calculate and press ENTER. The results are
shown in Figure 9.61 in Example 9.28.
EXCEL
WHFStat Macros Step 4 Enter the Number of successes 8.
Step 1 Enter the data into column A. (If you have only the Step 5 Enter the Sample size 500.
summary statistics, go to Step 2.) Step 6 Enter the Testing Proportion, p05 0.01.
Step 2 Load the WHFStat Macros. Step 7 Select your Confidence level, which should be 1 2 a.
Step 3 Select Add-Ins . Macros . Testing a Proportion . Here, because a 5 0.05, we select 95%.
One Sample. Step 8 Click OK.
MINITAB
If you have the summary statistics: a. Choose your Confidence Level as 100(1 a ). Our level of
Step 1 Click Stat . Basic Statistics . 1 Proportion. significance a here is 0.05, so the confidence level is 95.0.
Step 2 Click Summarized Data. b. Enter 0.01 for the Test Proportion.
Step 3 Enter the Number of trials 500 and the Number of c. Select Greater than for the Alternative.
Events 8. d. Check Use test and interval based on normal distribution.
Step 4 Click Options. Step 5 Click OK and click OK again. The results are shown in
Figure 9.62 in Example 9.28.
1
The essential idea about hypothesis testing for the Zdata represents the number of standard deviations (sp) the
proportion may be described as follows. When the sample sample proportion p lies above or below the hypothesized
is unusual or extreme in the sampling
proportion p proportion p0. Extreme values of p will be associated with
distribution of pthat is based on the assumption that H0 is extreme values of Zdata . The Z test for the proportion may
correct, we should reject H0. Otherwise, there is insufficient be performed using either the p-value method or the
evidence against H0, which leads us to the conclusion that we critical-value method. For the p-value method, we reject
should not reject H0. H0 if the p-value is less than a.
2
The test statistic used for the Z test for the proportion is
3 For the critical-value method, we compare the values of
2 p0 __________
p 2 p0
p Zdata and Zcrit. If Zdata falls in the critical region, we reject H0.
Zdata 5 ______
s
5
p p0(1 p0)
________
n
4
We can use a single 100(1 2 a)% confidence interval for
p to help us perform any number of two-tailed hypothesis
is the observed sample proportion of successes, p0 is
where p tests about p.
the value of p hypothesized in H0, and n is the sample size.
Section 9.5 Exercises 73
Section 9.5Exercises
Clarifying the Concepts 27. Test whether the population proportion differs from 0.5.
1. What is the difference between p and p? A random sample of size 900 yields 475 successes.
2. What is the standard deviation for the sampling 28. Test whether the population proportion exceeds 0.9. A
? What does it mean? (Hint: Use the concept
distribution of p random sample of size 1000 yields 925 successes.
.)
of variability of the statistic p
For Exercises 2931, do the following.
3. Explain the essential idea about hypothesis testing for the
a. Check the normality conditions.
proportion.
b. State the hypotheses.
4. Explain what p0 refers to.
c. Find Zcrit and the rejection rule.
5. What possible values can p0 take?
d. Calculate Zdata.
6. What is the difference between p and a p-value?
e. Compare Zcrit with Zdata. State the conclusion and the
interpretation.
Practicing the Techniques
29. Test whether the population proportion is less than
For the following samples, find sp, the standard deviation of
0.5. A random sample of size 225 yields 100 successes.
. Let p0 5 0.5.
the sampling distribution of p
Let a 5 0.05.
7. A sample of size 50 yields 20 successes.
30. Test whether the population proportion differs from 0.3.
8. A sample of size 50 yields 30 successes.
A random sample of size 100 yields 25 successes. Let a 5 0.01.
9. A sample of size 100 yields 70 successes.
31. Test whether the population proportion exceeds 0.6. A
10. A sample of size 100 yields 30 successes.
random sample of size 400 yields 260 successes. Let a 5 0.05.
11. A sample of size 1000 yields 900 successes.
12. A sample of size 1000 yields 100 successes.
Applying the Concepts
13. What kind of pattern do we observe in these results?
32. Baptists in America. A study reported in 2001 that
For Exercises 1417, find the value of the test statistic Zdata 17.2% of Americans identified themselves as Baptists.17
for a right-tailed test with p0 5 0.4. A survey of 500 randomly selected Americans this year
14. A sample of size 50 yields 20 successes. showed that 85 of them were Baptists. If appropriate,
15. A sample of size 50 yields 25 successes. test using the p-value method at level of significance a
16. A sample of size 50 yields 30 successes. 5 0.10 whether the population proportion of Americans
17. A sample of size 50 yields 35 successes. who are Baptists has changed since 2001.
18. What kind of pattern do we observe in the value of Zdata 33. Births to Unmarried Women. The National Center
for a right-tailed test as the number of successes becomes for Health Statistics reported: Childbearing by unmarried
more extreme? women increased to record levels for the Nation in 2005.18
In that year, 36.8% of all births were to unmarried women.
For Exercises 1923, find the value of the test statistic Zdata
Suppose that a random sample taken this year of 1000
for a two-tailed test with p0 5 0.5.
births showed 380 to unmarried women. If appropriate, test
19. A sample of size 80 yields 20 successes.
whether the population proportion has increased since 2005,
20. A sample of size 80 yields 30 successes.
using level of significance a 5 0.05.
21. A sample of size 80 yields 40 successes.
34. Twenty-Somethings. According to the U.S. Census
22. A sample of size 80 yields 50 successes.
Bureau, 7.1% of Americans living in 2004 were between
23. A sample of size 80 yields 60 successes.
the ages of 20 and 24. Suppose that a random sample
24. What kind of pattern do we observe in the value of Zdata
of 400 Americans taken this year yields 35 between
as the sample proportion approaches p0?
the ages of 20 and 24. If appropriate, test whether
For Exercises 2528, do the following. the population proportion of Americans aged 2024
a. Check the normality conditions. is different from the 2004 proportion. Use level of
b. State the hypotheses and the rejection rule for the significance a 5 0.01.
p-value method, using level of significance a 5 0.05. 35. Nonmedical Pain Reliever Use. The National
c. Find Zdata. Survey on Drug Use and Health reported that, from
d. Find the p-value. 2002 to 2005, 4.8% of persons aged 12 or older used
e. Compare the p-value with a 5 0.05. State the a prescription pain reliever nonmedically. 19 Suppose
conclusion and the interpretation. that a random sample of 900 persons aged 12 or older
25. Test whether the population proportion exceeds 0.4. A taken this year found 54 that had used a prescription
random sample of size 100 yields 44 successes. pain reliever nonmedically. If appropriate, test whether
26. Test whether the population proportion is less than 0.2. A the population proportion has increased, using level of
random sample of size 400 yields 75 successes. significance a 5 0.01.
74 Chapter 9 Hypothesis Testing
36. Ethnic Asians in California. A research report states d. Where in the sampling distribution would the value of
that, in 2005, 12.3% of California residents were of Asian lie? Near the tail? Near the center?
p
ethnicity.20 Suppose that this year, a random sample of 400 e. Is there evidence that the population proportion of
California residents yields 52 of Asian ethnicity. We are eighth-graders who used alcohol has changed? Test
interested in whether the population proportion of California using the p-value method at level of significance
residents of Asian ethnicity has risen since 2005. a 5 0.05.
a. Is it appropriate to perform the Z test for the 41. Hispanic Household Income. Refer to Exercise 39.
proportion? Why or why not? a. Construct the hypotheses.
b. What is sp? What does this number mean? b. Find the rejection rule.
c. How many standard deviations does p lie below p0? Is c. Calculate the value of the test statistic Zdata.
this extreme? Why or why not? d. Calculate the p-value.
d. Where in the sampling distribution would the value of e. Assess the strength of evidence against the null
lie? Near the tail? Near the center?
p hypothesis.
37. Affective Disorders Among Women. What do you f. Formulate and interpret your conclusion.
think is the most common nonobstetric (not related to 42. Eighth-Grade Alcohol Use. Refer to Exercise 40.
pregnancy) reason for hospitalization among 18- to 44- a. Evaluate the strength of evidence against the null
year-old American women? According to the U.S. Agency hypothesis.
for Healthcare Research and Quality (www.ahrq.gov), this b. Suppose that we decide to carry out the same Z test
is the category of affective disorders, such as depression. however, this time using the critical-value method.
In 2002, 7% of hospitalizations among 18- to 44-year-old Without actually performing the test, what would the
American women were for affective disorders. Suppose that conclusion be and why?
a random sample taken this year of 1000 hospitalizations of c. Suppose that a newspaper headline reported Alcohol
18- to 44-year-old women showed 80 admitted for affective Use Down Among Eighth-Graders. How would
disorders. We are interested in whether the population you respond? Does your hypothesis test support this
proportion of hospitalizations for affective disorders has headline?
H AT I F
changed since 2002. Test using the p-value method and level W
of significance a5 0.10.
What if we cut the sample size down to between 50 and 60
38. Ethnic Asians in California. Refer to Exercise 36. Is
eighth-graders, and we cut the sample number of successes
there evidence that the population proportion of California
as well, so that the sample proportion p stayed the same?
residents of Asian ethnicity has risen since 2005? Test using
Suppose that everything else stayed the same. Describe what
the p-value method at level of significance a 5 0.05.
would happen to the following, and why.
39. Hispanic Household Income. The U.S. Census Bureau
a. sp
reports that, in 2002, 15.3% of Hispanic families had household
b. Zdata
incomes of at least $75,000. We are interested in whether the
c. The p-value
population proportion has changed, using the critical-value
d. a
method and level of significance a 5 0.01. Suppose that
e. The conclusion
a random sample of 100 Hispanic families taken this year
reported 23 with household incomes of at least $75,000.
Children and Environmental Tobacco Smoke at Home.
a. Is it appropriate to perform the Z test for the
Use the following information for Exercises 4448.
proportion? Why or why not?
The Environmental Protection Agency reported in 2005
b. Where will the sampling distribution of p be
that 11% of children aged 6 and under were exposed to
centered? What will be the shape of the distribution?
environmental tobacco smoke (ETS) at home on a regular
c. What is sp? What does this number mean?
basis (at least four times per week).22 A random sample
d. How many standard deviations does p lie below p0? Is
taken this year of 100 children aged 6 and under showed
this extreme? Why or why not?
that 6% of these children had been exposed to ETS at home
e. Where in the sampling distribution would the value of
on a regular basis.
lie? Near the tail? Near the center?
p
44. Answer the following.
40. Eighth-Grade Alcohol Use. The National Institute on
a. Is it appropriate to perform the Z test for the
Alcohol Abuse and Alcoholism reported that 45.6% of eighth-
proportion? Why or why not?
graders had used alcohol.21 A random sample of 100 eighth-
b. What is sp? What does this number mean?
graders this year showed that 41 of them had used alcohol.
c. How many standard deviations does p lie below p0? Is
a. Is it appropriate to perform the Z test for the
this extreme? Why or why not?
proportion? Why or why not?
d. Where in the sampling distribution would the value of
b. What is sp? What does this number mean?
lie? Near the tail? Near the center?
p
c. How many standard deviations does p lie below p0? Is
45. Test using the critical-value method at level of
this extreme? Why or why not?
significance a 5 0.05 whether the population proportion
9.6 Chi-Square Test for the Population Standard Deviation s 75
?
b. Suppose that a newspaper headline reported Second-
hand Smoke Prevalence Down. How would you remains the same.
of successes are doubled, so that p
respond? Does your inference support this headline? Otherwise, everything else is the same as in the original
47. Refer to your work in Exercise 45. example. Describe how this change would affect the
a. Test using the p-value method at level of significance following.
a 5 0.10 whether the population proportion of a. sp
children aged 6 and under exposed to ETS at home b. Zdata
on a regular basis has decreased since 2005. c. The p-value
b. How do you explain the different conclusions you got d. a
in the two hypothesis tests above? Is the difference in e. The conclusion
H AT I F
W
your conclusions because you used the critical-value 50. Suppose that the hypothesized proportion p0 was
?
method in one test and the p-value method in the other? no longer 0.12. Instead, p0 takes some value between 0.12
If not, explain how you got different conclusions. and 0.134. Otherwise, everything else is the same as in the
c. Evaluate the strength of evidence against the null original example. Describe how this change would affect the
hypothesis. following.
H AT I F
a. sp
W
proportion p was decreased, but everything else stayed the b. Zdata
same? Describe what would happen to the following, and why. c. The p-value
a. sp d. a
e. The conclusion
b. Zdata
c. The p-value
3 Perform and interpret the x test for s using the critical-value method.
2
that wishes to ensure the safety of a particular new drug would perform statistical
tests to make sure that the drugs effect was consistent and did not vary widely from
patient to patient. The biostatisticians employed by the company would therefore
perform a hypothesis test to make sure that the population standard deviation s was
not too large. Pharmaceutical companies rank as among the largest employers of
statisticians in the world, since they are required to demonstrate that their drugs are
both safe and effective.
Recall the following from Section 8.4.
x2 (Chi-Square) Distribution
If we take a random sample of size n from a normal population with mean m
and standard deviation s, then the statistic
(n 2 1)s2
x2 5 ________
s2
follows a x2 distribution with n 2 1 degrees of freedom, where s2 represents
the sample variance.
Just like our hypothesis tests about the population mean m and the population pro-
portion p, there are three forms for the hypothesis test about the population standard
deviation s (Table 9.16). The notation s0 refers to the value of s to be tested.
Table 9.16 The three possible forms for the hypotheses for a test for s
Under the assumption that H0 is true, the x2 statistic takes the following form:
(n 21)s2
________
5
x2data
s20
For the hypothesis test about s, our test statistic is x2data. It is called x2data because the
values of n 2 1 and s2 come from the observed data. The test statistic x2data takes a
moderate value when the value of s2 is moderate assuming H0 is true, and x2data takes
an extreme value when the value of s2 is extreme assuming H0 is true. This leads us
to the following.
The Essential Idea About Hypothesis Testing for the Standard Deviation
When the observed value of x2data is unusual or extreme on the assumption
that H0 is true, we should reject H0. Otherwise, there is insufficient evidence
against H0, and we should not reject H0.
The p-value for the hypothesis test about s may be found using Table 9.17.
0 2
c data 0 c 2data
Example 9.31 x2 test for s using the p-value method and technology
Power plants around the country are retooling in order to consume biomass instead of
or in addition to coal. The following table contains a random sample of 10 such power
plants and the amount of biomass they consumed in 2006, in trillions of Btu (British
thermal units).23 Test whether the population standard deviation is greater than 2 trillion
Btu using level of significance a 5 0.05.
78 Chapter 9 Hypothesis Testing
Biomass consumed
Power plant Location (trillions of Btu)
Georgia Pacific Naheola Mill Choctaw, Alabama 13.4
Jefferson Smurfit Fernandina Beach Nassau, Florida 12.9
International Paper Augusta Mill Richmond, Georgia 17.8
Gaylord Container Bogalusa Washington, Louisiana 15.1
Escanaba Paper Company Delta, Michigan 19.5
Weyerhaeuser Plymouth NC Martin, North Carolina 18.6
International Paper Georgetown Georgetown, South 13.8
Mill Carolina
Bowater Newsprint Calhoun McMinn, Tennessee 10.6
Operation
Covington Facility Covington, Virginia 12.7
Mosinee Paper Marathon, Wisconsin 17.6
Alamy
Solution
The normal probability plot in Figure 9.68 indicates acceptable normality, allowing us
to proceed with the hypothesis test.
100 Step 1 State the hypotheses and the rejection rule. The phrase
95
90
greater than indicates that we have a right-tailed test. The ques-
80 tion Greater than what? tells us that s0 5 2, giving us
70
Percentage
60
H0: s # 2 versus Ha: s . 2
50
40
30 We reject H0 if the p-value is less than level of significance
20 a 5 0.05.
10
5
1 Step 2 Find x2data. The TI-83/84 descriptive statistics in Fig-
0 5 10 15 20 25 30 ure 9.69 tell us that the sample variance is
Biomass consumed (trillions of Btu)
Thus,
( n 2 1)s2 ___________________
(10 2 1)2.9903548662
x2data 5 ________
5
20.12
2
s0 22
Step 3 Find the p-value. For our right-tailed test, Table 9.17 tells us that
Figure 9.69
p-value 5 P(x2 . x2data) 5 p(x2 . 20.12)
p-value = That is, the p-value is the area to the right of x2data 5 20.12, as shown in Figure 9.70.
P( c 2 > 20.12) To find the p-value, we use the instructions provided in the Step-by-Step Technology
Guide provided at the end of this section. The TI-83/84 results shown in Figure 9.71a
tell us that p-value 5 P(x2 . 20.12) 5 0.0171861114.
0 2
c data = 20.12 The Excel and Minitab results in Figures 9.71b and 9.71c agree with this p-value.
(Excel and Minitab do not exactly match the TI-83/84 p-value because they round the
Figure 9.70 p-values to fewer decimal places.) Instead of providing the p-value directly, Minitab
gives the area to the left of x2data: P(X 20.12) 5 0.982814. We therefore need to
9.6 Chi-Square Test for the Population Standard Deviation s 79
Chi-Square with 9 DF
x P( X <= x )
20.12 0.982814
Figure 9.71a TI-83/84 Figure 9.71b Excel results. Figure 9.71c Minitab results.
results.
Step 4 State the conclusion and the interpretation. Since p-value 5 0.0171861114
is less than a 5 0.05, we reject H0. There is evidence that the population standard de-
viation is greater than 2 trillion Btu. n
(n 21)s2
x2data 5 ________
s20
The x2 critical values in the right-tailed, left-tailed, or two-tailed tests use the
following notations: x2a, x212a, x2a/2, and x212a/2 (see Table 9.18). In each case, the
subscript indicates the area to the right of the x2 critical value. Find these values
just as you did in Section 8.4, using either technology or Table E, Chi-Square (x2)
Distribution, in the Appendix (page 000).
80 Chapter 9 Hypothesis Testing
Table 9.18 Critical values and rejection rules for the x2 test for s
Reject H0 if Reject H0 if
2
c data 2
Reject H0 if < c 1a 2
c data 2
< c 1 Reject H0 if
a/2
2
c data > c a2 2
c data 2
> c a/2
0 c a2 0 c 21a 0 c2 c 2a /2
1a /2
Alabama 48
Arkansas 37
Iowa 33
Massachusetts 50
Minnesota 45
Oregon 63
South Carolina 66
Utah 52
Solution
The normal probability plot indicates acceptable normality.
100
95
90
80
70
Percentage
60
50
40
30
20
10
5
1
0 10 20 30 40 50 60 70 80 90 100
Children without health insurance (1000s)
Step 1 State the hypotheses. The phrase differs from indicates that we have a
two-tailed test. The value s0 5 10 answers the question Differs from what? (Note
that s0 is 10, and not 10,000, since the data are expressed in thousands.) Thus, we have
our hypotheses:
H0: s 5 10 versus Ha: s 10
where s represents the population standard deviation of number of children living in
low-income households without health insurance.
Step 2 Find the x2 critical values and state the rejection rule. We have n 5 8, so
degrees of freedom 5 n 2 1 5 7. Since a is given as 0.05, a/2 5 0.025 and 12a/2 5
0.975. Then, from the x2 table (Figure 9.72), we have x2a/2 5 x20.025 516.013, and x212a/2 5
x20.975 5 1.690.
We will reject H0 if x
2data
is either greater than x
2a/2
5 16.013 or less than x
12a/2
2
5 1.690.
Step 3 Find x2data. The TI-83/84 descriptive statistics in Figure 9.73 tell us that the
sample variance is
s2 5 11.411147432
Thus,
(n 2 1)s2
_________ (8 2 1)11.411147432
__________________
2data
x 5 5 9.115
s20 102
Step 4 State the conclusion and the interpretation. In Step 2 we said that we
would reject H0 if x
2data
was either greater than 16.013 or less than 1.690. Since x
2data
5
9.115 is neither greater than 16.013 nor less than 1.69, we do not reject H0. There is
Figure 9.73 insufficient evidence that the population standard deviation of the numbers of children
living in low-income households without health insurance differs from 10,000. n
(lower bound, upper bound), and are interested in two-tailed hypothesis tests using
a of the form:
H0: s 5 s0 versus Ha: s s0
We will not reject H0 for values of s0 that lie between the lower bound and upper bound
of the confidence interval. And we will reject H0 for values of s0 that lie outside this
interval.
Construct a 95% confidence interval for the population standard deviation s of sodium
content. Test using level of significance a 5 0.05 whether s differs from the following.
a. 80 mg
b. 40 mg
Solution
Figure 9.74 is an excerpt from Minitab output that shows the 95% confidence interval
for s. (See the Step-by-Step Technology Guide at the end of Section 8.4, page 000.)
a. For the hypothesis test H0: s 5 80 versus Ha: s 80, s05 80 lies between the
Figure 9.74 lower bound 44.53 and the upper bound 81.50 of the confidence interval, and
we therefore do not reject H0. There is insufficient evidence that the population
standard deviation of sodium content differs from 80 mg.
b. For the hypothesis test H0: s 5 40 versus Ha: s 40, s0 5 40 lies outside
the confidence interval, and we therefore reject H0. There is evidence that the
population standard deviation of sodium content differs from 40 mg. n
We will use the information from Example 9.31 (page 77). The steps for finding the x2
critical values are given in the Step-by-Step Technology Guide at the end of Section 8.4
(page 000).
TI-83/84
Step 1 Enter the data into List L1. in Figure 9.71a. (Remember that this e is inserted by pressing
Step 2 Press 2nd . DISTR, then x2 cdf(, and press ENTER. 2nd, followed by the comma key.)
Step 3 On the home screen, enter the value of x 2data, comma, Step 4 Press ENTER. The results for Example 9.31 are shown
1e99, comma, degrees of freedom, close parenthesis, as shown in Figure 9.71a.
Section 9.6 Exercises 83
EXCEL
Step 1 Select cell A1. Click the Insert Function icon fx. enter the degrees of freedom. Excel displays the p-value in the
Step 2 For Search for a Function, type chidist and click OK. cell in the dialog box, as shown in Figure 9.71b.
Step 3 For x, enter the value of x2data, and for Deg_freedom,
MINITAB
Step 1 Click on Calc . Probability Distributions . Step 4 Minitab displays the area to the left of x2data in the
Chi-Square. session window, as shown in Figure 9.71c. To find the
Step 2 Select Cumulative probability, and enter the p-value, subtract this area from 1.
Degrees of freedom.
Step 3 For Input constant, enter the value of x2data and click
OK.
1
The essential idea about hypothesis testing for the test is valid only if we have a random sample from a normal
standard deviation s may be described as follows. When population.
the observed value of x2data is unusual or extreme on the
assumption that H0 is true, we should reject H0. Otherwise, 4
If we have a 100(12a)% confidence interval for s, of
there is insufficient evidence against H0, and we should not the form (lower bound, upper bound), and are interested in a
reject H0. two-tailed hypothesis test, using a, of the form
2
We can perform hypothesis tests to determine whether the H0: s 5 s0 versus Ha: s s0
population standard deviation s is greater than, less than, or
not equal to a certain value s0. Under the assumption that H0 we will not reject H0 for values of s0 that lie between the
is true, the x2 statistic takes the following form: lower bound and upper bound of the confidence interval.
(n 2 1)s2 We will reject H0 for values of s0 that lie outside this
x2data 5 ________
interval.
s20
3 The hypothesis test about s may be performed using the
p-value method or the critical-value method. Either way, the
Section 9.6Exercises
Clarifying the Concepts 8. Test whether the population standard deviation is less
1. Think of one instance where an analyst would be than 5.
interested in performing a hypothesis test about the 9. Test whether the population standard deviation differs
population standard deviation s. from 3.
2. What is the difference between s and s0? 10. Test whether the population standard deviation is greater
3. Does it make sense to test whether s , 0? Explain. than 100.
4. What condition must be fulfilled for us to perform a
For Exercises 1116, a random sample is drawn from a
hypothesis test about s?
normal population. Calculate x2data.
5. Explain how we can use a confidence interval to
11. We are testing whether s . 1 and have a sample of size
determine significance.
n 5 21 with a sample variance of s2 5 3.
6. In the previous exercise, what must be the relationship
12. We are testing whether s , 5 and have a sample of size
between a and the confidence level?
n 5 11 with a sample variance of s2 5 25.
Practicing the Techniques 13. We are testing whether s 3 and have a sample of size
For Exercises 710, construct the hypotheses. n 5 16 with a standard deviation of s 5 2.5.
7. Test whether the population standard deviation is greater 14. We are testing whether s . 10 and have a sample of size
than 10. n 5 14 with a standard deviation of s 5 12.
84 Chapter 9 Hypothesis Testing
15. We are testing whether s , 20, using level of significance Applying the Concepts
a 5 0.10, and have a sample of size n 5 8 and a sample 35. DDT in Breast Milk. Researchers compared the
variance of s2 5 350. amount of DDT in the breast milk of a random sample
16. We are testing whether s 5 and have a sample of size of 12 Hispanic women in Yakima Valley in Washington
n 5 26 with a standard deviation of s 5 5. State with the amount of DDT in breast milk in the general
U.S. population.25 They measured the standard deviation
For Exercises 1720, a random sample is drawn from a of the amount of DDT in the general population to be
normal population. Do the following. 36.5 parts per billion (ppb). Assume that the population is
a. Draw a x2 distribution and indicate the location of normally distributed. We are interested in testing whether
x2data. the population standard deviation of DDT level in the breast
b. Find the p-value and indicate the p-value in your milk of Hispanic women in Yakima Valley is greater than
distribution in (a). that of the general population, using level of significance
c. Compare the p-value with level of significance a 5 0.01.
a 5 0.05. State the conclusion and interpretation. a. State the hypotheses and the rejection rule.
17. The data in Exercise 11 b. The sample variance is s2 5 119,025. Calculate x2data.
18. The data in Exercise 12 c. Calculate the p-value.
19. The data in Exercise 13 d. Compare the p-value with a. State your conclusion.
20. The data in Exercise 14 e. Interpret your conclusion.
21. The data in Exercise 15 36. Tree Rings. Does the growth of trees vary more when the
22. The data in Exercise 16 trees are young? The International Tree Ring Data Base
collected data on a particular 440-year-old Douglas fir tree.26
For Exercises 2328, a random sample is drawn from a The standard deviation of the annual ring growth in the trees
normal population. The values of x2data for these exercises are first 80 years of life was 0.8 millimeters (mm) per year.
found in Exercises 1116. Find the critical value or values. Assume that the population is normal. We are interested in
23. We are testing whether s . 1, using a 5 0.05, and have testing whether the population standard deviation of annual ring
a sample of size n 5 21 and a sample variance of s2 5 3. growth in the trees later years is less than 0.8 mm per year.
Find x2a. a. State the hypotheses.
24. We are testing whether s , 5, using a 5 0.05, and have b. Find the critical value x212a for level of significance
a sample of size n 5 11 and a sample variance of s2 5 25. a 5 0.05.
Find x212a. c. The sample variance for a random sample of size
25. We are testing whether s 3, using a 5 0.05, and have 100 taken from the trees later years is s2 5 0.3136.
a sample of size n 5 16 and a standard deviation of s 5 2.5. Calculate x2data.
Find x2a/2 and x212a/2. d. Compare x2data with x212a. State your conclusion.
26. We are testing whether s . 10, using a 5 0.01, and have e. Interpret your conclusion.
a sample of size n 5 14 and a standard deviation of s 5 12. 37. Union Membership. The following table contains the
Find x2a. total union membership (in 1000s) for 7 randomly selected
27. We are testing whether s , 20, using a 5 0.10, and have states in 2006.27 Assume that the distribution is normal. We
a sample of size n 5 8 and a sample variance of s2 5 350. are interested in whether the population standard deviation
Find x212a. of union membership s differs from 30,000, using level of
28. We are testing whether s 50, using a 5 0.05, and have significance a 5 0.05.
a sample of size n 5 26 with a standard deviation of s 5 5.
Find x2a/2, and x212a/2. Florida 397
Indiana 334
For Exercises 2934, a random sample is drawn from a Maryland 342
normal population. The values of x2data for these exercises Massachusetts 414
were found in Exercises 1116. The critical values were Minnesota 395
found in Exercises 2328. Do the following. Texas 476
a. State the rejection rule. Wisconsin 386
b. Compare x2data with the critical value or values. State
the conclusion and interpretation. a. State the hypotheses and the rejection rule.
29. The data in Exercise 23 b. The sample variance is s2 5 2245.67. Calculate x2data.
30. The data in Exercise 24 c. Calculate the p-value.
31. The data in Exercise 25 d. Compare the p-value with a. State your conclusion.
32. The data in Exercise 26 e. Interpret your conclusion.
33. The data in Exercise 27 38. Fourth-Grade Feet. Suppose a childrens shoe
34. The data in Exercise 28 manufacturer is interested in estimating the variability of
Chapter 9 Formulas and Vocabulary 85
fourth-graders feet. A random sample of 20 fourth-graders the bottom of nonverbal reasoning tests and quantitative
feet yielded the following foot lengths, in centimeters.28 The reasoning tests.29 Suppose that the standard deviation for
normality of the data was verified in Example 8.12 (page 000). girls scores is known to be 50 points for a particular test and
We are interested in whether the population standard that the population of all scores is normal. A random sample
deviation of foot lengths s is less than 1 centimeter. of 101 boys has a sample variance of 2600. Test whether the
population standard deviation for boys exceeds 50 points,
22.4 23.4 22.5 23.2 23.1 23.7 24.1 21.0 21.6 20.9 using level of significance a 5 0.05. Use either the p-value
25.5 22.8 24.1 25.0 24.0 21.7 22.0 22.7 24.7 23.5 method or the critical-value method.
40. Heart Rate Variability. A reduction in heart rate
a. State the hypotheses. variability is associated with elevated levels of stress,
b. Find the critical value x212a for level of significance since the body continues to pump adrenaline after high-
a 5 0.05. stress situations, even when at rest.30 Suppose the standard
c. The sample variance is s2 5 1.638. Calculate x2data. deviation of heartbeats in the general population is 20 beats
d. Compare x2data with x212a. State your conclusion. per minute, and that the population of heart rates is normal.
e. Interpret your conclusion. A random sample of 50 individuals leading high-stress
39. Does Score Variability Differ by Gender? Recently, lives has a sample variance of 200 beats per minute. Test
researchers have been examining the evidence for whether using level of significance a 5 0.05 whether the population
there is greater variability in boys scores than girls standard deviation for those leading high-stress lives is lower
scores on cognitive abilities tests. For example, one study than that in the general population.
found that boys were overrepresented at both the top and
b. Use the confidence interval to test at level of 28. Test whether the population proportion is below 0.2.
significance a 5 0.05 whether the population mean A random sample of size 900 yields 160 successes. Let
amount of coffee dispensed by the old coffee machine a 5 0.05.
in the lobby differs from the following amounts, in 29. Test whether the population proportion is not equal to
ounces. 0.4. A random sample of size 100 yields 55 successes. Let
i. 6.9 a 5 0.01.
ii. 7.5 30. DSL Internet Service. The U.S. Department of
iii. 6.7 Commerce reports that, in 2003, 41.6% of Internet users
iv. 7 preferred DSL as their method of service delivery.32 A
random sample of 1000 Internet users this year shows
Section 9.4 350 who preferred DSL. If appropriate, test whether the
For Exercises 1921, find the critical value tcrit and sketch the population proportion who prefer DSL has decreased, using
critical region. Assume normality. level of significance a 5 0.05.
19. H0 : m # 100, Ha: m . 100, n = 8, a 5 0.10
20. H0 : m # 100, Ha: m . 100, n = 8, a 5 0.05 Section 9.6
21. H0 : m # 100, Ha: m . 100, n = 8, a 5 0.01 For Exercises 31 and 32, do the following.
22. Describe what happens to the t critical value Zcrit for a. State the hypotheses and the p-value rejection rule for
right-tailed tests as a decreases. a 5 0.05.
23. A random sample of size 16 from a normal b. Find x2data.
population yields a sample mean of 10 and a sample c. Find the p-value. Also, draw a x2 distribution and
standard deviation of 3. Researchers are interested in indicate x2data and the p-value.
finding whether the population mean differs from 9, using d. State the conclusion and the interpretation.
level of significance a = 0.10. Test using the critical-value 31. We are testing whether s , 35 and have a random
method. sample of size 8 with a sample variance of 1200.
24. A random sample of size 144 from an unknown 32. We are testing whether s 50 and have a random
population yields a sample mean of 45 and a sample sample of size 26 with a standard deviation of s 5 45.
standard deviation of 10. Researchers are interested in
For Exercises 33 and 34, do the following.
finding whether the population mean differs from 45, using
a. State the hypotheses.
level of significance a 5 0.10. Test using the estimated
b. Find the x2 critical value or values, and state the
p-value method.
rejection rule.
c. Find x2data. Also, draw a x2 distribution and indicate
Section 9.5
x2data and the x2 critical value or values.
For Exercises 25 and 26, do the following.
d. State the conclusion and the interpretation.
a. Check the normality conditions.
33. We are testing whether s . 6 and have a random
b. State the hypotheses and the rejection rule for the
sample of size 20 with a standard deviation of s 5 9. Let
p-value method, using level of significance a = 0.05.
a 5 0.05.
c. Calculate Zdata.
34. We are testing whether s 10 and have a random
d. Calculate the p-value.
sample of size 26 with a sample variance of 90. Let
e. State the conclusion and the interpretation.
a 5 0.05.
25. Test whether the population proportion differs from 0.7.
35. Prisoner Deaths in State Custody. The following
A random sample of size 144 yields 110 successes.
table contains the numbers of prisoners who died in state
26. Test whether the population proportion is less than
custody in 2005 for a random sample of 5 states.33 Assume
0.25. A random sample of size 100 yields 25 successes.
normality. Using a 5 0.01 and the p-value method, test
For Exercises 2729, do the following. whether the population standard deviation of prisoners who
a. Check the normality conditions. died in state custody differs from 50.
b. State the hypotheses.
c. Find Zcrit and the rejection rule. New York 171
d. Calculate Zdata. Pennsylvania 149
e. State the conclusion and the interpretation. Michigan 140
27. Test whether the population proportion exceeds 0.8. Ohio 121
A random sample of size 1000 yields 830 successes. Let Georgia 122
a 5 0.10.
Chapter 9 Quiz 89
Chapter 9 Quiz
True or False the population mean fee charged on such transactions since
1. True or false: It is possible that both the null and 2002.
alternative hypotheses are correct at the same time. a. Construct the hypotheses.
2. True or false: The conclusion you draw from performing b. Find the Z critical value.
the critical-value method for the Z test is the same as the c. Find the rejection rule.
conclusion you draw from performing the p-value method d. Calculate the value of the test statistic Zdata.
for the Z test. e. Formulate and interpret your conclusion.
3. True or false: We do not need the estimated p-value 12. ATM Fees. Refer to Exercise 11. If we performed the
method if we have access to a computer or calculator. same hypothesis test with the same a 5 0.05, but this time
using the p-value method, what would be our conclusion?
Fill in the Blank How do you know this?
4. To reject H0 when H0 is true is a Type ___________ error. 13. ATM Fees. Refer to Exercise 11. Which type of error
_
5. An extreme value of xis associated with a ___________ is it possible that we are making, a Type I error or a Type II
p-value. error? Which type of error are we certain we are not making?
6. The rejection rule for performing a hypothesis test using 14. Alcohol-Related Fatal Car Accidents. The National
the p-value method is to reject H0 when the p-value is less Traffic Highway Safety Commission keeps statistics on
than ___________. the mean years of potential life lost in alcohol-related
fatal automobile accidents. For males in 2001, the mean
Short Answer years of life lost was 32. That is, on average, males
7. Under what conditions may we apply the Z test for the involved in fatal drinking-and-driving accidents had their
population proportion? lives cut short by 32 years. We are interested in whether
8. What does a small p-value indicate with respect to the the population mean years of life lost has changed since
null hypothesis? A large p-value? 2001, using a t test and level of significance a 5 0.10. A
9. Does the value of Zdata change when the form of the random sample of 36 alcohol-related fatal accidents had a
hypothesis test changes (for example, left-tailed instead of mean years of life lost of 33.8, with a standard deviation
right-tailed)? of 6 years.
a. Is it appropriate to apply the Z test? Why or why not?
Calculations and Interpretations b. Construct the hypotheses.
10. Prison Sentence Length. The Bureau of Justice c. Find the t critical value.
Statistics reports that, in 2000, the mean prison sentence d. Find the rejection rule.
length for felons convicted on drug offenses was 47 months. e. Calculate the value of the test statistic tdata.
Assume that the standard deviation of all such sentences is f. Formulate and interpret your conclusion.
1 year. We are interested in testing using the p-value method 15. Alcohol-Related Fatal Car Accidents. Refer to the
and level of significance a 5 0.10 whether the population previous exercise.
mean prison sentence length for such offenses has changed a. Estimate the p-value.
since 2000. A random sample taken this year of 100 felons b. Assess the strength of the evidence against the null
convicted on drug offenses yielded a sample mean sentence hypothesis.
length of 54 months. 16. Preterm Births. The U.S. National Center for Health
a. State the hypotheses and the p-value rejection rule. Statistics reports that, in 2005, the percentage of infants
b. Find Zdata. delivered at less than 37 weeks of gestation was 12.7%, up
c. Find the p-value. from 10.6% in 1990.34 Has this upward trend continued? We
d. State the conclusion and the interpretation. are interested in testing whether the population proportion of
e. Assess the strength of the evidence against the null preterm births has increased from 12.7%, using the p-value
hypothesis. method and level of significance a 5 0.05. A random sample
11. ATM Fees. Do you hate paying the extra fees imposed taken this year of 400 births contained 57 preterm births.
by banks when withdrawing funds from an automated teller a. Is it appropriate to perform the Z test for the
machine (ATM) not owned by your bank? The Federal proportion? Why or why not?
Reserve System reports that, in 2002, the mean such fee was b. Where will the sampling distribution for p be
$1.14. A random sample of 36 such transactions yielded a centered? What is the shape of the distribution?
mean of $1.07 in extra fees. Suppose the population standard c. What is the standard error? What does this number
deviation of such extra fees is $0.25. We are interested mean?
in testing, using the critical-value method and level of d. How many standard errors does p lie below p0? Is this
significance a 5 0.05, whether there has been a reduction in extreme? Why or why not?
90 Chapter 9 Hypothesis Testing
e. Where in the sampling distribution does the value of of significance a 5 0.10 and the critical-value method,
lie? Near the tail? Near the center?
p test whether the population standard deviation of net price
17. Preterm Births. Refer to the previous exercise. change is less than 25 cents.
a. Construct the hypotheses.
b. Find the rejection rule. Stock Closing price Net change
c. Calculate the value of the test statistic Zdata. Micron Technology, Inc. $10.74 21.05
d. Calculate the p-value. Ford Motor Company $ 8.43 20.14
e. Assess the strength of the evidence against the null Citigroup, Inc. $47.89 0.03
hypothesis. Advanced Micro Devices $13.23 0.03
f. Formulate and interpret your conclusion. EMC Corporation $21.13 20.24
18. Active Stocks. On October 3, 2007, the ten most traded Commerce Bancorp, Inc. $38.84 20.63
stocks on the New York Stock Exchange were those shown General Electric $41.55 20.57
in the following table, which gives their closing prices and Avaya, Inc. $16.95 20.07
net change in price, in dollars. Use only the net change Sprint Nextel Corporation $18.76 20.24
data for this analysis. Assume normality. Using for level iShares:Taiwan $17.18 20.18