Professional Documents
Culture Documents
Example 1: For each of the following, describe the Type I and Type II errors:
We can think of the filters decision as a hypothesis test. The null hypothesis is that the e-mail is a real
message and should go to your inbox. A higher point total provides evidence that the message may be
spam; when theres sufficient evidence, the filter rejects the null, classifying the message as junk. This
usually works pretty well, but, of course, sometimes the filter makes a mistake.
(a) When the filter allows spam to slip through into your inbox, which kind of error is that?
(b) Which kind of error is it when a real message gets classified as junk?
Type I; the email is not junk, but sent to junk mail anyway.
(c) Some filters allow the user (thats you) to adjust the cutoff. Suppose your filter has a default
cutoff of 50 points, but you reset it to 60. Is that analogous to choosing a higher or lower value of
for a hypothesis test? Explain.
This is similar to lowering the alpha level. It takes more evidence to reject the null and classify the
email as junk.
(d) What impact does this change in the cutoff value have on the chance of each type of error?
The probability of a Type I decreases (lower alpha) and the probability of a Type II increases.
Example 3: p.501 #18
Consider again the points-based spam filter described in Exercise 16. When the points assigned to
various components of an e-mail exceed the cutoff value youve set, the filter rejects its null hypothesis
(that the message is real) and diverts that e-mail to a junk mailbox.
The power of the test is the ability of the filter to detect spam. It is the probability that the test will
correctly send an email to the junk mailbox WHEN the email is indeed spam.
Lower the cutoff score (this would be analogous to increasing the alpha level).
If the cutoff score is lowered, the risk of a Type I error increases so a larger number of good emails
will end up in the junk mailbox.
Example 4: p.502 #22
Production managers on an assembly line must monitor the output to be sure that the level of defective
products remains small. They periodically inspect a random sample of the items produced. If they find
a significant increase in the proportion of items that must be rejected, they will halt the assembly
process until the problem can be identified and repaired.
The truth is the process is working fine but managers determine it is not.
The truth is the assembly process is producing defective items but managers determine it is working
fine.
(c) Which type of error would the factory owner consider more serious?
Type II; defects caught in the factory are generally cheaper to correct than defects found after the
point of sale.
Note: the power of the test in this scenario is the probability that the test will, in light of some true
alternative, correctly reject the null hypothesis. In context, given the factory is producing defective
items, the probability the test will correctly identify that.
Example 5: p.502 #26
Highway safety engineers test new road signs, hoping that increased reflectivity will make them more
visible to drivers. Volunteers drive through a test course with several of the new- and old-style signs
and rate which kind shows up the best.
The truth is reflective signs are not more visible but we decide they are.
The truth is reflective signs are more visible but we decide they are not.
With respect to some true alternative, the power of the test is the probability that the test will correctly
reject the null and determine that reflective signs are more visible.
(e) If the hypothesis is tested at the 1% level of significance instead of 5%, how will this affect the
power of the test?
If the alpha level decreases, so does the probability of a Type I error. Since the probability of a Type I
error is decreasing, the probability of a Type II error will increase. Since the probability associated
with the power of the test is the complement of a Type II error, the power will decrease as well.
(f) The engineers hoped to base their decision on the reactions of 50 drivers, but time and budget
constraints may force them to cut back to 20. How would this affect the power of the test?
Explain.
Larger sample sizes will result in more power. Decreasing the sample size will decrease the power of
the test.