Professional Documents
Culture Documents
Name: _____________________________
Period: _____________________________
The first thing we should probably tackle in our year long journey to understand the basics of
statistics is to define what it encompasses:
STATISTICS:
When trying to answer a question about some group using data, for example, What
proportion of Harker students own an iPhone? the easiest approach is to perform a
___________________.
Definition:
If we changed our question of interest to, What proportion of American high school students
own an iPhone? what would be difficult about this approach?
(1)
(2)
(3)
(4)
(II) As a result, a _____________ often times is inferior to taking a ___________________
Definition:
#4
THE KEY TO MAKING THIS DATA COLLECTION APPROACH WORK WELL IS THAT THE
SAMPLE NEEDS TO BE ______________________ OF THE POPULATION OF INTEREST.
Types of Sample Design
VOLUNTARY RESPONSE SAMPLE
CONVENIENCE SAMPLE
Do either of these two sampling designs accomplish our goal of having a _______________ sample?
As a matter of fact, the above methods of data collection often create a big problem with regard to
obtaining appropriate data:
Bias:
Example 1:
Example 2:
Suppose that you want to estimate the total amount of money spent by students on
textbooks each semester at UC Berkeley. You collect register receipts for
students as they leave the bookstore during lunch one day. Describe the source of
bias.
#7,9
OUR BEST APPROACH TO ENSURING THAT OUR SAMPLE IS ______________ AND DOES
NOT INTRODUCE BIAS IS TO TAKE A _____________ SAMPLE.
2
Example 3:
Mr. Thiele runs a lucrative Statistics tutoring business in the San Mateo area. In
order to provide high quality customer service, he wishes to interview a sample of
five clients in detail. In order to avoid bias he wishes to choose an SRS of size
five. Using Table D, line 130, determine which five clients should be selected
from this group:
Angela Anova
Cos Ashon
Lynnie Arregression
Miss Calculate
Dennis Decile
Stan Deviation
Norma Distribution
Martin Gale
Marge Innovera
Cara Lashon
Corey Lashon
Maxim M. Likelihood
Ester Mate
Jean Mean
Marian Median
Moe Ment
Chris Miss
Moe Mode
Dee Morgan
Mark Off
Kurt O'Sis
Penelope Probability
Sam Pull
Quincy Quartile
Sally Sample
Sammy Sample
Randy Sampling
Cheb E. Shev
Minnie Tabb
Percy N Tile
#12,16
STRATIFIED RANDOM SAMPLE
Example 4:
The financial aid office of a university wishes to estimate the average amount
of money that students spend on textbooks each term. They are considering
taking a stratified sample. For each of the following proposed stratification
schemes, determine whether it would be worthwhile to stratify the university
students in this manner.
(a) Strata corresponding to class standing (freshman, sophomore, junior,
senior, grad student).
(b)
(c)
Strata corresponding to the first letter of the last name: A-E, F-K, etc.
CLUSTER SAMPLE
Example 5:
#21,24
A hotel has 30 floors with 40 rooms per floor. The rooms on one side of the
hotel face the water, while rooms on the other side face a golf course. There is
an extra charge for the rooms with a water view. The hotel manager wants to
survey 120 guests who stayed at the hotel during a convention about their
overall satisfaction with the property.
(a) Explain why choosing a stratified random sample might be preferable to
an SRS in this case.
(b)
(c)
(d)
SYSTEMATIC SAMPLE
Example 6:
For each of the situations described below, state the kind of sampling
procedure that is used.
(a) All freshmen at a university are enrolled in one of 30 sections of a
freshman seminar course. To select a sample of freshmen at this
university, a researcher selects four sections of the freshmen seminar
course at random from the 30 sections and all students in the four
selected sections are included in the sample.
(b)
#26
(c)
(d)
(e)
BIAS
VARIABILITY
HIGH
LOW
HIGH
LOW
(2)
NONSAMPLING ERRORS
(1)
Nonresponse Bias
(2)
Measurement Bias
(3)
Response Bias
(4)
Wording Bias
Example 7:
Example 8:
The article Study Provides New Data on the Extent of Gambling by College
Athletes (Chronicle of Higher Education, 22 January 1999) reported that 72
percent of college football and basketball players had bet money at least once
since entering college. This conclusion was based on a study in which
copies of the survey were mailed to 3000 athletes at 182 Division I
institutions, 25 percent of whom responded. What types of bias might have
influenced the results of this study? Explain.
#27,31,34,35
(3)
Example 9:
CONFOUNDING VARIABLE:
Example 10:
Crime Finds the Never Married is the conclusion drawn in an article from
USA Today (29 June 2001). This conclusion is based on data from the Justice
Departments National Crime Victimization Survey, which estimated the
number of violent crimes per 1000 people, 12 years of age or older, to be 51
for the never married, 42 for the divorced or separated, 13 for married
individuals and 8 for the widowed. Does being single cause an increased risk
of violent crime? Describe a lurking variable that illustrates why it is
unreasonable to conclude that a change in marital status causes a change in
crime risk.
#46,47,50
Experimental Terminology
EXPLANATORY VARIABLE (FACTOR):
RESPONSE VARIABLE:
EXPERIMENTAL UNITS:
TREATMENT:
LEVELS:
Example 11: As we have already seen, nonresponse is a big issue in survey design. A
particular company was interested in reducing the rate of refusals in telephone
surveys. Most people who answer at all listen to the interviewers introductory
remarks and then decide whether to continue. To address this issue, a study made
telephone calls to randomly selected households to ask opinions about the next
election. In some calls, the interviewer gave her name, in others she identified the
university she was representing, and in still others she identified both herself and
the university. For each type of call, the interviewer either did or did not offer to
send a copy of the final survey results to the person interviewed. The company
then examined whether or not these differences in the introduction affected
whether the interview was completed.
(a)
(b)
(c)
(d)
#54,57,63
Experimental Design
GOAL:
(4)
(5)
(6)
9
Example 12: Will providing child-care for employees make a company more attractive to
women, even those who are unmarried? You are designing an experiment to
answer this question. You prepare recruiting material for two fictitious
companies, both in similar businesses in the same location. Company As
brochure does not mention child-care. There are two versions of Company Bs
material, identical except that one describes the companys on-site child-care
facility. Your subjects are 40 unmarried women who are college seniors seeking
employment. Each subject will read recruiting material for both companies and
choose the one she would prefer to work for. You will give each version of
Company Bs brochure to half the women. You expect that a higher percentage
of those who read the description that included child-care will choose Company
B.
(a) Describe an appropriate design for the experiment.
(b)
The names of the subjects appear below. Use Table D, beginning at line
131 to do the randomization required by your design. Indicate the subjects
who read the version that mentions child-care.
Abrams
Adamson
Afifi
Brown
Cansico
Chen
Cortez
Curzakis
Danielson
Durr
Edwards
Fluharty
Garcia
Gerson
Green
Gupta
Gutierrez
Howard
Hwang
Iselin
Janle
Kaplan
Kim
Lattimore
10
Lippman
Martinez
McNeill
Morse
Ng
Quinones
Rivera
Roberts
Rosen
Sugiwara
Thompson
Travers
Turing
Ullmann
Williams
Wong
CONTROL
(2)
RANDOMIZATION
(3)
REPLICATION
(4)
BLOCKING
Matched Pairs:
#67
11
Placebo Effect
Blinding
Single Blind:
Double Blind:
#69,71,73
Example 13:
(b)
(c)
12
Example 14:
To compare two levels of treatment with a new fertilizer, cherry tomatoes are
to be grown in each of eight test plots. Tall windows line the north side of the
hothouse and a breezy doorway is located on the east side.
Tall Windows
Breezy Doorway
(a)
What are the experimental units, explanatory and response variables, and
treatments?
(b)
What would blocking using the scheme above (one block is white, the
other is gray) accomplish?
(c)
Breezy Doorway
(d)
If a breezy doorway were added to the side of the hothouse opposite that
of the current breezy doorway, what type of blocking structure would
work well?
(e)
Describe the assignment of treatments to the test plots using the blocking
structure used in part (d).
13
Example 15:
In search of a mosquito repellent that is safer than the ones that are currently
on the market, scientists have developed a new compound that is rated as less
toxic than the current compound, thus making a repellent that contains this
new compound safer for human use. Scientists also believe that a repellent
containing the new compound will be more effective than the ones that
contain the current compound. To test the effectiveness of the new compound
versus that of the current compound, scientists have randomly selected 100
people from a state (probably Minnesota). Up to 100 bins, with an equal
number of mosquitoes in each bin, are available for use in the study. After a
compound is applied to a participants forearm, the participant will insert his
or her forearm into a bin for one minute, and the number of mosquito bites on
the arm at the end of that time will be determined. Suppose this study is to be
conducted using a matched-pairs design. Describe a randomization process.
#78,81,85,87
Scope of Inference
Researchers who conduct statistical studies often want to draw conclusions (i.e. make inferences)
that go beyond the data produced. So, what types of conclusions are appropriate? The answer
depends on the design of the study.
YES
TO GROUPS?
NO
YES
NO
Another issue that may prevent us from generalizing an experiments results is lack of ______________.
#103,104,108
14
Yet another issue occurs when experiments are impractical or unethical to complete. The criteria
for establishing causation when an experiment cannot be completed include:
(1)
(2)
(3)
(4)
(5)
Data Ethics
All studies involving humans should always adhere to the guidelines of:
(1)
(2)
(3)
#109,112,114,115
15
Simulation
Useful when actually carrying out an experiment is too ___________, _____________, or
_______________
Steps of a Simulation:
(1)
(2)
(3)
(4)
(5)
Example 16:
16