You are on page 1of 37

Statistics

Summary numbers, or indices, that result from an analysis of data (numbers) pg. 2 All the procedures and tools used to organize and interpret facts, events, and observations that can be expressed numerically pg. 2

Imaginary World
Snap

Crackle

Imaginary World #2
if
Snap

and or
then

Crackle

Pop

Imaginary World #3

Variables
n. an element, feature, or factor that is liable to vary or change adj. not consistent or having a fixed pattern; liable to change

The Goal of Statistics (and all of science)


Figure out how things vary together. If you succeed, you can answer the questions:
How, why, when, where, what

Q: Why is little Sally afraid of dogs? A: What are the variables, and how do they vary together?
(the presence or absence of dogs, Sallys emotional state, Sallys neural structure as it relates to these other events, Sallys history and how it relates to the above elements, etc.) Q: Why does my roommate drink my milk and leave the empty jug in the refrigerator?

Making Sense of the Variability


Break it down: Independent Variables: the variables we are manipulating (or using for categorization)
E.g., shock intensity

IV

Dependent Variables: the variables we are measuring


E.g., amplitude of scream

DV

Unaccounted For Variables


Variables that we do not know the status of Because we didnt bother to include them
E.g., eye color

Because we couldnt include them (even if we wanted to)


E.g., exact electro-chemical state of the nervous system before, during, and after the shock

Example 1
Octavian is interested in the effect of breathing exercises on time spent on task.
What is the independent variable?
Breathing exercises Time on task

What is the dependent variable?


Breathing exercises Time on task

Example 2
Adelpha wants to know if pronunciation can be improved if one practices speaking with marbles in ones mouth.
What is the independent variable?
Clarity of pronunciation Practice with marbles

What is the dependent variable?


Clarity of pronunciation Practice with marbles

Unaccounted For Variables: What To Do?


The influence of unaccounted for variables can be controlled for by designing your study in such a way that unaccounted for variables are not likely to influence one group or condition more than any other group or condition.

How is this accomplished?

Design
1. Random Selection

Random = each member of the population has an equal chance of being selected. If you stop here, you cant make causal statements.
Random = each selected participant has an equal chance of being exposed to each condition. This is the only way you can make causal statements. This is an experiment.

2. Random Assignment

Example
Romulus selects the first 50 students that walk into the Bernhard Center to participate in a study on WMU students. The students are asked if they are smokers or nonsmokers and then they fill out a short survey that is supposed to measure their general level of anxiety. Do we have random selection , random assignment , neither , or both ?

Whats wrong with it?


1. No random selection: not all students go to the Bernhard Center on a regular basis, and many might only go late in the afternoon. (selection bias) 2. No random assignment: students were already in the category of interest, smoker or nonsmoker, so the IV was not manipulated. (not an experiment)

So what?
1. No random selection: means that you cannot say anything about the target population (WMU students). 2. No random assignment: means that you cannot make causal statements about the IV on the DV (e.g., you cant say that smoking caused students to have a higher level of anxiety). 3. This experiment is a big failure, no matter how good the statistical analysis is.

Whats the Lesson?

Design Trumps Analysis


Whenever possible, a study should include both random selection and random assignment. If you cant randomly assign, your study might still be useful, but you cant make causal statements.

Food for Thought

Statistics and Variability


Variability Randomness Randomness is an Illusion

Complexity
Variability is brought about by COMPLEXITY, not randomness. Human behavior is very complex (i.e., there are many relevant variables), but because it is orderly, we can predict and influence behavior. Statistics makes this prediction and influence much easier.

Population vs. Sample


Population
Every person or object in a group (past, present, future, often infinite)

Sample
Part of the population

Random simple stratified

Biased

Summary Measure = Parameter Ex: = pop. mean 2 = pop. variance = pop. standard deviation

Summary Measure = Statistic _ Ex: X = sample mean s2 = sample variance s = sample standard deviation

Notice: Greek symbols are used for parameters, Roman symbols for statistics

Why Sample?
Population

Inference

Sampling

Sample

We want to make inferences about the population.

Inference
Definition: a conclusion reached on the basis of evidence [in our case, sample data] and reasoning [in our case, statistical analysis]

Why not take data on the population?


The population might be huge.
If the population is U.S. adults over age 65 you would need to take data on about 40 million people (also see below).

Part of the population might be inaccessible.


A study on infants born this year should apply to infants born next year. A study on heroin addicts should apply to people who were addicted to heroin but died of an overdose.

But if you can get population data, you should.


If I am interested in how many hours students in my class spend studying for quizzes, I can (and should) take population data.

INFER

Inferential vs. Descriptive Statistics


Inferential Statistics: using statistical methods to make inferences about the population given sample data Ex: How likely is it that my depression intervention results (from my sample) will generalize to the population? Descriptive Statistics: using statistical methods to describe a set of data Ex: What is the mean depression score on the Hamilton Rating Scale for clients I am working with?
DESCRIBE

Sampling Error
Sampling error: the error caused by observing a sample instead of the population.
Represented by the formula:
statistic parameter = sampling error Ex:

The ONLY way to avoid sampling error entirely is to take data on the entire population What can we do to reduce sample error?
1. Random Sampling 2. Large Samples 3. Multiple Samples

4. Stratified Random Sampling


1. Decide which characteristic of the population you will use for stratification. (Ex: sex) 2. Randomly sample from each strata. 3. The number drawn from each strata should be proportional to the number in each strata in the population.
Ex: Total pig population (16 pigs): You think the color of the pig might be an important variable, but you can only afford to do the study with half of the pigs. How many black and how many pink pigs would you include in your study?
Thats right! Six pink pigs and two black pigs.

Try This
Population: People age 16-65 living in the U. A. E. U. A. E. Male: 3,500,000 What should your Stratified sample look like if your total sample size is 100?

Female: 2,000,000

Heres how I wrote the number of females as I solved it: a fraction of the total population, and put my unknown over my sample size on the other side. Then I solved for X.

So, your sample should look like this: Male: 64 Female: 36

With and Without Replacement


Sampling with replacement means that after each person/object is selected, it is returned to the population so that it could be selected again.
This is done so that the probability of an individual being selected remains constant.

Sampling without replacement means that after each person/object is selected, it is not returned to the population and, therefore, cannot be selected again.
In most behavioral studies, this is the only way to sample. Why is that?

Measurement

Definition: the assignment of numbers or labels to objects or events Labels? How is that measurement?

1. Nominal Scales
Naming of objects or events Naming measurement

Naming allows us to categorize and deal with individuals or groups separately Labeling is required to get frequency data, which is type of measurement

Nominal Scales
If we took the last seven digits of each of your phone numbers and averaged them, what could we learn? NOTHING. Be careful, just because its a number doesnt mean its not a name. What if we wrote down the area code of each student in class and found the number of times each area code occurs (frequency)? This would work! Lets try it.

2. Ordinal Scales
Naming objects or events and putting them in order Ranking or rank ordering Example: advisory system

Ordinal Scales
4 3 2 1

Ordinal scales give us limited data.

Interval Scales
Naming of objects or events and putting them in order using units with equal intervals The distance between 1 and 2 should be exactly the same as the distance between 2 and 3 or 3 and 4.

Common example: temperature in degrees Fahrenheit or Celcius

Interval Scales
In psychology: whether or not Likert scales are interval or ordinal scales is an issue of debate. Why do you think this might be?

Note: We will treat them as interval scales in this course, but only for convenience.

Ratio Scales
Naming of objects or events and putting them in order using units with equal intervals on a scale with a true zero Ask yourself, does zero really mean nothing?

Common example: temperature in degrees Kelvin

Homework
Prepare for Quiz #2 over Chapters 1 & 2. Read Chapter 3 and be prepared to answer basic questions. Complete Homework assignment #1 and prepare to hand it in at the beginning of the next class.

Check WebCT and/or the course website for the homework assignment and other study materials.

You might also like