You are on page 1of 46

1.1.

1 State that error bars are a graphical representation of the


variability of data.
1.1.2 Calculate the mean and standard deviation of a set of values.
1.1.3 State that the term standard deviation is used to summarize
the spread of values around the mean, and that 68% of the
values fall within one standard deviation of the mean.
1.1.4 Explain how the standard deviation is useful for comparing
the means and the spread of data between two or more
samples.
1.1.5 Deduce the significance of the difference between two sets of
data using calculated values for t and the appropriate tables.
1.1.6 Explain that the existence of a correlation does not establish
that there is a causal relationship between two variables.
STATISTICS!!!
The science of data
What is data?
Information, in the form of facts or
figures obtained from
experiments or surveys, used as a
basis for making calculations or
drawing conclusions
Encarta dictionary
Statistics in Science

Data can be collected about a
population (surveys)

Data can be collected about a
process (experimentation)



2 types of Data

Qualitative
Quantitative



Qualitative Data
Information that relates to characteristics or
description (observable qualities)
Information is often grouped by descriptive
category
Examples
Species of plant
Type of insect
Shades of color
Rank of flavor in taste testing
Remember: qualitative data can be scored and
evaluated numerically

Qualitative data, manipulated
numerically
Survey results, teens and need for environmental action
Quantitative data
Quantitative measured using a
naturally occurring numerical scale
Examples
Chemical concentration
Temperature
Length
Weightetc.


Quantitation
Measurements are often displayed
graphically
Quantitation = Measurement
In data collection for Biology, data must be
measured carefully, using laboratory equipment
(ex. Timers, metersticks, pH meters, balances , pipettes, etc)
The limits of the equipment used add some
uncertainty to the data collected. All equipment
has a certain magnitude of uncertainty. For
example, is a ruler that is mass-produced a good
measure of 1 cm? 1mm? 0.1mm?

For quantitative testing, you must indicate
the level of uncertainty of the tool that you
are using for measurement!!
How to determine uncertainty?
As a rule-of-thumb, if not specified, use +/-
1/2 of the smallest measurement unit
Looking at Data
How accurate is the data? (How close are
the data to the real results?) This is also
considered as BIAS

How precise is the data? (All test systems
have some uncertainty, due to limits of
measurement) Estimation of the limits of
the experimental uncertainty is essential.
Comparing Averages
Once the 2 averages are
calculated for each set of data,
the average values can be plotted
together on a graph, to visualize
the relationship between the 2
Drawing error bars
The simplest way to draw an error bar is
to use the mean as the central point, and
to use the distance of the measurement
that is furthest from the average as the
endpoints of the data bar
Average
value
Value farthest
from average
Calculated
distance
What do error bars suggest?
If the bars show extensive overlap, it is
likely that there is not a significant
difference between those values
Quick Review 3 measures of
Central Tendency
mode: value that appears most frequently
median: When all data are listed from
least to greatest, the value at which half
of the observations are greater, and half
are lesser.
The most commonly used measure of
central tendency is the mean, or
arithmetic average (sum of data points
divided by the number of points)
How can leaf lengths be displayed
graphically?
Simply measure the lengths of each and plot how many
are of each length
If smoothed, the histogram data
assumes this shape
This Shape?
Is a classic bell-shaped curve, AKA
Gaussian Distribution Curve, AKA a Normal
Distribution curve.

Essentially it means that in all studies with
an adequate number of data points (>30)
a significant number of results tend to be
near the mean. Fewer results are found
farther from the mean
The standard deviation is a statistic that
tells you how tightly all the various
examples are clustered around the mean
in a set of data
Error bars can represent either the range
of the data or the Standard Deviation
Standard Deviation

The SD is a more sophisticated indicator
of the precision of a set of a given number
of measurements
The standard deviation is like an average
deviation of measurement values from the
mean. In large studies, the standard deviation
is used to draw error bars, instead of the
maximum deviation.



SD shows dispersion from mean
Low SD = data points close to the mean
High SD = data points spread out over
large range of values

Mathematically, the SD of a population is
the square root of its variance.
Variance
Measure of how far a set of numbers are
spread out from each other.
It is a parameter describing the actual
probability of an observed population or
the theoretical probability distribution.
Sample data estimation variances are
called the sample variance.
the variance is a measure of the amount of
variation of the values of that variable taking
into consideration all values, probabilities, or
weightings (not just extremes used for range.)
Degrees of Freedom
The number of independent pieces of
information available to estimate another
piece of information.
Or rather
The number of independent observations
in a sample of data available to estimate a
parameter of that population.
A typical standard distribution curve
According to this curve:
One standard deviation away from the
mean in either direction on the horizontal
axis (the red area on the preceding graph)
accounts for somewhere around 68
percent of the data in this group.
Two standard deviations away from the
mean (the red and green areas) account
for roughly 95 percent of the data.

Three Standard Deviations?
three standard deviations (the red, green
and blue areas) account for about 99
percent of the data
-3sd -2sd +/-1sd 2sd +3sd


How is Standard Deviation
calculated?
With this formula!
AGHHH!
DO I NEED TO
KNOW THIS FOR
THE TEST?????
Not the formula!
This can be calculated on a scientific calculator

OR. In Microsoft Excel, type the following code into the
cell where you want the Standard Deviation result, using
the "unbiased," or "n-1" method: =STDEV(A1:A30)
(substitute the cell name of the first value in your
dataset for A1, and the cell name of the last value for
A30.)

You DO need to know the concept!
standard deviation is a statistic that tells
how tightly all the various data points are
clustered around the mean in a set of data.
When the data points are tightly bunched
together and the bell-shaped curve is steep, the
standard deviation is small.(precise results,
smaller sd)
When the data points are spread apart and the
bell curve is relatively flat, a large standard
deviation value suggests less precise results
23, 92, 46, 55, 63, 94, 77, 38, 84, 26

21, 53, 28, 29, 22, 12, 78, 33, 222, 12


THE END




For today.

You might also like