Professional Documents
Culture Documents
Chapter 2 the mean and standard deviation (sd) provide enough data to describe a population and
the amount of variability among members of the population.
When the value of the variable is more likely than not to fall below or above the mean,
one should report the median and values of at least two other percentiles.
After collecting data and plotting the distribution we look for parameters of the
distribution:
Mean:
Measures of variability:
To determine the variation of data about the mean, you have the positive side an
negative side of the mean (above and below the mean). Squaring that gives you a
positive number. Variance is the square of the deviation around the mean.
Normal Distribution
The normal distribution describes most populations, with 68% of data falls within 1 sd of
mean and 95% within 2 sd of mean. The height of the normal distribution curve at any
given value of X is
Median
The middle value when observations are listed in order. 50% of data falls below
median. It is the (n+1)/2 observation when observations are listed in order.
Other percentiles
25th percentile - lowest quarter of observations (n+1)/4 observation.
Another other percentile - (n+1)/(100/p) observation
Calculating the percentile points of a population is a good way to see how close to a
normal distribution it is. If the percentiles usually have high numbers starting them off,
you know that it may be skewed.
For a normal distribution, the percentile points are as shown:
Standard Deviation
Note that the sigma or s with the small subscript of x with a bar over refers to standard
error! It does not refer to standard deviation. Standard deviation refers to the range of
deviation around the mean of a population or within a sample. The standard error
describes the range within which one can say that their mean is 95% accurate of the
entire population from which a sample was drawn.
By calculating what happens when we collect many samples from a population and
calculate their means and standard deviations, we come up with the Central Limit
Theorem: