Professional Documents
Culture Documents
consider the values of all the scores, it is not as sensitive a measure of central
tendency but it is
advantageous when there are extreme scores present in the data.
Let's expand the example. Let's say there were more people in the class and the exam
scores were as follows: one 34, three 35s, six 42s, seven 47s, five 51s, four 53s, one 57,
and two 100s.
The average is 50% but everyone is looking at each other's scores, and it does not
seem to make sense. Out of the 29 scores, 17 people scored below the mean, and 5
more people scored just slightly above the mean. It seems that the middle should
be lower. Is 50% really describing the set of scores? The median in this case, 47, is a
better representation of the central tendency of these data. You can keep going with
this illustration. Change the 57 to another 100 and see what happens to the mean
while the median remains at 47.
Measures of Variability:
While it is very useful to know where the center of the data is, we are also often interested in
the distribution or the spread of the scores around that center. In the example above using
exam scores, the spread of scores showed an interesting pattern. The majority of the class
scored between 34 and 57 and two students scored a perfect 100%. Remember that in this
case the average was 50%, and the median was 47%. Can you imagine that there could be a
wide range of distributions that would still result in a mean of 50%? There could have been
two zeros and two 100s, which would have given a mean of 50%, or every student could have
scored 50%, or there could have been four 45s and four 55s, etc. Can you see that the
"center" of the data describes a quality that is different than the spread or dispersion of the
scores? How can we describe variability?
The Range: The range is a very simple way of describing the spread of data. It is simply the
highest minus the lowest score. You can see that in the examples given in the paragraph above,
the ranges would be very different: 100-34=66; 100-0=100; 50-50=0; 55-45=10 This tells us
something about the spread of scores but only relies on two scores, the highest and lowest.
Standard Deviation: The standard deviation is a calculation that uses every score to
describe the variability. In this calculation, every score is subtracted from the mean to get
its distance from the middle of the data. There are two aspects of the standard deviation
that are important: the calculation and the interpretation. The calculation is not hard once
you understand some symbols used in statistics. The interpretation requires that you
understand the concept of the normal curve.
Interpreting Measures of Central Tendency and Variability:
In order to interpret measures of central tendency and variability, you need to understand
the normal curve. Many of you could probably draw a normal curve, but could you describe
what it represents or identify any of its major characteristics? A normal curve is a
frequency distribution of a particular variable. When enough people are included in this
frequency distribution, the curve that results becomes more and more "normal." Normal
means that the curve is bell-shaped and symmetric, with the majority of scores clustering
at the center and fewer scores at either end.
Refer to the Normal Curve lecture and see examples of how these curves can vary. Skewness
refers to whether the distribution is shifted up or down with a long tail. This means that some
people scored at one extreme while the majority of people's scores clustered around a point at
the opposite end. Kurtosis describes another characteristic of the curve which is how peaked or
flat the curve is.
Before we talk about the importance of the normal curve, let's make sure we all know what
we are looking at. The normal curve is a graph of a particular variable. Let's say that it is
the body weight of a group of people. The x-axis (horizontal axis) shows the range of scores
with M being the mean. The curve represents the number of people who scored each score
so the y-axis (vertical axis) is the number of scores. Instead of a general picture of a normal
curve, let me show you one using real data. The graph below shows the body weight of 240
men measured as part of a cardiovascular health study. You can see that this is a bar graph
that shows how many people scored each small range (each bar) of body weights. Can you
imagine a normal curve drawn over this graph? If it were skewed, in which direction would
it be?
Kurtosis: description of the vertical characteristic of the curve showing the data
distribution, for example, whether the curve is more peaked or flatter than the normal
curve
Variability: the degree of difference between each individual score and the central
tendency score Variance: the square of the standard deviation