Professional Documents
Culture Documents
DESCRIBING
S E T S O F DATA
A stem-and-leaf
display is often called a
stemplot, but the
latter term often refers
to another chart type
Stem and leaf display
44 46 47 49 63 64 66 68 68 72 72 75 76 81 84 88 106
4 |4679
5 |
6 |3 4 6 8 8
7 |2 2 5 6
8 |1 4 8
9 |
10 |6
Key 6 |3=63
Leaf unit =1.0
Steam unit=10.0
Histograms
• A histogram is a graphical representation of the distribution
of data. It is an estimate of the probability distribution of a
continuous variable and was first introduced by Karl
Pearson.
• A histogram is a representation of tabulated frequencies,
shown as adjacent rectangles, erected over discrete
intervals (bins), with an area equal to the frequency of the
observations in the interval.
• The height of a rectangle is also equal to the frequency
density of the interval, i.e., the frequency divided by the
width of the interval.
• The total area of the histogram is equal to the number of
data.
Histograms
• First described
by Karl Pearson
• Purpose :To
roughly assess
the probability
distribution of a
given variable
by depicting
the frequencies
of observations
occurring in
certain ranges
of values
SUMMATION NOTATION
• We denote the measurements of a quantitative
data set as follows:
• xI, x2, x3,. ... x, where xl is the first measurement in
the data set, x2 is the second measurement in the
data set, x3 is the third measurement in the data
set,. .., and x, is the nth (and last) measurement in
the data set. Thus, if we have five measurements
in a set of data, we will write x1, x2, x3, x4, x5 to
represent the measurements.
• If the actual numbers are 5, 3, 8, 5, and 4, we
have x1= 5, x2 = 3, x3 = 8, x4 = 5, and x5 = 4.
• Most of the formulas we use require a
summation of numbers. For example, one
sum we'll need to obtain is the sum of all the
measurements in the data set, or x1 + x2 + x3 +
. ..a + xn
• To shorten the notation, we use the symbol 2
for the n summation.
• That is
Verbally translate