Professional Documents
Culture Documents
Ex. Students from a small college were asked how Histograms Histograms
Histograms: Discrete Data many charge cards that they carry. x is the variable
Continuous Data:
representing the number of cards and the results are Credit card results:
below.
x Rel. Freq. Relative Frequency Equal Class Widths
Determine the frequency and relative Frequency 0 0.08 0.4
x #people Rel. Freq
Distribution Determine the frequency and relative
frequency for each value of x. Then 1 0.28 0.3
0 12 0.08
2 0.38
xi
frequency for each class. Then mark
mark possible x values on a horizontal 1 42 0.28
0.2
3 0.16 the class boundaries on a horizontal
scale. Above each value, draw a 2 57 0.38 0.1
4 0.06 0
measurement axis. Above each class
rectangle whose height is the relative 3 24 0.16
4 9 0.06
5 0.03 0 1 2 3 4 5 6 interval, draw a rectangle whose height
frequency of that value. 6 0.01 Number of Cards
5 4 0.03 is the relative frequency.
6 2 0.01
Histograms (Continuous Data): Histogram Shapes The Mean
Unequal Widths
After determining frequencies and relative
1.3 The average (mean) of the n numbers
frequencies, calculate the height of each x1, x2 ,..., xn is x where
rectangle using: Measures n
xi
symmetric unimodal bimodal
x + x + ... + xn
rectangle height =
relative frequency of the class x= 1 2 = i =1
class width of Location n n
The resulting heights are called densities
Population mean: µ
and the vertical scale is the density scale. positively skewed negatively skewed
( xi − x )
2
data points the median is the average of symmetric
Variability s2 = =
S xx
the middle two. n −1 n −1
Boxplots
Outliers
lower fourth upper fourth
Any observation farther than 1.5fs from
the closest fourth is an outlier. An
outlier is extreme if it is more than 3fs
from the nearest fourth, and it is mild
otherwise. extreme mild median
outliers outliers