Professional Documents
Culture Documents
VISUALIZING DISTRIBUTIONS
The overall pattern of a distribution of measurements generally described by
3 components
1.CENTER --- describes a point near the middle of a distribution that serve as
balance point for the distribution
2. SPREAD-- describe s how the data points spread out around a center.
3. SHAPE -- describes the basic pattern of the plotted data along with
any notable departures from that pattern. The most basic
shapes can be described using the idea of symmetry
SYMMETRIC DISTRIBUTION half of the distribution is
approximately a mirror image of the other half
RIGHT-SKEWED DISTRIBUTION- the distribution has a longer right tail
than the left tail. More lower values than with higher values.
LEFT-SKEWED DISTRIBUTION- the distribution has a longer left tail
than the right tail.More higher values than with lower values.
A boxplot splits the data set into quartiles. The body of the
boxplot consists of a "box" (hence, the name), which goes
from the first quartile (Q1) to the third quartile (Q3).
Within the box, a vertical line is drawn at the Q2, the median
of the data set.
Two horizontal lines, called whiskers, extend from the front
and back of the box. The front whisker goes from Q1 to the
smallest non-outlier in the data set, and the back whisker goes
from Q3 to the largest non-outlier.