You are on page 1of 20

Descriptive Statistics

Definition: Sample Mean

Descriptive Statistics
Population Mean

For a finite population with N measurements, the mean is

The sample mean is a reasonable estimate of the population mean.


2

Descriptive Statistics
Population Variance When the population is finite and consists of N values, we may define the population variance as

Descriptive Statistics
Definition: Sample Variance

The sample variance is a reasonable estimate of the population variance.


4

Descriptive Statistics
Definition

Stem-and-Leaf Diagrams

Stem-and-Leaf Diagrams

Figure

Stem: Tens digits. Leaf:


Ones digits.

Stem-and-Leaf Diagrams
Data Features
The median is a measure of central tendency that divides the data into two equal parts, half below the median and half above. If the number of observations is even, the median is halfway between the two central values.

Stem-and-Leaf Diagrams
For ordered set of data,

The first or lower quartile, q1 , is a value that has approximately one-fourth (25%) of the observations below it and approximately 75% of the observations above.
The second quartile, q2, has approximately one-half (50%) of the observations below its value. The second quartile is exactly equal to the median. The third or upper quartile, q3, has approximately three-fourths (75%) of the observations below its value. As in the case of the median, the quartiles may not be unique.
9

Stem-and-Leaf Diagrams
Data Features In general, the 100kth percentile is a data value such that approximately 100k% of the observations are at or below this value and approximately 100(1 - k)% of them are above it.

10

Frequency Distributions and Histograms

Figure Histogram of compressive strength for 80 aluminum-lithium alloy

specimens.

11

Frequency Distributions and Histograms

Histograms for symmetric and skewed distributions.

12

6-4 Box Plots


The box plot is a graphical display that simultaneously describes several important features of a data set, such as center, spread, departure from symmetry, and identification of observations that lie unusually far from the bulk of the data. Whisker Outlier Extreme outlier
13

6-4 Box Plots

Figure 6-13 Description of a box plot.

14

A Probability Plot

15

A Probability Plot
Suppose now that for percentages 100(i .5)/n for i = 1,, n the percentiles are determined for a specified population distribution whose plausibility is being investigated. If the sample is actually conforms the specified distribution, the sample percentiles should be close to the corresponding population distribution percentiles.

16

A Probability Plot
For the pairs

for i = 1,, n, can be plotted as a point on a twodimensional coordinate system.

17

A Probability Plot
Similarly,

([100(i .5)/n]th z percentile, ith smallest observation)


on a two-dimensional coordinate system is called a normal probability plot. If the sample observations are in fact drawn from a normal distribution with mean value and standard deviation, the points should fall close to a straight line with slope and intercept .

18

Example 30
The sample consisting of n = 20 observations on dielectric breakdown voltage of a piece of epoxy resin appeared in the article .. The values of (i .5)/n for which z percentiles are needed are (1 .5)/20 = .025, (2 .5)/20 = .075,, and .975.

19

Example 30

contd

Figure 4.35 shows the resulting normal probability plot. The pattern in the plot is quite straight, indicating it is plausible that the population distribution of dielectric breakdown voltage is normal.

Normal probability plot for the dielectric breakdown voltage sample


Figure 4.35

20

You might also like