You are on page 1of 26

CHAPTER 1

DESCRIPTIVE STATISTICS
WEEK 1

L2 - Graphical display of Data

Learning Objectives:
At the end of the lesson, students should be able to:

Construct and interpret pictorial and tabular display of data

Pictorial & Tabular Methods


1. Stem-and-Leaf Displays:
How to construct a Stem-and-Leaf Display:
1. Each numerical data is divided into two parts:
- The leading digit(s) becomes the stem,
and the remaining digit(s) becomes the leaf
2. List the stem values in a vertical column.
3. Record the leaf for each observation beside its stem.
4. Write the units for stems and leaves on the display.

Stem & Leaf Display


Result of Math. Exam.
of a 50-student class:
35
49
30
40
45

42
28
45
52
76

56
56
39
57
56

41
64
37
65
90

63
72
43
83
73

26
59
76
68
34

37
17
73
52
26

66
45
64
84
57

92
56
51
91
41

16
29
60
64
56

Stem & Leaf Display


Place the numbers in

Stem-and-Leaf Display

order from smallest to the


largest
16
29
37
42
49
56
57
64
72
83

17
30
39
43
51
56
59
64
73
84

26
34
40
45
52
56
60
65
73
90

26
35
41
45
52
56
63
66
76
91

28
37
41
45
56
57
64
68
76
92

Stem: tens digit

67

6689

045779

011235559

12266666779

03444568

23366

34

012

Leaf: ones digit

2. Histogram:
A bar graph representing a frequency distribution of a
quantitative variable.A histogram is made up of the following
components. Histograms are used to summarize large data sets.
Age

Freq.

Rel. Freq.

18

20

0.20

19

24

0.24

20

26

0.26

21

18

0.18

22

0.05

23

0.03

24

0.02

25

0.02

Sum 100

1.00

Histogram: ages of 100 students

0.30 Rel. Freq.


0.20

3. Box plot:
a graphical display that simultaneously describes several
important features of a data set:
center
Spread
departure from symmetry
identification of outliers
a box plot displays the median, the first quartile and the third
quartiles on a rectangular box, aligned either horizontally or
vertically.
sometimes called box whiskers plot.

HOW TO CONSTRUCT A BOX PLOT

Numerical Summary : Sample Median


The median of a sample depends on whether the number of terms in the
sample is even or odd.
If the number of terms is odd, then the median is the value of the
term in the middle.
If the number of terms is even, then the median is the average of
the two terms in the middle

Arrange the observations x1, , xn in increasing


order:

x (1) x 2 x ( n )

Use the following rule:

1
x ( n ) x ( n 1)

~
2
2
x 2

x ( n1 )
2

if n is even
if n is odd.

Numerical Summary : Sample Median


Example 1: Find Median for the following observations:
0.3

7.8

4.6

3.7

9.2

12.1

-5

-2.5

10.8

Numerical Summary : Sample Median


Example 1: Find Median for the following observations:
0.3

7.8

4.6

3.7

9.2

12.1

-5

-2.5

10.8

Arrange the observations in increasing order: n = 9


-5

-2.5

0.3

3.7

4.6

7.8

1
x ( n ) x ( n 1)

~
2
2
x 2

x ( n1 )
2

9.2

10.8 12.1

if n is even
if n is odd.

Example 2: Find Median for given observations :


2.8 5.2

-2.3 2.6

3.6

1.4

6.9

4.3

8.4

2.8

Example 2: Find Median for given observations :


2.8 5.2

-2.3 2.6

3.6

1.4

6.9

4.3

8.4

2.8

Rearrange the observations in increasing order:


- 2.3 1.4 2.6 2.8 2.8 3.6 4.3 5.2 6.9 8.4
Median = (2.8 + 3.6)/2 = 3.2

1
x ( n ) x ( n 1)

~
2
2
x 2

x ( n1 )
2

if n is even
if n is odd.

Percentile:
Measure of central tendency that divide a group
of data into 100 parts.
Nth percentile:

At least n% of the data lie between the nth percentile


and at most (100-n)% of the data lie above the nth
percentile
90 percentile:
At least 90% of the data lie between the 90th
percentile and at most (10)% of the data lie above the
90th percentile

LQ (Q1) is 25 percentile
Median (Q2) is 50 percentile
UP (Q3) is 75 percentile
25 percentile = Q1
At least 25% of the data lie between the 25th
percentile and at most (75)% of the data lie above the
25th percentile

LQ (Q1) and UQ (Q3) are defined as follows


Step 1. Arrange the values in increasing order
Step 2. Q1 is the value in position 0.25(n+1)

Q3 is the value in position 0.75(n+1)


Step 3. If the positions are not integers, Q1 and Q3
are found by interpolation, using adjacent
values
IQR = Q3 Q1

Example 1: (values are arranged in increasing order)

- 5 -2.5 0.4 3.7 4.6 7.8 9.2 10.8 12.1 13.5 14

n = 11, 0.25(n+1) = 0.25(12) = 3;

0.75(n+1) = 0.75(12) = 9

Q1 = x(3) = 0.4,
Q3 = x(9) = 12.1,
and IQR = 12.1 0.4 = 11.7

Example 2: (values are arranged in increasing order)

- 5 - 4 2 6 6.5 7.8 9.2 10.8 12.5 14.5 15 16.4


n=12,
0.25(n+1) = 0.25(13)=3.25;

0.75(n+1)=0.75(13) = 9.75

Q1 = x(3) + 0.25(x(4) - x(3))= 2 + 0.25(6 2) = 2 + 0.25(4) = 3


Q3 = x(9) + 0.75(x(10) - x(9))= 12.5 + 0.75(14.5 12.5) = 14

Example 3: (values are arranged in increasing order)


2

9.8

10.2

10.8

12.5

14 16.4

18.7

n=10,
0.25(n+1) = 0.25(11)=2.75;

0.75(n+1)=0.75(11) = 8.25

Q1 = x(2) + 0.75(x(3) - x(2))= 5 + 0.75(9 5) = 5 + 0.75(4) = 8


Q3 = x(8) + 0.25(x(9) - x(8))= 14 + 0.25(16.4 14) = 14.6

Example 2
The following cold start ignition time of an automobile
engine obtained for a test vehicle are as follows:
1.75 1.92 2.62 2.35 3.09 3.15 2.53 1.91
a) Calculate the sample median, the quartiles and the IQR
b) Construct a box plot of the data.
Solution:
Rank the n = 8 measurements from smallest to largest
1.75 1.91 1.92 2.35 2.53 2.62 3.09 3.15

sample median: since n is even

1
~
x ( xn / 2 xn / 2 1 )
2

1
1
~
x ( x4 x5 ) (2.35 2.53) 2.44
2
2

Solution:
1.75

1.91

1.92

Lower quartile:

2.35

2.53

2.62

3.09

3.15

Q1 x( 0.25( n 1)) x( 0.25(81)) x( 2.25)

Q1 x( 2) 0.25( x3 x2 ) 1.91 0.25(1.92 1.91) 1.9125


Upper quartile:

Q3 x( 0.75( n 1)) x( 0.75(81)) x( 6.75)

Q3 x( 6) 0.75( x7 x6 ) 2.62 0.75(3.09 2.62) 2.9725


IQR:

Q3 Q1 2.9125 1.9725 1.06

b) Construct a box plot of the data.


4.5625

Max: 3.15

1.5IQR

Q3 = 2.9725

Q2 = 2.44

IQR = 1.06

Q1 = 1.9125
Min: 1.75
1.5IQR

0.3225

Example 3
Suppose that the ages of thirty UTP students live in village 2 are follow:
18, 20, 21, 26, 24, 19, 25, 20, 22, 21,
19, 24, 25, 28, 24, 20, 26, 20, 35, 17,
18, 24, 20, 21, 22, 27, 25, 28, 27, 24.
a) Calculate the sample median, the quartiles and the IQR
b) Construct a box plot of the data.

Solution:
Step 1: Place the number in order from smallest to the largest
17, 18, 18, 19, 19, 20, 20, 20, 20, 20,
21, 21, 21, 22, 22, 24, 24, 24, 24, 24,
25, 25, 25, 25, 26, 26, 27, 27, 28, 35.
Step 2: The median, Q2 , the lower quartile, Q1, the upper quartile, Q3, and
the IQR are:

The median, Q2= (X15+X16)/2 = (22 + 24)/2 = 23


The position of Q1= 0.25(n+1) = 0.25(31) = 7.75

Q1 = X7+0.75(X8 - X7 ) = 20 + 0.75(20-20) = 20

The position of Q3= 0.75(n+1) = 0.75(31) = 23.25

Q3 = X23 + 0.25(X24 X23 ) = 25 + 0.25(26 - 25) = 25.25

Step 2: The IQR = Q3 - Q1 = 25.25 20 = 5.25, 1.5IQR = 1.5(5.25) = 7.875


Step 2: Start to draw the Box-Plot:

17

28

o35 outlier

Q1 =20
Q2 =23
Q3=25.25
12.125.<1.5IQR>< ------------..IQR=5.25--------------><1.5IQR>.33.125

QUIZ 1
79, 53, 82, 91, 87, 98, 80, 93
Lower quartile:

Q1 x( 0.25( n 1)) x( 0.25(81)) x( 2.25)

Q1 x( 2) 0.25( x3 x2 ) 1.91 0.25(1.92 1.91) 1.913


Upper quartile:

Q3 x( 0.75( n 1)) x( 0.75(81)) x( 6.75)

Q3 x( 6) 0.75( x7 x6 ) 2.62 0.75(3.09 2.62) 2.973


IQR:

Q3 Q1 2.973 1.913 1.06

QUIZ 1
The following data are given as follows:
82

79

53

91

87

98

93

80

a) Calculate the sample mean and sample variance


b) Calculate the sample median, the quartiles and the IQR
c) Construct a complete box plot of the data
d) Identify the possible outlier(s) if any

You might also like