Professional Documents
Culture Documents
DESCRIPTIVE STATISTICS
PROBABILITY
INFERENTIAL STATISTICS
FUNDAMENTAL ELEMENTS OF
STATISTICS
Fundamental elements of
statistics
1.Data and its collection
2.Variable
3. Population
4. Sample
1. Data and types of data
https://www.ncsu.edu/labwrite/res/gh/gh-tables.html
Parts of Previous table
• Title
• Table number
• Headings & Subheadings
• Table Body
• Table Spanner
• Dividers
• Table Notes
https://www.ncsu.edu/labwrite/res/gh/gh-tables.html
Types of Tables
•Statistical Tables
•Numerical Tables
GOOD
Practice
Versus
BAD
practice
FIGURES
Figures
Edward R. Tufte
The main themes of a graph
2. Scatter
6 Pie chart plot
Graphs that
can
compare
groups 3. Box
5 Line plot plots
4 Stem
and leaf
plot
1. Bar chart
6 6
5
Frequency
4.3
4 4
3 3
0
Category 1 Category 2 Category 3 Category 4
6. Pie chart
http://www.lofoya.com/Data-Interpretation/Pie-Charts/intro
http://www.lofoya.com/Data-Interpretation/Pie-Charts/intro
Pie chart: Explanation
http://www.lofoya.com/Data-Interpretation/Pie-Charts/intro
Pie chart: In a nutshell
http://www.lofoya.com/Data-Interpretation/Pie-Charts/intro
MEASUREMENT OF CENTRAL
TENDENCY AND DISPERSION
CENTRAL TRENDENCY
Central tendency
8 16 30 10
9 0 16
10 0 0
5
0
4 5 6 7 8 9 10
• If we draw the frequency polygon of the two frequency
distributions, we obtain
Inspection of these frequency polygons shows that they
have exactly the same shape. It is their position relative
to the horizontal axis (X-axis) which distinguishes them.
Mean
Mean
Copyright 2004 David J. Lilja
Geometric Mean
G X 1 X 2 ... X n
n (Where Xi > 0)
45 32 37 46 39 36 41 48 36
9
G X X .... X
f1 f2 fk
n
1 2 k
• Each value of X thus has to be multiplied by itself f
times, and the whole procedure becomes quite a
formidable task!
In terms of logarithms, the formula becomes
f log X
n
f log X
G anti log
n
• Obviously, the above formula is much easier to
handle.
151.37%
• Now, if we utilize this particular value to obtain the
individual turnover figures, we find that:
151.37% of £2,000 = £3,027
151.37% of £3,027 = £4,583
151.37% of £4,583 = £6,937
151.37% of £6,937 = £10,500
• So that the turnover figure of 1962 is exactly the same as
what we had in the original data.
Interpretation
• EXAMPLE:
• Suppose that the marks of eight students in a
particular test are as follows:
• 2, 7, 9, 5, 8, 9, 10, 9
• Obviously, the most common mark is 9.
• In other words,
Mode = 9.
THE MODE IN CASE OF A DISCRETE
FREQUENCY DISTRIBUTION:
It is evident that the third class is the modal class. The mode lies
somewhere between 35.95 and 38.95.
In order to apply the formula for the mode, we note that fm = 14,
f1 = 4 and f2 = 8.
Hence we obtain:
Mode by considering the graphical representation.
For the example of EPA Mileage Ratings, the
histogram was as shown below:
The frequency polygon of the same
distribution
Median
Number of Classes 20
15
10
0
– 9 1 4 19 – 24 – 29 – 34 – 39 – 44 - 49 – 54
4 0 – –
1 15 20 25 30 35 40 44 50
Number of Pupils
EXAMPLES
35 35
30 30
25 25
20 20
15 15
10 10
5 5
0 0
X
X0 Xm
Range
RANGE
• First quartile:
What are l, n, h, f and c?
h n f
Q1 l c
f 4
• Second quartile (i.e. median):
2n
c l n 2 c
h h
Q2 l
f 4 f
25% 25% 25% 25%
• Third quartile: ~ X
• Q1 Q2 = X Q3
h 3n
Q3 l c
f 4 It is clear from the formula of the second quartile that the
second quartile is the same as the median.
Quartiles, Deciles and Percentiles
h n
P1 l c
f 100
• The formulae for the subsequent percentiles are
h 2n h 3n
P2 l c P3 l c
f 100 f 100
• And so on…
A simple example for estimating quartiles
• For example, consider the marks of the 100 students below, which
have been ordered from the lowest to the highest scores.
Order Score Order Score Order Score Order Score Order Score
1st 35 21st 42 41st 53 61st 64 81st 74
2nd 37 22nd 42 42nd 53 62nd 64 82nd 74
3rd 37 23rd 44 43rd 54 63rd 65 83rd 74
4th 38 24th 44 44th 55 64th 66 84th 75
5th 39 25th 45 45th 55 65th 67 85th 75
6th 39 26th 45 46th 56 66th 67 86th 76
7th 39 27th 45 47th 57 67th 67 87th 77
A simple example for estimating quartiles
Order Score Order Score Order Score Order Score Order Score
8th 39 28th 45 48th 57 68th 67 88th 77
9th 39 29th 47 49th 58 69th 68 89th 79
10th 40 30th 48 50th 58 70th 69 90th 80
11th 40 31st 49 51st 59 71st 69 91st 81
12th 40 32nd 49 52nd 60 72nd 69 92nd 81
13th 40 33rd 49 53rd 61 73rd 70 93rd 81
14th 40 34th 49 54th 62 74th 70 94th 81
15th 40 35th 51 55th 62 75th 71 95th 81
16th 41 36th 51 56th 62 76th 71 96th 81
17th 41 37th 51 57th 63 77th 71 97th 83
18th 42 38th 51 58th 63 78th 72 98th 84
19th 42 39th 52 59th 64 79th 74 99th 84
20th 42 40th 52 60th 64 80th 74 100th 85
A simple example for estimating quartiles
• Quartile (Q1 The first) lies between the 25th and 26th student's
marks,
• The second quartile (Q2) between the 50th and 51st student's
marks, and
• The third quartile (Q3) between the 75th and 76th student's
marks.
First quartile (Q1) = (45 + 45) ÷ 2 = 45
Second quartile (Q2) = (58 + 59) ÷ 2 = 58.5
Third quartile (Q3) = (71 + 71) ÷ 2 = 71
Reflection form the example
X
Q1 Q3
Inter-quartile Range
Quartile Deviation
(Semi Inter-quartile Range)
Coefficient of Dispersion (COD)
COD
1
2 Range
Mid Range
Xm X0
2 Xm X0
Xm X0 Xm X0
2
Coefficient of Dispersion (COD)