Professional Documents
Culture Documents
What is Statistics?
• “Statistics is a way to get information from
Statistics data”
Data Information
Inference
Statistic
Parameter
What can we infer about a Population’s Parameters
based on a Sample’s Statistics?
Key Statistical Concepts…
• Population
— a population is the group of all items of interest to a
statistics practitioner.
— frequently very large; sometimes infinite.
E.g. All 5 million voters in the city
• Sample
— A sample is a set of data drawn from the
population.
— Potentially very large, but less than the population.
E.g. a sample of 765 voters exit polled on election day.
Key Statistical Concepts…
• Parameter
— A descriptive measure of a population.
• Statistic
— A descriptive measure of a sample.
Key Statistical Concepts…
Population Sample
Subset
Statistic
Parameter
• Populations have Parameters,
• Samples have Statistics.
1.15
MEASURES OF
CENTRAL TENDENCY
Measures of Central Tendency
• Is a single value that summarizes a set of
data. It is a value where the set of data
tends to center. The measures of central
tendency to be discussed are the mean,
the median and the mode.
Measures of Central Location…
The arithmetic mean, a.k.a. average, shortened
to mean, is the most popular & useful
measure of central location.
It is computed by simply adding up all the
observations and dividing by the total number
of observations:
Sum of the observations
Mean =
Number of observations
Weighted Mean is a special case of the
arithmetic mean. It is computed by
multiplying each value by an appropriate
weight, add these products, and then
divide the result by the sum of the
weights.
Formula in finding the weighted Mean
XW
XW
W
Where
X is the individual scores
W is the weight of each score
Arithmetic Mean…
Sample Mean
Population Mean
Statistics is a pattern language…
Population Sample
Size N n
Mean
MEDIAN
• Is the middlemost value in an ordered array of
data. It is the value of the observation that
divides the data set into two equal parts when
the data are arranged in increasing or decreasing
order.
• Unlike the mean, median is not affected by every
value in the data set especially by the extreme
values. The median is dependent on the position
of the data in a distribution the reason why it is
also called a position measure.
Formula for Ungrouped Data
Median(Md) X m
where Xm is the middle most value of the data
when n is odd
~ X m1 X m2
Median(Md) X
2
where Xm1 and Xm2 are the two middle most
values when n is of the data when n is even
For odd or even number of
observation
Median X n 1
2 2
D2 X 2 N P23 X 23 N 77 P55 X 55 N 45
8
100 100 100 100
10 10
Grouped Data
N
cfb Where:
Q1 Lb 4 i
f Lb is the lower boundary of the Q1,
D3, and P45
3N N is the total frequency
cf b
D3 Lb 10 i <cfb is the less than cumulative
f frequency of the class just before the
Q1, D3 and P45 class
45N
cf b
f is the frequency of the nth quantile
P45 Lb 100 i class
f
i is the class size
Use N if the data represent a population and n for a sample
Sample Interpretation
25% of the data is less than or equal to Q1
50% of the data is less than or equal to Q2,D5,P50
75% of the data is less than or equal to Q3 or P75
10% of the data is less than or equal to D1 or P10
30% of the data is less than or equal to D3 or P30
32% of the data is less than or equal to P32
87% of the data is less than or equal to P87
Example
Your handsome lecturer in Statistics gave a
test to all grade 11 students under
section A. The students finished the test
in 35 minutes. This is the 2.5th decile of
the allotted time. What does this mean?
Answer
• This means that 25% of the learners finished
the test.
Example
• Sir C is a teacher in one science high
school in Metro Manila. His salary is
in the 7th decile. Should the teacher
be glad about his salary or not?
Answer
• 70% of the employees in the said school
receive a salary that is less than or equal
to his salary and 30% of the employees
receive a salary that is greater than his
salary. Then, the teacher should be
pleased with the salary.
Example
• 1. Below are the grades of ten selected students
in a morning class. Determine Q1, D3, and the
80th percentile.
66, 90, 93, 76, 73, 87, 76, 96, 85, 69
• Q1 = ?
• D3 =?
• P80 =?
Example
• Calculate Q1, D7,P65 of the Mathematics test
scores of 50 students.
Class Boundary Frequency Lower Less Than
Boundaries cumulative
frequency
46-50 4 45.5 50
41-45 8 40.5 46
36-40 11 35.5 38
31-35 9 30.5 27
26-30 12 25.5 18
21-25 6 20.5 6
Measures of Dispersion
(Variability)
• Measures the extent to which data are
dispersed or spread out.
Q3 Q1
QD
2
Variance and Standard Deviation
Of a set of data with N observations are special
forms of average deviation from the mean
which is affected by all individual values of the
items in any given distribution. They measure
the average scatter around the mean.
Variance…
population mean
population size
sample mean
Sample Variance