Professional Documents
Culture Documents
Page1
10.2
mx = m
o The standard deviation of the sampling distribution of the sample means is
called the standard error of the mean,
sx =
s
n
Page2
Example 10.1 : Assuming that the mean weight of all the 10,000 students in HELP is
50 kg with a standard deviation of 5 kg.
Samples of 30 students are selected. For each sample the mean(sample mean)
represented by x and the sample variance represented by s2 are determined.
A frequency distribution of the sample mean is drawn and it is known as the
sampling distribution if the sample mean. This sampling distribution of sample
mean must have a mean and a standard deviation. The mean of the sampling
distribution of the mean is written as m x and it must be equal to the population
mean. i.e m x = m = 50kg
The standard deviation of the sampling distribution of the sample mean written
as s x
sx =
10.3
which
is
also
known
as
standard
error
the
mean
and
5
= 0.91kg
30
When all possible samples of a specific size are selected with replacement from a
population, the distribution of the sample means for a variable has two important
properties.
First property of Distribution of the sample means.
The mean of the sample means is equal to the population mean ie
mx = m
Second Property:
The standard deviation of the sample means is called the standard error of the mean,
Where
sx =
10.4
sx
Page3
o The central limit theorem can be used to answer questions about sample
means in the same manner that the normal distribution can be used to answer
questions about individual values. The only difference is that a new formula
must be used for the z-value. It is:
Z=
x - mx
sx
x-m
Eg 10.2: The time required by workers to complete an assembly job has a mean of
50 minutes and a standard deviation of 8 minutes. To spot check the workers
progress on a particular day, their supervisor intends to record the time of 60
workers to complete one assembly job a piece.
i.
What is the probability that a randomly selected worker will take more
than a mean of 58 minutes to assemble the job.
ii
What is the probability that the sample mean will be more than 52 minutes?
iii. What is the probability that the sample mean will be between 49 and 52
minutes?
i.
Required area
x
m =50
z=0
58
z=1
58 - 50
)
8
Page4
ii.
Required area
Let
m x = m = 50
52
Z= 0
1.94
x-m
p( x > 52) = p z >
52 - 50 52 - 50
p z >
= 8 ( 60) = p
8
60
(Z>1.94)
i.
49 - 50
52 - 50
P (49 min . < x < 52 min .) = P
<Z<
8
8
60
60
Example 10.3: Suppose the mean selling price of a gallon of gasoline in the
United States is $1.30. Further, assume the distribution is positively skewed,
with a standard deviation of $0.28.
What is the probability of selecting a sample of 35 gasoline stations with sample
mean selling price of more than $1.25?
o Solution:
1.25 - 1.3
p( x > 1.25) = P Z >
> p( Z > -1.06)
0
.
28
35
= 0.5+0.3554 = 0.8554
72 - 60
) = P ( X > 1.5)
8
70 - 60
55 - 60
P(55 < X < 70) = P
<z<
8
8
= P( - 0.63< z < 1.25)
= 0.3944+0.2357 = 0.6301
iii.
61 - 60
63 - 60
P(61 < x < 64) = P
<z<
8
8
50
50
58 - 60
62 - 60
P(58 < x < 62) = P
<Z<
8
8
50
50
Page5
What is the probability that the time taken to travel to the campus by a randomly
selected student is between 25 minutes and 40 minutes?
What is the probability that the time taken to travel to the campus by a randomly
selected student is between 32 minutes and 38 minutes?
Is random samples of 25 students were selected, find the mean and standard error
of the resultant sampling distribution of the sample mean.
What is the probability that the sample mean of a random sample of 25 students is
more than 31 minutes?
What is the probability that the sample mean of a random sample of 25 students is
between 28 minutes and 31.5 minutes?
Exercise 2 : Assume that the waiting time to check in by airline customers has a mean
time of 12 minutes and a standard deviation of 4 minutes.
a. What is the probability that the waiting time of a randomly selected student
i. is less than 16 minutes?
ii. is between 6 minutes and 14 minutes?
b. What is the probability that the mean weight of a random sample of 50 students
i. is between 11 minutes and 13 minutes?
ii. is more than 11.5 minutes?
7.6
Sampling Distribution of Sample Proportion
Consider samples of size n drawn from a population. For each sample, the proportion of
^
s^ =
p
pq
n
Page6
p- p
pq
n
Eg.10.5 The Laurier brand has a market share of 30%. In a survey, 1000 consumers
were asked which brand they prefer. What is the probability that more than 32% of the
respondents say they prefer Laurier brand?
Solution: Since np(300) and nq(700) are both more than 5, can use Z.
^
( p- p)
P ( p > 0.32) = P ( Z >
)
pq
n
(0.32 - 0.3)
= P(Z >
= P( Z > 1.38) = 1- 0.9162 = 0.0838
(0.3)(0.7)
1000
^
Exercise 3:
Assume that 60% of HELPs students are female. Sample of 100 students are selected.
i.
ii.
iii.
Find the mean and standard error of proportion of the resultant sampling
distribution of sample proportion
What is the probability that sample proportion of female students in a random
sample of 100 students is more than 65%?
What is the probability that the sample proportion is between 0.27 and 0.31?
Exercise 4
Assume that 40 % of a hyper-market customers are not local residents. What is the
probability that in a random sample of 50 students,
i.
ii.
Page7
Page8
x=
12 + 4 + 7 + 14 + 10)
= 9.4
5
o Thus the point estimate for the hourly mean number of defectives,
Use
x as an estimate of m
m , is 9.4.
Page9
Interval Estimates
An interval estimator draws inferences about a population by estimating the value
of an unknown population parameter, using an interval that is likely to include the
value of the population parameter based on a sample.
8.3
Formula
for
Confidence
Interval
estimate
for
the
population
(m ) = X Z a
2 n
Example 11.3 A certain medication is known to increase the pulse rate of its user. The
standard deviation of the pulse rate is known to be 5 beats per minute. A sample of 30
users had on an average pulse rate of 104 beats per minute.
i. Find the 99% confidence interval of the true population mean.
Solution : CI
( m ) = X Za
At 99 % confidence level,
Page10
2 n
Za = Z 0.01 = Z 0.005
2
m = 104 2.58
5
= 104 2.4
30
= 101.6 to 106.4
Hence one can be 99% confident that the mean pulse rate of all users of this medication is
between 101.6 to 106.4 beats per minute, based on a sample of 30 users.
ii. Estimate the 90% confidence interval for the true pulse rate for all the users.
At 90% Confidence level,
Z 0.1 = 1.645
2
m = 104 1.645
5
= 104 1.51 = 102.49 to 105.51
30
Therefore the 90% Confidence interval for the true pulse rate for all the users is 102.49
and 105.51
iii. Estimate the 95% confidence interval for the true pulse rate for all the users.
At 95% Confidence level,
Z 0.05 = 1.96
2
m = 104 1.96
5
= 104 1.79 = 102.21 to 105.79
30
Therefore the 95% Confidence interval for the true pulse rate for all the users is 102.21
and 105.79
Exercise 1
In order to estimate the mean distance travelled daily by students to the HELP campus, a
random sample of 30 students were selected and the mean distance was found to be 20
km. Assuming that the population standard deviation was 2 km.
i.
Find the 90% confidence interval for the mean time travelled daily by all the
students to HELP campus.
Find the 95% confidence interval for the mean time travelled daily by all the
students to HELP campus.
iii.
Find the 99% confidence interval for the mean time travelled daily by all the
students to HELP campus.
iv.
What must be the minimum sample size so that the sampling error at the 95%
confidence level in part (ii) do not exceed 0.5 km?
11.4 Formula for Confidence Interval estimate for the mean( population standard
deviation is unknown)
CI.
(m ) = X ta
, n -1
2
n
Page11
Page12
size. That is, d.f. = n-1. For some statistical tests the degrees of freedom, are not
equal to n-1.
Example 11.4 If the mean of 5 values is 10, then 4 of the 5 values are free to vary.
But once 4 values are selected, the fifth value must be a specific number to get a
sum of 50, since 50/5 = 10. Hence the degrees of freedom is 5-1=4 and this value
tells the researcher which curve to use.
Example 11.5:: Ten randomly selected automobiles were stopped, and the tread
depth of the right front tire was measured. The mean was 0.32 inch, and the
standard deviation was 0.08 inch. Find the 95% confidence interval of the mean
depth. Assume that the variable is approximately normally distributed.
o Solution:
m = x ta
=
, n -1
s
0.08
= 0.32 t 0.05 ,9
2
n
10
0.32 2.262
0.08
= 0.32 0.06
10
= 0.26 to 0.38
The 95% confidence interval estimate of the population mean depth is 0.26and
0.38.
Exercises.
Find t 0.1
,
,14
2
t0.1
, 50
t 0. 5
, t 0. 1
,
2
510 gm,
493 gm
488 gm,
498 gm.
508 gm,
Estimate the 90% and 95% confidence intervals for the true mean weight of all
the chocolate bars. State any assumption made
2
x = 4996 and x = 2496566
x=
x 4996
=
= 499.6 gm
n
10
( x)
x2 -
n
s=
n -1
49962
2496566 10
Page13
= 7.92 gm
m = x ta
=
= 1.833
s
7.92
= 499.6 t0.1 ,9
2
n
10
, n -1
499.6 1.833
7.92
= 499.6 4.6
10
= 495 gm to 504.2 gm
Therefore we are 90% confident that the mean weight of all chocolate bars weigh
between 495 gm and 504.2 gm
At 95% confidence level, t 0.05
,9
2
m = x ta
, n -1
499.6 2.262
= 2.262
s
7.92
= 499.6 t0.01 ,9
2
n
10
7.92
= 499.6 5.57
10
= 494.03 gm to 505.17 gm
Therefore we are 95% confident that the mean weight of all chocolate bars weigh
between 494.03 gm and 505.17 gm
Assumption: We have to assume that the population distribution of weights of
chocolate bars from which the sample was drawn is normally distributed.
Exercise 2
In order to estimate the mean distance travelled daily by students to the HELP campus, a
random sample of 30 students were selected and the mean distance was found to be 20
km and the standard deviation was 2 km.
Page14
Find the 90% confidence interval for the mean time travelled daily by all the
students to HELP campus.Ans : (19.38 km, 20.62 km)
ii.
Find the 95% confidence interval for the mean time travelled daily by all the
students to HELP campus. Ans : (19.25 km, 20.75 km)
ii. Find the 99% confidence interval for the mean time travelled daily by all the
students to HELP campus. Ans: (18.99 km, 21,01 km)
CI(p) =
p Za
pq
n
^
n p5
and
nq 5
Z 0.1
= t 0. 1
,
2
= 1.645 (1.65)
^ ^
CI(p) = p Za
= 0.12
pq
n
1.65
(0.12)(0.88)
= 0.12
500
The 90% C.I. estimate of the true proportion of men is 0.096, 0.144(9.6% ,
14.4%)11.7 Factors of Confidence Interval
The factors that determine the width of a confidence interval are:
1.
Page15
2.
3.
If all other quantities remain unchanged, an increase in the value of the level of
confidence will lead to an increase in table value,Z and it will lead to a wider
interval. If all other quantities remain unchanged, an increase in sample size will
lead to a narrow interval.
11.8 Sample Size needed for an Interval
Estimate of the Population mean.
In the formula for CI of the population mean,
s
Za
2 n
( m ) = X Za
2 n
E
As n becomes larger and larger, sampling error becomes smaller and smaller.
E=
s
Za
2 n
n=
s E
=
n
Za s
2
Therefore
Za s
n= 2
E
Example 11.8: The college president asks the statistics teacher to estimate the
average age of the students at their college. How large a sample is necessary?
The teacher decides the estimate should be accurate within 1 year and be 99%
confident. From a previous study, the standard deviation of the ages is known
to be 3 years.
Page16
Solution:
Since = 0.01 (or 1 - 0.99),
Z0.01 / 2 ( t 0.01
Za s
n= 2
E
) = 2.576(2.58), and E = 1,
(2.58)(3)
=
= 59.9
1
Find the 95% confidence interval for the true mean time that students were late
for lecture classes.
Ans :
0.5
s
= 10 1.96
2 n
20
= 10 minutes 0.22 minutes
(m ) = X Z a
ii. What must be the minimum sample size so that the sampling error in part (i) at
the 95% confidence level do not exceed 0.1 minutes?
Z .s
(1.96)(0.5)
n=
=
= 96.04
0.1
E
Ans :
CI,( p ) =
p Za
pq
n
Page17
^ ^
Za
pq
n
or B(Selvanathan).
^ ^
Za
E=
pq
n
E
=
Za
^ ^
pq
n
^ ^
pq
E
=
Z
n
a
2
^ ^
Za
2
n = pq
E
Solution
Since a
E=0.02,
p = 0.4
^ ^
= 1.96
and
Z 0.05
2
n = pq
E
q = 0.6
2
1.96
= (0.4)(0.6)
= 2304.96
0
.
02
m = x z0.1
= 20 1.645
3
= 20 0.9
30
ii
Estimate the mean number of times of hang-out per month for all the youths at the
95% confidence level.
m = x z0.05
s
2
3
= 20 1.07
30
= 20 1.96
iii.
What must be the minimum sample size so that the sampling error in part(ii) at the
95% confidence level do not exceed 0.5 times?
Z .s 1.96(3)
n
138.29
E 0.5
2
Find the 95% confidence interval for the true mean number of hours that college
students spent surfing the internet per day.
m = x z0.05
s
2
= 3 1.96
0.25
= 3 0.07
50
Page18
What must be the minimum sample size so that the sampling error in part (i) do not
exceed 3 minutes at the 95% confidence level?
Z .s 1.96(0.25)
n
96.04
E 0.05
2
Estimate the 90% confidence interval for the true mean time of eat-out per month
for all Malaysian.
ii. Estimate the 95% confidence interval for the true mean time of eat-out per month
for all Malaysian.
Ex. 6 In order to estimate the mean expenditure per car service , a random sample
cars were selected and their service expenditures are shown below.
RM250, RM220, RM180, RM200, RM210
RM 280, RM230, RM220, RM190, RM220
Also given :
i. Estimate the 90% confidence interval for the true mean expenditure per car
service for all the cars.
ii. Estimate the 95% confidence interval for the true mean expenditure per car
service for all the cars.
iii. Estimate the 99% confidence interval for the true mean expenditure per car
service for all the cars.
Ex.7 A sports scientist would like to estimate the proportion of school children spending
their time on sports in the evening. In a random sample of 100 school children, 40
of them spent their time on sports in the evening.
Page19
Estimate the 95% confidence interval for the true proportion of all school
children spending their time on sports in the evening.
ii. What must be the sample size so that the sampling error in part (i) do not
exceed 0.05 at the 95% confidence level?(Use the above sample information).
Ex 8. A lecturer would like to estimate the proportion of students who do not continue
lecture classes after a break. In a random sample of 50 students in a lecture class, 15
students did not come back after the break.
i. Estimate the 90% confidence interval for the true proportion of students who do
not come back to the tutorial class after the break.
ii. Using the above sample information, what must be the minimum sample size so
that the sampling in part(i) do not exceed 7%?
Chapter Thirteen: Testing of Hypothesis (One Sample Tests)
Statistical inference is concerned with how we draw conclusions from sample data about
the larger population from which the sample is selected.
In the previous chapter we discussed one branch of inference, namely estimation theory.
Another branch of inference is hypotheses testing theory.
Researchers are interested in answering many types of questions. For example, an
educator might wish to see whether a new teaching technique is better than a traditional
one. Automobile manufacturers are interested in determining whether seat belts will
reduce the severity of injuries caused by accidents.
These types of questions can be addressed
through statistical hypothesis testing, which is a decision-making process for evaluating
claims about a population.
In hypothesis testing, the researcher must define the population under study, state the
particular hypotheses that will be investigated, give the significance level, select a sample
Page20
Page21
from the population, collect the data, perform the calculations required for the statistical
test, and reach a conclusion.
13.1
Meaning of Hypothesis
o A Hypothesis is a statement about the value of a population parameter
developed for the purpose of testing.
13.2
Types of Hypotheses
: m = 42000
ii. H o
iii.
against
H1 : m 42000
Null Hypothesis
Alternate Hypothesis
H o : m = $42000
H1 : m $42000
= Indicates:
Equal to
Is exactly the same as
Indicates
Not equal to
Is different from
is $42000 against
m is not equal to
$42000
Is the same as
H o : m $42000
Page22
H1 : m > $42000
$42000 against m is
is no more than
H o : m $42,000
H o : m < $42,000
Indicates
< Indicates
equal to $42000 or it is
Less than
At Least
Has decreased
is more than or
Example 13.2 A company is offering weight reduction program and weight increase
program.
i.
The company claims that the weight will change after taking the program.
Ans: Ho :
H1 :
m = 60kg
ii. The company claims that the mean weight will be less after taking the program.
Ans: Ho :
m = 60kg
, Reject the companys claim that mean weight will be less than
60kg.
H1 :
m < 60kg , Accept the claim that mean weight will be less than 60kg..
iii. The company claims that the mean weight will increase after taking the program
Ans: Ho :
m = 60kg
than 60kg
H1 :
m > 60kg , Accept the claim that mean weight will increase to more than 60kg
Can you accepted the claim that the proportion of smokers has changed from 30%
Page23
0.3
ii. Can you accept the claim that the proportion of smokers has been reduced?
Ans : Ho : p = 0.3, reject the claim that p <0.3
H1 :
iii. Can you accept the claim the proportion of smokers has increased?
Ans : Ho : p = 0.3, reject the claim that p > 0.3
H1 :
13.3
When formulating the null and alternate hypotheses, the nature, or purpose of the test must be
taken into account.
The purpose of the test can guide us towards the appropriate testing procedure.
m =$42000
m > $42000
The test is a right tail since the alternate hypothesis indicated mean is more than $42000. So we
use more than sign in the alternate hypothesis.
Draw a graph of a right tail test at 0.05 level of significance showing the acceptance and
rejection or critical region and the critical value.
Page24
Left-tail Test : A test is one-tailed to the left when the alternate hypothesis,H1, states a left
direction, such as
Ho: The mean income of professors is $42000,
m =$42000
m < $42000
The test is a left tail since the alternate hypothesis indicated mean is less than $42000. So we
use less than sign in the alternate hypothesis.
Draw a graph of a left tail test at 0.05 level of significance showing the acceptance and
rejection or critical region and the critical value.
m =$42000
m $42000
Draw a graph of a two tail test at 0.05 level of significance showing the acceptance and
rejection or critical region and the critical value.
13.3.3 Errors in Hypothesis testing
In the judicial system, guilty persons sometimes go free, and innocent persons are
sometimes convicted.
Page25
denote as H0, that the person before the bar is innocent. This means that there exists
an alternative hypothesis, H1, that the defendant is guilty.
The jury examines the evidence to determine whether the prosecution has
demonstrated that this evidence is inconsistent with the basic hypotheses, H0 of
innocent. If the jurors decide the evidence is inconsistent with H0, they reject that
hypotheses and therefore accept its alternative, H1, that the defendant is guilty.
If we analyse the situation that results when the jury makes its decision, we find
that four possibilities exist. The first two possibilities pertain to the case in which
the basic hypothesis H0 is true, and the second to the case in which the basic
hypotheses H0 is false.
1. The defendant is innocent (H0 is true), and the jury finds that he is innocent (accept H0);
hence the correct decision has been made.
2. The defendant is innocent (H0 is true), and the jury finds him guilty (reject H0); hence
an error has been made.
3. The defendant is guilty (H0 is false), and the jury finds that he is guilty (reject H0);
hence the correct decision has been made.
4. The defendant is guilty (H0 is false), and the jury finds him is innocent (accept H0);
hence an error has been made.
In case (1) and (3), the jury reaches the correct decision; in case (2) and (4), it
makes an error. Let us consider these errors in terms of conventional statistical
terminology.
Type I Error: When we reject a null hypothesis, there is a chance that we made
a mistake i.e. we have rejected a true statement. Rejecting a true null hypothesis
is referred to as a Type I error and the maximum probability of making such an
error is the level of significance represented by
Type II Error: On the other hand we can also make the mistake of failing to
reject a false null hypothesis which we referred as a Type II error and the
maximum probability of making a type II error is represented by
b.
It may be noted that in our legal system a Type I error is considered to be more
serious than type II error as we feel that it is worse to convict an innocent
person than to let a guilty one go free.
Had we made Ho the hypothesis that the defendant is guilty, the meaning of type
I and Type II error would have been reversed.
13.4
13.5
In hypothesis testing, the researcher must define the population under study, state the
particular hypotheses that will be investigated, give the significance level, select a sample
from the population, collect the data, perform the calculations required for the statistical
test, and reach a conclusion.
Below is the summary of the six steps of hypothesis testing.
Step 1: State null and alternate hypotheses.
Page26
Page27
Step 6 : Decision
Do not reject Ho
13.5.1
The null hypothesis asserts that a population is equal to, no more than or no less than
some exact value, and it in evaluated in the face of numerical evidence. An appropriate
alternate hypothesis covers other possible values for the alternate parameter.
13.5.2 : Selecting the test statistic
The test statistic will be either Z or t, corresponding to the normal and the t distribution
respectively.
An important consideration in tests involving a sample mean is whether the population
standard deviation is known
The Z test will be used for hypothesis tests involving a sample mean if the
population standard deviation is known.
The t test will be used for hypothesis tests involving a sample mean if the
population standard deviation is unknown.
The Z test will be used for hypothesis tests involving a sample proportion with the
conditions of both np and nq are more than or equal to 5.
Page28
Z=
x-m
o Example 13.4: It is claimed that the average college student reads less than the
general public. The national average is 29.4 hours per week, with a standard
deviation of 2 hours. A sample of 30 college students has a mean of 27 hours.
Is there enough evidence to support the claim at = 0.01?
o Solution:
Step 1 H0: = 29.4 hrs, Reject the claim that college students read less than national
average
H1: < 29.4 hrs, Accept the claim that college students read less than national
average
Step 2: Test statistic:
Z=
x-m
since s is known
A p-value is the probability, assuming that the null hypothesis is true, of finding
a value of the test statistic at least as extreme as the computed value for the test.
Page29
Example 13.5: The mean marks obtained by the students used to be 45 with a
standard deviation of 4.36 marks. A new system was introduced and it was
claimed that the new system will produce different grade of students. After a few
months of introducing the new system a random sample of 50 students selected
showed that the mean marks obtained was 46.54. Can we accept the claim at 5%
level of significance? Use p-value met
Solution:
H0: = 45 (the claim is not accepted)
H1: 45 (the claim is accepted)
Step 2: Test statistic:
Z=
x-m
Z > 1.96
Page30
Z=
x-m
Pop SD is Known
Z=
( 46.5 - 45)
= 2.498 2.50
4.36
50
Can we accept the claim that the mean age of all customers is more than 23.5 years
at the 5% significance level?
Page31
Can we accept the claim that the mean age of all customers is less than 26.5 years at
the 5% significance level.
x-m
s
n
t=
test, standard six students should average 84.3. If 25 randomly selected standard
six students from a certain city averaged 87.8, with a standard deviation of 8.6,
test the null hypothesis = 84.3 for that city against the alternative hypothesis >
84.3, using = 0.05.
o Solution:
H0: = 84.3, mean has not increased
H1: > 84.3, mean has increased
Step 2: Test statistic:
t=
x-m
s
n
87.8 - 84.3
= 2.04
8.6
25
Example 13.8 A high school counselor believes the mean number of dropouts at her
college is 21 years. She reviews a sample of 17 dropouts and records the ages as shown
below:
12
18
24
16
21
20
18
19
22
25
16
18
19
19
20
23
19
Page32
x = 19.35 , s = 3.16
x=
By calculation :
x 329
=
= 19.35
n
17
3292
6527 17 = 3.16
s=
16
H o : m = 21 Do not refute counselors hypothesis
H1 : m 21 Refute counselors hypothesis
Test Statistics :
t=
x-m
s
n
(19.35 - 21)
= -2.15
3.16
17
= x t0.01
= 19.35
s
3.16
= 19.35 2.92
,16
2 n
17
2.238
= 17.11 to 21.5887
Therefore we are 99% confident that the mean number of dropouts is between 17 to 22
Page33
Page34
Since 21 is within the 99% CI of 17 to 22, do not reject Ho and this confirm our
Can we accept the claim that the mean age of all customers is not 26.5 years at the
5% significance level?
ii.
Can we accept the claim that the mean age of all customers is more than 23.5 years
at the 5% significance level?
iii.
Can we accept the claim that the mean age of all customers is less than 26.5 years at
the 5% significance level.
Exercise 3
An educationist claims that the students performance will be better with private tuition.
A random sample of 10 students with private tuition selected were given a test and the
marks scored are shown below:
40
45
58
Also given:
59
60
x = 650
63
and
70
80
85 90
2
x = 44664
ii. Can we accept the educationists claim that the mean mark of students will be more
than 55 at the 0.05 level of significance?
13.9 Tests Concerning Population Proportion
o A proportion is the fraction or percentage that indicates the part of the
population or sample having a particular trait of interest.
Eg. If 2000 out of 10,000 students in HELP are female, the population
proportion of female students, represented by
p=
2000
= 0.2
10000
Page35
Eg. If 10 out of 25 students in a sample are female, then the sample proportion
^
p=
10
= 0.4
25
p
^
Numberofsuccesse sin thesample x
p=
=
NumberInTheSampled
n
o The sample proportion is denoted by
Z=
p- p
pq
n
where q = 1 p
Sample proportion,
p=
35
= 0.35
100
Step1:
H0: p = 0.4, Reject the claim that less than 40% of students are hardworking
H1: p < 0.4, Accept the claim that less than40% of students are hardworking
^
Z=
p- p
Pg 2 of formula sheet
pq
n
Z=
0.35 - 0.4
= -1.02
(0.4)(0.6)
100
p-value = 0.1539
Step 6:Conclusion: -1.02 > -1.645;
Do not reject Ho
Step1:
H0: p = 0.5, Reject the claim that less than 50% of students were late for tutorial
classes
H1: p < 0.5, Accept the claim that less than 50% of students are late for tutorial
classes.
^
Z=
p- p
, Pg 6 of formula sheet
pq
n
Page36
Z=
0.4 - 0.5
= -1.41
(0.5)(0.5)
50
p-value = 0.0793
Step 6:
Conclusion: -1.41 > -1.645;
Do Not Reject H0
There is insufficient evidence to accept the claim that less than 50% of students
were late for tutorial classes at the 5% level of significance.
ii. Will your decision of part(i) change at the 10% level of significance?
Answer: p-value(0.0793) < 0.1, Reject Ho
Therefore decision has changed to accepting the claim that less than 50% students
were late for tutorial classes.
Example 13.11
It is claimed that more than 40% of HELP students do not do homework.
A random sample of 300 HELP students selected showed that 135 do not do home work.
i.
ii.
iii.
ii.
Step 1 : Ho : p =0.4, Reject the claim that >40% of students do not do homework
H1 : p > 0.4 Accept the claim that homework
^
p- p
Z=
pq
n
Z=
p- p
0.45 - 0.4
=
= 1.77
pq
(0.4)(0.6)
n
300
Page37
Page38
iii
Exercise 4
A marketing manager claims that his product X has more than 30% of the market share
In a random sample of 200 consumers 72 indicated they preferred product X.
i.
Can we accept the marketing managers claim at the 5%(0.05) level of significance?
ii.
iii.
Exercise 5
We wish to test the claim that less than 40% of students in HELP will attend tutorial
classes. In a random sample of 100 students, 35 students attended tutorial classes.
i.
ii
iii.
Exercise 6.
It is claimed that more than 35% of students in HELP made use of transportation
provided. To test the claim, a random sample of 250 students was selected. It was
found that 100 of the students in the sample made use of the transportation provided.
i.
ii.