Professional Documents
Culture Documents
Definition:
Statistical Hypothesis is an assumption made about some
parameters, i.e. about statistical measure of a population.
It is not every hypothesis is a statistical hypothesis
Non-statistical hypotheses are:
Mars is inhabited by human being
Mr.A is the candidate who would be the best president for the
Republic of Indonesia
Mt.Merapi is guarded by mbah Jugo
Neither of these statements is an assumption about a parameter. We
can change by a slight in wording of a statement to become in the
following statistical hypothesis:
Mr.A is the candidate most favored for the presidency of the
Republic of Indonesia
06_Test hypotheses
06_Test hypotheses
100. 12 . 12 25 5.
z
59.5 50 9.5
1.90,
5
5
y
Therefore, if an honest coin is
tossed 100 times, the probability
of obtaining 60 or more heads is
0.0287 ( = 2,87%), certainly not a
high probability
06_Test hypotheses
60 or more
Heads
P = 0.0287
m = 50
X = 59.5
Z=0
z = 1.90
z
4
By definition:
In testing a statistical hypothesis, the hypothesis is declared to be
true if the calculated probability exceeds a given value , called the
significance level; and it is considered false if the calculated
probability is less than
If the calculated probability is less than , indicating that the
hypothesis is false, the result is termed significant
Convention:
A result significant if the calculated probability is less than = 0.05
Highly significant if the calculated probability is less than = 0.01
06_Test hypotheses
In the above sample, since the result 0.0287 is less than 0.05, the result
is significant (although not highly significant), and consequently on the
basis of 5% level of significance ( = 0.05) we reject the hypothesis that
the coin is honest.
Why did we calculate the probability of obtaining 60 or more heads and
not the probability of obtaining a deviation of 10 or more from the
expected number of heads (i.e. 60 or more or 40 or fewer head)?
The answer is:
There are two probabilities to calculate depends on what we wish to
consider as the alternative to the hypothesis Ho: p = . If we consider
the alternative hypothesis, denoted by H1 to be p > , then the
appropriate probability to calculate is that of obtaining 60 or more
heads, since, by considering p>1/2 as the alternative hypothesis, we
have indicated that we dont wish to reject the hypothesis if fewer than
50 heads are obtained
If, on the other hand, we consider the alternative hypothesis to be p
, then the appropriate probability to calculate is that of obtaining 60 or
more or 40 or fewer heads.
06_Test hypotheses
06_Test hypotheses
y
Reject
hypothesis
Reject
hypothesis
Accept
hypothesis
Z= -1.96
Z= 1.96
y
Reject
hypothesis
Accept
hypothesis
Z= 1.64
06_Test hypotheses
Example:
From the Mendelian theory, we know that crosses of peas should give
yellow and green peas in the ratio 3:1 . In a particular experiment, 179
yellow and 49 green peas are obtained. Is this a significant deviation from
the theory on the basis of the 5 % level of significance?
Solution:
Probability of obtaining yellow peas is p = , total sample 228, then p = .
228 = 171
Hypothesis Ho: p = ; q = ; n = 228 ; m = . 228 = 171
178.5 171
1.15
6.54
06_Test hypotheses
Conclusion
Hypothesis Stated is
True
Sample
indicates
False
Reject
hypothesis
Type 1 error
Correct
conclusion
Accept
hypothesis
Correct
conclusion
Type 2 error
By definition
If it happens that the hypothesis being tested is actually true, and if from the
sample we reach the conclusion that it is false, we say that a type 1 error has
been committed.
If it happens that the hypothesis being tested is actually false, and if from the
sample we reach the conclusion that it is true, we say that a type 2 error has
been committed.
06_Test hypotheses
10
Example:
The mean breaking strength of a certain type of cord has been establish from
considerable experience at 18.3 kg with a standard deviation of 1.2 kg. A new
machine is purchased to manufacture this type of cord. A sample of 100
pieces obtained from the new machine shows a mean breaking strength of
17.0 kg. Would you say that this ample is inferior on the basis of 1 % level of
significance?
Solution:
We will calculate the probability of obtaining from a population with a mean of
18.3 and standard deviation of 1.2, a sample of 100 pieces with a mean of 17.0 or
less. If the probability turn out to be less than 1 %, we reject the hypothesis,
otherwise, we accept it.
Hypothesis:
Ho:
m = 18.3 = 1.2
1 .2
1 .2
0.12 ,
10
100
X = 17.0
n = 100
10.83
0.12
0.13
Since |z| = 10.83 is larger than any value of z listed in statistical table, the
corresponding tail area is much less than 0.01 and, consequently, the result is
highly significant
06_Test hypotheses
11
06_Test hypotheses
12
Solution:
Here we wish to test the statistical hypothesis that the means of two
populations (distance traveled per gallon of gasoline for makes A and B,
denoted by X and Y, respectively) are identical, or Ho : mx = my . The alternative
hypothesis H1 is mx < my
Hypothesis : Ho : mx = my or mx my = 0
Alternative hypothesis
X = 17,
H1: mx < my
X
2.5
,
n1
50
x y x y
2
y
n2
3.0
,
70
2.5 2 3.0 2
50
70
6.25 9.00
50
70
06_Test hypotheses
13
Then:
z
X Y m
my
x y
17.0 18.6 0
0.50
1.60
3.20
0.50
By definition:
The hypothesis being tested is also called the null- hypothesis
06_Test hypotheses
14
11
x
1.10
n 10
06_Test hypotheses
15
Therefore, 95% of the sample means lie between the value of X obtained by solving
for X the two equations.
X m
1.96
1.10
The value of 1.96 is obtained from statistical table, as the value of z
corresponding to an area of (0.95) = 0.475. The solution are :
X 1 m 2.16
and
X 2 m 2.16
However, since in the illustration we are given X and wish to determine m, we must
solve the above equations for m, which gives
and
These values, m1 and m2 constitute the lower and upper 95% confidence limits for the
mean of the population. Frequently they are written more briefly as
m = 40 2.16
06_Test hypotheses
16
X 2.16
X 40
X 2.16
06_Test hypotheses
17
Example:
A sample of 70 variates has a mean of 65 with standard deviation of 4.2. Find
the 98% confidence limits for the mean of the population.
Solution
Here we are given X 65,
Assuming s
s 4.2,
n 70.
we find:
4.2
4.2
0.50
n
70 8.37
X m
2.33
0.50
Therefore:
X m 1.16
and
m X 1.16 65 1.16
Which are the 98% confidence limits for the mean of the population
06_Test hypotheses
18
Solution: From results of the survey sample we will assume that the percentage of
voter in the population favoring candidate A is 43%. We are interested in determining the
size of the sample so that with a probability of 0.95 the percentage in the sample does not
differ by more than 3% from that of the population
Then we have:
p 0.43,
q 0.57,
m 0.43n
0.03n
1.96
0.495 n
or
1.96( 0.495)
3.23
0.03
or
06_Test hypotheses
19
06_Test hypotheses
20
06_Test hypotheses
21