Professional Documents
Culture Documents
Large Sample
Z test for mean
Test of significance for single Mean
n
standard deviation and n is the sample size.
Test of significance for difference of mean
x1 x 2
s 21 s 22
n1 n2
, where x1 is the first sample mean, x2 is the second sample mean, n1 is the
first sample size, n2 is the second sample size, s 21 is the first sample variance and s 22 is the
second sample variance.
Confidence Limits
are called 95% confidence limits for the mean of the population
n
limits for the mean of the population corresponding to the given sample.
pP
PQ
n
p1 p 2
1
1
PQ
n1 n 2
n1 p1 n2 p 2
n1 n2
Small Sample
t -Test of significance for single Mean
where x the sample mean, is is the population mean, s is the sample standard
t
,
s
n 1
deviation and n is the sample size.
If the mean and standard deviation are not given, then the following formulae are used to
calculate
x
n
, s
x x
Degrees of freedom is n - 1
Confidence Limits
Let x be the sample mean and n be the sample size. Let s be the sample standard deviation.
s
n 1
n 1
x1 x 2
1
1
s
n1 n 2
, where x1 is the first sample mean, x2 is the second sample mean, n1 is the
n1 s1 n2 s 2
.
n1 n2 2
2
Degrees of freedom is n1 + n2 - 2
F test
i.e, F
S1
S2
if S1 S 2
2
(OR) F
S2
S1
if S 2 S1
2
If the sample variances s12 and s22 are given, then the following formula can be used to
calculate S12 and S22 :
S12
n1 s12
n2 s 22
2
S
, 2
n2 1
n1 1
If the sample variances s12 and s22 are not given and the set of observations for both
samples are given then the following formula can be used to calculate S12 and S22
2
1
x x
2
2
2
y y
, S
, where n1 is the first sample size, n2 is the second
n1 1
n2 1
sample size, x is the first sample mean and y is the second sample mean.
2 test
2
O E 2
E
Total
a
c
a+c
b
d
b+d
Total a+b c+d a+b+c+d = N
Expected Frequencies
E(a) =
E(b) =
a c a b
N
b d a b
N
E(c) =
a c c d
Total
a+c
N
E(d) =
c d b d
b+d
N
Total
a+b
c+d
a+b+c+d = N
Problems
1. A company manufacturing electric light bulbs claims that the average life of its
bulbs is 1600 hours. The average life and standard deviation of a random sample
of 100 such bulbs were 1570 hours and 120 hours respectively. Test the claim of
the company at 5% level of significance.
Solution:
Null Hypothesis H0 : 1600 . There is no significant difference between sample mean
and population mean
Alternative Hypothesis H1 : 1600 . There is a significant difference between sample
mean and population mean.
The statistic test is Z
n
1570 1600
2.5
120
100
z 2.5
n
1850 1800
Z
3.54
100
50
Calculated value z = 3.54
Tabulated value of z at 5% level of significance for a one tail test is 2.33
Calculated value >Tabulated value, H0 is rejected.
The difference is significant and so we support the claim of the manufacturer.
3. Measurements of the weights of a random sample of 200 ball bearings made by a
certain machine during one week showed a mean of 0.824N and a standard
deviation of 0.042N. Find the 95% and 99% confidence limits for the mean weight
of all the ball bearings.
Solution:
x = 0.824
n = 200,
x 1.96
n
s = 0.042
0.042
0.824 1.96
0.824 0.0058
n
200
n
x 2.58
0.042
0.824 2.58
0.824 0.0077
n
200
n1 400
n2 400
x 250
y 220
s1 40
s2 55
s2 s2
40 2 55 2
s 1 2
3.4
400 400
n1 n2
xy
250 220
Z
8.82
2
2
3.4
s 1 s2
n1 n2
Calculated value z = 8.82
Tabulated value of z at 1% level of significance for a two tailed test is 2.56
Calculated value >Tabulated value, H0 is rejected.
The difference in the weekly food expenditure is significantly different.
5. A random sample of 500 pineapples was taken from a large consignment and 65
were found to be bad. Test whether the proportion of bad ones is not significantly
different from 0.1 at 1% level of significance
Solution:
Null Hypothesis H0: P = 0.1There is no significant difference between sample and
population proportion.
Alternative Hypothesis H1: P 0.1 There is a significant difference between sample and
population proportion.
pP
PQ
n
65
p=
0.13
500
Q= 1 P = 1 0.1= 0.9
P = 0.1
0.13 0.1
(0.1)(0.9)
500
2.238
540
0.54
1000
pP
PQ
n
0.54 0.5
(0.5)(0.5)
1000
2.532
P = 0.5 ,
p=
Q= 1 P = 1 0.5 = 0.5
55
0.55
100
pP
PQ
n
0.55 0.5
(0.5)(0.5)
900
Calculated value z = 3
Tabulated value of z at 5% level of significance for a one tail test is 2.33
Calculated value >Tabulated value, H0 is rejected.
The Democratic candidate is having more chances to win the President Post.
8. In a random sample of 1000 persons from town A, 400 are found to be consumers
of wheat. In a sample of 800 from town B, 400 are found to be consumers of
wheat. Do these data reveal a significant difference between town A and town B
so far as the proportion of wheat consumers is concerned?
Solution:
H0: Two towns do not differ much as far as the proportion of wheat consumption. P1 = P2
H1: P1 P2
The Statistic test is Z
p1 p2
1 1
PQ
n1 n2
400
400
0 .5
p1=
= 0.4
p2=
800
1000
n p n p 1000(0.4) 800(0.5)
P 1 1 2 2
0.444
n1 n2
1000 800
Q= 1 P = 1 0.444 = 0.556
0.4 0.5
1
1
(0.444)(0.556)
1000 800
0.1
4.17
0.024
deviation is 0.003 inch. Test the hypothesis that the machine is in proper working
order, using 5% and 1% level of significance.
Solution:
H0: = 0.050
H1: 0.050 (two tailed test)
s 0.003
0.050
x 0.053 0.050
3.00
The statistic test is t
s
0.003
n 10
x 0.053
n 1
10 1
s 5.7
180
x 169.5 180
3.68
The statistic test is t
s
5 .7
n5
x 169.5
n 1
5 1
11. Ten individuals are chosen at random from a normal population and their heights
are found to be 63,63,66,67,68,69,70,70,71,71 inches. Test the hypothesis that the
mean height is greater than 66 inches at 5% level of significance
Solution:
H0: = 66
H1: > 66 (one tailed test)
x 678 67.8
n
10
63
(x2
63
66
x x
67
68
69
70
70
- 0.8
0.64
0.2
0.04
1.2
2.2
2.2
1.44 4.84 4.84
71
71
Total
3.2
10.24
3.2
10.24 81.6
81.6
8.16
10
s = 2.857
x 67.8 66
1.89
The statistic test is t
s
2.857
n 1
x 94 11.75
n
y 73 10.43
n
x
9
11
13
11
15
9
12
14
94
(x- 2.75
- 0.75
1.25
- 0.75
3.25
- 2.75
0.25
2.25
7.56
0.56
1.56
0.56
10.56
7.56
0.06
5.06
33.48
y
10
12
10
14
9
8
10
(y- 0.43
1.57
- 0.43
3.47
- 1.43
- 2.43
- 0.43
73
x x y y
s = 2.086
The statistic test is t
0.185
2.465
0.185
12.041
2.045
5.905
0.185
23.011
n1 n2 2
33.48 23.011
4.35
872
xy
1
1
s
n1 n2
11.75 10.43
1 1
2.086
8 7
1.22
n1 16
n2 14
s1 10
s2
s2 8
2
s = 9.44
The statistic test
x 107
y 112
n1 s1 n2 s 2
1610 148
2496
89.143
n1 n2 2
16 14 2
28
2
xy
1
1
s
n1 n2
107 112
1
1
9.44
16 14
1.45
Degree of freedom = n1 n2 2 = 16 + 14 2 = 28
At 5% LOS:
Tabulated value of t at 5% level of significance with 28 degree of freedom for a two tail
test is 2.05
Calculated value Tabulated value, H0 is accepted
There is no significant difference in the IQ level of the two groups.
At 1% LOS:
Tabulated value of t at 1% level of significance with 28 degree of freedom is 2.76
Calculated value < Tabulated value, H0 is accepted.
There is no significant difference in the IQ level of the two groups.
14. A random sample of 10 parts from machine A has a sample standard deviation of
0.014 and another sample of 15 parts from machine B has a sample standard
deviation of 0.08. Test the hypothesis that the samples are from a population with
same variance.
Solution:
H0: 12 22
H1: 12 22
n1 10
n2 15
s1 0.014
S12
ns
10 0.014
0.0002
n1 1
10 1
S 22
n 2 s 22 15 0.082
0.006
n2 1
15 1
s2 0.08
2
1 1
S 22
0.006
30
2
0.0002
S1
Calculated value F = 30
Tabulated Value of F at 5% level of significant with (14, 9) degrees of freedom is 3.03
Calculated value Tabulated value, H0 is rejected
There is a significant difference in the variances of two populations.
15. Two random samples drawn from two normal populations are
Sample I
Sample II
20 16
27 33
26 27 23 22 18
42 35 32 34 38
24 25 19
28 41 43 30 37
Obtain the estimates of the variances of the population and test whether the two
populations have the same variance.
Solution:
H0: 12 22
H1: 12 22
x 220 22
y 420 35
x
20
16
26
27
23
22
18
24
25
19
10
12
(x-2
-6
4
5
1
0
-4
2
3
-3
4
36
16
25
1
0
16
4
9
9
y
27
33
42
35
32
34
38
28
41
43
30
37
120
n1 10
S1
x x
n1 1
y y
120
13.33
9
314
28.54
11
S2
64
4
49
0
9
1
9
49
36
64
25
4
314
n2 12
(y-8
-2
7
0
-3
-1
3
-7
6
8
-5
2
n2 1
S 22 28.54
2.14
S12 13.33
Solution:
H0: 12 22
H1: 12 22
n1 8
S1
2
n2 10
2
x x
n1 1
y y
x x
S2
n2 1
84.4
y y
102.6
84.4
12.057
7
102.6
11.4
9
S12 12.057
F 2
1.057
11.4
S2
Calculated value F = 1.057
Tabulated Value of F at 5% level of significance with (7, 9) degrees of freedom is 3.29
Calculated value Tabulated value, H0 is accepted
There is no significant difference between variances.
17. The mean life of a sample of 9 bulbs was observed to be 1309 hrs with standard
deviation 420 hrs. A second sample of 16 bulbs chosen from a different batch
showed a mean life of 1205 hrs with a standard deviation 390 hrs. Test at 5% level
whether both the samples come from the same normal population.
Solution:
Both t-test and F-test has to be done to check whether they have come from the same
population. Fist F-test is done and the followed by t-test.
F-test:
H0: 12 22
H1: 12 22
n1 9
n2 16
S12
s1 420
s2 390
x 1309
y 1205
2
n1 s12
9 420
198450
n1 1
9 1
n2 s 22 16 3902
S
162240
n2 1
16 1
2
2
S12 198450
1.223
S 22 162240
t-test:
H0: 1 2
H1: 1 2
2
2
n1 s1 n2 s 2
9420 16390
4021200
s
174834.7826
n1 n2 2
9 16 2
23
2
s = 418.13
The statistic test
xy
1
1
s
n1 n2
1309 1205
418.13
1 1
9 16
104
0.596
174.22
OE
30
25
18
10
22
15
20
20
20
20
20
20
10
5
-2
-10
2
-5
Calculated
100
25
4
100
4
25
5.00
1.25
0.20
5.00
0.20
1.25
12.90
= 12.90
Degree of freedom = n 1 = 6 1 = 5
Calculated value of
at 5% level of significance with 5 degree of freedom is 11.07
Tabulated value = 11.07
1
Type M 300 75
4
2
Type MN 300 150
4
1
Type N 300 75
4
Observed frequencies:
30
300 90
100
45
Type MN
300 135
100
25
Type N
300 75
100
Type M
OE
Type
observed
Expected
90
75
15
225
MN
135
150
- 15
225
1.5
75
75
Total
Calculated
4.5
= 4.5
Degree of freedom = n 1 = 3 1 = 2
Calculated value of
at 5% level of significance with 2 degree of freedom is 5.99
Tabulated value = 5.99
Calculated value < Tabulated value, H0 is accepted
There is no significant difference between the theoretical ratio and observed ratio.
20. A certain drug was administered to 456 males, out of a total 720 in a certain
locality, to test its efficacy against typhoid. To incidence of typhoid is shown
below. Find out the effectiveness of the drug against the disease. (The table value
of
for 1 degree of freedom at 5% level of significance is 3.84)
Infection No Infection
144
312
192
72
Total
456
264
336
720
384
Solution:
Null Hypothesis H0: The drug is independent.
Alternative Hypothesis H1: The drug is not independent
The expected frequencies are
456
243
213
264
123
141
336
384
OE
144
192
312
72
213
123
243
141
- 69
69
69
- 69
Calculated
4761
4761
4761
4761
720
22.35
38.71
19.59
33.77
114.42
= 114.42
21. A brand Manager is concerned that her brands share may be unevenly distributed
throughout the country. In a survey in which the country was divided into four
geographical regions, a random sampling of 100 consumers in each region was
surveyed, with the following results:
Purchased the
brand
Did not purchase
Using
NE
40
NW
55
60
45
Region
SE
SW
45
50
55
50
TOTAL
190
210
test, find out if the brand is unevenly distributed throughout the country.
Solution:
H0: There is no significant difference between the observed and expected frequencies
H1: There is a significant difference between the observed and expected frequencies
The expected frequencies are :
NE
NW
Purchased
the brand
190 100
47
400
190 100
48
400
Region
SE
190 100
47
400
Did not
purchase
210 100
53
400
210 100
52
400
210 100
53
400
210 100
52
400
210
100
100
100
100
400
OE
40
55
45
50
60
45
55
50
47
48
47
48
53
52
53
52
-7
7
-2
2
7
-7
2
-2
Calculated
49
49
4
4
49
49
4
4
SW
190 100
48
400
TOTAL
190
1.04
1.02
0.085
0.083
0.924
0.942
0.075
0.076
4.245
= 4.245