Professional Documents
Culture Documents
and
Analysis of Variance (ANOVA)
for
DOE
What is it?
DOE is a systematic approach to investigation of a system or process. A series of
structured tests are designed in which planned changes are made to the input
variables of a process or system. The effects of these changes on a pre-defined
output are then assessed.
The experimental design is the manner in which the researcher randomly assigns
the treatment to the experimental units. We have considered four experimental
designs:completely randomized design, randomized complete block, latin square
and factorial experiments.
DOE
Why is it important?
DOE is important as a formal way of maximizing information gained while
minimizing resources required. It has more to offer than 'one change at a time'
experimental methods, because it allows a judgment on the significance to the
output of input variables acting alone, as well input variables acting in
combination with one another.
DOE
ANOVA
Definition
ANOVA stands for analysis of variance, a method by which the
source of variability is identified. This method is widely used in
industry to help identify the source of potential problems in the
production process and identify whether variation in measured
output values is due to variability between various manufacturing
processes, or within them. By varying the factors in a
predetermined pattern and analyzing the output, one can use
statistical techniques to make an accurate assessment as to the
cause of variation in a manufacturing process.
Types of DOE
Completely Randomized Design
Randomized Block Design
Latin Square Design
Factorial Design
80
70
65
90
90
60
50
89
96
55
58
85
85
85
55
95
70
90
40
80
Solution.
The model of this problems is:
Yij = + Tj + eij
where is the overall mean; Yij, the ith observation
under the jth treatment of the factor, training organization;
Tj, the effect of the jth treatment of the factor, training
organization; eij, the random error associated with the ith
observation under the jth treatment of the factor training
organization,
Null hypothesis,
Ho : T1 = T2 = T3 = T4
Yj
A
80
B
70
C
65
D
90
90
96
60
55
50
58
89
85
85
70
421
85
90
360
55
40
268
95
80
439
We have
i 1
j 1
..
Yij 1488
.1
Then
5 4
Y ..2
2
SS total Y ij
20
i 1 j 1
(80) 2 (70) 2 (65) 2 (90) 2 (60) 2 .... (40) 2 (80) 2 (1488)
20
5368.8
4 Y 2j Y 2
..
ss treatments
j 1 5 20
(421) 2 (360) 2 (268) 2 (439) 2 (1488) 2
5
3570
SS error SS total SS treatments
5368.8 3570
1798.8
The results of this problem are summarized Randomized Design (Example4.2)
Degrees of Sum of
freedom squares
Between
treatments
3570.0
Within
treatments
16
1798.8
Total
19
5368.8
Mean sum of
squares (MSS)
F ratio
In Table 4.10, the value of the calculated F ratio is 10.58. The value of F
ratio from the statistical table for a significance level of 0.05 and degrees
of freedom (3,16) is 3.24. Then
Fcalculated (10.58) >Fa = 0.05 and df = (3.14) (3.24)
Team
L1
L2
L3
L4
L5
T1
40
30
55
25
35
T2
45
60
10
22
33
T3
38
55
40
55
28
T4
30
27
32
56
17
T5
45
34
20
34
37
0.05.
Solution. In this problem, 'team leader' is considered to be the
factor and 'team' is considered to be the block. The model of
Team
L1
L2
L3
L4
L5
Yi.
T1
40
30
55
25
35
185
T2
45
60
10
22
33
170
T3
38
55
40
55
28
216
T4
30
27
32
56
17
162
T5
45
34
20
34
37
170
Y.j
198
206
157
192
150
903
For the design, the following calculations are done based on the Table 4.17:
5
5
Y .. Y ij 903
i 1 j 1
Y .1 198, Y .2 206, Y .3 157, Y .4 192and Y .5 150
Y .1 185, Y .2 170, Y .3 216, Y .4 162and Y .5 170
Now ,w e get
2
5
5
2 Y ..
SStotal Y ij
25
i 1 j 1
( 40) 2 (30) 2 (55) 2 ( 25) 2 (35) 2 ( 45) 2 .... (34) 2 (37) 2
4118.64
5 Y .2j Y 2
..
SSteam leader
25
i 1 5
(198) 2 ( 206) 2 (157) 2 (192) 2 (150) 2 (903) 2
5
25
514.24
5 Y2 Y2
i. ..
SSteam
25
i 1 5
(185) 2 (170) 2 ( 216) 2 (162) 2 (170) 2 (903) 2
5
25
368.64
SStotal SStotal SStreaments SSblocks
4118.64 514.24 368.64
3235.76
The results of this example are summarized in Table 4.18.
(903) 2
25
Degrees. of
Sum of
Mean sum of
F ratio
variation
freedom
squares
squares (MSS)
514.24
514.24/4 = 128.56
128.56/202.24 = 0.636
368.64
368.64/4 = 92.16
92.16/202.24 = 0.456
Error
16
3235.76
3235.76/16 = 202.24
Total
24
4118.64
Between
treatments
Between
blocks
Supplier
Machine
S1
S2
S3
S4
S5
M1
M2
M3
M4
M5
Table 4.24 Hourly Production Volume as per Latin Square Design (Example 4.6)
Supplier
Machine
S1
S2
S3
S4
S5
M1
12 (A)
8 (B)
9 (C)
15 (D)
18 (E)
M2
14 (B)
7 (C)
13 (D)
10 (E)
12 (A)
M3
7 (C)
15 (D)
12 (E)
8 (A)
6 (B)
M4
16 (D)
17 (E)
13 (A)
11 (B)
12 (C)
M5
9 (E)
14 (A)
8 (B)
10 (C)
8 (D)
Table 4.25 Hourly Production Volume in Latin Square Design {Example 4.6)
Supplier
Machine
S1
S2
S3
S4
S5
Yi..
M1
12 (A)
8 (B)
9 (C)
15 (D)
18 (E)
62
M2
14 (8)
7 (C)
13 (D)
10 (E)
12 (A)
56
M3
7 (C)
15 (D)
12 (E)
8 (A)
6 (B)
48
M4
16 (D)
17 (E)
13 (A)
11 (B)
12 (C)
69
M5
9 (E)
14 (A)
8 (B)
10 (C)
8 (D)
49
Y..k
58
61
55
54
56
284
Y
Y
Y
...
Y
i
284
ijk
1..
..1
Then, w e get
SS
total
Y
i
2
ijk
25
2
...
Y
5
SS
machine
(62)
i 1
1..
2
...
25
(284)
25
62.96
Y
5
SS
supplier
(58)
6.16
k 1
2
..k
2
...
25
(284)
25
(284)
25
Y.1.
12
15
18
12
14
13
10
15
12
13
11
12
16
17
14
10
59
47
45
67
66
We get
.1.
Then,
5
SSoperator
i 1
Y ...
5
25
. j.
25
85.76
SS
error
Source of
variation
Degrees
of
freedom
Sum of
squares
Mean sum of
squares (MSS)
F ratio
Between
operators
85.76
85.76/4 = 21.44
21.44/9.74 = 2.20
Between
machines
62.96
62.96/4 = 15.74
15.74/9.74 = 1.62
Between suppliers
6.i6
6.16/4 = 1.54
1.54/9.74 = 0.16
12
116.88
24
271.76
Error
Total
116.88/12 = 9.74
In Table 4.27, the value of the calculated F ratio for 'operator' is 2.2, whereas its table value
with the significance level of 0.05 and degrees of freedom (4,12) is 3.26. The calculated F ratio
for 'machine' is 1.62, whereas its table value with the significance level of 0.05 and degrees of
freedom (4,12) is 3.26. The calculated F ratio for 'supplier' is 0.16, whereas its table value with
the significance level of 0.05 and degrees of freedom (4,12) is 3.26.
Component-treatment (operator): For this component,
Fcalculated [2.2] < Fa=0.05 and dj=(4,12) [3.26]
Hence, the null hypothesis, Ho should be accepted.
Inference: This means that there is no significant difference between operators in terms
of hourly production volume.
Component-rows (machines): For this component,
Fcalculated [1.62] < Fa=0.05 and dj=(4,12) [3.26]
Hence, the null hypothesis, (Ho) should be accepted.
Inference: This means that there is no significant difference between machines in terms
of hourly production volume.
Component-columns (supplier): For this component,
Fcalculated [0.16] < Fa = 0.05 and 4f= (4,12) [3.26]
Hence, the null hypothesis, (Ho) should be accepted.
Inference: This means there is no significant difference between suppliers in terms
of hourly production volume.
Solution.
(a) The ANOVA results has already been discussed earlier (refer to
Table 4.10).
(b) Since the null hypothesis is rejected, for further interpretation on the
difference between different treatment means, Duncan's multiple
range test is performed as follows:
Step-1:
53.6
72.0
84.2
87.8
.1
.2
.3
Step-2
.4
SE
Y
.j
MSS
n
error
112.43
4.74
5
Step-3: Obtain 3 (Le. a-I) significant ranges from the Duncan's table of
multiple ranges for j = 4 and df = 16 and with a = 0.05 as shown
in Table 4.28.
Table 4.28 Significant Ranges
j
Significant range
3.00
3.15
3.23
LSR
Actual difference
between means
Least significant
range
3 and 4
34.2
15.31
Significant
2 and 4
15.8
14.93
Significant
1 and 4
3.6
14.22
Not significant
3 and 1
30.6
14.93
Significant
2 and 1
12.2
14.22
Not significant
3 and 2
18.4
14.22
Significant
Remark
FACTORIAL DESIGN
If the number of factors are more than one, then there is a
need for a generalized design of experiment. Such design
is called as factorial design. Analysis of many of the real
life situations will require this type of design.
The concept of factorial design is demonstrated with a
problem involving as in Example 4.8.
Example 4.8 The sales manager of Krishna Engineering
Company wants to study effect of sales regions as well as
salesmen on quarterly sales revenue. So, he has designed
a factorial experiment involving two factors as shown in
Table 4.31, with two replications for each experimental
combination:
East
65
30
50
20
70
35
65
40
30
60
20
50
35
55
25
40
West
A1
B1
B2
B3
B4
65
30
50
20
70
35
65
40
30
60
20
50
35
55
25
40
200
180
160
150
375
Y.j.
315
690
This problem has two factors, namely sales region (A) and salesman (B).
The number of treatments for the factor A is 2 (A1 and A2) and that of the
factor B is 4 (B1, B2, B3 and B4). The number of replications for each of the
eight experimental combinations (A1B1, A1B2, A1B3, A1B4, A2B1, A2B2, A2B3
and A2B4) is 2.
Ho: A1 = A2 = A3 =... = Aa
Alternate hypothesis,
Ho: B1 = B2 = B3 =... = Bb
Alternate hypothesis,
factor B.
Hypothesis with respect to interaction component, AB:
Null hypothesis,
Alternate hypothesis,
This is,
SStotal SSA SSB SSAB SSerror
for theexample problem,
SStotal SSregion SSsalesman SSregion x salesman SSerror
The generalized shortcutformulas to compute the sum of squares of different
componentsof the model are as follows :
2
a b
n
Y ...
2
SStotal Y ijk
N
i 1 j 1k 1
a 2
2
Y
SSrows i.. Y ...
N
i 1 bn
b
2
Y2
SScolumns .i. Y ...
j 1 an N
a b Y2
2
SSsubtotals ij. Y ...
i 1 j 1 n
N
SSinteraction SSsubtotals SSrows SScolumns
SSerror SStotal SSrows SScolumns SSinteraction
where Y... is the sum of Yij overall values of i, j and k; Y.j ., the sum of Yij k overall values of i
and k for a given j; Yi.., the sum of Yij k overall vaues of j and k for a given i; Y..k,
the sum of Yij k overall values of i and j for a given value of k; and N is the total numner of
observations in the experiments (abn 2 x 4 x 2 16).
YY ...
Y ijk 690
i 1 j 1k 1
Y1.. 375, Y1.. 375,
Y.1. 200, Y.1. 180, Y.3. 160, and Y.4. 150,
Then,
2
4
2
2
2
SStotal Y ijk Y ...
i 1 j 1k 1
i 1 16
(6902
2
2
2
2
2
2
2
2
(65) (30) (50) (20) (70) (35) ....(25) ( 40)
16
4093.75
2
2 Y2
i.. Y ...
SSA
16
i 1 8
(375) 2 (315) 2 (690) 2
8
16
225
4 Y2
2
SSB . j. Y ...
16
i 1 4
(200) 2 (180) 2 (160) 2 (150) 2 (690) 2
4
16
368.75
Yi
A 1B 2
A 1B 3
A 1B 4
A 2B 1
A 2B 2
A 2B 3
A 2B 4
65
30
50
20
30
60
20
50
70
35
65
40
35
55
25
40
115
60
65
115
45
90
135
65
Then, we get
2
2 4 Y ij
Y ..2
SSsubtotals
i 1 j 1 2 16
(690) 2
(135) 2 (65) 2 (115) 2 (60) 2 (65) 2 (115) 2 (45) 2 (90) 2
2
16
3668.75
SSAB SSsubtotals SSA SSB
3668.75 225 368.75
3075
SSerror SStotals SSA SSB SSAB
4093.75 225 368.75 3075
425
The generatlised results and results applied to the example problem of
this design are summarized in Table 4.34 and Table 4.35, respectively.
Degrees
of
freedom
Sum of
squares
a-1
SSrows
b-1
SScolumns
Between
Rows x
(a-1)(b-1)
SSrows x columns
Error
ab(n -1)
SSerror
Total
abn-1
SStotal
columns
Mean sum of
squares
(MSS)
F ratio
SSrows
MSSrows
a-1
MSSerror
SScolumns
MSScolumns
b -1
MSSerror
SSrows x columns
MSSrows x columns
(a -1)(b - 1)
M SSerror
SSerror
ab(n-1)
Degrees of
freedom
Sum of
squares
Mean sum of
squares (MSS)
F ratio
SSA
225.00
225.00/1=225.00
225.00/53.13 =4.24
SSB
368.75
368.75/3=122.92
122.92/53.13 =2.31
1025.00/53.13 =
SSAB
3075.00
SSerror
425.00
Total
15
4093.75
3075/3 = 1025.00
19.29
425/8 = 53.13
In Table 4.35, the value of the calculated F ratio for the factor A is 4.24,
whereas its table value with the significance level of 0.05 and degrees of
freedom (I, 8) is 5.32. The calculated f ratio for the factor B is 2.31,
whereas its table value with the significance level of 0.05 and degrees of
freedom (3, 8) is 4.07. The calculated f, ratio for the interaction AB is
19.29, whereas its table value with the significance level of 0.05 and
degrees of freedom (3, 8) is 4.07.
Component-factor A: For this component,
Fcalculated[4.24] < Fa=O.O5 and df=(1,8) [5.32]
Hence, the null hypothesis (Ho) should be accepted.
Inference: This means that there is no significant difference in terms
of quarterly sales between different sales regions.
Component-factor B: For this component,
Fcalculated[4.24] < Fa=O.O5 and df=(1,8) [5.32]
Hence, the null hypothesis (Ho) should be accepted.
Example 4.9 The quality control department of a fabric finishing plant is studying
the effect of several factors on the dyeing of cotton-synthetic cloth used to
manufacture men's shirts. Two operators, three cycle times and two
temperatures were selected and two small specimens of the cloth were
dyed under each combination of experimental conditions. The finished cloth
was compared to a standard and a numerical score was assigned. The results
are shown in Table 4.36. Analyze the data and draw conclusions at the
significance level of 0.05.
Table 4.36 Numerical Scores of Specimens of Cloth
Temperature
2500
3000
Operator-1
Operator-2
Operator-1
Operator-2
23
27
24
38
25
26
28
35
36
34
37
34
36
39
35
36
28
35
26
36
27
34
25
34
40
Cycle time
50
60
Solution : The data in Table 4.36 is presented in Table 4.37 along with row totals and ; column
totals. This problem has three factors: cycle time (A), temperature (B) and operator (C).
Let a be he number of treatments for the factor A (a=3); b, the number of treatments 1 for the
factor B, (b=2); c be the number of treatments for the factor C (c=2); and n be the number of
replications under each experimental combination (n = 2).
Table 4.37 Numerical Scores of Specimens of Cloth
Temperature (B)
2500
3000
1
2
1
2
40
50
60
23
27
24
38
25
26
28
35
36
34
37
34
36
39
35
36
28
35
26
36
27
34
25
34
370
388
Y.j..
Y..k.
350
408
Yi
226
287
245
That is,
SS total SS A SS B SS AB SS C SS AC SS BC SS ABC SS error
The generalized shortcutformulas to compute the sum
of squares of different componentsof the model are as follows :
a
2
SS totali1 j1k1 l1Y ijkl
a Y2
i ...
SS A i1 bcn
SS
2
Y ....
N
2
Y ....
N
b Y .2j .. Y 2
....
B
N
j 1 acn
2
2
b Y ij
.. Y ....
cn
N
SS AB(subtotals) i1 j1
c Y2
Y2
.. k . ....
C
abn
N
k 1
2
c Y2
i . k . Y ....
bn
N
SS AC(subtotals) i1k1
SS BC(subtotals) j1k1
2
2
c Y ijk
. Y ....
N
SS ABCsubtotals i1 j1k1 an
SS ABC SS ABC(subtotals) SS A SS B SS AB SS C SS AC SS BC
SS error SS total SS A SS B SS AB SS C SS AC SS BC SS ABC
Where N is the total number of obervations in the experiments (abcn 3x2x2x2 24)
For theexample under consideration, w e have
3
2
2
2
Y ....
Y ijkl 758
i 1 j 1 k 1l 1
Y 1... 226, Y 2... 287, Y 3... 245,
Y .1.. 370, Y .2.. 388
Y ..1. 350, Y ..2. 408
Therefore,
2
3
2
2
2
2
SStotal
Y ....
Y ijkl
24
i 1 j 1 k 1l 1
2
23 27 24 38 25 26 ....... 25 34
609.834
2
3 Y2
i... Y ....
SS A
24
i 1 8
2262 2872 2452
8
243.584
2
3 Y2
Y ....
.
i
..
SS B
24
j 1 12
12
24
13.5
2
2 Y2
..k . Y ....
SS C
24
k 1 12
3502 4082 7582
12
24
140.166
758
24
758
24
23
101
25
36
50
145
36
60
28
27
124
3000
27
24
26
28
34
37
39
35
35
26
34
25
125
38
35
142
34
36
121
36
34
We also have,
SS
2
2
3 2 Y ij
.. Y ....
AB(subtotals)
24
i 1 j 1 4
2
2
2
2
2
125 1452 142 124 121
758
4
24
317.834
SSAB SSAB(subtotals) SSA SSB
101
23
100
25
Cycle time (A)
50
36
144
36
60
28
27
106
27
24
26
28
34
37
39
35
35
26
34
25
126
38
35
143
34
36
139
36
34
We have,
SS
3 2 Y2
Y2
i .k . ....
AC(subtotals)
24
i 1 j 1 4
100
2
2
2
2
126 (144) 2 143 106 139
4
758
24
464.334
SSAC SSAC(subtotals) SSA SSC
464.334 243.584 140.166
80.584
Computationof SSBC : To compute the sum of squares of the interaction BC,
Table 4.37 is rearranged, as shown in Table 4.40
Table 4.40 Rearranged Data to Compute Sum of Squares of BC
Temperature (B)
2500
Operator(C )
Y.jk.
3000
Operator(C )
23
27
24
38
25
26
28
35
36
34
37
34
36
39
35
36
28
35
26
36
27
34
25
34
175
195
175
213
Then,
3
2
2 Y2
.ik . Y ....
6
24
SS BC(subtotals) i1 j1
175
2
2
195 (175) 2 213
6
758
24
167.166
SSBC SSBC(subtotals) SSB SSC
167.166 13.5 140.166
13.5
Computationof SSABC : To compute the sum of squares of the interaction ABC,
Table 4.37 is rearranged, as shownin Table 4.41
23
48
25
Cycle time (A)
50
36
28
27
53
26
72
36
60
27
34
35
34
24
52
28
73
39
55
Operator-1
37
26
25
38
73
35
72
35
69
Operator-2
34
70
36
51
36
34
70
Then,
SS
2
2
3 2 2 Y ijk
. Y ....
BC(subtota ls)
24
i 1 j 1k 1 2
2
2
2 73 2 72 2 73 2 .... 51 2 70 2
48
53
(
52
)
2
574.834
SSABC SSABC(subtotals) SSA SSB SSC SSAB SSAC SSBC
574.834 243.584 13.5 140.166 60.75 80.584 13.5
22.75
SSerror SStotal (SSA SSB SSAB SSC SSAC SSBC SSABC
609.834 243.584 13.5 60.75 140.166 80.584 13.5 22.75
35
758
24
65
70
20
40
30
35
50
40
Operator (A)
Solution. Let, the factor, operator be A and the another factor machine
be B. here, all the factors have been assumed as fixed factors. Numbers
of levels for each of the factors, n=2. Number of replications in each
treatment combination, k=2. significance level, a=0.05.
The model of this problem is as shown below:
Yijk = + Ai+ Bj + ABij + eijk
The definitions of the components of this model are as already explained
in this section. The data in Table 4.44 are reproduced in table 4.45 with
necessary notations and the totals of different treatment
combinations(1,a,b,ab)
Table 4.45 Data with Necessary Information
Machine (B)
1
2
1
65
70
135
20
40
60
30
35
65
50
40
ab
90
Operator (A)
Then,
Sum of squares of A
(ContrastA) 2
4k
135 65 60 90
(1 a a ab) 2
4 x2
200
Sum of squares of B
(ContrastB) 2
135 65 60 90
4k
(1 a b ab) 2
4 x2
312.5
Sum of squares of AB
(ContrastAB) 2
4k
(1 a b ab) 2 135 65 60 90
4 x2
8
1250
2
2 Y ...
Total sum of squares Y ijk
k 22
(350) 2
17350
8
2037.5
Degrees of
Freedom
Sum of
squares
Mean sum of
squares (MSS)
F Ratio
SSA
200.0
200/1 = 200
2.910
SSB
312.5
312.5/1 = 321.5
4.676
SSAB
1250.0
1250/1 = 1250
18.182
SSerror
275.0
275/4 = 68.75
Total
2037.5