You are on page 1of 11

J Intell Manuf (2011) 22:585–595

DOI 10.1007/s10845-009-0321-7

Classification knowledge discovery in mold tooling test using


decision tree algorithm
Duen-Yian Yeh · Ching-Hsue Cheng ·
Shih-Chuan Hsiao

Received: 28 April 2009 / Accepted: 21 September 2009 / Published online: 7 October 2009
© Springer Science+Business Media, LLC 2009

Abstract The scale of Taiwan’s mold industry was ranked results showed the decision tree algorithm has an excellent
the sixth in the world. But, under the global competitive pres- discriminatory power of classification and is able to provide
sure, Taiwan has lost its competitive advantage gradually. clear and simple reference rules for decisions.
The only chance of Taiwan’s mold industry lies in improv-
ing the competitive abilities in product research, develop- Keywords Mold tooling test · Knowledge discovery ·
ment and design. In mold manufacturing cycle, mold tooling Data mining · Decision tree · Mold industry
test plays a very important role at accelerating the speed of
production. An experienced engineer can minimize the error
rate of mold tooling test according to his rich experiences Introduction
in parameter adjustment. However, this experience is mostly
implicit without theoretical basis and its knowledge is diffi- In Taiwan, the mold industry is known as the “mother of
cult to be transmitted. Benefiting from the well development industry,” which is the foundation of mass production of
of data mining technologies, this study aimed at construct- products. Molds are the best tools for product standardiza-
ing an intelligent classification knowledge discovery system tion, mass production and cost cutting of either simplest com-
for mold tooling test based on decision tree algorithm, so modities or highly precise electronic parts. Figure 1 shows
as to explore and accumulate the experimental knowledge the import and export values statistics of the mold industry
for the use of Taiwan’s mold industry. This study took the in Taiwan from 1998 to 2007. The annual gross values in the
only high-alloy steel manufacturer in Taiwan for case study, recent 4 years were among NT$55–60 billions, which ranked
and performed system validation with 66 record data. The Taiwan the sixth in the world. In detail, the import and export
results showed the accuracy rates of prediction of training values distribution between Taiwan and various countries in
data and testing data are 97.6 and 86.9%, respectively. In 2007 is shown in Table 1. As seen, Mainland China was the
addition, this study explored two classification knowledge largest market to Taiwan, which accounts for 30.99% of the
rules and proposed concrete proposals for tooling test param- export value and 44.29% of the import value (Taiwan Mold
eter adjustment. Moreover, this study provided two ways, and Die Industry Association 2009).
rule verification and effectiveness comparison of four mining In the application market, Taiwan gives priority to molds
algorithms, to conduct model verification. The experimental for electronic communication products. The characteristics
of electronic communication industry include shorter product
D.-Y. Yeh life cycle, shorter time to Market, and more product types but
Department of Information Management, Transworld Institute
of Technology, Yunlin, Taiwan
small quantity. The electronic communication industry needs
e-mail: yeh@tit.edu.tw to improve the design capacities of products and also expe-
dite the development speed of products, in order to respond
C.-H. Cheng (B) · S.-C. Hsiao to severe challenges. In fact, in the cycle of product devel-
Department of Information Management, National Yunlin University
of Science & Technology, 123 University Road, Section 3, Douliou,
opment, the design and manufacturing capacities of molds
Yunlin, Taiwan are key elements. In terms of economy and technology, the
e-mail: chcheng@mis4k.mis.yuntech.edu.tw relative advantages of Taiwan include fast pre-production,

123
586 J Intell Manuf (2011) 22:585–595

may influence the molding quality of products, including at


least five items as follows (Agrawal et al. 1987).
1. High dependence on professional persons: if a product
in the period of mold tooling test has any defect, it is
required to have the engineers, designers and makers to
discuss potential problems together, and then modify the
mold for proper adjustment.
Fig. 1 Statistics of import and export values of the mold industry in 2. High development cost: in general, only one mold is pre-
Taiwan pared. If multiple modifications are required, the unit cost
and time cost of mold development will be high.
3. Likely to be influenced by human factors: since the mold
development should be carried out by designers and tool-
mature technology in highly precise and complex molds, the
ing test engineers, it must be influenced by human factors,
operation model of “Made in Mainland China and Designed
such as personal emotions, duty changes, and centripetal
& Marketing in Taiwan,” and low wage in Mainland China
force (Lu and Khim 2001).
for cost control. While the relative disadvantages are lack
4. Difficult accumulation of experience: the mold tooling
of technical manpower and most of mold firms are small-
test engineer plays a very important role in the course of
sized companies. In addition, the fund resources are likely
development. An experienced mold tooling test engineer
to be limited and downstream manufacturers move produc-
may minimize the error rate after the injection molding
tion areas abroad, especially to the Mainland China. As a
according to his rich mold knowledge and experience in
result, the labor force highly depends on the Mainland China.
the adjustment of tooling test parameters. However, most
Generally speaking, most of Taiwan’s mold manufacturers
of the experiences are implicit knowledge and without
are small-sized, and always have difficulties in raising work-
theoretical basis, and are difficult to transit.
ing capital. If without governmental support and preferential
5. Lack of knowledge rule classification: the accumulated
measures for taxation, mold manufacturers would hard to
data in the existing database is enormous and lacks the
compete in the international market.
application function of reasoning. As a result, it is diffi-
The mold industry is a high customization industry, cus-
cult to obtain information for parameter adjustment, nei-
tomer usually demands for a unique mold according to the
ther to provide proper suggestions for engineers (Huang
specifications of product. Therefore, each mold has its own
and Lin 2008).
process and specific knowledge is required. The delivery
time, quality and cost of each mold would be determined Therefore, how to classify and save mold tooling test engi-
during its design and initial stage of tooling test (Lou et al. neers’ precious knowledge rules, and let relevant knowledge
2004). In the process of mold tooling test, many variables rules develop and grow with time based on mold design and

Table 1 Import and export


distributions between Taiwan Export (unit: NT1000 dollars) Import (unit: NT1000 dollars)
and various countries in 2007
Country Export value Percentage Ranking Country Import value Percentage Ranking

China 5,759,615 30.99 1 China 2,157,622 44.29 1


USA 1,724,547 9.28 2 Japan 1,616,512 33.18 2
Thailand 1,421,660 7.65 3 Korea 294,163 6.04 3
India 1,217,099 6.55 4 USA 237,231 4.87 4
HK 963,414 5.18 5 Holland 97,079 1.99 5
Japan 932,549 5.02 6 Canada 78,243 1.61 6
Indonesia 841,800 4.53 7 Singapore 70,161 1.44 7
Malaysia 832,227 4.48 8 Germany 57,209 1.17 8
Vietnam 761,690 4.10 9 Malaysia 47,704 0.98 9
Mexico 376,963 2.03 10 Thailand 29,443 0.60 10
Others 3,755,087 20.20 Others 185,899 3.82
Total 18,586,651 100 Total 4,871,266 100

123
J Intell Manuf (2011) 22:585–595 587

tooling test experiences, so as to improve the competitive sion tree, back-propagation neural network, combination of
power of Taiwan’s mold industry, would be an urgent, seri- decision tree and neural network and discriminate analysis.
ous and important topic for Taiwan’s mold industry. According to the results, the decision tree algorithm has
With the fast progress of information technology, data an excellent discriminatory power of classification and is
mining was developed and mainly applied to obtain valuable able to provide a clear and simple rule as the reference for
information from enormous database, and conduct in-depth decisions.
analysis of data based on statistics and artificial intelligence
technologies to find out hidden knowledge (Nicholson 2006). Data mining technologies
Many data mining tools have been developed in the mar-
ket, which are convenient for processing enormous data and Data mining can be used to obtain significant and active
have a wide range of application (Fayyad et al. 1996; Chiu et models and rules hidden in mass data by using full auto-
al. 2002), such as semiconductor manufacturing data feature matic or semiautomatic algorithms, and combine relevant
extraction (Li et al. 2006), knowledge mining (Rokach et al. knowledge and experiences as reference for event decisions,
2008), sales forecasting (Chang et al. 2006, 2008), flow time thus, creating profits for enterprises (Wang 2007a; Rokach
prediction (Chang and Liao 2006), etc. As one of data mining and Maimon 2006; Quinlan 1987). As a function of data
algorithms, decision tree induction is a famous method for mining, the classification method classifies and defines the
extracting knowledge rules in classification problems. The attributes (eigenvalues) of objects to set up the class, the
advantage of decision tree algorithm is that the results are skills adopted include decision tree, memory-based reason-
more comprehensive and easy to interpret and have a well ing (Wang 2007a; Chen et al. 1996; Quinlan 1993). This
organized knowledge structure. Its applications, such as on study adopted the decision tree algorithm as the data min-
agricultural land grading (Zhao et al. 2009), machining oper- ing approach according to the characteristics of develop-
ations selection (Deb et al. 2006) and fault pattern analysis ment processes of mold tooling test (Wang et al. 2005). The
(Taetragool and Achalakul 2009), have obtained favorable decision tree algorithms and other relevant technologies are
harvests (Wang 2007b). described as follows.
Considering the successful applications to various fields
(Balic et al. 2006), decision tree algorithm may be applied to
Decision tree algorithm
the knowledge rule classification in mold tooling test devel-
opment, and its favorable assistance should be expected. This
(1) Original technology
study aimed to develop an intelligent classification knowl-
The decision tree algorithm (Quinlan 1986) can be
edge discovery system to conduct parameter research during
regarded as a separation method with goal. Firstly, an
the mold tooling test development and found out valuable
optimal attribute is identified as the root node; all data
knowledge through historical data. Through the feedback of
take this root node as the class condition, classifying
valuable knowledge and accumulated experiences, this sys-
data in the same branch namely having the same par-
tem enables engineers to improve the accuracy of mold tool-
ticularity. Another optimal attribute is identified as the
ing test design, and to shorten the development period and
class condition, and classification is made to form the
reduce material loss.
sub-tree of the decision tree. The same sorting process is
This study adopted the decision tree algorithm, and took
repeated until all data in a branch are of the same type,
the only high-alloy steel manufacturer among the top 500
and the sorting is finished. The keys of the construction
enterprises in Taiwan as the case study, and performed sys-
include:
tem validation on 66 mold tooling test record data of injection
molding machine. The experimental results showed that the a. Spend much more time on learning about data, to
accuracy rates of predictions of training data and test data know the definition and application of each variable.
are 97.6 and 86.9%, respectively. In addition, this system b. Interview domain experts to acquaint domain
has explored two classification knowledge rules at the deci- knowledge of data sources, and identify rule impor-
sion tree learning stage, and proposed concrete proposals tance and the relationship inside variables or that
for the adjustment of mold tooling test parameters. More- between variables.
over, for the sake of closing to actual situation, this study c. Broaden the range of conversion of consecutive numer-
provided two ways, rule verification and effectiveness com- ical target variables to increase the performance of
parison of mining algorithms, to conduct model verifica- decision tree algorithm.
tion. The rule verification was confirmed by the approval d. Add in derived variables and catch important data rel-
of experienced experts. On the validation of the decision evancy to increase the reliability of calculation result.
tree algorithm availability, this study compared the effec- e. Select a proper value for lost data to make it signifi-
tiveness of four classification techniques, which are deci- cant when integrated into a rule.

123
588 J Intell Manuf (2011) 22:585–595

f. Pick out input variables correlated to output variables. (1) Input Layer: signals of network are inputted through it,
the quantity of processing units is determined according
The algorithm family includes classical algorithms such
to problems.
as ID3 (Maher and Clair 1993), C4.5 (Breiman et al.
(2) Hidden Layer: for showing the mutual influence
1984), CART (Kass 1980), CHAID (Quinlan 1993) and
between input processing units, there is no standard
a number of modern systems. Focusing on obtaining the
method for deciding the quantity of processing units.
preliminary pattern of decision tree, this study adopted
Usually, more processing units in the hidden layer
the ID3 algorithm.
indicate slower convergence, but there are smaller error
(2) ID3 algorithm
values. However, when the quantity exceeds a certain
ID3 (Iterative Dichotomiser 3) algorithm is developed
number, it is hardly helpful to reducing errors, only the
from the concept learning system (Maher and Clair 1993).
execution time increases. On the other hand, if the quan-
It shall separate the data into three groups: structure of the
tity of processing units is too small, it is not able to reflect
records, training data and testing data. The calculation
the interaction effect between variables.
process is to use the information gain as the criterion for
(3) Output Layer: it represents the output variables of net-
deciding branches, and select the optimal attribute as the
work, the processing units depend on problems, and the
root node to construct the decision tree (Quinlan 1996).
final signal of network is outputted through it. It uses
(3) Selection of important attribute
non-linear transfer function, while the most used non-
When constructing the decision tree, reducing the dis-
linear transfer function of the Back-propagation Neural
order of newly generated sub-nodes after the test can
Network is Sigmoid Function.
obtain shallow depth of the decision tree; this is the basic
spirit of selecting mold tooling test important attribute.
The methods for selecting important attribute condition Comparison between decision tree and neural network
value can be divided into two methods, one is the intu- algorithms
ition method, and the other one is the information gain
method (Quinlan 1979). The decision tree and neural classification techniques have
their own advantages in the application area of data mining.
a. Intuition method: find out an attribute through experi- Table 2 shows the comparison between their advantages and
ences and intuition to maximize the difference between disadvantages (Kijsirikul and Chongkasemwongse 2001).
tested subsets. This method is applicable to very small
training set only.
Combined application of decision tree and neural network
b. Information gain method: the information gain is
algorithms
defined as “pretest information” reduces “informa-
tion after test”. Smaller difference of information
It is expected to be able to increase the accuracy rate of neural
indicates less disorder. Therefore, the attribute of
network algorithm effectively by taking the classes mined by
minimal difference after test can be selected. This
decision tree algorithm as the input data of neural classifica-
method is mainly applied to the decision tree ID3
tion model (Liu et al. 2004).
algorithm.

Intelligent mold tooling test classification knowledge


Neural network algorithm discovery system

The neural network is a computing system, including soft- System architecture and analytical method
ware and hardware. It uses a lot of simple and connected
artificial neurons to simulate biological neural network and The system architecture is divided into five blocks, which are
export calculation results to the external environment or other interview experts, investigate important process parameters,
artificial neurons (Wong et al. 1997). In a neural network, mine mold process knowledge, design and develop knowl-
many artificial neurons are connected to compose different edge-based system and analyze mining results, as shown in
network models or Network Paradigms. Among which the Fig. 2.
Back-Propagation Neural Network (BPNN) is most widely
applied. The BPNN is a multi-layer feedforward network (1) Interview experts: combine three scholars and seven
architecture, which can be divided into output layer, input mold experts’ opinions about influential evaluation
layer and hidden layer processing units. The functions of all parameters required for constructing the mold tooling
layers are described as follows: test knowledge discovery system.

123
J Intell Manuf (2011) 22:585–595 589

Table 2 Comparisons between


decision tree and neural network Algorithm Decision tree Neural network
algorithms Item

Assumption Independence between the classes None


Advantages 1. Discovering easy-understanding 1. Multiple functions to handle extensive
rules problems
2. Suitable for rule-oriented field 2. Capable to handle complicated field
and produce good connections
3. Easy to classification 3. Easy to treat continuous and category
variables
4. Easy to treat continuous and 4. Easy to obtain software
category variables
Disadvantages 1. So many classes as to induce faults 1. [0, 1]-Input
2. Complex calculation and training 2. Incapable of explaining results
3. Incapable of treating not-right-angle 3. Possible to produce answers with bad
area convergence

Intelligent mold tooling test classification rules for designers and engineers. This system uses the
knowledge discovery system
production data management (PDM) system and takes
Interview experts
Java program language object-oriented as the develop-
ment tool.
Investigate important process parameters (5) Analyze mining results, and make specific feedback of
analytic results.
Decision tree algorithm
Mine mold process knowledge
Neural network algorithm

Design and develop knowledge-based system PDM system Investigating important parameters of mold processes

Analyze mining results Results comparison The attribute parameter is determined based on two features
(Turng et al. 2002). One is the data diversification of mold
processes, the other one is the experiment data consideration:
Fig. 2 Architecture of the intelligent classification knowledge discov-
ery system the injection molding of each circle, from the cylinder tem-
perature to the injection pressure, totally 26 parameters shall
be recorded. The selection of data attribute parameter is per-
formed by using the information gain of ID3 algorithm, and
(2) Investigate important process parameters of mold: then the expert interview using a semiformal questionnaire.
with reference to domestic and foreign literatures and The mold processes contain the following ten important oper-
experts’ opinions through interviews, sort out the evalu- ating parameters: injection temperature, mold temperature,
ation parameters of mold tooling test process. injection speed, injection pressure, injection time, injection
(3) Mine mold process knowledge: the mold tooling test pro- velocity, packing time, packing switch, cooling time and
cess is very complicated and is an instable loop program. ingredients melt index (MI) value.
The decision tree algorithm and the neural network algo-
rithm are selected as the data mining method. These two
Mining mold process knowledge
methods are combined for application and comparison,
to expect a preferable mining outcome.
This section will describe the knowledge discovery steps for
(4) Design and develop knowledge-based system: this sys-
mold processes according to the data mining steps (Wang
tem uses information technology to support the mold
2007a).
process to set up an electronic knowledge base. Its four
primary objectives are experience accumulation, data (1) Stipulate data mining target
sharing, knowledge sharing and computerized operation, It aims to find out the implicit meanings of variable data
and it provides accumulated mold tooling test in the mold processes, conclude the reasons for failure,
experiences, modification experiences, experiences in and propose a directory guideline to adjust the direction
setting of different parameters and implicit knowledge and range of parameter setting, so as to reduce mate-

123
590 J Intell Manuf (2011) 22:585–595

rial loss in the processes and reduce the times of mold Table 6 Classified orders of all nodes on each decision tree
tooling test. Order Parameter name Information gain
(2) Extract attribute parameter
Table 3 shows the ten important parameters of mold pro- 1 Mold temperature 0.43
cesses after extraction using ID3 algorithm and expert 2 Injection pressure 0.26
interview. 2 Injection velocity 0.26
(3) Determine target type 3 Injection temperature 0.24
Six variables for identifying failure of mold processes 4 Packing time 0.21
are defined according to frequent badness of finished 4 Packing switch 0.21
products, as shown in Table 4. 5 Cooling time 0.18
(4) Data normalization 6 Injection speed 0.10
It defines and separates values of parameters for conver- 7 Injection time 0.04
sions. The following three steps are used for the con- 8 Ingredients MI value 0.00
version of data, and the amplitude of change in each
parameter value is set, as shown in Table 5.
a. Set the first odd number (Ex: 00001) as the optimal Table 7 Parameter settings of decision tree
initial parameter, which acts as the basis of amplitude Parameter name Parameter setting
modulation setting.
Data collection period April 2007 to March 2008
b. Take the mean value of multi-segment data, for exam-
Data mining algorithm Decision tree
ple, mold temperature has six data such as N1, T1,
Input data 43 Training data
Maximum severity of root node 100
Minimum record number of root node 3
Table 3 Ten important parameters of mold processes after extraction
Maximum depth of the decision tree 5
Item no. Parameter name Item no. Parameter name
Class label Molding result
Output column 1
1 Injection temperature 6 Injection velocity
Molding result OK or NG
2 Mold temperature 7 Packing time
3 Injection speed 8 Packing switch
4 Injection pressure 9 Cooling time
5 Injection time 10 Ingredients MI value
T2, T3, T4 and T5, sum up N1–T5 and calculate the
mean temperature.
Table 4 Six variables for identifying failure of mold processes c. Eliminate unwanted parameters, import parameters
wanted only.
Item no. Variable name
(5) Set up data mining model
1 Streak
All original data are converted into variables of each
2 Stain
adjustment range through data conversion, and the clas-
3 Burn marks sified order of all nodes on each decision tree is set
4 Edges according to the information gain results of ID3 algo-
5 Unfilled parts rithm, expert opinions, and changes in all variables. The
6 Warped parts order is shown in Table 6.
(6) Result view and verification
Finally, as for the complete decision tree constructed in
Table 5 Amplitude settings of change in each parameter value data mining model, the results are validated and viewed
Category Positive range (%) Category Negative range (%) through the following two steps.

+A 0–5 −A 0 ∼ −5 a. Obtain optimal path: make decision tree pruning to


+B 5–10 −B −5 ∼ −10 procure the optimal path.
+C 10–15 −C −10 ∼ −15 b. Verify effective path: execute pruning function to elim-
+D 15–20 −D −15 ∼ −20 inate all zero ineffective nodes and data of
+E 20–∞ −E −20 ∼ −∞ which the sum of ‘goodness’ and ‘badness’ numbers
is less than ten.

123
J Intell Manuf (2011) 22:585–595 591

Results and data analysis (2) Model formation


a. The preliminary pattern
Case company description
Nine parameters were taken as the input variables of
decision tree, parameter settings are shown in Table 7.
The case company was the only high-alloy steel manufac-
The preliminary result is shown in Fig. 3.
turer among top 500 enterprises in Taiwan. In recent years,
b. The pruned pattern
this company had a steady growth in its sales volume, from
The decision tree was pruned through the informa-
NT$3 billion in 2005 to NT$5 billion in 2007. It had 66 mold
tion gain, so as to be effective anticipation knowledge
tooling test record data of the injection molding machine, and
rules of data mining. The result is shown in Fig. 4.
the data collection time was from April 2007 to March 2008,
11 months in total. As results, two classification rules explored by decision
tree are listed as follows:
Generation of rule
A. Knowledge rule I: if mold temperature = +A (0–
5%), then class label = ‘OK’. The result is: 27 ‘OK’
(1) Data set preparation
and 1 ‘NG’, yield rate = 96.4%.
The ratio of injection molding records was set at 2:1
Content of rule: when the mold temperature is incre-
as the training data and testing data sets. The results
ased by 0–5%, the optimal yield rate can be achieved.
are:
B. Knowledge rule II: if mold temperature = −D
a. Training stage: take 43 injection molding data, (−15 ∼ −20%) and injection speed = +A (0–5%),
including 35 ‘OK’ data and 8 ‘NG’ data, to find out then class label = ‘OK’. The result is: 3 ‘OK’ and 0
the pattern of data. ‘NG’, yield rate = 100%.
b. Testing stage: take 23 injection molding data, Content of rule: when the mold temperature is reduced
including 19 ‘OK’ data and 4 ‘NG’ data, to test the by −15∼−20% and the injection temperature is increased
pattern constructed by the training data. by 0–5%, the optimal yield rate can be achieved.

Ingredients
MI Value

Injection
Temperature

Injection
Packing Time
Time

Cooling
Time

Packing
Time

Injection
Time

Injection
Velocity

Fig. 3 The preliminary pattern of decision tree

123
592 J Intell Manuf (2011) 22:585–595

Mold Temperature

Injection Temperature

Injection Spee
Speedd

Fig. 4 The final decision tree pruned through the information gain

Using the effective rules obtained above, the model of (1) Rule verification
decision tree was to be verified by testing data. The In order to check whether the explored rule was effective
experimental results are presented as follows. The coin- for mold tooling test personnel in adjusting parameters
cidence matrix for predicted classes is shown in of forming injection machine, this study used a set of
Table 8. bad injection parameter data to verify the rules, and the
process is as follows.

A. In training stage, the accuracy rate of prediction is 97.6%, a. Procure an example case of molding defective during
and the error ratio is 2.4%. tooling test injection molding for case of improve-
B. In testing stage, the accuracy rate of prediction is 86.9%, ment suggestion (see Table 9).
and the error ratio is 13.1%. In Table 9, the cause of molding defective is surplus
burr. The mold tooling test personnel would adjust the
parameters of injection molding machine and repeat-
Model verification edly tests according to expert experience and intui-
tion, until defective condition is improved.
For the sake of closing to actual situation, this study provided b. Make adjustment suggestions based on explored
two ways, rule verification and effectiveness comparison of knowledge rules
mining algorithms, to conduct model verification. The fol- Although the molding yield rate could be 96.4% by
lowing shows the details. adjusting the mold temperature, molding defective

123
J Intell Manuf (2011) 22:585–595 593

Table 8 Coincidence matrix for predicted classes


I. Hit ratio (training data) = samples discriminated
Predicted class OK NG Total
correctly /training data sample number
Real class
II. Hit ratio (testing data) = samples discriminated
Training data correctly /testing data sample number
OK 35 0 35
The random sampling was taken ten times. Table 11 is the
NG 1 7 8
overview of the results of classification prediction.
Total 36 7 43
Testing data
OK 18 1 19
(a) Comparison of accuracy rate of training data: the accu-
NG 2 2 4
racy rate of training data of discriminate analysis model
Total 20 3 23
79.8% is lowest, and other three classification models
are at the same level which is 98.4%.
(b) Comparison of accuracy rate of testing data: the pre-
problem could not be completely improved only by diction model of combination of decision tree and neu-
adjusting the mold temperature parameter. According ral network is best, the accuracy rate of testing data
to the explored knowledge rules I and II, this study is 91.4%; secondly is the decision tree 90.5%, and the
suggested the parameter adjustments as: back-propagation neural network 90.0%, finally is the
discriminate analysis 72.6%.
(a) Reduce mold temperature to 58–62.5◦ C
(b) Increase injection speed to 20–21 (cm/s)
As verified by the experienced experts, the above The results of accuracy rate of four mining algorithms for
suggestions are effective for adjusting injection certain degree agrees with those of Lim et al. (1997) and
parameters in the process of mold tooling test. Liu et al. (2004). As for the classification model adopted by
(2) Effectiveness comparison of mining algorithms the practical system in this study, the decision tree is known as
This study compared the effectiveness of four classifi- having excellent classification discriminating ability accord-
cation techniques, which are decision tree, back-propa- ing to the aforesaid comparative result, and it is able to pro-
gation neural network, combination of decision tree and vide clear and simple rules as the guideline for decisions.
neural network and discriminate analysis.
Application of knowledge base system
a. Back-propagation neural network
This study used the back-propagation neural net-
The intelligent mold tooling test classification knowledge
work model of Weka software for construction of
discovery system developed by this study has the following
single classifier based on 43 training data, and eval-
two major functions.
uated the classification effectiveness based on 23
testing data. The settings of parameter values are
shown in Table 10. (1) Provide service platform for mold development knowl-
b. Combination of decision tree and neural network edge
Three nodes were selected from the decision tree In the present stage, this system contains more than 400
experiment as input variables of the neural network, knowledge documents, ten knowledge communities, a
including “mold temperature”, “injection temper- list of more than 40 experts, and two effective knowl-
ature” and “injection speed”. The next processes edge rules explored through “decision tree” technology,
followed the back-propagation neural network. provided for companies of individual cases, two client
c. Discriminate analysis The discriminate analysis is system factories and 120 partners as well as a knowl-
to figure out the function when the attribution group edge service platform for sharing, saving, search and
of group data point is known, which is most likely application.
to discriminate different group data points, and this (2) Provide simulation function
function is called discriminate function. The This system can provide a simulation for mold tooling
attribution group classification of new data points test engineers prior to practical adjustment of injection
can be carried out based on this function. The molding parameters, so as to avoid cost and time wastes
measurement method is as follows: resulted from failures of finished product injection.

123
594 J Intell Manuf (2011) 22:585–595

Table 9 A example case of molding defective


Mold no. Product name Record time Class label Defective cause Injection temperature Mold temperature Injection speed

WCL181501 2007/10/07 NG Burr 187 72.5 20


2005001 Injection time Injection velocity Injection pressure Packing time Packing switch Cooling time Ingredients MI value
1 45 47.5 1.75 6 21 30

Table 10 Parameter settings of back-propagation neural network validation of the system, this study took the only high-alloy
Parameter name Parameter setting steel manufacturer among the top 500 enterprises in Taiwan
as the case study, and classified 43 data of 66 mold tooling
test record data of injection molding machine as training data
Data collection period April 2007 to March 2008 and 23 of which as testing data. The data mining technique
Data mining algorithm Back-propagation neural network adopted was the decision tree algorithm. The experimental
Input data 43 Training data results showed that, the prediction accuracy rate of training
Module Training model data is 97.6%, and the prediction accuracy rate of testing data
Learning rate 0.3 is 86.9%. In addition, two classification rules of decision tree
Momentum factor 0.2 learning method were explored, and concrete proposals for
Maximum learning cycle 500 parameter adjustment were proposed according to these two
Number of neurons in hidden 6 (Processing units in input layer knowledge rules.
layer + processing units in output Moreover, this study provided two ways, rule verifica-
layer)/2
Input column 10
tion and effectiveness comparison of four mining algorithms,
Output column 1
to conduct model verification. The four classification algo-
rithms were decision tree, back-propagation neural network,
Class label Molding results
combination of decision tree, and discriminate analysis. The
Molding results OK or NG
experimental results indicated that, the decision tree algo-
rithm has an excellent classification discriminating ability,
and it can provide clear and simple rules as the guideline for
Table 11 Experimental results of four mining algorithms
decisions.
Data mining algorithm Data type Mean accuracy
rate (%)

Decision tree Training data 98.4


Testing data 90.5
References
Back-propagation neural Training data 98.4
Agrawal, A. R., Pandelidis, I. O., & Pecht, M. (1987). Injection-
network Testing data 90.0 molding process control—A review. Polymer Engineering and
Combination of decision tree Training data 98.4 Science, 27(18), 1345–1357.
and neural network Testing data 91.4 Balic, J., Kovacic, M., & Vaupotic, B. (2006). Intelligent programming
of CNC turning operations using genetic algorithm. Journal of
Discriminate analysis Training data 79.8 Intelligent Manufacturing, 17(3), 331–340.
Testing data 72.6 Breiman, L., Friedman, J., Olshen, R., & Stone, C. (1984). Classifica-
tion and regression trees. Belmont CA: Wadsworth International
Group.
Chang, P.-C., Liu, C.-H., & Lai, R. K. (2008). A fuzzy case-based
reasoning model for sales forecasting in print circuit board
Conclusions industries. Expert Systems with Applications, 34(3), 2049–2058.
Chang, P.-C., Liu, C.-H., & Wang, Y.-W. (2006). A hybrid model by
This study aimed to develop an intelligent mold tooling test clustering and evolving fuzzy rules for sale forecasting in printed
classification knowledge discovery system based on data circuit board industry. Decision Support Systems, 42(3), 1254–
1269.
mining techniques, so as to seek for mold tooling test exper- Chang, P.-C., & Liao, T. W. (2006). Combing SOM and fuzzy rule
imental knowledge which is difficult to transit during mold base for flow time prediction in semiconductor manufacturing
development, so that it can be saved, fed back and accu- factory. Applied Soft Computing, 6(2), 198–206.
mulated to the knowledge base system after converted into Chen, M.-S., Han, J., & Yu, P.-S. (1996). Data mining: An overview
from a database perspective. IEEE Transactions on Knowledge
effective knowledge for adjusting parameters. It is also user- and Data Engineering, 8(6), 866–883.
friendly to mold designers and tooling test engineers, so as to Chiu, C.-C., Lee, T.-S., Chou, Y.-C., & Lu, C.-J. (2002). Data
reduce error cost and time during mold development. In the mining by using the integration of neural network and dis-

123
J Intell Manuf (2011) 22:585–595 595

criminate analysis. Journal of the Chinese Institute of Industrial Quinlan, J. R. (1986). Induction of decision tree. Machine Learn-
Engineers, 19(2), 9–22. ing, 1, 81–106.
Deb, S., Ghosh, K., & Paul, S. (2006). A neural network based meth- Quinlan, J. R. (1987). Simplifying decision trees. International Journal
odology for machining operations selection in computer-aided of Man-Mach Studies, 27, 221–234.
process planning for rotationally symmetrical parts. Journal of Quinlan, J. R. (1993). C4.5: Programs for Machine Learning. San
Intelligent Manufacturing, 17(5), 557–569. Francisco: Morgan- Kaufmann.
Fayyad, U., Piatetsky-Shapiro, G., & Smyth, P. (1996). The KDD pro- Quinlan, J. R. (1996). Improved use of continuous attributes in
cess for extracting useful knowledge from volumes of data. Com- C4.5. Journal of Artificial Intelligence Research, 4, 77–90.
munications of the ACM, 39, 27–34. Rokach, L., & Maimon, O. (2006). Data Mining for improving the qual-
Huang, M.-S., & Lin, T.-Y. (2008). An innovative regression-model ity of manufacturing: A feature set decomposition approach. Jour-
based searching method for setting the robust injection molding nal of Intelligent Manufacturing, 17(3), 285–299.
parameters. Journal of Materials Processing Technology, 198(1– Rokach, L., Romano, R., & Maimon, O. (2008). Mining manufactur-
3), 436–444. ing databases to discover the effect of operation sequence on the
Kass, G. V. (1980). An exploratory technique for investigating product quality. Journal of Intelligent Manufacturing, 19(3), 313–
large quantities of categorical data. Applied Statistics, 29(2), 325.
119–127. Taetragool, U., & Achalakul, T. (2009). Applying decision tree in
Kijsirikul, B., & Chongkasemwongse, K. (2001). Decision tree prun- fault pattern analysis for HGA manufacturing. International con-
ing using backpropagation neural networks. Proceedings of IEEE ference on complex, intelligent and software intensive systems
international conference on neural networks (Vol. 3, pp. 1876– (pp. 83–89). March 16–19, Fukuoka, Japan.
1880). July, Washington, DC, USA. Taiwan Mold & Die Industry Association. (2009). http://www.tmdia.
Li, T.-S., Huang, C.-L., & Wu, Z.-Y. (2006). Data mining using org.tw/en/export_import_output.asp.
genetic programming for construction of a semiconductor man- Turng, L. S., Peic, M., & Bradley, D. K. (2002). Process simula-
ufacturing yield rate prediction system. Journal of Intelligent tion and optimization for injection molding-experimental ver-
Manufacturing, 17(3), 355–361. ifications and field applications. Journal of Injection Molding
Lim, T.-S., Loh, W.-Y., & Shih, Y.-S. (1997). An empirical com- Technology, 6(2), 143–156.
parison of decision trees and other classification methods. UW Wang, K. (2007a). Applying data mining to manufacturing: The
Madison: TR979, Department of Statistics. nature and implications. Journal of Intelligent Manufactur-
Liu, X., Bowyer, K. W., & Hall, L. O. (2004). Decision trees work ing, 18(4), 487–495.
better than feed-forward back-propagation neural nets for a spe- Wang, Y. (2007b). Combining data mining and game theory in
cific class of problems. IEEE International Conference on Systems manufacturing strategy analysis. Journal of Intelligent Manufac-
Man and Cybernetics, 6, 5969–5974. 10–13 Oct. turing, 18(4), 505–511.
Lou, Z., Jiang, H., & Ruan, X. (2004). Development of an inte- Wang, K.-J., Chen, J., & Lin, Y.-S. (2005). A hybrid decision tree
grated knowledge-based system for mold-base design. Journal and neural network model for knowledge discovery of dispatch-
of Materials Processing Technology, 150, 194–199. ing rules selection in semiconductor testing industry. Production
Lu, X., & Khim, L. S. (2001). A statistical experimental study Planning and Control, 16(7), 665–680.
of the injection molding of optical lenses. Journal of Materials Wong, B.-K., Bodnovich, T. A., & Selvi, Y. (1997). Neural network
Processing Technology, 113, 189–195. applications in business: A review and analysis of the literature
Maher, P. E., & Clair, D. S. (1993). Uncertain reasoning in an ID3 (1988-95). Decision Support Systems, 19, 301–320.
machine learning framework. Proceedings of the 2nd IEEE inter- Zhao, L., Zheng, X., Yan, H., Wang, S., & Zhang, K. (2009). Construc-
national conference on fuzzy systems, FUZZ-IEEE’93, Vol. 1, tion and application of the decision tree model for agricultural land
pp. 7–12. grading based on MATLAB. Second international workshop on
Nicholson, S. (2006). The basis for bibliomining: Frameworks for knowledge discovery and data mining (pp. 155–158). January
bringing together usage-based data mining and bibliometrics 23–25, Moscow.
through data warehousing in digital library services. Information
Processing and Management, 42(3), 785–804.
Quinlan, J. R. (1979). Discovering rules by induction from large
collections of examples. Expert Systems in the Micro-Electronic
Age (pp. 168–201). Edinburgh: Edinburgh University Press.

123

You might also like