You are on page 1of 4

Nama : Yulita Ariani

Kelas : 5ELB
NIM : 061540342246

9.8 Summary

In this chapter, we considered knowledge engineering and data mining. First we discussed
what kind of problems can be addressed with intelligent systems and introduced six main
phases of the knowledge engineering process. Then we studied typical applications of
intelligent systems, including diagnosis, classifi-cation, decision support, pattern recognition
and prediction. Finally, we examined an application of decision trees in data mining.

The most important lessons learned in this chapter are:

. Knowledge engineering is the process of building intelligent knowledge-based systems.


There are six main steps: assess the problem; acquire data and knowledge; develop a
prototype system; develop a complete system; evaluate and revise the system; and integrate
and maintain the system.

. Intelligent systems are typically used for diagnosis, selection, prediction, classification,
clustering, optimisation and control. The choice of a tool for building an intelligent system
is influenced by the problem type, availability of data and expertise, and the form and
content of the required solution.

. Understanding the problems domain is critical for building an intelligent system.


Developing a prototype system helps us to test how well we under-stand the problem and
to make sure that the problem-solving strategy, the tool selected for building a system, and
the techniques for representing acquired data and knowledge are adequate to the task.

. Intelligent systems, unlike conventional computer programs, are designed to solve


problems that quite often do not have clearly defined right and wrong solutions.
Therefore, the system is normally evaluated with test cases selected by the user.

. Diagnostic and troubleshooting problems are very attractive candidates for expert systems.
Diagnostic expert systems are easy to develop because most diagnostic problems have a
finite list of possible solutions, involve a limited amount of well-formalised knowledge,
and usually take a human expert a short time to solve.

. Solving real-world classification problems often involves inexact and incom-plete data.
Expert systems are capable of dealing with such data by managing incrementally acquired
evidence as well as information with different degrees of belief.

. Fuzzy systems are well suited for modelling human decision-making. Import-ant decisions
are often based on human intuition, common sense and experience, rather than on the
availability and precision of data. Fuzzy technology provides us with a means of coping
with the soft criteria and fuzzy data. Although decision-support fuzzy systems may
include dozens, even hundreds, of rules, they can be developed, tested and implemented
relatively quickly.
KNOWLEDGE ENGINEERING AND DATA MINING

. Neuralnetworks represent a class of general-purpose tools that are success-fully applied to


prediction, classification and clustering problems. They are used in such areas as speech and
character recognition, medical diagnosis, process control and robotics, identifying radar
targets, predicting foreign exchange rates and detecting fraudulent transactions. The areas of
neural network applications are expanding very rapidly.

. Datamining is the extraction of knowledge from data. It can also be defined as the exploration
and analysis of large quantities of data in order to discover meaningful patterns and rules. The
ultimate goal of data mining is to dis-cover knowledge.

. Although data mining is still largely a new, evolving field, it has already found numerous
applications. In direct marketing, data mining is used for targeting people who are most likely
to buy certain products and services. In trend analysis, it is used to identify trends in the
marketplace by, for example, modelling the stock market. In fraud detection, data mining is
used to identify insurance claims, cellular phone calls and credit card purchases that are most
likely to be fraudulent.

. The most popular tool for data mining is a decision tree a tool that describes a data set by a
tree-like structure. Decision trees are particularly good at solving classification problems. The
main advantage of the decision-tree approach to data mining is that it visualises the solution; it
is easy to follow any path through the tree. The trees ability to produce clear sets of rules
makes it particularly attractive for business professionals.

Questions for review

What is knowledge engineering? Describe the main steps in knowledge engineering.


Why is choosing the right tool for the job the most critical part of building an intelligent
system?

What are the stages in the knowledge acquisition process? Why is knowledge
acquisition often called a bottleneck of the process of knowledge engineering? How
can the acquired data affect our choice of the system building tool?

What is a prototype? What is a test case? How do we test an intelligent system?


What should we do if we have made a bad choice of system-building tool?

Why is adopting new intelligent technologies becoming problem-driven, rather than


curiosity-driven, as it often was in the past?

What makes diagnosis and troubleshooting problems so attractive for expert system
technology? What is a phone call rule?

How do we choose a tool to develop an expert system? What are the advantages of
expert system shells? How do we choose an expert system shell for building an
intelligent system?
Nama : Yulita Ariani
Kelas : 5ELB
NIM : 061540342246

REFERENCES 363

Why are fuzzy systems particularly well suited for modelling human decision-making? Why
does fuzzy technology have great potential in such areas as business and finance?

What is the basis for the popularity of neural networks? What are the most
successful areas of neural network applications? Explain why and give examples.

Why do we need to massage data before using them in a neural network model? How
do we massage the data? Give examples of massaging continuous and discrete data.
What is 1 of N coding?

What is data mining? What is the difference between a query tool and data mining?
What are data mining tools? How is data mining applied in practice? Give examples.

What is a decision tree? What are dependent variables and predictors? What is the
Gini coefficient? How does a decision tree select predictors?

What are advantages and limitations of the decision-tree approach to data mining?
Why are decision trees particularly attractive to business professionals?

References

Adriaans, P. and Zantinge, D. (1996). Data Mining. Addison-Wesley, Harlow. Applegate, D.,
Bixby, R., Chvatal, V. and Cook, W. (2001). TSP cuts which do not
conform to the template paradigm, Computational Combinatorial Optimization, M. Junger and
D. Naddef, eds, Springer-Verlag, Berlin, pp. 261304.
Berry, M. and Linoff, G. (1997). Data Mining Techniques: For Marketing, Sales, and Customer
Support. John Wiley, New York.
Berry, M. and Linoff, G. (2000). Mastering Data Mining. John Wiley, New York. Breiman, L.,
Friedman, J.H., Olshen, R.A. and Stone, C.J. (1984). Classification and
Regression Trees. Wadsworth, Belmont, CA.
Cabena, P., Hadjinian, P., Stadler, R., Verhees, J. and Zanasi, A. (1998). Discovering Data Mining:
From Concept to Implementation. Prentice Hall, Upper Saddle River, NJ.
Dantzig, G., Fulkerson, R. and Johnson, S. (1954). Solution of a large-scale traveling-salesman
problem, Operations Research, 2, 393410.
Davis, L. (1985). Applying adaptive algorithms to epistatic domains, Proceedings of the 9th
International Joint Conference on Artificial Intelligence, A. Joshi, ed., Morgan Kaufmann, Los
Angeles, CA, pp. 162164.
Davis, R. and King, J. (1977). An overview of production systems, Machine Intelligence, 8, 300
322.
Durkin, J. (1994). Expert Systems Design and Development. Prentice Hall, Englewood Cliffs, NJ.

Firebaugh, M. (1988). Artificial Intelligence: A Knowledge-Based Approach. Boyd & Fraser,


Boston.
Fisher, R.A. (1950). Contributions to Mathematical Statistics. John Wiley, New York.
Flood, M.M. (1955). The traveling salesman problem, Operations Research, 4, 6175.
Goldberg, D.E. (1989). Genetic Algorithms in Search Optimization and Machine Learning.
Addison-Wesley, Boston.
Goldberg, D.E. and Lingle, R. (1985). Alleles, loci and the traveling salesman problem, Proceedings
of the 1st International Conference on Genetic Algorithms, J.J. Grefenstette, ed., Lawrence
Erlbaum Associates, Pittsburgh, PA, pp. 154159.
KNOWLEDGE ENGINEERING AND DATA MINING

Groth, R. (1998). Data Mining: A Hands-On Approach for Business Professionals (Data Warehousing Institute Series).
Prentice Hall, Upper Saddle River, NJ.
Gu llich, H.-P. (1996). Fuzzy logic decision support system for credit risk evaluation, EUFIT Fourth European Congress
on Intelligent Techniques and Soft Computing, pp. 22192223.

Haupt, R.L. and Haupt, S.E. (1998). Practical Genetic Algorithms. John Wiley, New York. Haykin, S. (1999). Neural
Networks: A Comprehensive Foundation, 2nd edn. Prentice
Hall, Englewood Cliffs, NJ.
Hopfield, J.J. and Tank, D.W. (1985). Neural computation of decisions in optimiza-tion problems, Biological Cybernetics,
52, 141152.
Kurgan, L.A., Cios, K.J., Tadeusiewicz, R., Ogiela, M. and Goodenday, L. (2001). Knowledge discovery approach to
automated cardiac SPECT diagnosis, Artificial Intelligence in Medicine, 23(2), 149169.
Laarhoven, P.J.M. and Aarts, E.H.L. (1987). Simulated Annealing: The Theory and Application. Kluwer Academic
Publishers, Norwell, MA.
Lawler, E.L., Lenstra, J.K., Rinnooy Kan, A.H.G. and Shmoys, D.B. (1985). The Traveling Salesman Problem. John Wiley,
Chichester.
LeCun, Y., Boser, B., Denker, J.S., Henderson, D., Howard, R.E., Hubbard, W. and Jackel, L.D. (1990). Handwritten digit
recognition with a back-propagation network, Advances in Neural Information Processing Systems, D.S. Touretzky, ed.,
Morgan Kaufmann, San Mateo, CA, vol. 2, pp. 396404.
Martin, O., Otto, S.W. and Felten, E.W. (1991). Large-step Markov chains for the traveling salesman problem, Complex
Systems, 5(3), 299326.
Michalewicz, Z. (1996). Genetic Algorithms Data Structures Evolutionary Programs, 3rd edn. Springer-Verlag, New
York.
Michie, D. (1982). The state of the art in machine learning, Introductory Readings in Expert Systems, Gordon and Breach,
New York, pp. 209229.
Potvin, J.V. (1996). Genetic algorithms for the traveling salesman problem, Annals of Operations Research, 63, 339370.
Principe, J.C., Euliano, N.R. and Lefebvre, W.C. (2000). Neural and Adaptive Systems: Fundamentals Through
Simulations. John Wiley, New York.
Richards, R. (2002). Application of multiple artificial intelligence techniques for an aircraft carrier landing decision support
tool, Proceedings of the IEEE International Conference on Fuzzy Systems, FUZZ-IEEE02, Honolulu, Hawaii.
Russell, S.J. and Norvig, P. (2002). Artificial Intelligence: A Modern Approach, 2nd edn. Prentice Hall, Englewood Cliffs,
NJ.
Simon, R. (1987). The morning after, Forbes, October 19, pp. 164168.
Tschoke, S., Lubling, R. and Monien, B. (1995). Solving the traveling salesman problem with a distributed branch-and-
bound algorithm on a 1024 processor network, Proceedings of the 9th IEEE International Parallel Processing
Symposium, Santa Barbara, CA, pp. 182189.

Von Altrock, C. (1997). Fuzzy Logic and NeuroFuzzy Applications in Business and Finance. Prentice Hall, Upper Saddle
River, NJ.
Waterman, D.A. (1986). A Guide to Expert Systems. Addison-Wesley, Reading, MA. Widrow, B. and Stearns, S.D. (1985).
Adaptive Signal Processing. Prentice Hall, Engle-
wood Cliffs, NJ.
Zurada, J.M. (1992). Introduction to Artificial Neural Systems. West Publishing Company, St Paul, MN.

You might also like