Professional Documents
Culture Documents
Article information:
To cite this document:
Sang Jun Lee Keng Siau, (2001),"A review of data mining techniques", Industrial Management & Data Systems, Vol. 101 Iss
1 pp. 41 - 46
Permanent link to this document:
http://dx.doi.org/10.1108/02635570110365989
Downloaded on: 26 May 2015, At: 20:15 (PT)
References: this document contains references to 17 other documents.
To copy this document: permissions@emeraldinsight.com
The fulltext of this document has been downloaded 7812 times since 2006*
Ahmed Elragal, Nada El-Gendy, (2013),"Trajectory data mining: integrating semantics", Journal of Enterprise Information
Management, Vol. 26 Iss 5 pp. 516-535 http://dx.doi.org/10.1108/JEIM-07-2013-0038
(2008),"Data Mining and Knowledge Discovery Technologies20081. Data Mining and Knowledge Discovery Technologies.
Hershey, PA: IGI Publishing 2008. 369 pp. US$99.00 hard cover, ISBN: 9781599049601", Online Information Review, Vol. 32
Iss 6 pp. 866-867 http://dx.doi.org/10.1108/14684520810923980
Marvin L. Brown, John F. Kros, (2003),"Data mining and the impact of missing data", Industrial Management & Data
Systems, Vol. 103 Iss 8 pp. 611-621 http://dx.doi.org/10.1108/02635570310497657
Access to this document was granted through an Emerald subscription provided by 460696 []
For Authors
If you would like to write for this, or any other Emerald publication, then please use our Emerald for Authors service
information about how to choose which publication to write for and submission guidelines are available for all. Please
visit www.emeraldinsight.com/authors for more information.
Keywords
Data mining,
Artificial intelligence, Algorithms,
Decision trees
Abstract
Introduction
The technologies for generating and
collecting data have been advancing rapidly.
At the current stage, lack of data is no longer
a problem; the inability to generate useful
information from data is! The explosive
growth in data and database results in the
need to develop new technologies and tools to
process data into useful information and
knowledge intelligently and automatically.
Data mining (DM), therefore, has become a
research area with increasing importance
(Weiss and Indurkhya, 1998; Technology
Forecast, 1997; Fayyad et al., 1996; PiatetskyShapiro and Frawley, 1991).
DM is the search for valuable information
in large volumes of data (Weiss and
Indurkhya, 1998). It is the process of
nontrivial extraction of implicit, previously
unknown and potentially useful information
such as knowledge rules, constraints, and
regularities from data stored in repositories
using pattern recognition technologies as
well as statistical and mathematical
techniques (Technology Forecast, 1997;
Piatetsky-Shapiro and Frawley, 1991). Many
companies have recognized DM as an
important technique that will have an impact
on the performance of the companies.
DM is an active research area and research
is ongoing to bring statistical analysis and
artificial intelligence (AI) techniques
together to address the issues.
[ 41 ]
Valuable DM results
[ 42 ]
Classifying DM techniques
Many DM techniques and systems have been
developed and designed. These techniques
can be classified based on the database, the
knowledge to be discovered, and the
techniques to be utilized. In this section, we
review one of the classification schemes
proposed by Chen et al. (1996).
Major DM techniques
In this section, we review and discuss the
major DM techniques.
Statistics
[ 43 ]
Genetic algorithm
[ 44 ]
Visualization
Conclusion
Having the right information at the right
time is crucial for making the right decision.
The problem of collecting data, which used to
be a major concern for most organizations, is
almost resolved. In the millennium,
organizations will be competing in
generating information from data and not in
collecting data. Industry surveys indicated
that over 80 percent of Fortune 500 companies
believe that data mining would be a critical
factor for business success by the year 2000
(Baker and Baker, 1998). Obviously, DM will
be one of the main competitive focuses of
organizations. Although progresses are
continuously been made in the DM field,
many issues remain to be resolved and much
research has to be done.
References
[ 45 ]
[ 46 ]
26. Pedro Gago, Manuel Filipe Santos, Alvaro Silva, Paulo Cortez, Jos Neves, Lopes Gomes. 2005. INTCare: a Knowledge
Discovery Based Intelligent Decision Support System for Intensive Care Medicine. Journal of Decision Systems 14, 241-259.
[CrossRef]
27. Asai Asaithambi, Ventzeslav Valev. 2004. Construction of all non-reducible descriptors. Pattern Recognition 37, 1817-1823.
[CrossRef]
28. H.C.W. Lau, F.T.S. Chan, Richard Fung, Christina W.Y. Wong. 2004. An XMLbased realtime quality measurement
scheme. Industrial Management & Data Systems 104:6, 505-512. [Abstract] [Full Text] [PDF]
29. Marvin L. Brown, John F. Kros. 2003. Data mining and the impact of missing data. Industrial Management & Data Systems
103:8, 611-621. [Abstract] [Full Text] [PDF]
30. Jeong Yong Ahn, Seok Ki Kim, Kyung Soo Han. 2003. On the design concepts for CRM system. Industrial Management &
Data Systems 103:5, 324-331. [Abstract] [Full Text] [PDF]
31. Hamid R. Nemati, Christopher D. Barko. 2003. Key factors for achieving organizational datamining success. Industrial
Management & Data Systems 103:4, 282-292. [Abstract] [Full Text] [PDF]
32. Bassam Hassan. 2003. Examining data accuracy and authenticity with leading digit frequency analysis. Industrial Management
& Data Systems 103:2, 121-125. [Abstract] [Full Text] [PDF]
33. Abdulrahman R. Alazemi, Abdulaziz R. AlazemiOverview of Business Intelligence through Data Mining 272-295. [CrossRef]
34. Nermin Ozgulbas, Ali Serhan KoyuncugilFinancial Early Warning System for Risk Detection and Prevention from Financial
Crisis 1559-1590. [CrossRef]
35. Nermin Ozgulbas, Ali Serhan KoyuncugilFinancial Early Warning System for Risk Detection and Prevention from Financial
Crisis 76-108. [CrossRef]