Professional Documents
Culture Documents
Discovery
Content Overview • Technologies used in Data Mining
Introduction • Goals of Data Mining and
Warehouse with a database Knowledge Discovery
What is Data-Warehousing?
Warehousing Functions Compendium
Architecture Of Data Warehouse
What is Data Mining?
Warehousing and Mining
Data Mining as a part of Knowledge
1
2
Bibliography excellence. Information technology (IT) tools
that are oriented towards knowledge processing
can provide the edge that organizations need to
survive and thrive in the current era of fierce
competition. The increasing competitive
Introduction
pressures and the desire to leverage information
“Knowledge [no more Information] is not only
technology techniques have led many
power, but also has significant competitive
organizations to explore the benefits of new
advantage”
emerging technology - "Data Warehousing and
Organizations have lately realized that
Data Mining". What is needed today is not just
just processing transactions and/or information’s
the latest and updated to the nano-second
faster and more efficiently, no longer provide
information, but the cross-functional
them with a competitive advantage vis-à-vis
information that can help decisions making
their competitors for achieving business
activity as "on-line" process.
2
3
One thing that remains constant , especially in that supports the decision-making process and
corporate world , is “ Change” provides businesses the ability to access and
analyze data to increase an organization's
These days, change is occurring at
competitive advantage. Datawarehousing is a
an ever-increasing rate. A key challenge is
process, not an off-the-shelf solution you buy,
implementing an information infrastructure that
but hardware--database and tools integrated into
allows your company to rapidly respond to
an evolving information infrastructure--that
change. One solution to this challenge is the
changes with the dynamics of the business.
datawarehouse. Datawarehousing is an
information infrastructure based on detail data
What is Data-Warehousing?
The data warehouse makes an attempt to figure Data in a warehouse is not updates or
out "what we need" before we know we need it. changed in any way, but is only loaded
What it actually is? and accessed later on
This data is taken from various, perhaps In general a database is not a data
incompatible, sources and stored in a warehouse unless it has the following two
uniform format features:
3
4
• It allows several different
applications to make use of the same
information.
Information Sources always include the The Data Warehouse itself is the bridge
core operational systems, which form the between the operational systems and the
backbone of day-to-day activities. It is decision support tools. It holds a copy of
these systems, which have traditionally much of the operational system data in a
provided management information to logical structure, which is more
support decision-making. conducive to analysis. The Data
Warehouse, which will be refreshed in
Decision Support Tools are used to
scheduled bursts from operational
analyze the information stored in the
systems and from relevant external data
warehouse, typically to identify trends
sources, provides a single, consistent
and new business opportunities.
view of corporate data, leaving
operational systems unaffected.
4
5
Data Warehouse Architecture relational, or multidimensional.
While choosing a DBMS it must be
Each implementation of a data
kept in view that the database
warehouse is different in its detailed design (as
management system should be
shown in figure below), but all are characterised
powerful enough to handle huge
by a handful of the following key components:
amount of data running up to
terabytes.
• A data model to define the
warehouse contents. • A front end for Decision Support
System (DSS) for reporting and for
• A carefully designed warehouse
structured and unstructured analysis.
database, whether hierarchical,
Transformed Data
Data Sources
Extracted
1 Information
Assimilated Information
2 Data
Selected
Warehouse
Data
Data Mining and Data Warehousing transactions. To make data mining more
efficient, the data warehouse should have an
The goal of a data warehouse is to
aggregated or summarized collection of data
support decision making with data. Data mining
.Data mining helps in extracting meaningful
can be used in conjunction with a data
new patterns that cannot be found necessarily by
warehouse to help with certain types of
merely querying or processing data or metadata
decisions. Data mining can be applied to
in the data warehouse.
operational databases with individual
6
7
data mining. The knowledge discovery process Transformation: Create general
comprises five phases: representation and tables of metadata
be relational or multi-dimensional in
nature)
8
9
together with online transaction processing and presentation techniques, like hypertext mark up
data mining, allows the management to provide language (HTML), Open Database Connectivity
better customer service, create greater customer (ODBC) etc. the database mining (Data & Text)
loyalty and activity, focus customer acquisition operation has gained wide spread recognition as
and retention of the most profitable customer, a viable tool for business intelligence gathering.
increase revenue, reduce operating cost; Advances in the document mining technology
provides tools that facilitate sounder decision (database mining of free form text/data, in
making; improves worker/management contrast to the “classical” approach to data
knowledge and productivity; spares the mining of fixed length records) are making the
operational database from ad-hoc queries with data mining technology more powerful. Last but
the resulting performance degradation and never the least, the Internet has emerged as the
clears the legacy database system, while moving largest data warehouse of unstructured and free
the corporate system architecture forward. With form data. The new technologies are geared
the incorporation of new data delivery and towards mining this great data warehouse.
Bibliography