Professional Documents
Culture Documents
BY
T.V.Nagaraju K.V.Praneeth
( III/IV B. Tech- CSE ) ( III/IV B. Tech- CSE)
1 Extract and load the data: Data extraction involves extracting the data from source
systems and makes it available to the data warehouse where as data load takes
extracted data and loads it into the data warehouse.
Clean and transform data: It performs the consistency checks on the loaded data, and
then structures it for query performance and for minimizing the operational costs.
2 Back up and archive data: The data is being backed up regularly and also older data
is removed from the system in a format that allows it to be quickly restored if required.
3 Query management: It manages the queries and speeds them up by directing queries
to the most effective data source and also monitor the actual query profiles.
Clustering: It is the method by which like records are grouped together. Usually this is
done to give the end user a high level view of what is going on in the database. There are
mainly two types.
Hierarchical and Non-Hierarchical Clustering: The hierarchy of clusters is usually
viewed as a tree where the smallest clusters merge together to create the next highest
level of clusters and so on.
Hierararchy of clusters elongated clusters
2. Next Generations Techniques: They represent techniques such as Trees, Networks
and Rules that have only been widely used since the early 1980’s.
Neural Networks: Neural networks consist of a number of neurons that are
interconnected--often in complex ways--and then organized into layers. Neurons are very
simple processing units that compute a linear combination of a number of inputs and then
perform a simple mathematical process on the result to produce an output.
• Classes: Stored data is used to locate data in predetermined groups. For example,
a restaurant chain could mine customer purchase data to determine when
customers visit and what they typically order. This information could be used to
increase traffic by having daily specials.
• Sequential patterns: Data is mined to anticipate behavior patterns and trends. For
example, an outdoor equipment retailer could predict the likelihood of a hiking
shoes.
BIBILIOGRAPHY
1. www.kdnuggets.com
2. www.ultragem.com
3. info.gte.com/kdd/
4. www.google.com
5. Data Base Management Systems by RaghuRamaKrishnan
6. Data Mining Techniques.