Professional Documents
Culture Documents
BIF 515
Neeru Redhu
CCS HAU
Data mining : finding hidden information in a
database
Also called as exploratory data analysis, data driven
discovery and deductive learning
Predictive Descriptive
Time
Classificati Regressio
series Prediction
on n
analysis
Summariz Association Sequence
Clustering
ation rules Discovery
Predictive model
o Makes prediction about values of data using known
results found from the data
Descriptive Model
o Identifies pattern and relationships in data
Time Contribution
Late 1700s Bayes Theorem of probability
Early 1900s Regression analysis
Early 1920s Maximum likelihood estimate
Early 1940 1950s Neural networks and nearest neighbor, perceptron, jack knife
estimator
1960s ML started, decision trees, clustering, relational data model
1970s SMART IR systems, genetic algorithms, K-means clustering
1980s Kohonen self-organizing maps
1990s Association rules, data warehousing, (Online Analytic
Processing) OLAP
Data Mining Issues
Human Interaction
Overfitting
Outliers
Interpretations of results
Visualization of results
Large Datasets
High Dimensionalty
Multimedia data
Missing data
Irrelevant data
Noisy data
Integration
Application
Implementation issues
Scalablity
Real world data
Update
Ease of use
END
Questions?