Professional Documents
Culture Documents
WEKA
Waikato Environment for Knowledge Analysis
(WEKA)
Developed by the Department of Computer Science,
University of Waikato, New Zealand
Machine learning/data mining software written in
Java (distributed under the GNU Public License)
Used for research, education, and applications
http://www.cs.waikato.ac.nz/ml/weka/
Weka Interfaces
Explorer
Preprocessing, attribute selection, learning,
visualization
Knowledge Flow
Visual design of KDD process
Experimenter
testing and evaluating machine learning
algorithms
Command-line
Data Formats
Uses flat text files to describe the data
Can work with a wide variety of data files including
its own .arff format and C4.5 file formats
Data can be imported from a file in various formats:
ARFF, CSV, C4.5 etc.
ARFF (Attribute Relation File Format)
@relation person
Explorer:
BayesNet, NaiveBayes
ID3, J48
OneR, Conjunctive Rule
Linear Regression,
RBFNetwork,
Multilayer Perceptron
Lazy
KStar, IBk
Miscellaneous- VFI
Clusterers:
OPTICS
DBScan
SimpleKMeans
Cobweb
Associations:
Apriori
Predictive Apriori
Filtered Associator
Attribute Selection:
Attribute Evaluators
CfsSubsetEval
ClassifierSubsetEval
GainRatioAttributeEval
InfoGainAttributeEval
Search Method
Best First
Exhaustive Search
Genetic Search
Rank Search
Knowledge Flow Interface:
Data-flow inspired interface to WEKA
process data in batches or incrementally
process multiple batches or streams in parallel (each
separate flow executes in its own thread)
chain filters together
visualize performance of incremental classifiers
during processing
Experimenter Interface:
Enables the user to create, run, modify, and analyse
experiments in a more convenient manner
Modes of Operation
Simple
Advanced
Local / Remote Experiments are supported
References:
Witten, I.H. and Frank, E. (2005) Data Mining:
Practical machine learning tools and techniques. 2nd
edition Morgan Kaufmann, San Francisco
Weka Knowledge Flow Tutorial, Mark Hall Peter
Reutemann
http://www.inf.fhdortmund.de/personen/professoren/engels/dm/praktik
um/WEKA-KnowledgeFlowTutorial-3-5-7.pdf
WEKA Manual for Version 3-6-2 - Remco R.
Bouckaert, Eibe Frank et.al, January 11, 2010