Professional Documents
Culture Documents
\
H[HUFLVH
H
You will learn how to use Clementine in a 'learning by doing' approach. We will use the loan approval data file
discussed during the lecture1. The same file has been used for learning how to work with DataSculptor (data pre-
processing) and SNNS (develop neural nets)2. Now, you will develop the same application but this time within a
self-contained software environment which allows for comparing different tools available.
This handout is based on Clementine documentation. You will be referred to the respective chapters for further
reading and more details. You may also want to refer to the online help system available in Clementine.
Palettes
representing processing
operations
1
see lecture notes Project Life Cycle for ANN Applications
2
see laboratory assignment 1
How to use SNNS
Before you attempt to develop a solution to the loan approval problem you should spend some time to familiarise
with the Clementine data mining system. A good starting point is a tutorial example ([Guide], p. 51-77) . A copy
will be available during the laboratory exercises.
3
DOS-command% kclem +clementine_current
2
How to use SNNS
Data Pre-Processing
In order to compare both approaches (DataSculpor + SNNS vs. Clementine) pre-processing of raw data is done
along the ideas suggested in the DataSculptor Tutorial4.
To save you time a stream for pre-processing the loan data file is available for this lab exercise. Copy all files
necessary into your project.
Have a look at the various nodes and compare them with objects used in DataSculptor to achieve the same
results. To inspect a node right-click it and select Edit… . Selecting Annotate… displays hints regarding
settings used within the node. You may add comments of your own.
Executing a node (right-click and select Execute… ) will start processing all nodes from the beginning
downstream up to the selected node. You may try that by executing the check samples node.
We will only use supervised neural networks to model the loan approval data set. Therefore, a Train Net node
has been added to the pre-processing stream.
4
just as a reminder: You have got a copy of that tutorial during a previous lab exercise.
3
How to use SNNS
Right-clicking this node and selecting Edit… gives access to a menu used to set up various network parameters
(layout, learning rate, control over training process, etc.).
For details refer to [Guide] p. 162f, [Ref] p. 65f
Note the differences to SNNS which provides a quite different approach to setting up and training neural nets. In
Clementine, focus is on modelling whereas in SNNS architectures and learning algorithms can be controlled in
more detail.
4
How to use SNNS
In order to check out a model it is taken from the Generated Models palette and included in the stream. Nodes
downstream a model under investigation can be used to calculate performance indicators.
Executing the prediction x risk node will calculate a misclassification matrix for a
model plugged into the steam.
T
hat concludes our survey of functions Clementine provides to support the whole project life cycle of a
neural network application development. You have seen that Clementine comprises pre-processing,
model building and post-processing alike in a self-contained environment.
Clementine Documentation
Please note that you are using version 6.02 Therefore, changes to the manual should be considered.
[Ref] Clementine Reference Manual Version 5
[Guide] Clementine User Guide Version 5