Professional Documents
Culture Documents
The sound is the best way for a submarine to access the sur- 2. OUTLIER DETECTION
rounding environment to detect and identify possible threats
[ 11. The submarine will use its passive sonar system to ana- Both detection methods developed are based on trigger-
lyze sounds produced by several sources in the ocean. The ing the classifier for non-classified patterns, called outliers,
sonar operator is responsible for the identification of the re- which fall outside the data distribution used in the training
ceived signal and will make use of his ability to isolate and phase. An outlier can be generated due to measurement er-
identify features that will lead to the classification of the rors, noise, or the arriving of a new class of contact. The
interesting source. Automatic methods can be applied to decision on whether the outliers belong to an "unknown"
accelerate the decision task, and to reduce the work of the class or even to a known class but with a different machin-
sonar operator. The use of classifiers based on neural net- ery configuration is the responsibility of the sonar operator.
works seems to be a good choice, due to their ability to deal He will probably use the full information available, includ-
with high dimensional input data spaces, which is typically ing that coming from other sensors (acoustic or not).
the case for passive sonar applications. One important aspect to be considered is how often these
When dealing with the situation where a different noise outliers appear in a given time interval. If the frequency of
arrives on the sensor of the sonar system, the operator will the outliers is low, they might be due to noise or errors in
probably react assigning an "unknown" label to that con- the whole data acquisition and processing chains. On the
tact until a proper identification can be provided. For an other hand, if the frequency is high, the decision for new
automatic identification system, it would be interesting if class detection or for considering outliers as a subclass of a
besides classifying contacts, the system could also provide known class may increase.
a confidence level to the classification. A low confidence Once a given number of samples of such "unknown" class
Partially supported by IPqM, FUJB, FAPERJ (Rio de Janeiro), and has been detected, it is important to update the classifica-
CNPq (Brasilia), Brazil tion system, so that further appearances of patterns from the
111 - 106
Fig. 3. Classification efficiency for the three known classes
Fig. 2. A matrix of histograms for the network output for and the fraction of patterns of the "unknown" class detected
four ship classes. See text. as outliers. as a function of the decision threshold.
responds to the output of the first neuron for the patterns eraged and the fraction of patterns from the fourth class de-
that belong to the first class. In this histogram, the distri- tected as outliers was monitored as a function of the deci-
butions concentrate near +I, as expected. Considering only sion threshold. Tests confirmed that the number of detected
the 3x3 submatrix of histograms (columns 1 to 3, rows 1 outliers, for a given threshold, increases when the number
to 3), similar behavior is observed for active nodes, when of network responses used in the averaging increases too,
data from their corresponding classes feed the neural net- up to the limit imposed by process stationarity condition in
work. The histograms which are not in the main diagonal the wide sense. On the other hand, the classification effi-
represent distributions for inactive nodes with peaks close ciency for the three known classes suffers from small fluctu-
to 0, as expected. Thus, such 3 x 3 submatrix points out the ations, with respect to the single response procedure, when
high quality of the classifier performance for the testing set, averaged response is considered. Figure 4 shows results for
when data are projected onto the four discriminating com- averaging 10 network responses. For a minimum classifi-
ponents and the classifier is responding for classes that did cation efficiency of 90% on known classes, the detection
participate in the training phase. threshold is set to 0.7 and -85% of the patterns from the
The fourth column of histograms in Figure 2 corresponds fourth class are detected as outliers.
to the network response for patterns belonging to the fourth
class (unseen in the training phase). The distributions in 3.2. Clustering Neural Network
these histograms tend to spread along the dynamic range of
the classifier and do not show evidence of an active response The training phase is developed through a loop of two steps.
of any output node. If a decision threshold is defined, so that In the first, a subnetwork having ART topology is trained for
only output values exceeding the threshold can be accepted, each known class. The vigilance radii of neurons belong-
most patterns from the fourth class will be outliers and will ing to a subnetwork are equal and initially fixed, computed
not be assigned to any known class. from the peaking value of the distribution of Euclidean dis-
Figure 3 shows the classification efficiency of the neu- tances among training patterns from the corresponding ship
ral classifier for the testing set, which comprises the three class. In this way, training classes are first encapsulated
classes used in the training phase, and the fraction of pat- with a combination of neurons having fixed but class de-
terns from the fourth class detected as outliers, for decision pendable vigilance radii. In the second step, each subnet-
thresholds ranging from 0 to 1. For a minimum classifica- work is fed with training patterns from classes other than
tion efficiency of 90% for the three classes used for train- that one represented by the specific subnetwork under test,
ing, a decision threshold of 0.65 is selected and about 50% in order to identify possible invaders to class encapsulation.
of the patterns from the fourth class are detected as outliers. For instance, the subnetwork representing the first class is
The outlier detection efficiency will increase to about 70% if tested with training patterns from both the second and third
we allow the minimum classification efficiency for the three classes. If class invasion is detected, the vigilance radius
known classes to decrease to 80%. of the neuron responsible for the wrong capture of the in-
Since the input patterns that feed the neural network were vader is reduced to a value that keeps the invader out of
computed from samples from different time windows of the the radius of such neuron and thus allowing to recover from
received signal, the corresponding network responses might wrong class encapsulation condition. When invasion does
be combined to improve the efficiency of a classifier [8]. not occur any longer, the first training step of the loop is
Thus, a number of consecutive network responses were av- performed once more. Now, if a pattern lies out of its class
I11 - 107
sive sonar system. Preprocessed spectra from contiguous
time windows were projected onto four neural discriminat-
ing components, allowing compact and efficient neural net-
works to,be designed and making online operation feasible.
The neural detection capability was enlarged by developing
a methodology to allow neural classifiers to absorb the ar-
riving of a new class in a practical operation.
When comparing both neural processing approaches de-
veloped for new class detection, it can be seen that the multi-
layer perceptron network achieves better performance, iden-
tifying correctly 87% of the input patterns for classes known
beforehand and up to 85% of the new arriving class, when
experimental data from four clasijes are used.
The clustering network, on the other hand, will include
the newly detected class more easily than the multilayer
Fig. 4. Classification efficiency for the known classes and neural network. Based on these facts, we are testing a hy-
fraction of patterns of the ”unknown” class detected as out- brid approach where a multilayer neural network is respon-
liers, as a function of the decision threshold, considering an sible for the classification of the known classes and for the
average of 10 network responses. detection of outliers. When a new class is detected, its pat-
tems are fed into the clustering neural network that will try
to classify them using data from previously detected classes.
encapsulation, a new neuron has to be added for that subnet- If none of those classes corresponds to the newly detected
work and its radius is defined as the distance of the nearest class, a new class is included in the network. Experimental
neuron with respect to that pattern. The training phase pro- results are on the way.
ceeds with such loop of steps and the stopping criterium is
defined by the classification performance for the testing set 5. REFERElNCES
[9], which is evaluated by grouping together the trained sub-
networks and making all neurons to compete for each input Robert J. Urick, Principles of underwater sound,
pattern. McGraw-Hill, 1983.
When using three ship classes in the training phase, the
modified ART neural network achieved a classification ef- M.D. Richard and R.P. Lippmann, “Neural network
ficiency of 87%. When patterns from the fourth class were classifiers estimate Bayesian a posteriori probabilities,”
fed into the input nodes of this classifier, -45% of them Neural Computation, vol. 3, pp. 461-483, 1991.
were found to be out all class encapsulations and could be G.A. Carpenter and S . Grossberg, “Art2: Self-
detected as outliers. The averaging of consecutive network organization of stable category recognition codes for
responses was not efficient with the clustering procedure. analog input patterns,” Ap,plied Optics, vol. 26, pp.
If the operator decides that the detected outliers belong 49194930,1987.
to a new class, then these patterns will be used to update
the neural network. Two approaches may be followed here, K.S.Shammughan and A.M. Breipohl, Random Sig-
depending on the need of online operation. If this is the nals: Detection, Estimation, and Data Analysis, John
case, the training of the new subclass follows a similar pro- Wiley & Sons, 1988.
cedure used to train the classes known beforehand. The out- William Soares-Filho, Jose Manoel de Seixas, and
lier patterns are used to train the new subnetwork and then Luiz Pereira CalBba, “Principal component analysis fix
this trained subnetwork is integrated to the complete classi- classifying passive sonar signals,” in Int. Symp. on Cir-
fier. To avoid conflict between the encapsulation of the new cuit and Systems, Sidney, 200 1.
class and the previous encapsulations, the radii of the neu-
rons of the new subclass are reduced, eliminating any inva- A.V. Oppenheim and R.W. Schafer, Digital signal pro-
sion to the already defined encapsulations. Following such cessing, Prentice Hall, 1975.
approach, more than 63% of the patterns from the fourth
class were correctly identified with a minimum decrease in Luiz Pereira Caloba, F. Pereira, and Jose Manoel
the classification efficiency for the other three classes (from de Seixas, “Neural discriminating analysis for a second-
87% to 85%). level trigger system,” in Int. Conf: on Computing in
High Energy Physics, Rio de: Janeiro, Brazil, 1995.
I11 - 108