You are on page 1of 4

ENLARGING NEURAL CLASS DETECTION CAPACITY IN PASSIVE SONAR SYSTEMS

FK Soares-Filho'$2,J. M. Seixas2, L. l? Caldba2


IPqM - Brazilian Navy Research Institute
R. Ipiru, 2 - Rio de Janeiro 21931-090, Brazil

Signal Processing Laboratory - LPS


COPPEBEAJFRJ - Federal University of Rio de Janeiro
CP 68504, Rio de Janeiro 21945-970, Brazil
e-mail: [william,seixas,caloba]@lps.ufrJ.br

ABSTRACT level could be used to suggest the presence of "unknown"


contacts, bringing the attention of the operator to them.
A neural discriminating analysis is used for classifying pas- In this work, we first develop a method to detect "un-
sive sonar signals. Preprocessed information from the am- known" arriving patterns using a decision threshold for the
plitude spectra of the noise radiated from ships is projected output of a multilayer neural classifier. The confidence level
onto only a few principal discriminating components for used is the a posteriori probability estimated by the output
feeding the input nodes of the neural classifier. Envisag- of the neural network, trained using backpropagation [2].
ing practical applications, in which new incoming classes This method is then tested to detect "unknown" classes, us-
not known by the time of the training phase have to be de- ing real data from ships. A second method uses a clustering
tected in the production phase, a method is provided using neural network similar to the ART topology [3], but with
the identification of outliers to trigger the arriving of a new supervised training. The advantage of this method is that
class. Using experimental data, it is shown that up to 85% it can be easily used to update the classification system to
of patterns from an untrained class can be identified, with- include the new detected class.
out significant decrease on the efficiency of the classifer for The next section presents both detection methods for "un-
classes known beforehand. known" patterns. Section 3 shows experimental results on
new class detection and some conclusions are derived in
1. INTRODUCTION Section 4.

The sound is the best way for a submarine to access the sur- 2. OUTLIER DETECTION
rounding environment to detect and identify possible threats
[ 11. The submarine will use its passive sonar system to ana- Both detection methods developed are based on trigger-
lyze sounds produced by several sources in the ocean. The ing the classifier for non-classified patterns, called outliers,
sonar operator is responsible for the identification of the re- which fall outside the data distribution used in the training
ceived signal and will make use of his ability to isolate and phase. An outlier can be generated due to measurement er-
identify features that will lead to the classification of the rors, noise, or the arriving of a new class of contact. The
interesting source. Automatic methods can be applied to decision on whether the outliers belong to an "unknown"
accelerate the decision task, and to reduce the work of the class or even to a known class but with a different machin-
sonar operator. The use of classifiers based on neural net- ery configuration is the responsibility of the sonar operator.
works seems to be a good choice, due to their ability to deal He will probably use the full information available, includ-
with high dimensional input data spaces, which is typically ing that coming from other sensors (acoustic or not).
the case for passive sonar applications. One important aspect to be considered is how often these
When dealing with the situation where a different noise outliers appear in a given time interval. If the frequency of
arrives on the sensor of the sonar system, the operator will the outliers is low, they might be due to noise or errors in
probably react assigning an "unknown" label to that con- the whole data acquisition and processing chains. On the
tact until a proper identification can be provided. For an other hand, if the frequency is high, the decision for new
automatic identification system, it would be interesting if class detection or for considering outliers as a subclass of a
besides classifying contacts, the system could also provide known class may increase.
a confidence level to the classification. A low confidence Once a given number of samples of such "unknown" class
Partially supported by IPqM, FUJB, FAPERJ (Rio de Janeiro), and has been detected, it is important to update the classifica-
CNPq (Brasilia), Brazil tion system, so that further appearances of patterns from the

0-7803-7448-7/02/$17.00 02002 IEEE -


111 105
7350 Hz 0-2871Hz
same class can be promptly recognised. The absorption of
the new information probably will be made some time later
by an offline training procedure, maybe using backprop-
agation on a multilayer feedforward network. Immediate SPECTRA
class absorption can be made by using clustering methods
to group these outlier patterns together, without having to Fig. 1. 'Block diagram of the: preprocessing performed on
modify the main classifier itself. the incoming signal

2.1. Multilayer Neural Network


22050 Hz. Consecutive blocks of this signal were trans-
When a multilayer neural network is trained using the back- formed to the frequency domain to obtain amplitude spectra
propagation learning algorithm to identify one of M classes, in the frequency range from IIC to 2.87 kHz [5]. This fre-
its output nodes estimate the a posteriori probability of a quency range was chosen because it contains most of the
given incoming pattern to belong to any of the classes as- frequency lines related to the machines inside the ship and
signed to the network output nodes [2]. These estimated these lines are known to be quite relevant for the classifica-
a posteriori probabilities could be used to produce a confi- tion task [ 13.
dence level for the class identification task. An appropriate Figure 1 shows a diagram of the preprocessing used to
decision rule for declaring a given class to be the winner obtain the spectra. First, the signal was filtered and dec-
in the discrimination problem under study is the Bayes rule imated to reduce the sampling frequency to 7.35 kH>!(re-
[4], generated from the estimate of the a posteriori probabil- duction by a factor of 3). Then, consecutive blocks of 1024
ity of each output node. This rule is known as MAP (max- points, without overlapping, were passed through a Hanning
imum a posteriori) and gives non-ambiguous decisions (if window and transformed to the frequency domain by using
non-finite resolution is considered for computations). the fast Fourier transform (FFT) [6]. For each spectrum
If a decision threshold is defined, so that only net- obtained, 400 frequency bins, corresponding to an alias-
work outputs above such threshold are qualified as win- free frequency range from 0 to 2.87 kHz,were retained for
ners, classes will be encapsulated in a region with a con- the subsequent processing. Each spectrum corresponds to a
trolled a posteriori probability. If, for a given pattern, none time window with about 139 miliseconds duration.
of the outputs exceeds the decision threshold, the pattern Envisaging the online operation of a practical system, a
can be declared as "unknown" (not classified). Thus, with neural discriminating analysis [5, 71 was developed over
the class encapsulation, the neural network can be used to blocks to compact the high-dimensional (400 point spectra)
detect when a given pattern does not belong to any of the input data space. Previous analysis [5] showed that only
classes used in the training phase. four discriminating components are required for classify-
ing correctly more than 93% of the spectra, considering all
2.2. Clustering Neural Network classes. In this work, the four components were extracted
using data from three classes. The fourth class was used
Clustering methods can also be used to detect "unknown" for testing purposes only, being considered the "unkn"n"
classes, when the clusters are encapsulated into a limited class to be detected. Preprocessed data (spectra) from all
region. In this work, it is used a clustering neural network, classes were then projected onto such four discriminating
similar to the ART topology, being formed by hyperspher- components, and the resulting projections were the patterns
ical neurons with variable vigilance radius, constructed us- fed into the input nodes of the neural classifiers.
ing a supervised training algorithm.
In this network, a pattern gets classified by a given neuron
if it falls inside the vigilance radius of that neuron. When 3.1. Multilayer Neural Network
more than one neuron qualifies for a given input pattern, the Pattems from three of the four classes of ships were used to
winner will be the one with the highest similarity hnction. train and test a multilayer ne:ural network configured with
Pattems that fall outside all vigilance radii are considered only one hidden layer (6 neurons), and three output nodes.
not to belong to the classes used in the training phase, and All neurons were of the sigmoid type and each output node
are declared to be "unknown". was assigned to an input class, so that a target value of +1
was defined for an active node and 0 for inactive ones. The
3. NEW CLASS DETECTION fourth class was used only in the production phase of the
classifier, in order to test the detection capacity for new
Experimental data were used to test the detection of new classes.
classes. Data correspond to the radiated noise from ships Figure 2 shows the histogram of each output of the
from four different classes, in experimental runs in an trained classifier (rows 1 to 3), when the input nodes are
acoustic range with about 45 meter depth. The noise was fed with testing pattems of the four ship classes (columns 1
acquired using a unidirectional hydrophone positioned near to 4): the three classes for which the classifier was trained
the bottom of the range. The received signal was digitized on and the fourth one, simulating an "unknown" class. The
through an A/D converter with a sampling frequency of first histogram (first row, first column), for instance, cor-

111 - 106
Fig. 3. Classification efficiency for the three known classes
Fig. 2. A matrix of histograms for the network output for and the fraction of patterns of the "unknown" class detected
four ship classes. See text. as outliers. as a function of the decision threshold.

responds to the output of the first neuron for the patterns eraged and the fraction of patterns from the fourth class de-
that belong to the first class. In this histogram, the distri- tected as outliers was monitored as a function of the deci-
butions concentrate near +I, as expected. Considering only sion threshold. Tests confirmed that the number of detected
the 3x3 submatrix of histograms (columns 1 to 3, rows 1 outliers, for a given threshold, increases when the number
to 3), similar behavior is observed for active nodes, when of network responses used in the averaging increases too,
data from their corresponding classes feed the neural net- up to the limit imposed by process stationarity condition in
work. The histograms which are not in the main diagonal the wide sense. On the other hand, the classification effi-
represent distributions for inactive nodes with peaks close ciency for the three known classes suffers from small fluctu-
to 0, as expected. Thus, such 3 x 3 submatrix points out the ations, with respect to the single response procedure, when
high quality of the classifier performance for the testing set, averaged response is considered. Figure 4 shows results for
when data are projected onto the four discriminating com- averaging 10 network responses. For a minimum classifi-
ponents and the classifier is responding for classes that did cation efficiency of 90% on known classes, the detection
participate in the training phase. threshold is set to 0.7 and -85% of the patterns from the
The fourth column of histograms in Figure 2 corresponds fourth class are detected as outliers.
to the network response for patterns belonging to the fourth
class (unseen in the training phase). The distributions in 3.2. Clustering Neural Network
these histograms tend to spread along the dynamic range of
the classifier and do not show evidence of an active response The training phase is developed through a loop of two steps.
of any output node. If a decision threshold is defined, so that In the first, a subnetwork having ART topology is trained for
only output values exceeding the threshold can be accepted, each known class. The vigilance radii of neurons belong-
most patterns from the fourth class will be outliers and will ing to a subnetwork are equal and initially fixed, computed
not be assigned to any known class. from the peaking value of the distribution of Euclidean dis-
Figure 3 shows the classification efficiency of the neu- tances among training patterns from the corresponding ship
ral classifier for the testing set, which comprises the three class. In this way, training classes are first encapsulated
classes used in the training phase, and the fraction of pat- with a combination of neurons having fixed but class de-
terns from the fourth class detected as outliers, for decision pendable vigilance radii. In the second step, each subnet-
thresholds ranging from 0 to 1. For a minimum classifica- work is fed with training patterns from classes other than
tion efficiency of 90% for the three classes used for train- that one represented by the specific subnetwork under test,
ing, a decision threshold of 0.65 is selected and about 50% in order to identify possible invaders to class encapsulation.
of the patterns from the fourth class are detected as outliers. For instance, the subnetwork representing the first class is
The outlier detection efficiency will increase to about 70% if tested with training patterns from both the second and third
we allow the minimum classification efficiency for the three classes. If class invasion is detected, the vigilance radius
known classes to decrease to 80%. of the neuron responsible for the wrong capture of the in-
Since the input patterns that feed the neural network were vader is reduced to a value that keeps the invader out of
computed from samples from different time windows of the the radius of such neuron and thus allowing to recover from
received signal, the corresponding network responses might wrong class encapsulation condition. When invasion does
be combined to improve the efficiency of a classifier [8]. not occur any longer, the first training step of the loop is
Thus, a number of consecutive network responses were av- performed once more. Now, if a pattern lies out of its class

I11 - 107
sive sonar system. Preprocessed spectra from contiguous
time windows were projected onto four neural discriminat-
ing components, allowing compact and efficient neural net-
works to,be designed and making online operation feasible.
The neural detection capability was enlarged by developing
a methodology to allow neural classifiers to absorb the ar-
riving of a new class in a practical operation.
When comparing both neural processing approaches de-
veloped for new class detection, it can be seen that the multi-
layer perceptron network achieves better performance, iden-
tifying correctly 87% of the input patterns for classes known
beforehand and up to 85% of the new arriving class, when
experimental data from four clasijes are used.
The clustering network, on the other hand, will include
the newly detected class more easily than the multilayer
Fig. 4. Classification efficiency for the known classes and neural network. Based on these facts, we are testing a hy-
fraction of patterns of the ”unknown” class detected as out- brid approach where a multilayer neural network is respon-
liers, as a function of the decision threshold, considering an sible for the classification of the known classes and for the
average of 10 network responses. detection of outliers. When a new class is detected, its pat-
tems are fed into the clustering neural network that will try
to classify them using data from previously detected classes.
encapsulation, a new neuron has to be added for that subnet- If none of those classes corresponds to the newly detected
work and its radius is defined as the distance of the nearest class, a new class is included in the network. Experimental
neuron with respect to that pattern. The training phase pro- results are on the way.
ceeds with such loop of steps and the stopping criterium is
defined by the classification performance for the testing set 5. REFERElNCES
[9], which is evaluated by grouping together the trained sub-
networks and making all neurons to compete for each input Robert J. Urick, Principles of underwater sound,
pattern. McGraw-Hill, 1983.
When using three ship classes in the training phase, the
modified ART neural network achieved a classification ef- M.D. Richard and R.P. Lippmann, “Neural network
ficiency of 87%. When patterns from the fourth class were classifiers estimate Bayesian a posteriori probabilities,”
fed into the input nodes of this classifier, -45% of them Neural Computation, vol. 3, pp. 461-483, 1991.
were found to be out all class encapsulations and could be G.A. Carpenter and S . Grossberg, “Art2: Self-
detected as outliers. The averaging of consecutive network organization of stable category recognition codes for
responses was not efficient with the clustering procedure. analog input patterns,” Ap,plied Optics, vol. 26, pp.
If the operator decides that the detected outliers belong 49194930,1987.
to a new class, then these patterns will be used to update
the neural network. Two approaches may be followed here, K.S.Shammughan and A.M. Breipohl, Random Sig-
depending on the need of online operation. If this is the nals: Detection, Estimation, and Data Analysis, John
case, the training of the new subclass follows a similar pro- Wiley & Sons, 1988.
cedure used to train the classes known beforehand. The out- William Soares-Filho, Jose Manoel de Seixas, and
lier patterns are used to train the new subnetwork and then Luiz Pereira CalBba, “Principal component analysis fix
this trained subnetwork is integrated to the complete classi- classifying passive sonar signals,” in Int. Symp. on Cir-
fier. To avoid conflict between the encapsulation of the new cuit and Systems, Sidney, 200 1.
class and the previous encapsulations, the radii of the neu-
rons of the new subclass are reduced, eliminating any inva- A.V. Oppenheim and R.W. Schafer, Digital signal pro-
sion to the already defined encapsulations. Following such cessing, Prentice Hall, 1975.
approach, more than 63% of the patterns from the fourth
class were correctly identified with a minimum decrease in Luiz Pereira Caloba, F. Pereira, and Jose Manoel
the classification efficiency for the other three classes (from de Seixas, “Neural discriminating analysis for a second-
87% to 85%). level trigger system,” in Int. Conf: on Computing in
High Energy Physics, Rio de: Janeiro, Brazil, 1995.

4. CONCLUSIONS A.J.C. Sharkey, “On combining artificial neural nets,”


Connection Science, vol. 8, no. 314, pp. 299-3 14, 1996.
In this work, multilayer perceptron and modified ART neu- Simon Haykin, Neural networks: A comprehensive
ral network topologies were applied for the identification of foundation, second edition, ]+entice Hall, 1999.
the noise radiated from ships, which was received by a pas-

I11 - 108

You might also like