Professional Documents
Culture Documents
Stephane Gaudreault
Air Quality Modelling Applications
Canadian Meteorological Centre
Meteorological Service of Canada
2121, route Transcanadienne
Dorval, QC, Canada, H9P 1J3
E-Mail:
Stephane.Gaudreaultgec.gc.ca
Louis-Philippe Crevier,
Hugo Landry
Sylvain Menard
Air Quality Modelling Applications
Canadian Meteorological Centre
Meteorological Service of Canada
2121, route Transcanadienne
Dorval, QC, Canada, H9P 1J3
Joel Martin
Departement d'informatique
Universite de Sherbrooke
2500 boul. de l'Universite
Sherbrooke, QC, Canada, J 1K 2R1
Abstract A fundamental problem in the development of an air
quality forecast system is the implementation of an
evaluation protocol. Traditionally, statistics are
computed to compare the model output to the
observations. These methods are limited in that they
are generally unable to easily identify the nature of an
error (such as location and timing errors). In this
paper, we describe CALUMeT (Canadian pollution
monitoring tool), an experimental framework that
attempts to address these limitations. This framework
makes use of self-organizing feature maps to compute
the classification of feature vectors from the regions
of interest. It encompasses both formalism and a
software tool that is under active development. More
specifically, the framework allows the specification
and manipulation of invariants associated with
topological elements of an air quality forecast.
I. INTRODUCTION
2302
a4
R2
0.3786
Vy
2 99326
I wo 1w1! 0~~~~~T*
4.41S6
sVi0ons of ooe
Fiue3
Corelaio
diara
of teojeciv
forec~dzasBt(03).32
N~ ~ ~1
0b 7F340
concentration.
L IlI III
II
2303
(figure 3).
2304
B. Geometricalfeatures
1) Moments
The normalized central moment [8] is invariant
to translation and scale change. It is equivalent to normal
moments but in a way that the centroid of the ROI
represents the origin. Hu invariants are also computed
since they provide a way to identify an ROI despite of
scale change, rotation or translation.
2) Center ofMass
Another physical invariant is the center of mass
which is computed from the pollutant concentration level
in the ROI. The coordinates are computed from the
following relation:
centroid)
Moreover, simple geometrical features such as
area, perimeter and position of the centroid are also used
to classify the data and give a precise evaluation when
the final step of analysing the classification results is
made.
moo
where
FC-FCmin
FCm - FCm
FCmin =n-1
FCmax=(
=
Y=
mi
(4)
where ml0, inmo and mi0 are first order moments of the
ROI.
3) Normalized compacityfactor
Compacity factor [1] is a feature that is invariant
to rotation, translation and scale changes. It qualifies
compactness of a shape as a simple metric. A feature
more robust to noise is called the normalized compacity
factor and is defined by
FCN
imno
x-n0
D. Classification
Once the features extraction step is done, the
vectors are classified using a SOFM. The main
advantage of such a classifier is its ability to classify data
in an unsupervised manner. The actual problem does not
provide sufficient training data to train an efficient multilayer perceptron neural network. Additionally, this
unsupervised learning method gives enough flexibility to
the system to classify new shapes never seen before. Our
SOFM implementation is inspired from Kohonen's [5].
The distance function used to evaluate the
winning node is the Euclidian distance although other
functions (like Manhattan distance) might be used. The
(1)
(2)
(3)
Wi)2
(5)
[x - min(x)]
[max(x)- min(x)]
(6)
dE=
4) Ellipses
Basic geometrical shapes like ellipse
approximation [2] [3] are computed because they are
invariant to translation and rotation. The ratio of the two
radius of each ellipse is also invariant to scale change.
Additionally, it gives an important tool when comes the
operation of match evaluation and comparison, where the
axis orientation difference can be used.
N(
5) Pattern spectrum
The pattem spectrum [6] is a shape-size
descriptor that summarizes important shape
characteristics. The succession of morphological
operations with a structuring element increasing in size
gives a unique set of values for a given shape. For each
morphological opening made on a ROI, the area that is
subtracted is kept in a vector. After a given number of
operations (customisable by the user) this pattern
spectrum is added to the ROI feature vector.
C. Physicalfeatures
1) Density histogram
To evaluate the physical properties of the ROI, a
density histogram is computed. This feature helps
evaluating similarly dense regions that would have
important shape differences. The principle is to segment
2305
W,+h6(t)[
hcJ =a (t ).exp
2(t) J
on its
(8)
methods are:
1) SOM Radius
This method consists of searching for the
nearest neighbour of the node to which a shape is
matched. The radius of search is increased with each
iterations of the search, starting with the node itself, until
a match is found.
3) Score-Layer
2306
ACKNOWLEDGMENTS
The authors would like to thank the Air Quality
Modelling and Application Division of Environment
Canada, more precisely Veronique Bouchet, Sophie
Cousineau, Michel Jean, Richard Moffet, Jacinthe
Racine, Alain Robichaud and Mourad Sassi as well as
Jean-Philippe Gauthier-Bilodeau and Serge Trudel for
their help and support in this project.
REFERENCES
2307