Professional Documents
Culture Documents
Abstract
The study investigates the performance of image classifiers for forest and non-forest area classification.
Remote sensing image classification is one of the most significant application worlds for remote sensing. A few
number of image classification algorithms have proved good precision in classifying remote sensing data. We are
experimenting with both supervised and unsupervised classification. Here we compare the different classification
methods and their performances. Specially tested are performances of Maximum Likelihood classifier, Minimum
Distance classifier, Parallelopiped classifier based on Landsat7 ETM+ spectral data and produced high accuracies
of more than 75% with limited input information. Of the classified images, the maximum likelihood method is
found to be more applicable and reliable for the satellite image classification purposes and the Parallelopiped
method is found to give the least reliable results compared to the other methods.
Keywords: Forest classification; Maximum Likelihood Classifier; Minimum Distance; Parallelopiped
Classification accuracy.
1. INTRODUCTION: remotely sensed data, into classes that match the
informational categories of user interest by
Remote sensing, particularly satellites offer
comparing pixels to one another and to those of
an immense source of data for studying spatial and
known identity. This categorized data may then be
temporal variability of the environmental
used to produce thematic maps of the land cover
parameters. Image classification is an important part
present in an image. Studies[3] indicate that spectral
of the remote sensing, image analysis and pattern
information is an effective means of achieving this
recognition. In some instances, the classification
goal. The spectral pattern present within the data for
itself may be the object of the analysis. For example,
each pixel is used as the numerical basis for
classification of land use from remotely sensed data
categorization. . A Multi-spectral image is one that
produces a map like image as the final product of
captures image data at specific frequencies across
the analysis[1]. The image classification therefore
the electromagnetic spectrum. The wavelengths may
forms an important tool for examination of the
be separated by filters or by the use of instruments
digital images.
that are sensitive to particular wavelengths,
Forest classification has evolved from the including light from frequencies beyond the visible
initial identification of forested areas to the light range, such as infrared. Multi-spectral imaging
determination of variation in species diversity. can allow extraction of additional information that
Initially, efforts at classifying forests were the human eye fails to capture with its receptors for
motivated by management objectives, i.e. to red, green and blue. It was originally developed for
safeguard supplies of timber and other forest space-based imaging. Several methods of image
products, and to provide environmental services, classification exist. Two main classification
particularly the protection of fragile mountain methods are Supervised Classification and
catchments areas[2]. Over the years, however, Unsupervised Classification [4]. Unsupervised
nature conservation, recreation, research and classification is a method which examines a large
education have presented additional objectives. number of unknown pixels and divides into a
With the growing interest in developing number of classes based on natural groupings
conservation strategies for species diversity, there is present in the image values. Unsupervised
a need to consider the variation within the classification does not require analyst-specified
vegetation cover (including non-forest training data. This classification is becoming
areas).Deforestation is the major problem existing in increasingly popular in agencies involved in long
the world now. Therefore, classifying the forest and term GIS database maintenance. Supervised
non-forest areas help us to evaluate the areas that classification of remote-sensing images has been
turn to non-forest. widely used as a powerful means to extract various
kinds of information concerning the earth
2. SATELLITE IMAGE CLASSIFICATION
environment. The objective of supervised
Image classification in the field of remote classification in remote sensing is to identify and
sensing, is the process of assigning pixels or the partition the pixels comprising the noisy image of an
basic units of an image to classes. It is likely to area according to its class (e.g. forest and non-
assemble groups of identical pixels found in forest), with the parameters in the model for pixel
values estimated from training samples (ground
truths).Supervised classification is the procedure Fazenda Boa Esperança site is between Três Picos
most often used for quantitative analysis of remote and Desengano State Park, but more than 30 km
sensing image data. It rests upon using suitable from both of these protected areas. The Desengano
algorithms to label the pixels in an image as State Park is located more than 50 km from all other
representing particular ground cover types, or sites, with the forest being currently isolated due to
classes. A variety of algorithms is available for this, deforestation in surrounding areas. All of these sites
ranging from those based upon probability are part of a once continuous forest with one of the
distribution models for the classes of interest to highest biodiversity levels in the world including
those in which the multispectral space is partitioned many threatened species[7].
into class specific regions using optimally located
The Tijuca Forest is a mountainous hand-planted
surfaces.
rainforest in the city of Rio de Janeiro, Brazil. It is
For nearly a century, forest and national the world's largest urban forest, covering some
park agencies have been engaged in mapping forest 32km² (12.4mi²). The Tijuca Forest is home to
areas in order to provide managers information and hundreds of species of plants and wildlife, many
monitor the condition of ecosystems over time. threatened by extinction, found only in the Atlantic
There have been attempts to improve the accuracy Rainforest.
of the forest maps. But this is an attempt to evaluate
4. DATA PREPARATION
the best of the classifier that suits for forest and non-
forest area classification. In this study three In this research, we have made use of land
supervised classification methods were compared cover images obtained from remote sensing for
i.e. Maximum Likelihood classifier (MLC), experimentation. Landsat 7 is equipped with an
Minimum Distance classifier, Parallelopiped enhanced Thematic Mapper (ETM+) as in fig.3 has
classifier. These classification methods were chosen been used. Landsat 7’s ETM+ is different from
because MLC has been noted to have a more robust thematic mappers because it offers following
theoretical basis and higher accuracies[5], whereas features.1. A panchromatic band with 15m spatial
Minimum distance classifier and parallelopided resolution. 2. On-board, full aperture, 5% absolute
classifier are found to be very simple to implement radiometric calibration. 3. A thermal IR channel
with less complexity. with 60m spatial resolution. 4. An on-board data
recorder.
3. STUDY AREA
The data set consists of 800*625 pixels and
Rio de Janeiro lies on a strip of Brazil's
covers Rio de Janeiro of Brazil[8] shown in Fig.1.
Atlantic coast, close to the Tropic of Capricorn,
The advantage of using this dataset is the
where the shoreline is oriented east–west. Facing
availability of the referenced image produced from
largely south, the city was founded on an inlet of
field survey, which is used for the accuracy purpose.
this stretch of the coast, Guanabara Bay and its
The original remote sensing image in false colors
entrance is marked by a point of land called Sugar
with RGB: 432 with its characteristics Table 1ist
Loaf– a "calling card" of the city.It lies in the
shown in Table.1. A ground truth image (reference
22°54′30″S 43°11′47″W coordinates[6].
image) is generated by field study campaign as in
This region retains good quality forest, Fig. 2. Random sampling is carried out to select the
including elfin forest, and has the most continuous pixels for training and testing the classifiers.
well-preserved remnant of Atlantic forest in the
Table 1.characteristics of Landsat7+ETM
state. Surveys occurred at four locations, each of
which had montane forest, and usually elfin forest. Sensor:L7ETM+
Araras Biological Reserve is a state conservation AcquisitionDate:February28,2000
unit located between Serra dos Órgãos National Path/Row:217/76
Park and Serra do Tinguá Biological Reserve Lat/Long:-22.904/-43.210
Fazenda Itatiba is within Três Picos State Park. Both
sites are part of the Montane Central Region,
covered mainly by Montane Atlantic Forest. The
Fig. 1 Map of South America Fig. 2 Rio de Janeiro
3. Use the training data to estimate
the parameters of the particular classifier
algorithm to be used; these parameters will be
the properties of the probability model used
or will be equations that define partitions in
the multispectral space. The set of parameters
for a given class is sometimes called the
signature of that class.
4. Using the trained classifier, label or
classify every pixel in the image into one of
the desired ground cover types (information
classes). Here the whole image segment of
interest is typically classified. Whereas
training in Step 2 may have required the user
Fig. 3 Satellite image of Rio de Janeiro to identify perhaps 1% of the image pixels by
other means, the computer will label the rest
by classification.
4. METHODOLOGY
The main aim of the study is to evaluate the 5. Produce tabular summaries or
performance of the different classification thematic (class) maps which summarize the
algorithms for forest and non-forest areas results of the classification.
classification using the multispectral data. 6. Assess the accuracy of the final product
Irrespective of the classifier used there are some using a labelled testing data set.
basic steps used in supervised image classification.
4.1 DEFINING THE TRAINING SAMPLES
1. Decide the set of ground cover types into
which the image is to be segmented. These Supervised classification is much more
are the information classes like forest, non- accurate for mapping classes, but depends heavily
forest, water etc. on the cognition and skills of the image specialist.
The strategy is simple: the specialist must recognize
2. Choose representative or conventional classes (real and familiar) or
prototype pixels from each of the desired set meaningful (but somewhat artificial) classes in a
of classes. These pixels are said to form scene from prior knowledge, such as personal
training data. Training sets for each class can experience with what's present in the scene, or more
be established using site visits, maps, air generally, the region it's located in, by experience
photographs or even photo interpretation of a with thematic maps, or by on-site visits. This
color composite product formed from the familiarity allows the individual(s) making the
image data. Often the training pixels for a classification to choose and set up discrete classes
given class will lie in a common region (thus supervising the selection) and then, assign
enclosed by a border. That region is then them category names. As a rule, the classifying
often called a training field. person also locates specific training samples on the
image - either a print or a monitor display - to
identify the classes. The resulting Training samples
are areas representing each known land cover computations. We put the word `accurate' in quotes
category that appear fairly homogeneous on the because this assumes that classes in the input data
image (as determined by similarity in tone or color have a Gaussian distribution and that signatures
within shapes delineating the category). In the were well selected; this is not always a safe
computer display one must locate these samples and assumption.[11] Multivariate normal statistical
circumscribe them with polygonal boundaries drawn theory describes the probability that an observation
using the computer mouse[9]. More than one X will occur, given that it belongs to a class k, as the
polygon is usually drawn for any class. The quality following function
of a supervised classification [10] depends on the
quality of the training sites.
4.2 SIGNATURE EXTRACTION
After the training site areas have been
digitized, the next step is to create statistical
characterizations of each information. For each class Where µ k Parametric mean vector associated with
thus outlined, mean values ,standard deviation and the k th class.
covariance of the selected area for each band used to
classify them are calculated from all the pixels X p-dimensional random vector
enclosed in each site.
Σ k covariance matrix associated with the k th class.
4.2.1. Estimation of MEAN for Classes:
Φ k(Xi) Probability density value associated with
The true values of the mean and observation Xi as
covariance matrix are not known and must be
estimated from training samples. The mean is as evaluated for class k..
typically estimated by the sample mean The quantitative product