Paper Presentation - Sathiyavani at 11th Feb.

CLASSIFICATION OF FOREST AND NON-FOREST AREAS
R.Sathiyavani* Prof V.Srinivasan.

Department of Computer Science, Annamalai University
Abstract
The study investigates the performance of image classifiers for forest and non-forest area classification.
Remote sensing image classification is one of the most significant application worlds for remote sensing. A few
number of image classification algorithms have proved good precision in classifying remote sensing data. We are
experimenting with both supervised and unsupervised classification. Here we compare the different classification
methods and their performances. Specially tested are performances of Maximum Likelihood classifier, Minimum
Distance classifier, Parallelopiped classifier based on Landsat7 ETM+ spectral data and produced high accuracies
of more than 75% with limited input information. Of the classified images, the maximum likelihood method is
found to be more applicable and reliable for the satellite image classification purposes and the Parallelopiped
method is found to give the least reliable results compared to the other methods.
Keywords: Forest classification; Maximum Likelihood Classifier; Minimum Distance; Parallelopiped
Classification accuracy.
1. INTRODUCTION: remotely sensed data, into classes that match the
informational categories of user interest by
Remote sensing, particularly satellites offer
comparing pixels to one another and to those of
an immense source of data for studying spatial and
known identity. This categorized data may then be
temporal variability of the environmental
used to produce thematic maps of the land cover
parameters. Image classification is an important part
present in an image. Studies[3] indicate that spectral
of the remote sensing, image analysis and pattern
information is an effective means of achieving this
recognition. In some instances, the classification
goal. The spectral pattern present within the data for
itself may be the object of the analysis. For example,
each pixel is used as the numerical basis for
classification of land use from remotely sensed data
categorization. . A Multi-spectral image is one that
produces a map like image as the final product of
captures image data at specific frequencies across
the analysis[1]. The image classification therefore
the electromagnetic spectrum. The wavelengths may
forms an important tool for examination of the
be separated by filters or by the use of instruments
digital images.
that are sensitive to particular wavelengths,
Forest classification has evolved from the including light from frequencies beyond the visible
initial identification of forested areas to the light range, such as infrared. Multi-spectral imaging
determination of variation in species diversity. can allow extraction of additional information that
Initially, efforts at classifying forests were the human eye fails to capture with its receptors for
motivated by management objectives, i.e. to red, green and blue. It was originally developed for
safeguard supplies of timber and other forest space-based imaging. Several methods of image
products, and to provide environmental services, classification exist. Two main classification
particularly the protection of fragile mountain methods are Supervised Classification and
catchments areas[2]. Over the years, however, Unsupervised Classification [4]. Unsupervised
nature conservation, recreation, research and classification is a method which examines a large
education have presented additional objectives. number of unknown pixels and divides into a
With the growing interest in developing number of classes based on natural groupings
conservation strategies for species diversity, there is present in the image values. Unsupervised
a need to consider the variation within the classification does not require analyst-specified
vegetation cover (including non-forest training data. This classification is becoming
areas).Deforestation is the major problem existing in increasingly popular in agencies involved in long
the world now. Therefore, classifying the forest and term GIS database maintenance. Supervised
non-forest areas help us to evaluate the areas that classification of remote-sensing images has been
turn to non-forest. widely used as a powerful means to extract various
kinds of information concerning the earth
2. SATELLITE IMAGE CLASSIFICATION
environment. The objective of supervised
Image classification in the field of remote classification in remote sensing is to identify and
sensing, is the process of assigning pixels or the partition the pixels comprising the noisy image of an
basic units of an image to classes. It is likely to area according to its class (e.g. forest and non-
assemble groups of identical pixels found in forest), with the parameters in the model for pixel
values estimated from training samples (ground
truths).Supervised classification is the procedure Fazenda Boa Esperança site is between Três Picos
most often used for quantitative analysis of remote and Desengano State Park, but more than 30 km
sensing image data. It rests upon using suitable from both of these protected areas. The Desengano
algorithms to label the pixels in an image as State Park is located more than 50 km from all other
representing particular ground cover types, or sites, with the forest being currently isolated due to
classes. A variety of algorithms is available for this, deforestation in surrounding areas. All of these sites
ranging from those based upon probability are part of a once continuous forest with one of the
distribution models for the classes of interest to highest biodiversity levels in the world including
those in which the multispectral space is partitioned many threatened species[7].
into class specific regions using optimally located
The Tijuca Forest is a mountainous hand-planted
surfaces.
rainforest in the city of Rio de Janeiro, Brazil. It is
For nearly a century, forest and national the world's largest urban forest, covering some
park agencies have been engaged in mapping forest 32km² (12.4mi²). The Tijuca Forest is home to
areas in order to provide managers information and hundreds of species of plants and wildlife, many
monitor the condition of ecosystems over time. threatened by extinction, found only in the Atlantic
There have been attempts to improve the accuracy Rainforest.
of the forest maps. But this is an attempt to evaluate
4. DATA PREPARATION
the best of the classifier that suits for forest and non-
forest area classification. In this study three In this research, we have made use of land
supervised classification methods were compared cover images obtained from remote sensing for
i.e. Maximum Likelihood classifier (MLC), experimentation. Landsat 7 is equipped with an
Minimum Distance classifier, Parallelopiped enhanced Thematic Mapper (ETM+) as in fig.3 has
classifier. These classification methods were chosen been used. Landsat 7’s ETM+ is different from
because MLC has been noted to have a more robust thematic mappers because it offers following
theoretical basis and higher accuracies[5], whereas features.1. A panchromatic band with 15m spatial
Minimum distance classifier and parallelopided resolution. 2. On-board, full aperture, 5% absolute
classifier are found to be very simple to implement radiometric calibration. 3. A thermal IR channel
with less complexity. with 60m spatial resolution. 4. An on-board data
recorder.
3. STUDY AREA
The data set consists of 800*625 pixels and
Rio de Janeiro lies on a strip of Brazil's
covers Rio de Janeiro of Brazil[8] shown in Fig.1.
Atlantic coast, close to the Tropic of Capricorn,
The advantage of using this dataset is the
where the shoreline is oriented east–west. Facing
availability of the referenced image produced from
largely south, the city was founded on an inlet of
field survey, which is used for the accuracy purpose.
this stretch of the coast, Guanabara Bay and its
The original remote sensing image in false colors
entrance is marked by a point of land called Sugar
with RGB: 432 with its characteristics Table 1ist
Loaf– a "calling card" of the city.It lies in the
shown in Table.1. A ground truth image (reference
22°54′30″S 43°11′47″W coordinates[6].
image) is generated by field study campaign as in
This region retains good quality forest, Fig. 2. Random sampling is carried out to select the
including elfin forest, and has the most continuous pixels for training and testing the classifiers.
well-preserved remnant of Atlantic forest in the
Table 1.characteristics of Landsat7+ETM
state. Surveys occurred at four locations, each of
which had montane forest, and usually elfin forest. Sensor:L7ETM+
Araras Biological Reserve is a state conservation AcquisitionDate:February28,2000
unit located between Serra dos Órgãos National Path/Row:217/76
Park and Serra do Tinguá Biological Reserve Lat/Long:-22.904/-43.210
Fazenda Itatiba is within Três Picos State Park. Both
sites are part of the Montane Central Region,
covered mainly by Montane Atlantic Forest. The
Fig. 1 Map of South America Fig. 2 Rio de Janeiro
3. Use the training data to estimate
the parameters of the particular classifier
algorithm to be used; these parameters will be
the properties of the probability model used
or will be equations that define partitions in
the multispectral space. The set of parameters
for a given class is sometimes called the
signature of that class.
4. Using the trained classifier, label or
classify every pixel in the image into one of
the desired ground cover types (information
classes). Here the whole image segment of
interest is typically classified. Whereas
training in Step 2 may have required the user
Fig. 3 Satellite image of Rio de Janeiro to identify perhaps 1% of the image pixels by
other means, the computer will label the rest
by classification.
4. METHODOLOGY
The main aim of the study is to evaluate the 5. Produce tabular summaries or
performance of the different classification thematic (class) maps which summarize the
algorithms for forest and non-forest areas results of the classification.
classification using the multispectral data. 6. Assess the accuracy of the final product
Irrespective of the classifier used there are some using a labelled testing data set.
basic steps used in supervised image classification.
4.1 DEFINING THE TRAINING SAMPLES
1. Decide the set of ground cover types into
which the image is to be segmented. These Supervised classification is much more
are the information classes like forest, non- accurate for mapping classes, but depends heavily
forest, water etc. on the cognition and skills of the image specialist.
The strategy is simple: the specialist must recognize
2. Choose representative or conventional classes (real and familiar) or
prototype pixels from each of the desired set meaningful (but somewhat artificial) classes in a
of classes. These pixels are said to form scene from prior knowledge, such as personal
training data. Training sets for each class can experience with what's present in the scene, or more
be established using site visits, maps, air generally, the region it's located in, by experience
photographs or even photo interpretation of a with thematic maps, or by on-site visits. This
color composite product formed from the familiarity allows the individual(s) making the
image data. Often the training pixels for a classification to choose and set up discrete classes
given class will lie in a common region (thus supervising the selection) and then, assign
enclosed by a border. That region is then them category names. As a rule, the classifying
often called a training field. person also locates specific training samples on the
image - either a print or a monitor display - to
identify the classes. The resulting Training samples
are areas representing each known land cover computations. We put the word `accurate' in quotes
category that appear fairly homogeneous on the because this assumes that classes in the input data
image (as determined by similarity in tone or color have a Gaussian distribution and that signatures
within shapes delineating the category). In the were well selected; this is not always a safe
computer display one must locate these samples and assumption.[11] Multivariate normal statistical
circumscribe them with polygonal boundaries drawn theory describes the probability that an observation
using the computer mouse[9]. More than one X will occur, given that it belongs to a class k, as the
polygon is usually drawn for any class. The quality following function
of a supervised classification [10] depends on the
quality of the training sites.
4.2 SIGNATURE EXTRACTION
After the training site areas have been
digitized, the next step is to create statistical
characterizations of each information. For each class Where µ k Parametric mean vector associated with
thus outlined, mean values ,standard deviation and the k th class.
covariance of the selected area for each band used to
classify them are calculated from all the pixels X p-dimensional random vector
enclosed in each site.
Σ k covariance matrix associated with the k th class.
4.2.1. Estimation of MEAN for Classes:
Φ k(Xi) Probability density value associated with
The true values of the mean and observation Xi as
covariance matrix are not known and must be
estimated from training samples. The mean is as evaluated for class k..
typically estimated by the sample mean The quantitative product
can be thought of as a squared distance function

which measures the distance between the
observation and the class mean as scaled and
where Xi,j the sample j from class i. corrected for mean and covariance of the class. It
Ni the number of training samples from class i can be shown that this expression is an X 2 variate
with p degrees of freedom.
mi the sample mean
As applied in a maximum likelihood
decision rule, expression (1) allows the calculation
4.2.2 Estimation of Covariance Matrix for of the probability that an observation is a member of
Classes: each of k classes. The individual is then assigned to
the class for which the probability value is greatest.
The covariance matrix is typically estimated by the In an operational context, we substitute observed
sample covariance matrix means, variances, and covariances and use the log
form of expression (1)
Where Σ i the sample covariance matrix
4.3 CLASSIFICATION METHODS Where Dk p by p dispersion matrix associated with a

4.3.1 Maximum Likelihood Classifier sample of observations belonging to the k th class
Maximum likelihood Classification is a Since the log of the probability is a

statistical decision criterion to assist in the monotonic increasing function of the probability, the
classification of overlapping signatures; pixels are decision can be made by comparing values for each
assigned to the class of highest probability. The class as calculated from the right hand side of this
maximum likelihood classifier is considered to give equation
more accurate results than parallelepiped 4.3.2 Minimum distance Classification
classification however it is much slower due to extra
Minimum distance classifier classifies The decision boundaries form an n-
image data on a database file using a set of 256 dimensional parallelepiped in feature space. If the
possible class signature segments as specified by pixel value lies above the lower threshold and below
signature parameter. Each segment specified in the high threshold for all n bands evaluated, it is
signature, for example, stores signature data assigned to an unclassified category. Although it is
pertaining to a particular class. Only the mean only possible to analyze visually up to three
vector in each class signature segment is used. Other dimensions, it is possible to create an n-dimensional
data, such as standard deviations and covariance parallelepiped for classification purposes. The
matrices, are ignored (though the maximum parallelepiped algorithm is a computationally
likelihood classifier uses this)[13]. efficient method of classifying remote sensor at a
[14]. Unfortunately, because some parallelepipeds
The result of the classification is a theme
overlap, it is possible that an unknown candidate
map directed to a specified database image channel.
pixel might satisfy the criteria of more than one
A theme map encodes each class with a unique gray
class. In such cases it is usually assigned to the first
level. The gray-level value used to encode a class is
class for which it meets all criteria. A more elegant
specified when the class signature is created. If the
solution is to take this pixel that can be assigned to
theme map is later transferred to the display, then a
more than one class and use a minimum distance to
pseudo-color table should be loaded so that each
means decision rule to assign it to just one class.
class is represented by a different color.
Classification Accuracy Assessment
The equation used by minimum Euclidian distance
classifier defined by the following equation[12]: Quantitatively assessing classification
T accuracy requires the collection of some in situ data
Gi(X) = (X-Ui) * (X-Ui)
or a priori knowledge about some parts of the terrain
= SUM [(xj-uj) **2] for j = 1 to d. which can then be compared with the remote
sensing derived classification map. Thus to assess
classification accuracy it is necessary to compare
Gi(X) is the result for class i on pixel X two classification maps 1) the remote sensing
derived map, and 2) assumed true map (in fact it
T indicates transposition of the elements in brackets may contain some error). The assumed true map
d is the number of pixels in the classification may be derived from in situ investigation or quite
often from the interpretation of remotely sensed data
X=(x1,…, xd) is the (d by 1) pixel vector of grey- obtained at a larger scale or higher resolution.[15]
levels
Classification Error Matrix
Ui=(u1,...,ud) is the (d by 1) mean vector for class i
One of the most common means of
J is the subscript of jth element of a vector expressing classification accuracy is the preparation
SUM[]is the total of elements inside brackets of classification error matrix sometimes called
confusion or a contingency table. Error matrices
The distances between the pixel to be compare on a category by category basis, the
classified and each class centre are compared. The relationship between known reference data (ground
pixel is assigned to the class whose centre is the truth) and the corresponding results of an automated
closest to the pixel. classification. Such matrices are square, with the
If for all i not equal j, Gj(X) < Gi(X), then X is number of rows and columns equal to the number of
classified as j. categories whose classification accuracy is being
assessed. error matrix that an image analyst has
Parallelepiped Classification Algorithm prepared to determine how well a Classification has
This is a widely used decision rule based categorized a representative subset of pixels used in
on simple Boolean “and/or” logic. the training process of a supervised classification.
This matrix stems from classifying the sampled
Training data in n spectral bands are used training set pixels and listing the known cover types
in performing the classification. used for training (columns) versus the Pixels
Brightness values from each pixel of the actually classified into each land cover category by
multispectral imagery are used to produce an n- the classifier (rows). An error matrix expresses
dimensional mean vector, Mc = (μc1, μc2, μc3, ... μcn) several characteristics about classification
with μck being the mean value of the training data performance. For example, one can study the
obtained for class c in band k out of m possible various classification errors of omission (exclusion)
classes, as previously defined. Sck is the standard and commission (inclusion). Several other measures
deviation of the training data class c of band k out of for e.g. the overall accuracy of classification can be
m possible classes. computed from the error matrix. It is determined by
dividing the total number correctly classified pixels
(sum of elements along the major diagonal) by the
Table 3 Maximum likelihood Classification Error
total number of reference pixels. Likewise, the
matrix (Percentage)
accuracies of individual categories can be calculated
by dividing the number of correctly classified pixels CLASS CLASS1 CLASS2 CLASS3 TOTAL
in each category by either the total number of pixels
in the corresponding rows or column. Producers Unclassified 0.21 0.33 0.2 0.21
accuracy which indicates how well the training sets Class1[Red] 99.79 0.07 0 96.06
pixels of a given cover type are classified can be
determined by dividing the number of correctly Class2[Blue] 0 99.61 0 2.22
classified pixels in each category by number of Class3[Green] 0 0 99.8 1.5
training sets used for that category (column total).
Total 100 100.01 100 100
Users accuracy is computed by dividing the number
of correctly classified pixels in each category by the Producer’s Accuracy and Users Accuracy
total number of pixels that were classified in that
category (row total). Red(non-forest)= 65367/65503 = 99.8%
Red= 65367/65368 = 99.9%
Kappa coefficient
Green(forest) = 1514/1520 = 99.6%
Kappa analysis is a discrete multivariate technique Green= 1514/1514 = 100%
for accuracy assessment
Blue (water) = 1022/1024 = 99.8%
Kappa analysis yields a Khat statistic that is the water = 1022/1022 = 100%
measure of agreement of accuracy. The Khat
statistic is computed as Overall accuracy = (65367+ 1514 + 1022)/68047=
99.7%
Kappa Coefficient =0.9716.
The error matrix shows the accuracy of
Maximum Likelihood classification method using
Where r is the number of rows in the matrix xii is the image provided. Similarly the error matrices for
the number of observations in row i and column i, other classification were found out. The minimum
and xi+ and x+i are the marginal totals for the row i distance classifier is found to be the least accurate
and column i respectively and N is the total number with the lowest accuracies. The output of the
of observations. classification is shown in figure [Fig. 4-6]. Overall,
A study of the performance of various the Maximum Likelihood classifier shows the
classifiers based on the overall accuracy, kappa highest accuracy assessment for this particular area.
coefficient, and confusion matrix and it is shown in 6. Conclusion
table 2 and 3 is made. It is observed that Maximum
Likelihood classification method is determined to be In this paper we have compared the
the most accurate. One of the reasons is it filters out performance of various classifiers and found that the
shadows and also it classifies the highly varied Maximum Likelihood classifier outperforms other
clusters. classifiers. This accurate but simple classifier shows
the importance of considering the data set -
classifier relationship for successful image
classification. Further studies are required to
improve the use of classifiers to increase the
Table.2 Maximum Classification Error matrix applicability of such methods. There is a need to
(Pixels) develop new work to compare the supervised
classifiers with unsupervised classifiers and identify
CLASS CLASS1 CLASS2 CLASS3 TOTAL
the pros and cons of among them.
Unclassified 136 5 2 143
Reference:
Class1[Red] 65367 1 0 65368
Class2[Blue] 0 1514 0 1514 [1] Campbell, “Introduction to Remote Sensing”,
GE-21,pp.383-392,2002.
Class3[Green 0 0 1022 1022
] [2] Grace Nangendo , Andrew K. Skidmore b,
Total 65503 1520 1024 68047 Henk van Oosten , “Mapping East African
tropical forests and woodlands — A
comparison of classifiers”, ISPRS Journal of
Photogrammetry & Remote Sensing 61
,pp.393–404, 2007
[3] Vieira, I.C.G., Almeida, A.S.d., Davidson, Alan H. Strahler, “The Use of Prior Probabilities in
E.A., Stone, T.A., Carvalho, C.J.R.d., Maximum Likelihood Classification of Remotely
Guerrero, J.B., “Classifying successional Sensed Data”, Remote Sensing Environment10,135-
forests using Landsat spectral properties and 163 (1980).
ecological characteristicsin eastern
Amazonia”,Remote Sensing of Environment [11] http://www.pcigeomatics.com/cgi-
87 (4), 470–481,2003. bin/pcihlp/MINDIS
[4] Sudhir Gupta, “Exploring The Utility Of [12] Aykut Akgun, A.Husnu Eronatb and Necdet
Moderate Resolution Time Series Remotely Turka, “Comparing different satellite image
Sensed Data For Land Use/Cover classification methods: An application in
Classification”, International Institute of Ayvalik district, western Turkey”,vol.7,678-
Information Technology, Hyderabad, INDIA, 697,2009.
December 2009.
[13] K Perumal and R Bhaskaran, “Supervised
[5] M. Heinl, J. Walde, G. Tappeiner, U. classification performance of multispectral
Tappeiner ,” Classifiers vs. input variables— images”, Journal of computing, vol. 2, issue 2,
The drivers in image classification for land 151-167, February 2010
cover mapping”, International Journal of
Applied Earth Observation and [14] Minakshi Kumar, “Digital Image Processing”,
Geoinformation 11 , pp.423–430, 2009 Photogrammetry and Remote Sensing
Division, Indian Institute of Remote Sensing,
[6] www.wikipedia.com Dehra Dun.
[7] Maria Alice S. Alves ,Clinton N.
Jenkins,Stuart L. Pimm ,Alline Storni,Marcos
A. Raposo ,M. de L. Brooke,Grant Harris
,Andy Foster, “LISTS OF SPECIES Birds,
Montane forest, State of Rio de Janeiro,
Southeastern Brazil”, ISSN: 1809-127X, 289–
299, 2009.
[8] http://landsat.usgs.gov
http://landsat.usgs.gov/gallery_view.php?
category=nocategory&thesort=mainTitle
[9] http://rst.gsfc.nasa.gov/Sect1
[10] C. Palaniswami, A. K. Upadhyay and H. P.

Maheswarappa, "Spectral mixture analysis for
subpixel classification of coconut", Current
Science, Vol. 91, No. 12, pp. 1706 -1711, 25
December 2006.
Fig-4 Parallelopiped classification

Fig-5 maximum Likelihood classification
Fig-6 Minimum distance classification

Paper Presentation - Sathiyavani at 11th Feb.

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Paper Presentation - Sathiyavani at 11th Feb.

Uploaded by

Copyright:

Available Formats

CLASSIFICATION OF FOREST AND NON-FOREST AREAS

R.Sathiyavani* Prof V.Srinivasan.

can be thought of as a squared distance function

Where Σ i the sample covariance matrix

4.3 CLASSIFICATION METHODS Where Dk p by p dispersion matrix associated with a

Maximum likelihood Classification is a Since the log of the probability is a

[10] C. Palaniswami, A. K. Upadhyay and H. P.

Fig-4 Parallelopiped classification

Fig-6 Minimum distance classification

You might also like