Professional Documents
Culture Documents
To cite this article: S. Abolfazl Hosseini & Hassan Ghassemian (2016) Rational function
approximation for feature reduction in hyperspectral data, Remote Sensing Letters, 7:2,
101-110, DOI: 10.1080/2150704X.2015.1101180
1. Introduction
Hyperspectral (HS) data sometimes called imaging spectroscopy (Ben-Dor et al. 2009)
contain hundreds of spatially co-registered images in the form of image cubes; each image
corresponds to a specic narrow spectral band usually in the wavelength range of
4002500 nm. For an N bands HS data set, the intensities measured for each pixel by the
HS sensor can be considered as the elements of an N-dimensional vector. The plot of the
sequence of intensity values corresponding to the reectance amounts of its ground surface
in adjacent wavelengths intervals (y = [y1, y2. . . yN]T, where T denotes vector transpose) as a
function of band numbers (x = [1, 2,. . ., N]T) is named as spectral signature (SS) of pixel. In
supervised classication of this type of data, some theoretical and practical problems like
Hughes phenomenon appear due to their high between bands correlation and high
dimensional spaces specications (Landgrebe 2002). There are four strategies to reduce
this phenomenon eect in HS images classication: semi-supervised classication
(Marconcini, Camps-Valls, and Bruzzone 2009; Shahshahani and Landgrebe 1994),
LDA. Also, optimal selection of spectral bands has been extensively discussed in the
literature (Landgrebe 2002). The features produced by feature reduction step are fed to
the classier. The maximum likelihood classier (MLC) is a widely used parametric
classier in HS data classication (Pal and Mather 2003).
Despite simple structure and relatively good results, many statistical analysis-based
methods like PCA and LDA have some deciencies when they are used in HS image
classication. For example, these methods do not consider the geometric aspects of SSs
and the ordinance of original features which is a rich source of information. For each
pixel of an HS image, we have a vector of measured quantities corresponding to
reectance in consecutive wavelengths. Therefore, the ordinance of measurements
might have some useful information. In other words, SS as a curve has some useful
information. Hosseini and Ghassemian (2012, 2013) introduced an FE technique based
on the fractal nature of SS that inherently depends on the ordinance of samples. Run
and King (1999) and Tsai and Philpot (1998) proposed a method for nding absorption
points of spectra based on derivative spectra. Since derivative computation is very
sensitive to noise, they have used a smoothing preprocessing step. Other disadvantage
of many FE methods is that they cannot be applied pixel by pixel, and rst we need to
know all data points to compute transformation matrix before transforming data to new
space and producing new features. The main contribution of this letter is to introduce an
FE method which regards the geometrical nature of the SSs and the ordinance informa-
tion existing in the SS that yields improvement of classication results. Indeed, we try to
t a rational function with polynomial numerator and denominator to the SS of each
pixel. Then the coecients of these polynomials are considered as new feature vectors
and are fed to an MLC. Although the concept of curve tting has already been used in
HS data processing in applications such as spectra smoothing (Run and King 1999; Tsai
and Philpot 1998), the novelty of the proposed framework is the use of the coecients
of the tted curves (not the amounts of the tted curves themselves) as the reduced
features in the new space, and using these reduced features in data classication and
compression. Results are compared to PCA, LDA, NWFE and MMP as four basic and
classic FE methods with relatively good performance for HS data classication. Unlike
these methods, our proposed method is applied pixel by pixel. Therefore, a parallel
REMOTE SENSING LETTERS 103
least absolute residuals and bisquare tting. In the LS method, f() is found by minimiz-
ing the following weighted mean squared error:
1X N
w f I 2 (1)
N 1
There are dierent curve-tting models like polynomial, linear, spline, etc. In this
letter, a method for tting a curve using rational functions is utilized in order to extract
new features for classication and compression purposes. Therefore, it is important that f
() has fewer parameters than the number of data samples.
Consider a function f and the two integers L 0 and M 0, the rational function
approximant of order (L, M) for f is dened as ^f
, !
X
L X
M
^f
L;M
f cjM1
j
1 cj
j
(2)
j0 j1
For the same model, dissimilarities of curves yield dierences in the coecients of the
tted curves. Therefore, it seems that these coecients could be used as discriminating
features for the curves. For example, in Figure 1(a) two dierent families of curves are
plotted, and the distributions of their rational function approximants coecients are
demonstrated in Figure 1(b)(f). In this curve-tting problem, we tted a rational func-
tion with L = 0 and M = 4 to both families of curves. Therefore, each curve can be
expressed with its own ve coecients. As can be seen, the histograms of some
coecients completely separate the two families. This fact can be considered as a
motivation for using the coecients of the rational function tted curves of the SSs as
discriminating features in HS data classication tasks.
Figure 1. Two families of curves and the histograms of the coecients of their corresponding Pad
approximants: (a) the curves families (family 1: solid blue lines, and family 2: dotted red lines); (b)
Downloaded by [Wuhan University] at 05:04 12 November 2015
1X N
2
E f^ = N f = N (4)
N 1
( j )
f N N j 1; :::; M
where ANML1 aj , aj j and bNx1
j M 1; :::; M L 1
f N1 ; f N2 ; ::::; f NN t . N
REMOTE SENSING LETTERS 105
4. Experimental results
4.1. HS data sets
The rst data set used in our experiments is a mixed forest/agricultural 145 145
pixels image from Indian Pine Site (IPS) in Indiana. It is captured by AVIRIS sensor.
The spatial resolution of this data set is 20 m. The image contains 220 reectance
bands in the wavelengths from 400 to 2500 nm with 10 nm resolution. After
removing water absorption bands, N = 200 bands were left. The scene contains
16 dierent land covers, and detailed information can be found in Universidad-del-
Pais-Vasco (2015).
The other data set was gathered over the urban area of the University of Pavia
(UP) by ROSIS sensor. It is 610 340 pixels, with the spatial resolution of 1.3 m, in
115 reectance bands in the range of 430860 nm. Discarding some noisy bands
yields N = 103 bands nally. This scene contains nine dierent land cover
(Universidad-del-Pais-Vasco 2015). Band 18 of these data sets in shown in Figures
2(a) and 3(a), respectively.
Figure 2. Comparing the original and the reconstructed image of IPS (band 18): (a) original data; (b)
obtained using the RFCF-FE method; (c) obtained using inverse PCA.
106 S. A. HOSSEINI AND H. GHASSEMIAN
Downloaded by [Wuhan University] at 05:04 12 November 2015
Figure 3. Comparing the original and the reconstructed image of UP (band 18): (a) Original data; (b)
obtained using the RFCF-FE method; (c) obtained using inverse PCA.
Figure 4. Spectral signature of a typical pixel and its approximation: (a) IPS data, image coordi-
nate = (50, 20), Land cover = corn; (b) UP data, image coordinate (70, 70), Land cover = self-blocking
Downloaded by [Wuhan University] at 05:04 12 November 2015
bricks.
results show that these exceptional points do not have a severe impact on the classica-
tion performance. This is because (1) these exceptional points in each SS are rare, if any;
(2) the locations of these points are not the same for dierent SSs, i.e. this phenomenon
does not destroy any specic band completely; and (3) for each SS, the coecients of
the corresponding tted curve are used as the features, not the curve itself.
Figure 5 demonstrates the accuracy assessment measures for IPS and UP data sets.
The optimum values of L (and so M) parameters of the RFCF-FE method dier for
dierent values of D and dierent iterations of the algorithm, but in most cases the
best classication results have occurred when values of L are 0, 1, D 2 and D 1. Note
that despite the other methods the maximum number of extracted features in the LDA
method is equal to Nc 1, where Nc is the number of classes.
The superiority of the RFCF-FE method compared to the other methods is appar-
ent from Figure 5. As can be seen, all measures (average accuracy, average validity,
overall accuracy and kappa statistics) have been dramatically improved by the RFCF-
FE algorithm in comparison to the other methods. The only exception is for the MMP
method in the UP case at D = 12, 13 and 14. This improvement in IPS is more than in
UP because IPS data contain agricultural scene with less details and larger ground
instantaneous eld of view in contrast to the urban scene of UP. Therefore, our
method outperforms PCA, NWFE and LDA for agricultural scenes as well as urban
ones and loses out to MMP by a few points for the urban data set.
Table 1 contains the peak signal-to-noise ratio (PSNR) values for reconstructed images
from PCA and the proposed method for IPS and UP, respectively, when D varies from 3 to 15.
PSNR for the proposed method corresponds to the values of L and M that yields to the
best result. These optimum values of L and M have been shown in these tables. The
superiority of the proposed method with respect to the PCA-based compressing method is
apparent from these tables. As demonstrated in Table 1, in most cases for the UP data set, the
best result is obtained when M = 0. It implies that Maclaurin series has a better performance
in these cases. PSNR is calculated using 10 log10 (S/N), where S is the energy of original signal
and N is the energy of dierence between the original and decompressed signal.
108 S. A. HOSSEINI AND H. GHASSEMIAN
Downloaded by [Wuhan University] at 05:04 12 November 2015
Figure 5. Comparison of the RFCF-FE method with those of PCA, NWFE, MMP and LDA in terms of
common accuracy measures. (a)(d) For IPS data, and (e)(h) for UP data. Reference ground truth
maps are available in Universidad-del-Pais-Vasco.
REMOTE SENSING LETTERS 109
Table 1. Comparing PSNR of the proposed method and inverse PCA for IPS and UP data sets.
Compression rate IPS Compression rate UP
(N/D) Best(L,M) PSNR IPCA PSNR (N/D) Best(L,M) PSNR IPCA PSNR
200/3 (0, 2) 34.95 24.39 103/3 (2, 0) 31.85 29.76
200/4 (0, 3) 27.42 25.74 103/4 (3, 0) 20.25 16.44
200/5 (0, 4) 58.70 24.20 103/5 (2, 2) 32.92 23.54
200/6 (0, 5) 29.63 23.28 103/6 (5, 0) 21.35 16.17
200/7 (4, 2) 24.17 24.46 103/7 (5, 1) 25.29 19.27
200/8 (0, 7) 22.78 25.13 103/8 (2, 5) 24.06 19.52
200/9 (6, 2) 55.11 25.21 103/9 (1, 7) 43.77 22.52
200/10 (2, 7) 34.38 25.90 103/10 (2, 7) 38.98 21.77
200/11 (6, 4) 28.46 27.03 103/11 (6, 4) 68.19 22.74
200/12 (8, 3) 28.17 28.99 103/12 (11, 0) 34.11 23.65
200/13 (5, 7) 31.47 29.43 103/13 (12, 0) 27.86 22.91
200/14 (2, 11) 41.01 31.47 103/14 (13, 0) 26.05 24.08
200/15 (10, 4) 42.38 30.14 103/15 (14, 0) 35.47 23.98
Downloaded by [Wuhan University] at 05:04 12 November 2015
5. Conclusion
A new FE method for HS data is proposed using rational function curve tting. The main
motivation for using the curve-tting approach for HS data FE is the utilization of the
information that exists in the sequence of original features (ordinance of reectance
coecients in SS) that are neglected by competing methods. The coecients of SSs
approximants are calculated through analytical operations. These extracted features are
then fed into an ML classier. The size of the training samples is selected as 10% of total
data volume with a minimum of 15 and maximum of 50 samples for each class. The
classication performance is compared to PCA as a traditional unsupervised FE method
and LDA and NWFE as two supervised FE methods and MMP as an unsupervised manifold-
based algorithm. The results show the superiority of the proposed method. Also, it has
been shown that this technique has satisfactory results for signal visualization and signal
representation, and can be considered as a good coding algorithm for lossy compression
of HS data. The proposed method can be applied pixel by pixel and does not need to
transform whole data to a new space simultaneously. In addition, this method is a novel
approach which can be used as a powerful base for developing more ecient FE methods.
Disclosure statement
No potential conict of interest was reported by the authors.
Funding
This work was supported by Iran communication research center [grant number 18133/500 T by
Identication code: 90-01-03].
References
Acevedo, J. C., H. Haneishi, M. Yamaguchi, N. Ohyamaa, and J. Baez. 2004. CIE-XYZ Fitting by
Multispectral Images and Mean Square Error Minimization with a Linear Interpolation Function.
Revista Mexicana De Fsica 50 (6): 601607.
Baudat, G., and F. Anouar. 2000. Generalized Discriminant Analysis Using a Kernel Approach.
Neural Computation 12 (10): 23852404. doi:10.1162/089976600300014980.
110 S. A. HOSSEINI AND H. GHASSEMIAN
Ben-Dor, E., S. Chabrillat, J. Dematt, G. Taylor, J. Hill, M. Whiting, and S. Sommer. 2009. Using
Imaging Spectroscopy to Study Soil Properties. Remote Sensing of Environment 113: S38S55.
doi:10.1016/j.rse.2008.09.019.
Camps-Valls, G., N. Shervashidze, and K. M. Borgwardt. 2010. Spatio-Spectral Remote Sensing
Image Classication With Graph Kernels. IEEE Geoscience and Remote Sensing Letters 7 (4): 741
745. doi:10.1109/LGRS.2010.2046618.
Fang, L., and D. C. Gossard. 1995. Multidimensional Curve Fitting to Unorganized Data Points by
Nonlinear Minimization. Computer-Aided Design 27 (1): 4858. doi:10.1016/0010-4485(95)
90752-2.
Hosseini, A., and H. Ghassemian. 2012. Classication of Hyperspectral and Multispectral Images by
Using Fractal Dimension of Spectral Response Curve. In 2012 20th Iranian Conference on
Electrical Engineering (ICEE), 14521457. Tehran: IEEE.
Hosseini, A., and H. Ghassemian. 2013. A New Hyperspectral Image Classication Approach Using
Fractal Dimension of Spectral Response Curve. In 2013 21st Iranian Conference on Electrical
Engineering (ICEE), 16. Mashad: IEEE.
Kuo, B.-C., and D. A. Landgrebe. 2004. Nonparametric Weighted Feature Extraction for
Downloaded by [Wuhan University] at 05:04 12 November 2015