Professional Documents
Culture Documents
Each of the face images belongs to one of the c classes. that X = ( X 1 , X 2 ,..., X n ) ⊂ ℜ m is a matrix containing the
2
( n )∑
by:
where Φ = (Φ1 , Φ 2 ,..., Φ n ) ⊂ ℜ m
2
×n n
Z= 1
( )( )
and Z i is c (5)
i =1
S B = ∑ ni X i − X X i − X
T
∑ ∑ ⎛⎜⎝ X − X ⎞⎟⎛⎜ X i − X ⎞⎟
i i
SW = (6)
corresponding to its related eigenface X ⊂ ℜ r×n can be i =1 X i∈ni
i
⎠⎝ ⎠
calculated by projecting Z in the eigenface space by: The within class scatter matrix defines the data of one class
X =UTZ (3)
across the average of the class. The optimal subspace is
computed by
ET SB E
III. DISCRETE COSINE TRANSFORM Eoptimal = argmaxE T = [c1, c2 ,...,cc−1 ] (7)
The DCT transforms spatial information to decoupled E SW E
frequency information in the form of DCT coefficients. Also it
exhibits excellent energy compaction. The definition of DCT Where [c1 , c2 ,..., cc−1 ] is the set of eigen vectors of S B and SW
for an N × N image is [4]:
corresponding to c − 1 greatest generalized eigen value λi
N −1 N −1
π (2x + 1)u π (2 y + 1)v
∑∑Z(x, y)cos
C(u, v) = α (u)α (v)
2N
cos
2N
and i = 1,2,..., c − 1
x=0 y =0 S B Ei = λi SW Ei i = 1, 2,..., c − 1 (8)
u, v = 0,1,2,...,N −1 (4) Eoptimal is an optimal matrix which maximize the proportion of
⎧ 1 u=0 between class scatter matrix to the within class scatter matrix.
⎪ N
α(u) = ⎨ This means that it maximize the scattering of the data that
⎪ 2 u≠0 belongs to different classes and minimize the scattering of the
⎩ N data belonging to the same class.
For an N × N image the matrix coefficients cover the whole Thus the most discriminant answer for face images X would
frequency space of image components. The DCT coefficients be [5]:
with higher values are placed in the upper left of the matrix. In P = ET ⋅X
optimal
(9)
order to convert the DCT matrix to a vector, we need to start
with the upper left side of the matrix and in a zigzag manner put
the elements in a vector. As a feature extraction method, DCT
would change the face images with high dimension to a
subspace with low dimension in which the most important V. RADIAL BASIS FUNCTION NEURAL NETWORK
features of face such as the lines belonging to hairs and face, the For classification we used a distance measure and also RBF
position of eyes, nose and mouth will remain in the DCT neural network in order to compare their classification power.
coefficients. As a distance measure we selected Euclidean distance. RBF
neural network is a powerful classification method for pattern
recognition problems. It doesn't have the drawbacks of multi
IV. LINEAR DISCRMINANT ANALYSIS layer perceptron neural networks and trains much faster than it.
LDA is used for projecting a set of training data. In here Fig. 1 shows an RBF neural network.
these training data are the vectors which have been extracted by
PCA and DCT in previous sections. In new face space which is
a m 2 dimension and m 2 is the vector dimension, consider
5 image from every person was used as training set and the rest
used as test set. Fig. 2 shows a sample of this database.
VII. CONCLUSION
In this paper we used DCT as a prior feature extraction for
LDA and showed the result on ORL database using Euclidean
distance and RBF neural networks as classifiers. The results
show that DCT extracts more discriminant features than PCA
and also show that RBF classifier is much stronger than
distance based classifiers such as Euclidean distance. This
work can be applied to other databases to see the result of
proposed method on other databases.
REFERENCES
[1] R. Hietmeyer, “Biometric identification promises fast and secure
processing of airline passengers,” The Int’l Civil Aviation Organization
Journal, vol. 55, no. 9, 2000, pp. 10–11.
[2] M. Turk, and A. Pentland,, “Eigenfaces for recognition,” J. Cognitiv
Neurosci., vol. 3, 1991, pp. 71–86.
[3] ORL database: http://www.camorl.co.uk.
[4] P. M. Embree, and B. Kimble, C language Algorithm for Digital Signal
Processing. Printice Hall, New Jercy , 1991
[5] K. Fukunaga,, Introduction to statistical pattern recognition, 2nd ed. San
Diego, CA: Academic Press, 1990, pp. 445-450.