Professional Documents
Culture Documents
Research Article
Applying Artificial Neural Networks for Face Recognition
Thai Hoang Le
Department of Computer Science, Ho Chi Minh University of Science, Ho Chi Minh City 70000, Vietnam
Correspondence should be addressed to Thai Hoang Le, lhthai@fit.hcmus.edu.vn
Received 25 January 2011; Revised 16 June 2011; Accepted 19 July 2011
Academic Editor: Naoyuki Kubota
Copyright 2011 Thai Hoang Le. This is an open access article distributed under the Creative Commons Attribution License,
which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
This paper introduces some novel models for all steps of a face recognition system. In the step of face detection, we propose
a hybrid model combining AdaBoost and Artificial Neural Network (ABANN) to solve the process eciently. In the next step,
labeled faces detected by ABANN will be aligned by Active Shape Model and Multi Layer Perceptron. In this alignment step,
we propose a new 2D local texture model based on Multi Layer Perceptron. The classifier of the model significantly improves the
accuracy and the robustness of local searching on faces with expression variation and ambiguous contours. In the feature extraction
step, we describe a methodology for improving the eciency by the association of two methods: geometric feature based method
and Independent Component Analysis method. In the face matching step, we apply a model combining many Neural Networks
for matching geometric features of human face. The model links many Neural Networks together, so we call it Multi Artificial
Neural Network. MIT + CMU database is used for evaluating our proposed methods for face detection and alignment. Finally, the
experimental results of all steps on CallTech database show the feasibility of our proposed model.
1. Introduction
Face recognition is a visual pattern recognition problem.
In detail, a face recognition system with the input of an
arbitrary image will search in database to output peoples
identification in the input image. A face recognition system
generally consists of four modules as depicted in Figure 1:
detection, alignment, feature extraction, and matching,
where localization and normalization (face detection and
alignment) are processing steps before face recognition
(facial feature extraction and matching) is performed [1].
Face detection segments the face areas from the background. In the case of video, the detected faces may
need to be tracked using a face tracking component. Face
alignment aims at achieving more accurate localization
and at normalizing faces thereby, whereas face detection
provides coarse estimates of the location and scale of
each detected face. Facial components, such as eyes, nose,
and mouth and facial outline, are located; based on the
location points, the input face image is normalized with
respect to geometrical properties, such as size and pose,
using geometrical transforms or morphing. The face is
usually further normalized with respect to photometrical
Image
Face
detection
tracking
Face
location
size & pose
Face
alignment
Aligned
face
Feature
extraction
Feature
Vector
Feature
matching
Face ID
Enrolled
users
One of the most popular and ecient learning machinebased approaches for detecting faces is AdaBoost approach
[6]. Viola and Jones designed a fast, robust face detection
system where AdaBoost learning is used to build nonlinear
classifiers. AdaBoost is used to solve the following three
fundamental problems: (1) learning eective features from
a large feature set; (2) constructing weak classifiers, each
of which is based on one of the selected features; (3)
boosting the weak classifiers to construct a strong classifier.
Viola and Jones make use of several techniques for eective
computation of a large number of such features under
varying scale and location which is important for realtime performance. Moreover, the cascade of strong classifiers
which form cascade tree will make the computation even
more ecient. Their system is the first real-time frontal-view
face detector. However, their system still has some drawbacks.
Since the detection results depend on weak classifiers, the
detection results often have many false positives. To decrease
the rate of false positives, it is compelled to increase
the number of strong classifiers and Haar-like features in
cascade tree, but this will cause a significant increase in
the performance time, and detection rate can be decreased.
Thus, to deal with the issue, we should combine AdaBoost
with other machine learning techniques to achieve the same
face detecting ratios but with the minimum number of false
positives and the running time.
One of the popular methods having the same achievement as well is artificial neural networks (ANNs) [7]. ANN
is the term on the method to solve problems by simulating neurons activities. In detail, ANNs can be most adequately characterized as computational models with particular properties such as the ability to adapt or learn, to
generalize, or to cluster or organize data, and which operation is based on parallel processing. However, many of the
previously mentioned properties can be attributed to nonneural models. A hybrid approach combining AdaBoost
and ANN is proposed to detect faces with the purpose of
decreasing the performance time but still achieving the desired faces detecting rate.
Our hybrid model is named ABANN. This is the model
of combining AB and ANN for detecting faces. In this model,
ABs have a role to quickly reject nonface images; then ANNs
continue filtering false negative images to achieve better
results. The final result is face/nonface.
The selected neural network here is three-layer feedforward neural network with back propagation algorithm.
The number of input neurons T is equivalent to the length of
extracted feature vector, and the number of output neurons
3
Image
AdaBoost combining artificial neural
network for face detection
Feature extraction using the association of geometric featurebased method and independent component analysis
Extract geometry face global and component feature vectors
Face ID
The image results of the step will be the inputs of the face
alignment step. The next section elaborates our proposed
method.
Image
C1
C2
Candidate
image
Face/
nonface
Nonface
AdaBoost
ANN
(a)
Window 20 20 pixels
(b)
Figure 3: (a) The process of detecting faces of ABANN and (b) input features for neural network.
Face alignment
T(X + Ps bs , xc , yc , sx , s y , )
Image
Model
Image
p1
b1
X=
1
Xi .
s i=1
(1)
s
T
1
Xi X Xi X .
s 1 i=1
(2)
(3)
(4)
E bs , xc , yc , sx , s y , = X T X + Ps bs , xc , yc , sx , s y , .
(6)
bs = PsT X X ,
bi 3 i + 3 i .
X = T X + Ps bs , xc , yc , sx , s y , .
2
(5)
Step 1. Computing normal vector at point (xi , yi ) and calculating tangent vector t,
tx = xi+1 xi1 ,
t y = yi+1 yi1 .
g(k)
(7)
tx
,
|t |
ty =
ty
.
|t |
x The strongest
edge
(8)
Point
n y = tx .
(9)
k [. . . , 2, 1, 0, 1, 2, . . .].
(10)
31
Feature
point
MLP
31
gkl
gkc
gkr
+
+
g(k) =
.
4
2
4
(11)
y y1 , y2 , . . . , ym ,
(12)
j
y Lj 1 wiLj + wi0L .
y = MLP(x, W);
(13)
f (x) = a x
1
x ,
a
1
1
<x<+ ,
a
a
1
x+ .
a
(14)
7
identifies the mixing matrix. (3) It finds an unnecessary
orthogonal basic which may reconstruct the data better than
PCA in the presence of noise such as variations lighting and
expressions of face. (4) It is sensitive to high-order statistics
in the data, not just to the covariance matrix.
The appearance-based method group has been found the
best performer in facial feature extraction problem because
it keeps the important information of face image, rejects
redundant information, and reflects face global structure.
In many applications of face recognition such as identity
authentication for credit card and video surveillance, accuracy of face recognition problem is the best important factor.
Therefore, besides principle components, independent components of data and face global structure are kept by PCA and
ICA method. Combination of these features with geometric
features such as nose, eyes, and mouth in recognition will
increase accuracy, confident of face recognition system.
In this section, we present architectures of ICA for face
recognition and combining ICA method with geometric
feature-based method (GICA) [24], then comparison of
GICA method and GPCA method on CalTech database
[4]. Since then, practicability of GICA method for face
recognition problem is demonstrated.
4.1. Geometric-Face Global Feature Vector. Labeled faces were
detected by ABANN20 (Section 2), which will be normalized
in a standard size of 30 30 pixels. After standardization,
the face images will be represented by vectors xface =
(x1 , x2 , . . . , x900 )T .
4.2. Geometric-Face Component Feature Vectors. Section 3
presented a method to align face by MLP-ASM (multilayer
perceptronactive shape model). After face alignment, we
can detect image regions that contain eyes and mouth, in face
images. To define face component feature vectors, we detect
two image regions: (a) region 1 contains eyes, which was
detected by coordinates of feature points: left temple and right
temple features, left outer top eye brow and left inner top eye
brow features, right outer top eye brow and right inner top eye
brow features, left eye bottom and right eye bottom features;
(b) region 2 contains labeled face without mouth, which was
detected by coordinates of feature points: Left Jaw and Right
Jaw features, Left Outer Top Eye Brow and Left Inner Top Eye
Brow features, Right Outer Top Eye Brow and Right Inner Top
Eye Brow features, and Left Nose Bottom, Nose base, and Right
Nose Bottom. For instance in Figure 11.
After locating image regions of face image, region 1 and
region 2 will be normalized in a standard size of 30 30
pixels. We have vectors that represent eyes image region:
xeyes = (x1 , x2 , . . . , x900 )T , and face without mouth image
region: xface no mouth = (x1 , x2 , . . . , x900 )T . These vectors are
called Geometric-component feature vector.
4.3. ICA Method for Facial Feature Extraction. Independent
component analysis (ICA) minimizes both second-order and
higher-order dependencies in the input data and attempts
to find the basis along which the data (when projected
onto them) are statistically independent. Bartlett et al. [22]
U
Coefficient 1
Pixel 1
(a)
(b)
A
Basic
images
Pixel 2
Coefficient 2
..
.
..
Weights
.
matrix was
Coefficient p
Pixel p
produced by
ICA Independent coefficients
Input face images
(15)
(16)
(17)
(18)
Figure 13: First row contains eight eigenfaces corresponding to the highest eight eigenvalues for PCA. Second row first of eight basic images
for architecture II ICA.
xface
xeyes
xface no mouth
ICA
yface
ICA
yeyes
ICA
yface no mouth
ycomb
10
Vector
V1
Training
loop for
SNN1
V1
SNN1
Vector
R1
Vector R1
Vector
Vector
V2
R2
Vector
Vector
Vm
RL
CNN1
Global
frame
V2
Vm
SNN2
Vector R2
CNN2
Vector RL
CNNL
SNN1m
V1
V1
SNN1
Vector R1
SNN1
CNN1
Global
frame
V2
Vm
SNN2
Vector R2
CNN2
Vector RL
CNNL
SNN1m
CNN1
Global
frame
V2
Vm
SNN2
Vector R2
CNN2
Vector RL
CNNL
SNN1m
xface
ICA
11
Table 1: The ANN structure for detecting faces.
yface
Name
xeyes
xface no mouth
ICA
ICA
ANN FACE
yeyes
Input
nodes
25
Hidden
nodes
25
Output
nodes
1
Learning rate
0.3
ycomb
yface no mouth
1 ex
,
1 + ex
(19)
12
Figure 20: Example of face images used for training AdaBoost detector.
Table 2: Performance of detecton on MIT + CMU test set of AdaBoost detector.
Method
AB20
AB25
Number of
stages
20
25
Number of
Haar-like
features used
1925
2913
Face
detected
Missed faces
(false rejection)
False detections
Detection rates
467
452
40
55
202
40
92.11%
89.15%
0.179
0.202
13
Face detected
Missed faces
(false rejection)
False detections
Detection rates
476
31
17
93.89%
71.56
AB-ANN
AB-ANN
AdaBoost structure
Number of
strong
classifiers
Number of
Haar-like
features
ANN structure
20
25
1925
2913
Rowleys model
Face detected
Missed faces
(false rejection)
False
detections
Detection
rate
466
450
41
57
13
3
91.91%
88.76%
Classical ASM
2 ms
MLP ASM
15 ms
GPCA
94.70
GICA
96.57
Average time
to process an
image
(second)
0.174
0.195
14
12
10
8
6
4
2
0
1
11
21
31
41
51
61
Index of feature point
71
81
MLP
Classical
7. Conclusions
This paper presented novel models for all steps of the
recognition of human faces in 2-dimentional digital images.
For face detection module, a model to combine three-layer
15
Initialization
Classical ASM
MLP ASM
100
98
96
References
94
92
1
Rank
GICA
GPCA
16
[11] S. Yan, M. Li, H. Zhang, and Q. Cheng, Ranking prior
likelihood distributions for Bayesian shape localization framework, in Proceedings of the 8th IEEE International Conference
on Computer Vision, pp. 5158, October 2003.
[12] J. Tu, Z. Zhang, Z. Zeng, and T. Huang, Face localization via
hierarchical CONDENSATION with Fisher Boosting feature
selection, in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 04),
pp. I719I724, July 2004.
[13] S. Marcel, Artificial neural network for pattern recognition: application to face detection and recognition, 2004,
http://www.idiap.ch/marcel/.
[14] T. Kawaguchi, D. Hidaka, and M. Rizon, Detection of eyes
from human faces by Hough transform and separability
filter, in Proceedings of the International Conference on Image
Processing, vol. 1, no. 2000, pp. 4952, Vancouver, Canada,
2000.
[15] A. L. Yuille, D. S. Cohen, and P. W. Hallinan, Feature
extraction from faces using deformable template. Computer
vision and pattern recognition, in Proceedings of the IEEE
Computer Society Conference, pp. 104109, San Diego, CA ,
USA, June 1989.
[16] L. Zhang, Estimation of the mouth features using deformable
templates, in Proceedings of the International Conference on
Image Processing, vol. 3, pp. 328331, Santa Barbara, CA , USA,
October 1997.
[17] P. Kuo and J. Hannah, An improved eye feature extraction
algorithm based on deformable templates, in Proceedings of
the IEEE International Conference on Image Processing (ICIP
05), pp. 12061209, September 2005.
[18] S. L. Phung, A. Bouzerdoum, and D. Chai, Skin segmentation
using color and edge information. Signal processing and its
applications, in Proceedings of the 7th International Symposium, vol. 1, pp. 525528, July 2003.
[19] T. Sawangsri, V. Patanavijit, and S. Jitapunkul, Segmentation
using novel skin-color map and morphological technique, in
Proceedings of the World Academy of Science, Engineering and
Technology, vol. 2, January 2005.
[20] M. Turk and A. Pentland, Face recognition using eigenfaces,
in Proceedings of the IEEE Conference on Computer Vision and
Pattern Recognition, pp. 586591, 1991.
[21] B. A. Draper, K. Baek, M. S. Bartlett, and J. R. Beveridge,
Recognizing faces with PCA and ICA, Computer Vision and
Image Understanding, vol. 91, no. 1-2, pp. 115137, 2003.
[22] M. S. Bartlett, J. R. Movellan, and T. J. Sejnowski, Face recognition by independent component analysis, IEEE Transactions
on Neural Networks, vol. 13, no. 6, pp. 14501464, 2002.
[23] P. Comon, Independent component analysisa new concept? Signal Processing, vol. 36, no. 3, pp. 287314, 1994.
[24] T. T. Do and T. H. Le, Facial feature extraction using geometric feature and independent component analysis, in Proceedings of the Pacific Rim Knowedge Acquisition Workshop (PKAW
08), Hanoi, Vietnam, December 2008, (Revised Selected
Papers) in Knowledge Acquisition: Approaches, Algorithms
and Applications, Lecture Notes in Artificial Intelligence,
Springer, Berlin, Germany, pp.231241, 2009.
[25] A. Hyvarinen and E. Oja, Independent component analysis:
algorithms and applications, Neural Networks, vol. 13, no. 45, pp. 411430, 2000.
[26] O. A. Uwechue and A. S. Pandya, Human Face Recognition
using Third-Order Synthetic Neural Networks, The Springer
International Series in Engineering and Computer Science,
Springer, 1st edition, 1997.
Journal of
Advances in
Industrial Engineering
Multimedia
The Scientific
World Journal
Volume 2014
Volume 2014
Applied
Computational
Intelligence and Soft
Computing
International Journal of
Distributed
Sensor Networks
Hindawi Publishing Corporation
http://www.hindawi.com
Volume 2014
Volume 2014
Volume 2014
Advances in
Fuzzy
Systems
Modelling &
Simulation
in Engineering
Hindawi Publishing Corporation
http://www.hindawi.com
Volume 2014
Volume 2014
Journal of
Computer Networks
and Communications
Advancesin
Artificial
Intelligence
Hindawi Publishing Corporation
http://www.hindawi.com
HindawiPublishingCorporation
http://www.hindawi.com
Volume 2014
International Journal of
Biomedical Imaging
Volume2014
Advances in
Artificial
Neural Systems
International Journal of
Computer Engineering
Computer Games
Technology
Advances in
Volume 2014
Advances in
Software Engineering
Volume 2014
Volume 2014
Volume 2014
Volume 2014
International Journal of
Reconfigurable
Computing
Robotics
Hindawi Publishing Corporation
http://www.hindawi.com
Computational
Intelligence and
Neuroscience
Advances in
Human-Computer
Interaction
Journal of
Volume 2014
Volume 2014
Journal of
Volume 2014
Volume 2014