Professional Documents
Culture Documents
ISSN 2278-6856
Abstract
Nowadays there is explosive growth in number of images
available on the web. Among the different images celebrities
images are also available in large amount which are in the
form of posters, photographs and images taken at different
events. Celebrities related queries are ranking high. Most of
the end users are more interested in celebrity related data and
images. To better serve the end user demand we have
developed an application which will provide celebrities
information when an image is given as input. Algorithm used
for face detection is HAAR cascade algorithm which
eliminates false positive rate as compared to canny edge
detection and thus increase accuracy and detection rate.
1. INTRODUCTION
General web image page do not always contain the name
of celebrity in an image. Because of noise in web data it
becomes difficult to identify celebrity name form web page
text. There are mainly two challenges. Firstly the surround
text of web image is lacking of standard grammar
structure, therefore it is difficult to apply natural language
processing techniques to extract celebrity names from it.
Secondly the celebrities face in image may be having
different pose, makeup, expression and occlusion caused
by sunglasses or fancy hairstyles. So it becomes difficult to
identify celebrity in an image with visual analysis and a
normal face database. To face this challenge a CFW
dataset can be used which contain millions of celebrity
images in different pose, makeup, expression. Work so far
is conducted on news images, where descriptive captions
are usually provided and most of the time the caption
contains the celebrity name that is there in an image. By
This project we are not only providing name of celebrity
but also provide other information related to celebrity.
This paper presents related work done for image
annotation in section 2; introduce proposed system using
HAAR Cascade Classifier and CFW dataset in section 3.
2. RELATED WORK
Name annotation system has been developed for family
album, news images, ARISTA project. Normally family
photographs are indexed according to when, where, who
and what. The advance digital camera provides date and
Page 154
3.IMPLEMENTATION
Proposed work in this project is extension of the above
system by introducing celebrities with the help of other
information like DOB, Designation, Achievements, Family
details etc. rather than just name tagging. The input to
system will be an image of celebrity. Output of the system
is name assignment to the celebrity in an image and also
display other related information to it. Therefore, to better
serve the end-user demand and foster multimedia research
I propose,
1. Scalable and accurate face annotation approach to
name celebrities in general web images and
2. Methods to infer more properties of the celebrity from
the web like DOB, Designation, Family Background,
Achievements etc.
The system presented in this paper uses HAAR cascade
algorithm for face detection which minimize false positive
rate and increase accuracy as well as speed of detection. As
MSRA-CFW dataset is rich source of images of various
celebrities it helps to detect face in any pose, makeup and
hairstyle. Or we can build our own database which
contains celebrity images downloaded from Google
images. The working of the system and the algorithms
used in each phase are explained further in this section.
Input to the system is an image of celebrity, from the
image first the face is detected then the same image is used
by tineye.com for similar image search. We get some
names of celebrity which can look like the query image.
From this list of celebrities best match is found by face
recognition. And finally from the Wikipedia link details
are extracted and displayed as the end result.
3.1Algorithms used for face detection
In an image annotation system first we need to detect the
face in an image. Face detection methods are as follows.
They are divided into four categories. [5] These categories
may overlap, so an algorithm could belong to two or more
categories. This classification can be made as follows:
Knowledge-based methods or Ruled-based methods:
that encodes our knowledge of human faces using
different rules.
Feature-invariant methods: Algorithms that try to find
invariant features of a face despite its angle or
position.
Template matching methods: These algorithms
compare input images with stored patterns of faces or
features.
Appearance-based methods: A template matching
method whose pattern database is learnt from a set of
training images.
Above methods with their strength and limitations can be
summarized in the following Table 1.
ISSN 2278-6856
Strengths
Limitations
Knowledge
-based
methods
Difficulty in
building an
appropriate set of
rules. Its unable to
find many faces in a
complex image.
Featureinvariant
methods
Success rate of
94%.
If face is with
sunglasses, Skin
color detects the
face.
Template
matching
methods
Define a face as
a function.
Appearancebased
methods
Use a wide
variety of
classification
methods.
Sometimes two
or more
classifiers are
combined to
achieve better
results
Simple to
implement.
Limited to faces
that are frontal,
cannot achieve good
results with
variations in pose,
scale and shape.
Rely on techniques
from statistical
analysis and
machine learning to
find relevant
characteristics of
face images.
Page 155
ISSN 2278-6856
Page 156
ISSN 2278-6856
Positive Hit
Negative
Feature
Rate
Hit Rate
Eyes
93%
23%
Nose
100%
29%
Mouth
67%
28%
d) Regionalized detection
Page 157
ISSN 2278-6856
Page 158
4. RESULT
Figure 6 shows how the system works. An image is input
which further goes for face detection and tineye.com. The
list shows possible names from database. After correct face
recognition a new window shows current details about
celebrity from the Wikipedia.
5. CONCLUSION
This system is developed to identify celebrities in an image
which can be in the form of poster, photographs and web
images where name is missing. This paper introduces
different approach for face detection which can be
combined for better results. HAAR is feature based method
for face detection. HAAR features, Integral images,
regionalized detection of features improve Face detection
in terms of speed and accuracy. HAAR algorithm also
gives small false positives rate. We have also seen CFW
dataset which contain large number of celebrity images.
CFW is open to all for research purpose and it is
downloadable [11]. It gives better results for name
annotation, as it contains
ISSN 2278-6856
[4] Xiao Zhang, Lei Zhang, Xin-Jing Wang, HeungYeung Shum, "Finding Celebrities in Billions of Web
Images, IEEE Transactions On Multimedia, Vol. 14,
No. 4, August 2012, pp. 995-1007.
[5] Ion Marques Face Recognition Algorithms June 16,
2010
[6] Abdallah S. Abdallah Investigation of new
techniques For face detection, May 9, 2007
Blacksburg, Virginia
[7] Phillip Ian Wilson Dr. John Fernandez Facial Feature
Detection Using HAAR Classifiers JCSC 21, 4
(April 2006), pp. 127-133.
[8] TinEye.com
[9] A tutorial Face Recognition using Principal
Component Analysis
[10] R. Padilla, C. F. F. Costa Filho and M. G. F. Costa
Evaluation of HAAR Cascade Classifiers Designed
for Face Detection World Academy of Science,
Engineering and Technology, Vol: 6 2012-04-22, pp.
323-326.
[11] http://research.microsoft.com/en-us/projects/msracfw/default.aspx
References
[1] Lei Zhang, Longbin Chen, Mingjing Li, H. Zhang,
Bayesian Face Annotation in Family albums,
Microsoft Research Asia
[2] P. Tirilly, V. Claveau, and P. Gros, News image
annotation on a large parallel text- image corpus,
Presented at the LREC, Malta, 2010, pp. 2564-2569.
[3] Xin-Jing Wang, Lei Zhang, Ming Liu, Yi Li, WeiYing Ma, ARISTA - Image Search to Annotation on
Billions of Web Photos, in Proc. CVPR, 2010, pp.
2987-2994.
Page 159