You are on page 1of 5

2014 IIAI 3rd International Conference on Advanced Applied Informatics

Batik Motif Classification using Color-Texture-Based


Feature Extraction and Backpropagation Neural
Network

Nanik Suciati, Winny Adlina Pratomo, Diana Purwitasari


Depart ment of Informatics
Institut Teknologi Sepuluh Nopember
Surabaya, Indonesia
nanik@if.its.ac.id

Abstract—Batik is an Indonesian’s traditional cloth which has lungan, megamendung, and buketan. Beside the two streams,
been recognized as one of the world cultural heritage. Currently, there is also a group of batik motif wh ich can not be identified
there are hundreds of different batik motif which can be as geometry or non-geometry motifs, namely special mot if or
classified into 7 groups, i.e. Parang, Ceplok, Lereng, combination motif. The special motif usually combines two
Megamendung, Semen, Lunglungan, and Buketan. This research motifs or more. Fig. 1 shows some examples of batik.
develops a software to automatically identify motifs of batik
image using color-texture-based feature extraction and Parang Batik has basic patterns with rhombus shape which
backpropagation neural network. Color and texture features of are composed into slant lines. The basic pattern in Parang Batik
batik image is extracted using combination of Color Co- is usually named as mlinjon. Ceplo k Batik has basic patterns
occurence Matrix, Different Between Pixels of S can Pattern, and with comb ination of quadrangle and circle wh ich are usually
Color Histogram for K-Means methods. The extracted features composed into regular rows and columns. Basically, Lereng
vectors are furthermore classified into motifs using Batik has similar reguler slant lines shape like the Parang
Backpropagation Neural Network. The experiment shows that Batik, but the basic pattern of Lereng Batik does not have
the software can recognize batik motifs quite well, with rate of mlinjon ornament. The characteristics of basic pattern in
Tanimoto Distance 0,37. Semen Bat ik are shapes like a mountain or a place to grow
plants. Generally, Lung-lungan Batik is similar to Semen
Keywords—batik motif; color co-ocurence matrix;
Batik, except that ornament of Lung-lungan Batik does not
backpropagation neural network
contain mountain shape. The basic patterns in Buketan Batik
consist of flower arrangements, petals with butterfly, bird, or a
variety of small animals.
I. INT RODUCTION Several efforts for collecting and documenting batik data
Batik is a traditional cloth made by painting motifs with which spread in all regions in Indonesia, have been performed
Indonesian’s culture nuance on a plain cloth. The long history in order to conserve the batiks as one of Indonesian’s cultural
of batik is started since before the era of Prince Wijaya (1294 – heritages [3, 4, 5]. The batik data are usually organized based
1309), the first king of Majapahit Kingdom. In its on the place of origin and the motif name wh ich are manually
development, the complexion, style, and color of batik motif identified by an expert. In order to ease the indexing process of
are affected by the cultures of Hindu, Islam, Netherland, China, batik data based on the motif name, it is required to have a tool
and Japan. The book “Batik Spirits of Indonesia” [1] which can identify the name of motif group from the batik
mentioned that there are more than 181 batik mot ifs. Motifs in image automatically. The objective of this research is to
some regions in Indonesia, wh ich are continue increasing, develop a software that can be used to recognize motif group of
haven’t been counted in that number. batik image by inputting a batik image.
Batik motifs are categorized into two streams, geometry The using of color, texture, and shape combination features
and non-geometry motifs [2]. In geometry motif, a bas ic on developing a smart content-based image retrieval have been
pattern appears on cloth regulary. Based on the type of basic performed [6]. Moreover, the good results in developing image
pattern and the type of regular appeareance of basic pattern, retrieval system using some classification methods have also
geometry motifs are clustered into three motif groups, i.e., been reported. Huang et. al [7] developed a kansei-based image
parang, ceplok, and lereng. Non-geometry motifs are clustered retrieval for textile images by using Back Propagation Neural
only based on the appeareance of specific patterns on cloth. Network. Nilogiri et. al [8] co mpared the performance of Mult i
There are four motif groups of non-geometry, i.e., semen, lung- Label Probabilistic Neural Network (M LPNN) and

978-1-4799-4173-5/14 $31.00 © 2014 IEEE 517


DOI 10.1109/IIAI-AAI.2014.108
Backpropagation Neural Network (BPNN) in a kansei Motif name Batik Image
classification system for batik image. As they reported, the
performance of BPNN is better than MLPNN. Based on some
previous results, this research develop a software that can be Feature Extraction
used to recognize motif group of batik image using color- Process
texture-based feature extraction method and backpropagation
neural network classification method. Features Vector

Training Testing
Backpropagation
Neural network
Model

Classification Process

Parang Ceplok Lereng


(a) Motif name

Fig. 2. Structure of batik classification system.

Semen Lung-lungan Buketan


(b) III. COLOR-TEXT URE-BASED FEATURE EXT RACTION
The images of Batik Motif with their pattern complexity
will need more feature variations compare to usual images. In
this proposed method of classifying images of Batik with motif
names as the class labels, the features are based on color and
Ceplok+semen Parang+semen texture. Color Co-occurrence Matrix (CCM) and Difference
Parang+M ega
M endung Between Pixels of Scan Pattern (DBSP) are used to extract the
texture features while Color Histogram for K-Means (CHKM)
(c)
is used to extract the color features [6]. After the ext raction
Fig. 1. (a) Example of batiks with geometry motifs: parang, ceplok, lereng. phase, there will be 71 features for representing an image of
(b) Example of batiks with non-geometry motifs: semen, lung-lungan, Batik motif which are 49 values from CCM, 6 values from
buketan. (c) Example of batiks with special motifs: parang + mega mendung, DBSP and 16 values fro m CHKM.
ceplok+ semen, parang + semen.
CCM is a matrix that represents the pattern of color
differences between adjacent pixels in an RGB co lor image.
The color differences follow seven scan patterns thereby the
final result is a co-occurrence matrix with a dimension size of 7
II. OVERVIEW OF THE SYST EM
x 7. The steps of texture feature extractions are as follows [6]:
In this research, a batik image is classified into seven motif
groups, i.e. parang, ceplok, lereng, semen, lung-lungan, mega 1. Starting fro m the upper left until the lower right in a color
mendung and buketan. A batik image can have one or more image of one Batif motif parses a window of 2 x 2 pixels,
motif. The examp les of batik images and their motif name are creates block matrices with each cell has sub-matrix of 2 x
shown in Fig. 1. 2 pixels based on four kinds of convolution mask. It means
each [i,j] pixel of the original image will correspond to four
Structure of the batik classification system is shown in Fig. kinds of convolution mask.
2. There are two main processes, i.e. feature extraction and
classification. Color and texture features of batik image are Input: an image matrix with a dimension size of n x m and
extracted using combination of three methods, i.e. Co lor Co- its cell contains value of pixel color. Output: four block
occurence Matrix (CCM), Different Between Pixels of Scan matrices for each of four convolution masks. Each b lock
Pattern (DBPSP), and Co lor Histogram for K-Means (CHKM). matrix has dimension size of n x m and each cell contains a
Features vectors generated fro m the features extraction process sub-matrix of 2 x 2 pixels.
and motif names are inputted to training process. Output of the 2. Identifies scan pattern of each cell in the four b lock
training process is a backprogration neural network model that matrices
is furthermore used in testing process to recognize motifs of
batik features vectors.

518
Input: four block matrices with a dimension size of n x m Input: six summarized DBSP values. Output: six DBSP
and its cell contains sub-matrix of 2 x 2 pixels. Output: four features or a feature vector that represents texture-based
matrices with each dimension size of n x m and the cell feature with a dimension size of 6 x 1.
contains value of scan pattern identifier. Those resulted four
matrices are called as scanned matrices. The combination of CCM feature vector and DBSP feature
vector is a feature vector with dimension size of 55 x 1
3. Create a co-occurrence matrix (CCM) with a dimension representing texture-based features of an image of Batik motif.
size of 7 x 7 based on seven scan patterns.
CHKM is used to extract color-based features of an image
Input: -- (none). Output: an empty matrix with a dimension of Batik motif. Color-based features show the color distribution
size of 7 x 7. of an image. Basically CHKM is a clustering approach with K -
Means algorithm. The steps of texture feature extractions are as
4. Fills in the [i, j] cell of CCM with the co-occurrence
follows [6]:
number of scan pattern-ith next to scan pattern-jth from four
scanned matrices in Step-2. 1. Clusters the pixels fro m a RGB image of Batik motif with
the K value is set to 16.
Input: four scanned matrices in Step-2 and CCM in Step-2.
Output: a filled CCM. Input: an image matrix with a dimension size of n x m and
its cell contains value of pixel color. Output: 16 color
5. Normalizes CCM such that the cell value ranges from 0..1. clusters from n x m data items.
Input: CCM in Step-4. Output: a normalized CCM.
2. Calculates the 16 values of CHKM features with each value
6. Linearizes a normalized CCM such that all values from the representing a color cluster. Each value of CHKM features
49 cells become texture features of an image. is obtained from dividing the number of members in each
cluster with the total number of pixels (n x m) in the
Input: a normalized CCM with a dimension size of 7 x 7. original image.
Output: a CCM feature vector that represents texture-based
feature with a dimension size of 49 x 1.
DBPSP feature represents the complexity of the texture in
the image. Two windows of 2 x 2 p ixels with any of the four IV. BACKPROPAGATION NEURAL NETWORK
convolution masks can result the same values of scan pattern CLASSIFICAT ION
identifier. However there could be value difference in the pixel The used classification method in this paper is a state of the
content of those two windows. The higher the difference in art of Back-propagation Neural Network (BPNN). For training
value means the pattern in a certain window area has more phase there are some classes of Batik motifs with each class
complex texture. The DBSP features are extracted fro m CCM label is the motif name. It is mentioned in the introduction that
in the previous steps. There are seven scan patterns with there will be seven motif classes of (i) Parang, (ii) Ceplok, (iii)
identifier 0..6 but DBSP extraction only considers pattern 1..6 Lereng, (iv) Semen, (v) Lung-lungan, (vi) Mega-Mendung, and
and as a result there are six kinds of DBSP values. In order to (vii) Buketan. The real problem lies on classifying Batik motifs
extract DBSP features, the steps are as follows [6]: which has some co mbinations like Parang + Mega-Mendung or
1. Fro m the four block matrices in the steps of CCM feature Ceplok + Semen or Parang + Semen. Hypothetically the
selected class for an image of a co mbination of Batik Motifs is
extraction, calculates the appropriate DBSP value of pixel
contents according to their values of scan pattern identifier. based on its stand-out motif fro m seven classes. Here it is
difficult to state a rigid classifier in order to recognize the stand
Input: (1) four block matrices with each dimension size of n out motif, therefore the classifier determination follows a black
x m and its cell contains sub-matrix of 2 x 2 pixels; (2) four box process of neural network. In addition to that, the used
scanned matrices with each dimension size of n x m and its BPNN classification could generate mult i classes especially for
cell contains value of scan pattern identifier as references of the combination motifs.
the appropriate DBSP values. Output: four matrices with
The used BPNN has four layers which are one input layer,
each dimension size of n x m and its cell contains the
appropriate DBSP value. Those resulted four matrices are two hidden layers and one output layer. The input layer
contains 72 nodes fro m the number of features plus one bias.
called as DBSP mat rices.
The number of nodes in the hidden layers are varied and being
2. Fro m the four DBSP matrices, summarizes the six kinds of examined in the experiments. The output layer contains seven
DBSP values. nodes representing seven classes of Batik motif. Bipolar
Sig moid activation function is used in the hidden layers while
Input: four DBSP matrices. Output: six summarized DBSP
Binary Sig moid activation function is used in the output layer.
values.
The experiments are about analyzing the variation values of
3. Calculates the six DBSP features fro m the six summarized mo mentum, learning rate, and epoch in BPNN in order to
DBSP values by dividing them with a constant value of 256 know the reco mmended structure for classifying Batik Motifs
since it is the maximu m value of d ifferences in the pixel with color-texture-based features.
contents of RGB color. Then average the six values with
the occurrence numbers of each scan pattern.

519
It is reco mmended that the number of hidden nodes follows consisted of 49 CCM features, 6 DBPSP features, and 16
a certain rule [9, 10]. With m is the number of output nodes CHKM features. The total 70 features vectors are generated
and n is the number of input nodes, fro m 70 batik images. Among the total, 54 features vetors are
used as input in training process to develop a Backpropagation
n Neural Network (BPNN) model. The rest 16 features vectors
# nd .1st.hidden  m  2n  2 (1)
are used for testing. The testing is performed to evaluate
m2
performance of BPNN model in recognizing batik motif. The
n performance of batik motif recognition is measured using
# nd .2nd .hidden  m (2)
m2 Tanimoto Distance, which is defined by the following Equation
3:
Therefore in the experiments, with m=7 and n=71 the
number of nodes in the first hidden layer ranges from 25..71 Pj  C j  2 Pj  C j
while the number of nodes in the second hidden layer ranges Tanimoto j 
Pj  C j  Pj  C j
fro m 0..30. Zero node means that the used NN has only one (3)
hidden layer.

V. EXPERIMENT
The number of batik data used in experiment is 70, which is
consisted of 25 geometry motif, 37 non-geometry motif, and 8
special (co mbination) motif, as shown in detail in Table I. In
addition to the seven motifs, i.e. Buketan, Ceplok, Lunglungan,
Megamendung, Parang, Semen, and Lereng, the seven others
combination motifs are also used in the experiment, i.e. Ceplo k
+ Semen, Cep lok + Lunglungan, Ceplok + Parang,
Megamendung + Parang, Megamendung + Lunglungan,
Semen + Parang, and Lereng + Semen. Examp le of some batik
images with Megamendung motif wh ich are used in
experiment are shown in Fig. 3. The size of all batik image is
190 x 190 pixels.

TABLE I. BATIK DATA SP ECIFICATION USED FOR EXPERIMENT


Number of
Motif Fig. 3. Batik with Megamendung motif
Batik Data
Buketan 10
Ceplok 6
Lunglungan 10 Pj is motif estimated by the system, Cj is the actual motif.
Mega Mendung 8 If Pj is equal to Cj (the motif is recognized correctly), then the
Parang 10 result of Tanimoto Distance is 0. If Pj is not equal to Cj, the
Semen 9 value of Tanimoto Distance is 0. If the value of Tan imoto
Distance is getting closer to 0, then performance of the
Lereng 9
recognition system is better.
Ceplok + Semen 1
Ceplok + Lunglungan 1 The training process is performed 30 times, with one
Ceplok + Parang 2 generated BPNN model in each training. All BPNN model is
Mega Mendung + Parang 1
developed using 2 hidden layers (31 nodes in the first hidden
layer and 20 nodes in the second hidden layer), learning rate
Mega Mendung + Lunglungan 1
0,5, mo mentum 0,1, and epoch 10000. Performance of each
Semen + Parang 1 BPNN model to recognize motif of batik image is tested using
Lereng + Semen 1 two image groups as input, i.e. train ing-testing data (54 training
Total 70 images are used as input in testing process) and testing data
(only 16 testing images are used as input in testing process).
Table II shows time for training, recognition rate for training -
testing data (TT-Data) and testing-only data (TO-Data). The
Color and texture features of each batik image is extracted recognition rate is measured using Tanimoto Distance.
using CCM, DBPSP, and CHKM method. The size of
extracted features vector for each image is 71, which is

520
TABLE II. TRAINING T IME AND RECOGNITION RATE IN 30 VI. CONCLUSION
EXPERIMENTS
A software that can be used to recognize batik motif
Re cognition Recognition automatically has been developed in this research. The
Train Train
Exp. Rate Exp. Rate experiment shows that the software can recognize batik motifs
Time Time
No. TT- TO - No. TT- TO -
(s)
Data Data
(s)
Data Data
quite well with performance rate 0,37. The performance rate
1 6,74 0,00 0,26 16 5,76 0,00 0,42 which is measured using Tanimoto Distance is obtained using
BPNN model with 2 h idden layers (31 nodes in the first hidden
2 6,77 0,00 0,32 17 6,56 0,00 0,27
layer and 20 nodes in the second hidden layer), learning rate
3 6,58 0,00 0,39 18 5,68 0,00 0,35
0,5, mo mentum 0,1, and epoch 10000. In order to improve the
4 6,35 0,00 0,38 19 6,06 0,00 0,42 performance of batik motif recognition, further exploration to
5 6,77 0,00 0,32 20 5,81 0,00 0,33 some other methods in features extraction and classification
6 6,45 0,00 0,34 21 6,17 0,00 0,40 can be carried out.
7 6,52 0,00 0,33 22 5,99 0,00 0,30
8 6,33 0,00 0,41 23 6,51 0,00 0,33 A CKNOWLEDGMENT
9 6,25 0,00 0,36 24 6,56 0,00 0,32
This research is a part of Research on Science, Knowledge,
10 6,78 0,00 0,36 25 6,20 0,00 0,38 Technology and Art supported by Directorat General of Higher
11 6,29 0,00 0,42 26 6,14 0,00 0,36 Education, Ministry of National Education, Indonesia and
12 6,89 0,00 0,50 27 6,10 0,00 0,46 JICA Technical Cooperation Project for Research and
13 6,45 0,00 0,39 28 6,28 0,00 0,51 Education Development on ICT in ITS (JICA PREDICT ITS).
14 5,79 0,00 0,41 29 6,38 0,00 0,29
15 6,42 0,00 0,44 30 6,74 0,00 0,38
Rate for 30 experiments 6,34 0,00 0,37 REFERENCES

As shown in Table II, the time rate required for 30 training [1] Achjadi, J., 1999. Batik Spirits of Indonesia. Yayasan Batik Indonesia,
PT. Buku Antar Bangsa.
processes is 6,34 seconds. The recognition rate for TT-Data is
[2] H. Santosa Doellah. 2002. Batik : Pengaruh Zaman dan Lingkungan.
0,00, meaning that all 54 images which are previously used as Batik Danar Hadi Solo.
training data, can be correctly recognized during testing. And [3] Nanik Suciati, Chastine Fatichah, Anny Yuniarti, Rizky Januar Akbar,
the recognition rate for TO-Data is 0,37, meaning that the “Spline and Color Representation for Batik Design Modification”,
BPNN model can recognize motif of the 16 input images quite International Conference on Computer, Control, Informatics, and It’s
well. The recognition rate is measured using Tanimoto Application (IC3INA) 2013, 19-20 Nopember 2013, LIPI, Jakarta.
Distance, because a batik image can have one or more motif . [4] Aris Fanani, Anny Yuniarti, Nanik Suciati, “Geometric Features
Table III shows examples of Tanimoto Distance measurement Extraction of Batik Image using Cardinal Spline Curve Representation”,
Telkomnika Journal, Vol. 12, No. 2, 2014.
for some cases in motif recognition. Range of Tanimoto
[5] Agus Eko Minarno, Nanik Suciati, “Batik Image Retrieval Based on
Distance is from 0 (the motifs recognized by the system is Color Difference Histogram and Gray Level Co-Occurence
same to the actual motifs) to 1 (the motifs recognized by the Matrix”, Telkomnika Journal, Vol. 12, No. 3, 2014.
system is completely different to the actual motifs). [6] C. H. Lin, R. T. Chen dan Y. K. Chan, “A Smart Content-based Image
Retrieval System Based on Color and T exture Feature”, Journal Science
Direct, pp. 658-665, 2009.
TABLE III. EXAMPLES OF TANIMOTO DISTANCE MEASUREMENTS
[7] X. Huang, S. Sobue dan Y.-W. Chen, “Retrieval of Clothing Fabric
Motifs Recognized The Actual Motifs Tanimoto Images Based on KANSEI Words,” Information, vol. 6, pp. 215-230,
Case 2003.
by the System (P) (C) Distance
1 Parang Parang 0 [8] A. Nilogiri, N. Suciati dan D. Purwitasari, “Klasifikasi Kansei Multi
2 Ceplok Megamendung 1 Label Dengan Probabilistic Neural Network pada Citra batik
menggunakan Kombinasi Fitur Warna, Tekstur dan Bentuk”, Master
Parang + Parang +
3 0 Tesis, Teknik Informatika, Institut T eknologi Sepuluh Nopember, 2012.
Megamendung Megamendung
Parang + [9] D. Stathakis, “ How many hidden layers and nodes?”, International
4 Parang + Ceplok 0,67 Journal of Remote Sensing, vol. 30, p. 2133–2147, 2009.
Megamendung
Parang + [10] E. Oliveira, P.M. Ciarelli, C.Badue,A.F. De Souza, “A Comparison
Parang + between a KNN Based Approach and a PNN Algorithm for a Multi-label
5 Megamendung + 0,33
Megamendung Classification Problem”,The Eight International ConferenceIntelligent
Ceplok
Systems Design and Applications (ISDA '08), vol.2, pp.628-633, 26-28
Nov. 2008.

521

You might also like