Professional Documents
Culture Documents
fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
1 INTRODUCTION
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
Channel 1
Channel 2
Quantization
(Single channel)
Binary Pattern
Histogram
Concatenated
Binary Pattern
Histogram
(a)
Channel 1
Binary Pattern 1
Channel 2
Binary Pattern 2
(b)
Channel 1
Binary Pattern 1
Histogram 1
Channel 2
Binary Pattern 2
Histogram 2
Concatenated
Histogram
Channel
1
Binary
Pattern 1
Channel
2
Binary
Pattern 2
Transfo
rmation
Binary
Pattern 1
Histogram
1
Binary
Pattern t
Histogram
t
Concatenated
Histogram
(c)
(d)
Fig. 1. Illustration of four types of the multichannel feature extraction
technique using two input channels, (a) Each channel is quantized and merged
to form a single channel and then descriptor is computed over it, (b) Binary
patterns extracted over each channel are concatenated to form a single binary
pattern and then histogram is computed over it, obviously this mechanism
results the high dimensional feature vector, (c) Histograms of binary patterns
extracted over each channel are concatenated to from the final feature vector,
obviously the mutual information among each is not utilized, and (d) Binary
patterns extracted over each channel are converted into other binary patterns
using some processing and finally histogram of generated binary patterns are
concatenated to form the final feature vector (generalized versions are
proposed in this paper).
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
channel are just concatenated, there is no cross-channel cooccurrence information, whereas, if we want to preserve the
cross-channel co-occurrence information then the dimension
of the final descriptor will be too high. So, in order to capture
the cross-channel co-occurrence information to some extent,
we proposed the adder and decoder based method with lower
dimensions. Moreover, the joint information of each channel
is captured in each of the output channels of adder and
decoder before the computation of the histogram. We
validated the proposed approach against the image retrieval
experiments over ten benchmark databases including natural
scenes and color textures.
The rest of the paper is organized in following manner;
Section II introduces the multichannel decoded Local Binary
Patterns; Section III discusses the distance measures and
evaluation criteria. Image retrieval experiments using
proposed methods are performed in section IV with results
discussion; and finally section V concludes the paper.
2 MULTICHANNEL DECODED LOCAL BINARY PATTERNS
In this section, we proposed two multichannel decoded local
binary pattern approaches namely multichannel adder based
local binary pattern () and multichannel decoder based
local binary pattern () to utilize the local binary
pattern information of multiple channels in efficient manners.
Total + 1 and 2 number of output channels are generated
by using multichannel adder and decoder respectively from
number of input channels for 2.
Let is the channel of any image of size ,
where 1, and is the total number of channels. If the
neighbors equally-spaced at radius of any pixel (, ) for
1, and 1, are defined as (, ) also depicted
in Fig. 2, where 1, . Then, according to the definition
of the Local Binary Pattern (LBP) [6], a local binary pattern
(, ) for a particular pixel (, ) in channel is
generated by computing a binary value , given by
the following equation,
, =
=1
, ,
1,
(1)
where,
1, , ,
(2)
0,
is a weighting function defined by the following
, =
and
equation,
(1)
= (2)
, [1, ]
(3)
(, )
2 (, )
1 (, )
,
+1 (, )
(, )
1 (, )
Fig. 2. The
local
neighbors
(, )
of
a
center
pixel
, in channel in polar coordinate system for [1, ] and [1, ].
TABLE I
TRUTH TABLE OF ADDER AND DECODER MAP WITH 3 INPUT CHANNELS
, , , (, ) (, )
0
0
0
0
0
0
0
1
1
1
0
1
0
1
2
0
1
1
2
3
1
0
0
1
4
1
0
1
2
5
1
1
0
2
6
1
1
1
3
7
(, ) =
=1
()
=1 2
(4)
,
(5)
1, (, ) = (1 1)
0,
(6)
1, (, ) = (2 1)
0,
(7)
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
0
0
0
0
1
0
0
0
0
0
LBP 3
0
0
0
1
1
0
0
1
0
1
a
0
maLBPn3
0
0
0
1
0
0
1
0
mdLBPn2
mdLBPn3
0
0
0
1
1
0
0
0
72
161
20
maLBP 1
maLBP 2
maLBP 3
maLBP 4
72
128
32
mdLBP 1
mdLBP 2
mdLBP 3
mdLBP 4
0
0
0
0
0
0
16
mdLBPn5
mdLBPn6
mdLBPn7
mdLBPn8
Fig. 3.
f
(c) Weighting function
mdLBPn4
a
0
DM
2
4
mdLBPn1
AM
a
0
(e) Multichannel adder based local binary pattern for each output channels
0
128
16
maLBPn4
0
1
32
maLBPn2
maLBPn1
64
1
0
0
LBP 2
LBP 1
0
1
mdLBP 5
mdLBP 6
mdLBP 7
mdLBP 8
(g) Multichannel decoder based local binary pattern for each output channels
An illustration of the computation of the adder/decoder binary patterns, and adder/decoder decimal values for = 3 and = 8.
(a)
(b)
(c)
(d)
(e)
(f)
(g)
(h)
(i)
(j)
(k)
(l)
(m)
(n)
(o)
(p)
(q)
(r)
(s)
Fig. 4. (a) RGB image, (b) R channel, (c) G channel, (d) B channel, (e) LBP
map over R channel, (f) LBP map over G channel, (g) LBP map over B
channel, (h-k) 4 output channels of the adder, and (l-s) 8 output channels of
the decoder using 3 input LBP map of R, G and B channels.
=1 1
(8)
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
Color
Image
Red (R)
Channel
Green (G)
Channel
Blue (B)
Channel
LBP1
LBP2
LBP3
1 , 2 =
1,
0,
1 = 2
(11)
Adder
maLBP1
maLBP1
Histogram
Decoder
maLBP4
mdLBP1
maLBP4
Histogram
mdLBP1
Histogram
mdLBP8
mdLBP8
Histogram
2 , =
=1 2
(9)
1
2 2
1 (, ),
=+1 =+1
(10)
1
2 2
2 (, ),
(12)
=+1 =+1
(13)
(14)
=1
=1
& =
(15)
=
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
TABLE II
IMAGE DATABASES SUMMARY
Database Name
Image
Size
# Class
# Images in
Each Class
# Images
(Total)
Corel-1k [27]
MIT-VisTex [28]
STex-512S [29]
USPTex [30]
FTVL [31]
KTH-TIPS [32]
KTH-TIPS2a [32]
Corel-Tex [27]
ALOT [33]
ZuBuD [34]
Colored Brodatz [57]
ALOT-Complete [33]
384256
128128
128128
128128
154116
200200
200200
12080
192128
640480
4040
192128
10
40
26
191
15
10
11
6
250
1001
112
250
100
16
Vary
12
Vary
81
396 or 432
100
16
5
256
100
1000
640
7616
2292
2612
810
4608
600
4000
1005
28672
25000
(a)
(b)
Fig. 6.
Example images of the (a) Corel-1k [27], (2) FTVL database [30].
=1
& =
=1
1, (16)
=
& =
1,
(17)
2
67.08
69.60
71.73
70.96
73.43
74.93
Cosine
60.47
63.22
66.30
64.24
65.77
64.98
1
67.22
69.09
71.16
69.81
72.31
74.05
TABLE IV
ARP (%) USING DIFFERENT DISTANCE MEASURES ON MIT-VISTEX DATABASE
Category
2
69.36
71.94
73.52
72.64
74.86
80.08
Cosine
63.44
64.98
68.14
66.52
67.37
73.05
1
68.95
71.08
73.22
71.28
73.66
79.89
TABLE V
ARP (%) USING DIFFERENT DISTANCE MEASURES ON USPTEX DATABASE
Category
2
68.44
73.69
75.66
70.74
75.67
82.50
Cosine
63.98
70.29
73.48
64.25
66.51
74.24
1
66.98
72.37
74.69
68.97
72.20
80.56
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
81
74
R,G
R,B
G,B
R,G,B
80
73
72
71
76
maLBP
72
mdLBP
maLBP
(a)
Corel-Tex database
ALOT database
64
R,G
R,B
G,B
R,G,B
62
60
ARP (%)
ARP (%)
mdLBP
(b)
68
66
65
R,G
R,B
G,B
R,G,B
58
56
54
64
63
52
maLBP
50
mdLBP
maLBP
(c)
mdLBP
(d)
Fig. 7. ARP (%) for different combinations of channels of RGB color space
using maLBP and mdLBP descriptors over (a) Corel-1k, (b) MIT-VisTex, (c)
Corel-Tex, and (d) ALOT database when 10 similar images are retrieved.
MIT-VisTex database
75
75
72
72
69
69
ARP
ARP
Corel-1k database
66
mCENTRIST
maLBP
mdLBP
63
60
RG
RB
66
mCENTRIST
maLBP
mdLBP
63
60
GB
RG
Channels
(a)
(b)
55
75
52
ARP
72
69
mCENTRIST
maLBP
mdLBP
63
60
RG
RB
Channels
(c)
GB
ALOT database
78
66
RB
Channels
USPTex database
ARP
78
R,G
R,B
G,B
R,G,B
74
67
MIT-VisTex database
Corel-1k database
75
ARP (%)
ARP (%)
GB
49
46
mCENTRIST
maLBP
mdLBP
43
40
RG
RB
GB
Channels
(d)
Fig. 8. ARP (%) for different combinations of two channels of RGB color
space using mCENTRIST, maLBP and mdLBP descriptors over (a) Corel-1k,
(b) MIT-VisTex, (c) USPTex, and (d) ALOT database when 10 similar
images are retrieved.
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
75
ARP (%)
73
71
69
81
80
67
65
74
71
68
Corel-1k
65
MIT-VisTex
Corel-1k
(a)
(b)
75
80
ARP (%)
69
66
RGB
HSV
L*a*b*
YCbCr
77
ARP (%)
RGB
HSV
L*a*b*
YCbCr
72
74
71
68
63
Corel-1k
65
MIT-VisTex
Corel-1k
MIT-VisTex
Database
Database
(c)
(d)
Using mdLBPriu2 descriptor
65
60
80
RGB
HSV
L*a*b*
YCbCr
75
ARP (%)
RGB
HSV
L*a*b*
YCbCr
70
ARP (%)
MIT-VisTex
Database
Database
60
77
ARP (%)
RGB
HSV
L*a*b*
YCbCr
70
65
55
60
50
Corel-1k
MIT-VisTex
Corel-1k
MIT-VisTex
Database
Database
(e)
(f)
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
Corel-1k database
75
95
95
95
90
90
LBP
cLBP
mscLBP
mCENTRIST
maLBP
mdLBP
85
80
75
70
65
1
100
70
1
10
FTVL database
75
5
10
75
5
70
1
10
90
LBP
cLBP
mscLBP
mCENTRIST
maLBP
mdLBP
96
LBP
cLBP
mscLBP
mCENTRIST
maLBP
mdLBP
94
92
10
10
Corel-Tex database
98
KTH-TIPS2a database
95
90
100
80
1
10
80
100
85
LBP
cLBP
mscLBP
mCENTRIST
maLBP
mdLBP
85
100
ARP (%)
ARP (%)
ARP (%)
LBP
cLBP
mscLBP
mCENTRIST
maLBP
mdLBP
88
KTH-TIPS database
96
90
90
98
92
80
100
94
LBP
cLBP
mscLBP
mCENTRIST
maLBP
mdLBP
85
70
1
ARP (%)
80
100
ARP (%)
85
USPTex database
100
ARP (%)
ARP (%)
90
ARP (%)
LBP
cLBP
mscLBP
mCENTRIST
maLBP
mdLBP
95
STex-512S database
MIT-VisTex database
100
LBP
cLBP
mscLBP
mCENTRIST
maLBP
mdLBP
80
70
60
1
10
10
Fig. 10. The performance comparison of proposed maLBP and mdLBP descriptor with existing approaches such as LBP, cLBP, mscLBP, and mCENTRIST
descriptors using ARP vs number of retrieved images over Corel-1k, MIT-VisTex, STex, USPTex, FTVL, KTH-TIPS, KTH-TIPS2a, and Corel-Tex databases.
Corel-1k database
MIT-VisTex database
80
75
95
95
95
90
90
85
LBPu2
cLBPu2
mscLBPu2
mCENTRISTu2
maLBPu2
mdLBPu2
80
75
70
65
1
100
70
2
65
1
10
97
95
85
1
ARP (%)
ARP (%)
100
LBPu2
cLBPu2
mscLBPu2
mCENTRISTu2
maLBPu2
mdLBPu2
88
2
75
70
5
65
1
10
10
65
1
10
10
10
Corel-Tex database
LBPu2
cLBPu2
mscLBPu2
mCENTRISTu2
maLBPu2
mdLBPu2
90
96
LBPu2
cLBPu2
mscLBPu2
mCENTRISTu2
maLBPu2
mdLBPu2
94
100
90
6
75
KTH-TIPS2a database
92
LBPu2
cLBPu2
mscLBPu2
mCENTRISTu2
maLBPu2
mdLBPu2
80
LBPu2
cLBPu2
mscLBPu2
mCENTRISTu2
maLBPu2
mdLBPu2
85
70
5
98
90
80
1
100
85
80
KTH-TIPS database
FTVL database
91
LBPu2
cLBPu2
mscLBPu2
mCENTRISTu2
maLBPu2
mdLBPu2
100
94
90
85
ARP (%)
85
USPTex database
100
ARP (%)
90
STex-512S database
100
ARP (%)
ARP (%)
95
ARP (%)
LBPu2
cLBPu2
mscLBPu2
mCENTRISTu2
maLBPu2
mdLBPu2
ARP (%)
100
80
70
60
1
10
10
Fig. 11. The performance comparison of proposed maLBP and mdLBP descriptor with existing approaches such as LBP, cLBP, mscLBP, and mCENTRIST
descriptors under uniform transformation (u2) using ARP vs number of retrieved images plot over Corel-1k, MIT-VisTex, STex, USPTex, FTVL, KTH-TIPS,
KTH-TIPS2a, and Corel-Tex databases.
MIT-VisTex database
60
1
10
LBPriu2
cLBPriu2
mscLBPriu2
mCENTRISTriu2
maLBPriu2
mdLBPriu2
FTVL database
LBPriu2
cLBPriu2
mscLBPriu2
mCENTRISTriu2
maLBPriu2
mdLBPriu2
70
65
1
ARP (%)
ARP (%)
90
75
LBPriu2
cLBPriu2
mscLBPriu2
mCENTRISTriu2
maLBPriu2
mdLBPriu2
70
60
50
1
10
10
80
60
5
50
1
10
95
90
LBPriu2
cLBPriu2
mscLBPriu2
mCENTRISTriu2
maLBPriu2
mdLBPriu2
90
LBPriu2
cLBPriu2
mscLBPriu2
mCENTRISTriu2
maLBPriu2
mdLBPriu2
85
80
10
10
Corel-Tex database
KTH-TIPS2a database
95
75
1
100
85
100
90
LBPriu2
cLBPriu2
mscLBPriu2
mCENTRISTriu2
maLBPriu2
mdLBPriu2
70
100
80
5
90
80
KTH-TIPS database
95
80
90
100
85
USPTex database
100
ARP (%)
70
STex-512S database
100
ARP (%)
80
100
95
90
85
80
75
70
65
60
1
ARP (%)
ARP (%)
90
ARP (%)
LBPriu2
cLBPriu2
mscLBPriu2
mCENTRISTriu2
maLBPriu2
mdLBPriu2
ARP (%)
Corel-1k database
100
LBPriu2
cLBPriu2
mscLBPriu2
mCENTRISTriu2
maLBPriu2
mdLBPriu2
80
70
60
10
50
1
10
Fig. 12. The performance comparison of proposed multichannel decoded local binary patterns with existing approaches under rotation invariant uniform
transformation (riu2) using ARP vs number of retrieved images curve over Corel-1k, MIT-VisTex, STex, USPTex, FTVL, KTH-TIPS, KTH-TIPS2a, and CorelTex databases.
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
ALOT database
ZuBuD database
40
70
60
30
25
ARR (%)
ARR (%)
35
LBP
cLBP
mscLBP
mCENTRIST
maLBP
mdLBP
20
15
10
5
1
50
LBP
cLBP
mscLBP
mCENTRIST
maLBP
mdLBP
40
30
20
1
10
(a)
10
ZuBuD database
40
70
35
60
30
25
ARR (%)
ARR (%)
(b)
ALOT database
LBPu2
cLBPu2
mscLBPu2
mCENTRISTu2
maLBPu2
mdLBPu2
20
15
10
5
1
50
LBPu2
cLBPu2
mscLBPu2
mCENTRISTu2
maLBPu2
mdLBPu2
40
30
20
1
10
10
(c)
(d)
ALOT database
ZuBuD database
48
63
60
42
36
30
ARR (%)
ARR (%)
LBPriu2
cLBPriu2
mscLBPriu2
mCENTRISTriu2
maLBPriu2
mdLBPriu2
24
18
12
6
1
50
LBPriu2
cLBPriu2
mscLBPriu2
mCENTRISTriu2
maLBPriu2
mdLBPriu2
40
30
20
1
10
10
(e)
(f)
Corel-1k database
100
100
80
60
LBP
cLBP
mscLBP
mCENTRIST
maLBP
mdLBP
40
20
0
1
90
80
70
LBPu2
cLBPu2
mscLBPu2
mCENTRISTu2
maLBPu2
mdLBPu2
60
50
40
30
1
10
7 10 13 16 19 22 25 28 31 34 37 40
Image Categories
Image Categories
(a)
(b)
90
80
70
60
50
40
30
20
10
0
1
STex-512S database
40
LBP
cLBP
mscLBP
mCENTRIST
maLBP
mdLBP
FTVL database
LBPriu2
cLBPriu2
mscLBPriu2
mCENTRISTriu2
maLBPriu2
mdLBPriu2
35
30
25
20
15
10
5
9 10 11 12 13 14 15
Image Categories
(c)
10 12 14 16 18 20 22 24 26
Image Categories
(d)
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
(a)
(b)
(c)
(d)
(e)
(f)
(g)
(h)
(i)
Fig. 15. Top 10 retrieved images using each descriptor from Corel-1k database by considering the query image from (a) Building, (b) Bus, (c) Dinosaurs,
(d) Elephant, (e) Flower, (f) Food, (g) Horse, (h) Africans, and (i) Beaches categories. Note that 6 rows in each subfigure corresponds to the different
descriptor such as LBP (1st row), cLBP (2nd row), mscLBP (3rd row), mCENTRIST (4th row), maLBP (5th row) and mdLBP (6th row) and 10 columns in each
subfigure corresponds to the 10 retrieved images in decreasing similarity order; images in the 1st column are query images as well as the top most similar images.
TABLE VI
FEATURE DIMENSIONS VS TIME COMPLEXITY IN SECONDS
Method
Feature
Feature
Dimension Extraction
Time
(Corel-1k
database)
Total
Retrieval
Time
(Corel1k
database)
Feature
Extraction
Time
(MITVisTex
database)
Total
Retrieval
Time
(MITVisTex
database)
LBP
cLBP
mscLBP
mCENTRIST
maLBP
mdLBP
LBPu2
cLBPu2
mscLBPu2
mCENTRISTu2
maLBPu2
mdLBPu2
LBPriu2
cLBPriu2
mscLBPriu2
mCENTRISTriu2
maLBPriu2
mdLBPriu2
256
3256
9256
6256
4256
8256
59
359
959
659
459
859
10
310
910
610
410
810
2.73
11.75
56.40
33.31
16.49
47.93
0.74
1.85
5.50
3.72
2.41
4.71
0.28
0.46
1.02
0.74
0.55
0.91
4.67
6.44
13.03
9.72
9.24
12.48
5.11
7.16
19.50
10.46
9.41
13.63
5.27
6.95
19.13
10.08
9.32
13.23
1.19
3.38
10.07
6.7
4.39
7.8
0.41
0.88
2.43
1.64
1.11
2.03
0.23
0.31
0.53
0.42
0.35
0.48
31.00
89.52
310.95
128.01
102.45
228.97
32.00
88.78
316.34
132.20
105.49
233.93
31.73
88.64
316.11
130.94
105.09
233.10
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
100
30
LBP
cLBP
mscLBP
mCENTRIST
mSIFT
mGIST
CDH
maLBP
mdLBP
LBP
cLBP
mscLBP
mCENTRIST
mSIFT
mGIST
CDH
maLBP
mdLBP
60
40
20
0
10
20
30
40
50
60
70
80
90
ARR (%)
ARP (%)
80
20
10
0
10
100
20
30
40
50
60
70
80
(a)
(b)
30
80
60
50
40
30
ARR (%)
LBP
cLBP
mscLBP
mCENTRIST
mSIFT
mGIST
CDH
maLBP
mdLBP
70
10
10
LBP
cLBP
mscLBP
mCENTRIST
mSIFT
mGIST
CDH
maLBP
mdLBP
20
10
TABLE VII
THE ARP VALUES FOR NR=10 USING NON-LBP AND PROPOSED DESCRIPTORS
Descriptors
Database
Corel-1k
MIT-VisTex
STex-512S
USPTex
FTVL
KTH-TIPS
KTH-TIPS2a
Corel-Tex
ALOT
ZuBuD
20
30
40
50
60
70
80
90
100
0
10
mSIFT
OpponentSIFT
mGIST
CDH
maLBP
mdLBP
45.14
19.63
22.55
16.80
64.99
41.07
44.72
32.80
23.52
25.15
49.93
19.47
21.41
17.67
77.05
43.19
40.24
31.57
26.38
29.03
62.83
43.98
46.86
43.93
61.70
89.56
89.50
52.83
42.44
33.49
74.08
62.94
59.08
55.89
85.04
85.23
89.73
66.47
44.35
32.67
73.43
74.86
77.54
75.67
89.74
85.12
92.23
66.93
53.74
33.03
74.93
80.08
83.89
82.50
95.02
86.91
94.87
67.68
63.04
34.32
20
30
40
50
60
70
80
90
100
(c)
(d)
Fig. 16. Comparison of proposed descriptors maLBP and mdLBP with LBP,
cLBP, mscLBP, mCENTRIST, mSIFT, mGIST and CDH over large
databases such as (a-b) Colored Brodatz and (c-d) ALOT-Complete in terms
of the ARP and ARR.
TABLE VIII
PERFORMANCE ANALYSIS OF PROPOSED IDEA WITH CLBP IN TERMS OF THE
ARP WHEN THE NUMBER OF RETRIEVED IMAGES IS 10
Method
Database
Corel-1k KTH-TIPS ALOT ZuBuD Corel-Tex USPTex
maLBPu2
maCLBPu2
mdLBPu2
mdCLBPu2
71.73
73.11
73.60
75.29
83.21
87.00
85.84
88.69
52.37
59.21
62.69
68.73
32.40
33.96
33.88
35.15
65.02
65.23
66.45
67.75
73.34
76.16
81.59
81.89
70
60
100
ALOT_Complete database
20
90
ALOT_Complete database
ARP (%)
50
40
30
20
10
0
LBP
cLBP
mscLBP
mCENTRIST
mSIFT
mGIST
CDH
maLBP
mdLBP
Illumination
Rotation
Scale
Effect
Fig. 17. Comparison of proposed descriptors maLBP and mdLBP with LBP,
cLBP, mscLBP, mCENTRIST, mSIFT, mGIST and CDH over the databases
having uniform illumination, rotation and image scale changes.
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
35. S.R. Dubey, S.K. Singh and R.K. Singh, Rotation and scale invariant
hybrid image descriptor and retrieval, Computers & Electrical
Engineering, vol. 46, pp. 288-302, 2015.
36. S.R. Dubey, S.K. Singh and R.K. Singh, Local Bit-plane Decoded
Pattern: A Novel Feature Descriptor for Biomedical Image
Retrieval, IEEE Journal of Biomedical and Health Informatics, 2015.
(In Press)
37. S.R. Dubey, S.K. Singh and R.K. Singh, Local Wavelet Pattern: A New
Feature Descriptor for Image Retrieval in Medical CT Databases, IEEE
Transactions on Image Processing, vol. 24, no. 12, pp. 5892-5903, 2015.
38. G.H. Liu and J.Y. Yang, Content-based image retrieval using color
difference histogram, Pattern Recognition, vol. 46, no. 1, pp. 188-198,
2013.
39. Koen E.A. Van De Sande, T. Gevers and Cees G.M. Snoek, Evaluating
color descriptors for object and scene recognition, IEEE Transactions
on Pattern Analysis and Machine Intelligence, vol. 32, no. 9, pp. 15821596, 2010.
40. A. Oliva and A. Torralba, Modeling the shape of the scene: A holistic
representation of the spatial envelope, International Journal of
computer vision, vol. 42, no. 3, pp. 145-175, 2001.
41. M. Brown and S. Ssstrunk, Multi-spectral SIFT for scene category
recognition, In IEEE Conference on Computer Vision and Pattern
Recognition, pp. 177-184, 2011.
42. M. Douze, H. Jgou, H. Sandhawalia, L. Amsaleg and C. Schmid,
Evaluation of gist descriptors for web-scale image search, In ACM
International Conference on Image and Video Retrieval, pp. 19-27,
2009.
43. H. Jgou, M. Douze, C. Schmid and P. Prez, Aggregating local
descriptors into a compact image representation, In IEEE Conference
on Computer Vision and Pattern Recognition, pp. 3304-3311, 2010.
44. J. Yue-Hei Ng, F, Yang and L.S. Davis, Exploiting Local Features from
Deep Networks for Image Retrieval, In IEEE International Conference
on Computer Vision and Pattern Recognition, DeepVision Workshop
(CVPRW), pp. 53-61, 2015.
45. H. Jgou, M. Douze and C. Schmid, Improving bag-of-features for
large scale image search, International Journal of Computer Vision, vol.
87, no. 3, pp. 316-336, 2010.
46. V. Sydorov, M. Sakurada and C.H. Lampert, Deep Fisher Kernels--End
to End Learning of the Fisher Kernel GMM Parameters, In IEEE
Conference on Computer Vision and Pattern Recognition, pp. 14021409, 2014.
47. K. Simonyan, A. Vedaldi and A. Zisserman, Deep fisher networks for
large-scale image classification, In Advances in neural information
processing systems, pp. 163-171. 2013.
48. A. Krizhevsky, I. Sutskever and G.E. Hinton, Imagenet classification
with deep convolutional neural networks, In Advances in neural
information processing systems, pp. 1097-1105. 2012.
49. F. Perronnin and D. Larlus, Fisher Vectors Meet Neural Networks: A
Hybrid Classification Architecture, In IEEE Conference on Computer
Vision and Pattern Recognition, pp. 3743-3752. 2015.
50. X. Zhou, K. Yu, T. Zhang and T.S. Huang, Image classification using
super-vector coding of local image descriptors, In European
Conference of Computer Vision, pp. 141-154, 2010.
51. X. Bai, C. Yan, P. Ren, L. Bai and J. Zhou, Discriminative sparse
neighbor coding, Multimedia Tools and Applications, 2015. (In Press)
52. N. Inoue and K. Shinoda, Fast Coding of Feature Vectors using
Neighbor-To-Neighbor Search, IEEE Transactions on Pattern Analysis
and Machine Intelligence, 2015. (In Press)
53. X. Li, M. Fang and J.J. Zhang, Projected transfer sparse coding for
cross domain image representation, Journal of Visual Communication
and Image Representation, vol. 33, pp. 265272, 2015.
54. C. Zhang, J. Cheng, J. Liu, J. Pang, Q. Huang and Q. Tian, Beyond
explicit codebook generation: Visual Representation Using Implicitly
Transferred codebooks, IEEE Transactions on Image Processing, vol.
24, no. 12, pp. 5777 - 5788, 2015.
55. S. Murala and QM Jonathan Wu, Local ternary co-occurrence patterns:
a new feature descriptor for MRI and CT image retrieval,
Neurocomputing, vol. 119, pp. 399-412, 2013.
56. www.cs.columbia.edu/~mmerler/project/code/pdist2.m.
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
57. Colored
Brodatz
Database,
http://multibandtexture.recherche.usherbrooke.ca/colored%20_brodatz.ht
ml.
58. http://www.ifp.illinois.edu/~jyang29/LLC.htm.
59. J. Wang, J. Yang, K. Yu, F. Lv, T. Huang, and Y. Gong, Locality
constrained linear coding for image classification, In IEEE Conference
on Computer Vision and Pattern Recognition, pp. 33603367, 2010.
60. http://people.csail.mit.edu/torralba/code/spatialenvelope/.
61. L. Liu, M. Yu, and L. Shao, Multiview alignment hashing for efficient
image search, IEEE Transactions on Image Processing, vol. 24, no. 3,
pp. 956-966, 2015.
62. J. Tang, Z. Li, M. Wang, and R. Zhao, Neighborhood discriminant
hashing for large-scale image retrieval, IEEE Transactions on Image
Processing, vol. 24, no. 9, pp. 2827-2840, 2015.
63. L. Liu and L. Shao, Sequential Compact Code Learning for
Unsupervised Image Hashing, IEEE Transactions on Neural Networks
and Learning Systems, 2015. (In Press)
64. L. Liu, M. Yu, and Ling Shao, Unsupervised local feature hashing for
image similarity search, IEEE Transactions on Cybernetics, 2015. (In
Press)
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.