LBP Image Processing
LBP Image Processing
fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
4018 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 25, NO. 9, SEPTEMBER 2016
Abstract— Local binary pattern (LBP) is widely adopted for effectiveness in several applications [7]–[10]. Inspired from the
efficient image feature description and simplicity. To describe recognition of LBP, several other LBP variants are proposed
the color images, it is required to combine the LBPs from each in the literature [11]–[17], [36], [37]. These approaches are
channel of the image. The traditional way of binary combination
is to simply concatenate the LBPs from each channel, but it introduced basically for gray images, in other words only for
increases the dimensionality of the pattern. In order to cope one channel and performed well but most of the times in real
with this problem, this paper proposes a novel method for image cases the natural color images are required to be characterize
description with multichannel decoded LBPs. We introduce which are having multiple channel.
adder- and decoder-based two schemas for the combination of the A performance evaluating of color descriptors such as
LBPs from more than one channel. Image retrieval experiments
are performed to observe the effectiveness of the proposed color SIFT (we have termed mSIFT for color SIFT in
approaches and compared with the existing ways of multichannel this paper), Opponent SIFT, etc. are made for object and
techniques. The experiments are performed over 12 bench- scene Recognition in [39]. These descriptors first find the
mark natural scene and color texture image databases, such regions in the image using region detectors, then compute
as Corel-1k, MIT-VisTex, USPTex, Colored Brodatz, and so on. the descriptor over each region and finally the descriptor is
It is observed that the introduced multichannel adder- and
decoder-based LBPs significantly improve the retrieval per- formed by using bag-of-words (BoW) model. Researchers
formance over each database and outperform the other are also working to upgrade the BoW model [45]. Another
multichannel-based approaches in terms of the average retrieval interesting descriptor is GIST which is basically a holistic
precision and average retrieval rate. representation of features and has gained wider publicity due
Index Terms— Image retrieval, local patterns, multichannel, its high discriminative ability [40]–[42]. In order to encode
LBP, color, texture. the region based descriptors into a single descriptor, a vector
locally aggregated descriptors (VLAD) has been proposed in
I. I NTRODUCTION
the literature [43]. Recently, it is used with deep networks
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
DUBEY et al.: MULTICHANNEL DECODED LBPs FOR CONTENT-BASED IMAGE RETRIEVAL 4019
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
4020 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 25, NO. 9, SEPTEMBER 2016
TABLE I
T RUTH TABLE OF A DDER AND D ECODER M AP W ITH 3 I NPUT C HANNELS
Fig. 2. The local neighbors Itn (x, y) of a center pixel It (x, y) in t th channel
in polar coordinate system for n ∈ [1, N ] and t ∈ [1, c]. and f n is a weighting function defined by the following
equation,
co-occurrence information, whereas, if we want to preserve the
f n = (2)(n−1) , ∀n ∈ [1, N] (3)
cross-channel co-occurrence information then the dimension
of the final descriptor will be too high. So, in order to capture We have set of N binary values LBPnt (x, y) for a partic-
the cross-channel co-occurrence information to some extent, ular pixel (x, y) corresponding to each neighbor Itn (x, y) of
we proposed the adder and decoder based method with lower t t h channel. Now we apply the proposed concept of multichan-
dimensions. Moreover, the joint information of each channel is nel LBP adder and multichannel LBP decoder by considering
captured in each of the output channels of adder and decoder LBPnt (x, y) |∀t ∈ [1, c] as the c input channels.
before the computation of the histogram. We validated the Let, the multichannel adder based local binary patterns
proposed approach against the image retrieval experiments maLBPnt1 (x, y) and multichannel decoder based local binary
over twelve benchmark databases including natural scenes and patterns mdLBPnt2 (x, y) are the outputs of the multichannel
color textures. LBP adder and multichannel LBP decoder respectively, where
The rest of the paper is organized in following manner; t1 ∈ [1, c + 1] and t2 ∈ [1, 2c ]. Note that the values
Section II introduces the multichannel decoded Local Binary of LBPnt (x, y) are in the binary form (i.e. either 0 or 1).
Patterns; Section III discusses the distance measures and Thus, the values of maLBPnt1 (x, y) and mdLBPnt2 (x, y) are
evaluation criteria. Image retrieval experiments using proposed also in the binary form generated from the multichan-
methods are performed in section IV with results discussion; nel adder map maM n (x, y) and multichannel decoder map
and finally section V concludes the paper. mdM n (x, y) respectively corresponding to the each neighbor
n of pixel (x, y).
II. M ULTICHANNEL D ECODED L OCAL B INARY PATTERNS The truth map of maM n (x, y) and mdM n (x, y) for c = 3
are shown in Table 1 are having 4 and 8 distinct values
In this section, we proposed two multichannel decoded local respectively. Mathematically, the maM n (x, y) and mdM n (x, y)
binary pattern approaches namely multichannel adder based are defined as,
local binary pattern (maLBP) and multichannel decoder based c
local binary pattern (mdLBP) to utilize the local binary pattern maM n (x, y) = LBPnt (x, y) (4)
information of multiple channels in efficient manners. Total tc=1
mdM n (x, y) = 2(c−t ) ×LBPnt (x, y) (5)
c+1 and 2c number of output channels are generated by using t =1
multichannel adder and decoder respectively from c number We denote (x, y) for ∀n ∈ [1, N] and ∀t ∈ [1, c]
LBPnt
of input channels for c ≥ 2. by input patterns, maLBPnt1 (x, y) for ∀n ∈ [1, N] and ∀t1 ∈
Let It is the t t h channel of any image I of size u × v × c, [1, c+1] by adder patterns and mdLBPnt2 (x, y) for ∀n ∈ [1, N]
where t ∈ [1, c] and c is the total number of channels. If the and ∀t2 ∈ [1, 2c ] by decoder patterns respectively.
N neighbors equally-spaced at radius R of any pixel It (x, y) The multichannel adder based local binary pattern
for x ∈ [1, u] and y ∈ [1, v] are defined as Itn (x, y) also maLBPnt1 (x, y) for pixel (x, y) from multichannel adder map
depicted in Fig. 2, where n ∈ [1, N]. Then, according to the maM n (x, y) and t1 is defined as,
definition of the Local Binary Pattern (LBP) [6], a local binary
pattern LBPt (x, y) for a particular pixel (x, y) in t t h channel 1, if maM n (x, y) = (t 1 − 1)
maLBPt1 (x, y) =
n
(6)
is generated by computing a binary value LBPnt (x, y) given 0, otherwise
by the following equation,
for ∀t1 ∈ [1, c + 1] and ∀n ∈ [1,N].
N Similarly, the multichannel decoder based local binary pat-
LBPt (x, y) = LBPnt (x, y) × f n , ∀t ∈ [1, c] (1) tern mdLBPnt2 (x, y) for pixel (x, y) from multichannel decoder
n=1 map mdM n (x, y) and t2 can be computed as,
where, 1, if mdM n (x, y) = (t 2 − 1)
mdLBPnt2 (x, y) = (7)
1, Itn (x, y) ≥ It (x, y) 0, otherwise
LBPnt (x, y) = (2)
0, otherwise for ∀t2 ∈ [1, 2c ] and ∀n ∈ [1,N].
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
DUBEY et al.: MULTICHANNEL DECODED LBPs FOR CONTENT-BASED IMAGE RETRIEVAL 4021
Fig. 3. An illustration of the computation of the adder/decoder binary patterns, and adder/decoder decimal values for c = 3 and N = 8.
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
4022 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 25, NO. 9, SEPTEMBER 2016
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
DUBEY et al.: MULTICHANNEL DECODED LBPs FOR CONTENT-BASED IMAGE RETRIEVAL 4023
TABLE II
I MAGE D ATABASES S UMMARY
Fig. 6. Example images of the (a) Corel-1k [27], (2) FTVL database [31].
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
4024 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 25, NO. 9, SEPTEMBER 2016
TABLE III
ARP (%) U SING D IFFERENT D ISTANCE M EASURES
ON C OREL -1k D ATABASE
TABLE IV
ARP (%) U SING D IFFERENT D ISTANCE M EASURES
ON MIT-VisTex D ATABASE
Fig. 7. ARP (%) for different combinations of channels of RGB color space
using maLBP and mdLBP descriptors over (a) Corel-1k, (b) MIT-VisTex,
(c) Corel-Tex, and (d) ALOT database when 10 similar images are retrieved.
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
DUBEY et al.: MULTICHANNEL DECODED LBPs FOR CONTENT-BASED IMAGE RETRIEVAL 4025
Fig. 8. ARP (%) for different combinations of two channels of RGB color
space using mCENTRIST, maLBP and mdLBP descriptors over (a) Corel-1k,
(b) MIT-VisTex, (c) USPTex, and (d) ALOT database when 10 similar images
are retrieved.
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
4026 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 25, NO. 9, SEPTEMBER 2016
Fig. 10. The performance comparison of proposed maLBP and mdLBP descriptor with existing approaches such as LBP, cLBP, mscLBP, and mCENTRIST
descriptors using ARP vs number of retrieved images over Corel-1k, MIT-VisTex, STex, USPTex, FTVL, KTH-TIPS, KTH-TIPS2a, and Corel-Tex databases.
Fig. 11. The performance comparison of proposed maLBP and mdLBP descriptor with existing approaches such as LBP, cLBP, mscLBP, and mCENTRIST
descriptors under uniform transformation (u2) using ARP vs number of retrieved images plot over Corel-1k, MIT-VisTex, STex, USPTex, FTVL, KTH-TIPS,
KTH-TIPS2a, and Corel-Tex databases.
Fig. 12. The performance comparison of proposed multichannel decoded local binary patterns with existing approaches under rotation invariant uniform
transformation (riu2) using ARP vs number of retrieved images curve over Corel-1k, MIT-VisTex, STex, USPTex, FTVL, KTH-TIPS, KTH-TIPS2a, and
Corel-Tex databases.
existing approaches in most of the results of Fig. 13, while the degree of improvement in the performance of mdLBP is
the ARR values using proposed mdLBP are higher than higher over the ALOT database as compared to the ZuBuD
other approaches in each result of the Fig. 13, moreover, database. We also explored the categorical performance
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
DUBEY et al.: MULTICHANNEL DECODED LBPs FOR CONTENT-BASED IMAGE RETRIEVAL 4027
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
4028 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 25, NO. 9, SEPTEMBER 2016
Fig. 15. Top 10 retrieved images using each descriptor from Corel-1k database by considering the query image from (a) ‘Building’, (b) ‘Bus’, (c) ‘Dinosaurs’,
(d) ‘Elephant’, (e) ‘Flower’, (f) ‘Food’, (g) ‘Horse’, (h) ‘Africans’, and (i) ‘Beaches’ categories. Note that 6 rows in each subfigure corresponds to the different
descriptor such as LBP (1st row), cLBP (2nd row), mscLBP (3rd row), mCENTRIST (4th row), maLBP (5th row) and mdLBP (6th row) and 10 columns
in each subfigure corresponds to the 10 retrieved images in decreasing similarity order; images in the 1st column are query images as well as the top most
similar images.
in the Fig. 15(f). The number of correct images retrieved computer having Intel(R) Core(TM) i5 CPU 650@3.20 GHz
using LBP, cLBP, mscLBP, mCENTRIST, maLBP and mdLBP processor, 4 GB RAM, and 32-bit Windows 7 Ultimate operat-
descriptors for a query image from ‘Horse’ category are 7, 7, ing system with 4-cores active. The feature dimension of each
6, 6, 9, and 10 (see the Fig. 15(g)). The retrieval precision descriptor is mentioned in the Table VI with feature extraction
gained by proposed descriptors are also high as compared to and retrieval times over Corel-1k and MIT-VisTex databases.
the existing descriptors for the query images from categories The feature extraction time of mdLBP is {2.55, 1.93}, {0.73,
‘Africans’ and ‘Beaches’ as demonstrated in the Fig. 15(h-i) 0.96} and {1.78, 1.28} times slower than the feature extraction
respectively. time of cLBP, mscLBP and mCENTRIST respectively over
It is deduced from the retrieval results that the precision {Corel-1k, MIT-VisTex} databases. While at the other end, the
and recall using proposed multichannel based maLBP and feature extraction time of maLBP is nearly {−13%, −30%},
mdLBP descriptor is high as compared to the same using LBP {204%, 41%} and {25%, 5%} faster than the feature extrac-
and existing multichannel based approaches such as cLBP, tion time of cLBP, mscLBP and mCENTRIST respectively
mscLBP and mCENTRIST descriptors. It is also observed over {Corel-1k, MIT-VisTex} databases. The retrieval time
that the performance of mdLBP is better than the maLBP. using mdLBP is {4, 2.31}, {0.85, 0.77}, {1.44, 1.16}, and
It is shown by the experiments that proposed mdLBP method {2.9, 1.78} times slower than the retrieval time using cLBP,
outperforms other methods because mdLBP encodes each mscLBP and mCENTRIST, and maLBP descriptors over
combination of the red, green and blue channels locally from {Corel-1k, MIT-VisTex} databases. The feature extraction
its LBP binary values. The color in images is depicted by three time of each descriptor is nearly equal with u2 and riu2
values but most of methods process these values separately transformations also. The retrieval time using mdLBPu2 and
which loss the cross channel information. Whereas, mdLBP mdLBPriu2 is nearly 10 and 50 times better respectively than
takes all the combinations of LBP binary value computed over the retrieval time using mdLBP over Corel-1k database. The
each channel using a decoder based methodology. feature extraction time and retrieval time using each descriptor
is more over Corel-1k database because this database is having
E. Analysis Over Feature Extraction and Retrieval Time more number of images with large resolution as compared to
We analyzed the feature extraction time as well as the MIT-VisTex database images. From the Table VI, it is
the retrieval time for each descriptor over Corel-1k and explored that the maLBP is more time efficient whereas
MIT-VisTex databases. Both the feature extraction and mdLBP is less time efficient as the dimension of mdLBP is
retrieval time are computed in seconds using a personal higher than others except mscLBP.
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
DUBEY et al.: MULTICHANNEL DECODED LBPs FOR CONTENT-BASED IMAGE RETRIEVAL 4029
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
4030 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 25, NO. 9, SEPTEMBER 2016
V. C ONCLUSION
In this paper, two multichannel decoded local binary pat-
terns are introduced namely multichannel adder local binary
pattern (maLBP) and multichannel decoder local binary
pattern (mdLBP). Basically both maLBP and mdLBP have
utilized the local information of multiple channels on the basis
of the adder and decoder concepts. The proposed methods are
evaluated using image retrieval experiments over ten databases
having images of natural scene and color textures. The results
are computed in terms of the average precision rate and aver-
age retrieval rate and improved performance is observed when
Fig. 16. Comparison of proposed descriptors maLBP and mdLBP with LBP, compared with the results of the existing multichannel based
cLBP, mscLBP, mCENTRIST, mSIFT, mGIST and CDH over large databases approaches over each database. From the experimental results,
such as (a-b) Colored Brodatz and (c-d) ALOT-Complete in terms of the it is concluded that the maLBP descriptor is not showing the
ARP and ARR.
best performance in most of the cases while mdLBP descriptor
outperforms the existing state-of-the-art multichannel based
TABLE VIII
descriptors. It is also deduced that Chi-square distance measure
P ERFORMANCE A NALYSIS OF P ROPOSED I DEA W ITH CLBP IN T ERMS
OF THE ARP W HEN THE N UMBER OF R ETRIEVED I MAGES I S 10 is better suited with the proposed image descriptors. The
performance of the proposed descriptors is much improved
for three input channels and also in the RGB color space.
The performance of mdLBP is also superior to non-LBP
descriptors. It is also pointed out that mdLBP outperforms the
state-of-the-art descriptors over large databases. Experiments
also suggested that the introduced approach is generalized and
can be applied over any LBP based descriptor. The increased
dimension of the decoder based descriptor slows down the
retrieval time which is the future direction of this research. One
future aspect of this research is to make the descriptors noise
robust which can be achieved by using the noise robust binary
patterns over each channel as the input to the adder/decoder.
R EFERENCES
[1] A. W. M. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain,
“Content-based image retrieval at the end of the early years,” IEEE
Trans. Pattern Anal. Mach. Intell., vol. 22, no. 12, pp. 1349–1380,
Dec. 2000.
[2] Y. Liu, D. Zhang, G. Lu, and W. Y. Ma, “A survey of content-based
Fig. 17. Comparison of proposed descriptors maLBP and mdLBP with LBP, image retrieval with high-level semantics,” Pattern Recognit., vol. 40,
cLBP, mscLBP, mCENTRIST, mSIFT, mGIST and CDH over the databases no. 1, pp. 262–282, 2007.
having uniform illumination, rotation and image scale changes. [3] S. R. Dubey, S. K. Singh, and R. K. Singh, “Rotation and illumination
invariant interleaved intensity order based local descriptor,” IEEE Trans.
Image Process., vol. 23, no. 12, pp. 5323–5333, Dec. 2014.
the first 25 images of each category of Corel-1k with angle [4] S. R. Dubey, S. K. Singh, and R. K. Singh, “A multi-channel based
illumination compensation mechanism for brightness invariant image
0, 90, 180, and 270 degrees. The Scale database is obtained retrieval,” Multimedia Tools Appl., vol. 74, no. 24, pp. 11223–11253,
by scaling the first 20 images of each category of Corel-1k at 2015.
the scales of 0.5, 0.75, 1, 1.25, and 1.5. Thus, each database [5] T. Ojala, M. Pietikäinen, and D. Harwood, “A comparative study of
texture measures with classification based on featured distributions,”
consists of the 1000 images with 100 images per category. Pattern Recognit., vol. 29, no. 1, pp. 51–59, 1996.
The retrieval results over these databases are displayed [6] T. Ojala, M. Pietikäinen, and T. Mäenpää, “Multiresolution gray-scale
in Fig. 17 in terms of ARP (%) when 20 images are and rotation invariant texture classification with local binary patterns,”
IEEE Trans. Pattern Anal. Mach. Intell., vol. 24, no. 7, pp. 971–987,
retrieved using each descriptor. The mdLBP outperforms the Jul. 2002.
remaining descriptors in case of uniform intensity change. [7] A. Hadid and G. Zhao, Computer Vision Using Local Binary Patterns,
The performance of mdLBP is also better in rotation and vol. 40. New York, NY, USA: Springer, 2011.
[8] T. Ahonen, A. Hadid, and M. Pietikäinen, “Face description with local
scaling conditions except CDH. The performance of CDH binary patterns: Application to face recognition,” IEEE Trans. Pattern
is very well in rotation and scaling conditions because Anal. Mach. Intell., vol. 28, no. 12, pp. 2037–2041, Dec. 2006.
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
DUBEY et al.: MULTICHANNEL DECODED LBPs FOR CONTENT-BASED IMAGE RETRIEVAL 4031
[9] D. Huang, C. Shan, M. Ardabilian, Y. Wang, and L. Chen, “Local binary [34] Zurich Buildings Database (ZuBuD). [Online]. Available: http://www.
patterns and its application to facial image analysis: A survey,” IEEE vision.ee.ethz.ch/datasets/index.en.html
Trans. Syst., Man, Cybern. C, Appl. Rev., vol. 41, no. 6, pp. 765–781, [35] S. R. Dubey, S. K. Singh, and R. K. Singh, “Rotation and scale invariant
Nov. 2011. hybrid image descriptor and retrieval,” Comput. Electr. Eng., vol. 46,
[10] C. Shan, S. Gong, and P. W. McOwan, “Facial expression recognition pp. 288–302, Sep. 2015.
based on local binary patterns: A comprehensive study,” Image Vis. [36] S. R. Dubey, S. K. Singh, and R. K. Singh, “Local bit-plane decoded
Comput., vol. 27, no. 6, pp. 803–816, 2009. pattern: A novel feature descriptor for biomedical image retrieval,” IEEE
[11] S. R. Dubey, S. K. Singh, and R. K. Singh, “Local diagonal extrema J. Biomed. Health Inf., in press.
pattern: A new and efficient feature descriptor for CT image retrieval,” [37] S. R. Dubey, S. K. Singh, and R. K. Singh, “Local wavelet pattern:
IEEE Signal Process. Lett., vol. 22, no. 9, pp. 1215–1219, Sep. 2015. A new feature descriptor for image retrieval in medical CT databases,”
[12] M. Heikkilä, M. Pietikäinen, and C. Schmid, “Description of interest IEEE Trans. Image Process., vol. 24, no. 12, pp. 5892–5903,
regions with local binary patterns,” Pattern Recognit., vol. 42, no. 3, Dec. 2015.
pp. 425–436, 2009. [38] G. H. Liu and J. Y. Yang, “Content-based image retrieval using color
[13] S. Liao, M. W. K. Law, and A. C. S. Chung, “Dominant local binary difference histogram,” Pattern Recognit., vol. 46, no. 1, pp. 188–198,
patterns for texture classification,” IEEE Trans. Image Process., vol. 18, 2013.
no. 5, pp. 1107–1118, May 2009. [39] K. E. A. van de Sande, T. Gevers, and C. G. M. Snoek, “Evaluating
[14] Z. Guo, L. Zhang, and D. Zhang, “Rotation invariant texture classi- color descriptors for object and scene recognition,” IEEE Trans. Pattern
fication using LBP variance (LBPV) with global matching,” Pattern Anal. Mach. Intell., vol. 32, no. 9, pp. 1582–1596, Sep. 2010.
Recognit., vol. 43, no. 3, pp. 706–719, 2010. [40] A. Oliva and A. Torralba, “Modeling the shape of the scene: A holistic
[15] Z. Guo and D. Zhang, “A completed modeling of local binary pattern representation of the spatial envelope,” Int. J. Comput. Vis., vol. 42,
operator for texture classification,” IEEE Trans. Image Process., vol. 19, no. 3, pp. 145–175, 2001.
no. 6, pp. 1657–1663, Jan. 2010. [41] M. Brown and S. Süsstrunk, “Multi-spectral SIFT for scene category
[16] X. Tan and B. Triggs, “Enhanced local texture feature sets for face recognition,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit.,
recognition under difficult lighting conditions,” IEEE Trans. Image Jun. 2011, pp. 177–184.
Process., vol. 19, no. 6, pp. 1635–1650, Jun. 2010. [42] M. Douze, H. Jégou, H. Sandhawalia, L. Amsaleg, and C. Schmid,
[17] B. Zhang, Y. Gao, S. Zhao, and J. Liu, “Local derivative pattern versus “Evaluation of gist descriptors for Web-scale image search,” in Proc.
local binary pattern: Face recognition with high-order local pattern ACM Int. Conf. Image Video Retr., Jul. 2009, pp. 19–27.
descriptor,” IEEE Trans. Image Process., vol. 19, no. 2, pp. 533–544, [43] H. Jégou, M. Douze, C. Schmid, and P. Pérez, “Aggregating local
Feb. 2010. descriptors into a compact image representation,” in Proc. IEEE Conf.
[18] S. R. Dubey, S. K. Singh, and R. K. Singh, “Local neighborhood based Comput. Vis. Pattern Recognit., Jun. 2010, pp. 3304–3311.
robust colour occurrence descriptor for colour image retrieval,” IET [44] J. Y.-H. Ng, F. Yang, and L. S. Davis, “Exploiting local features from
Image Process., vol. 9, no. 7, pp. 578–586, Jul. 2015. deep networks for image retrieval,” in Proc. IEEE Int. Conf. Com-
[19] W. T. Chu, C. H. Chen, and H. N. Hsu, “Color CENTRIST: Embedding put. Vis. Pattern Recognit., DeepVis. Workshop (CVPRW), Apr. 2015,
color information in scene categorization,” J. Vis. Commun. Image pp. 53–61.
Represent., vol. 25, no. 5, pp. 840–854, 2014. [45] H. Jégou, M. Douze, and C. Schmid, “Improving bag-of-features
[20] C. K. Heng, S. Yokomitsu, Y. Matsumoto, and H. Tamura, “Shrink for large scale image search,” Int. J. Comput. Vis., vol. 87, no. 3,
boost for selecting multi-LBP histogram features in object detection,” pp. 316–336, May 2010.
in Proc. IEEE Int. Conf. Comput. Vis. Pattern Recognit., Jun. 2012, [46] V. Sydorov, M. Sakurada, and C. H. Lampert, “Deep Fisher kernels–end
pp. 3250–3257. to end learning of the Fisher kernel GMM parameters,” in Proc. IEEE
[21] J. Y. Choi, K. N. Plataniotis, and Y. M. Ro, “Using colour local binary Conf. Comput. Vis. Pattern Recognit., Jun. 2014, pp. 1402–1409.
pattern features for face recognition,” in Proc. 17th IEEE Int. Conf. [47] K. Simonyan, A. Vedaldi, and A. Zisserman, “Deep Fisher networks
Image Process. (ICIP), Sep. 2010, pp. 4541–4544. for large-scale image classification,” in Proc. Adv. Neural Inf. Process.
[22] C. Zhu, C. E. Bichot, and L. Chen, “Multi-scale color local binary Syst., 2013, pp. 163–171.
patterns for visual object classes recognition,” in Proc. IEEE Int. Conf. [48] A. Krizhevsky, I. Sutskever, and G. E. Hinton, “Imagenet classification
Pattern Recognit., Aug. 2010, pp. 3065–3068. with deep convolutional neural networks,” in Proc. Adv. Neural Inf.
[23] C. Zhu, C.-E. Bichot, and L. Chen, “Image region description using Process. Syst., 2012, pp. 1097–1105.
orthogonal combination of local binary patterns enhanced with color [49] F. Perronnin and D. Larlus, “Fisher vectors meet neural networks:
information,” Pattern Recognit., vol. 46, no. 7, pp. 1949–1963, Jul. 2013. A hybrid classification architecture,” in Proc. IEEE Conf. Comput. Vis.
[24] S. Banerji, A. Verma, and C. Liu, “Novel color LBP descriptors for Pattern Recognit., Jun. 2015, pp. 3743–3752.
scene and image texture classification,” in Proc. 15th Int. Conf. Image [50] X. Zhou, K. Yu, T. Zhang, and T. S. Huang, “Image classification using
Process., Comput. Vis., Pattern Recognit., Las Vegas, NV, USA, 2011, super-vector coding of local image descriptors,” in Proc. Eur. Conf.
pp. 537–543. Comput. Vis., Sep. 2010, pp. 141–154.
[25] S. H. Lee, J. Y. Choi, Y. M. Ro, and K. N. Plataniotis, “Local color vector [51] X. Bai, C. Yan, P. Ren, L. Bai, and J. Zhou, “Discriminative sparse
binary patterns from multichannel face images for face recognition,” neighbor coding,” Multimedia Tools Appl., vol. 75, no. 7, pp. 4013–
IEEE Trans. Image Process., vol. 21, no. 4, pp. 2347–2353, Apr. 2012. 4037, 2016.
[26] Y. Xiao, J. Wu, and J. Yuan, “mCENTRIST: A multi-channel feature [52] N. Inoue and K. Shinoda, “Fast coding of feature vectors using neighbor-
generation mechanism for scene categorization,” IEEE Trans. Image to-neighbor search,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 38,
Process., vol. 23, no. 2, pp. 823–836, Feb. 2014. no. 6, pp. 1160–1184, 2016.
[27] Corel Photo Collection Color Image Database, accessed on Aug. 2014. [53] X. Li, M. Fang, and J. J. Zhang, “Projected transfer sparse coding for
[Online]. Available: http://wang.ist.psu.edu/docs/realted/ cross domain image representation,” J. Vis. Commun. Image Represent.,
[28] MIT Vision and Modeling Group, Cambridge, U.K. Vision vol. 33, pp. 265–272, Nov. 2015.
Texture, accessed on Aug. 2014. [Online]. Available: [54] C. Zhang, J. Cheng, J. Liu, J. Pang, Q. Huang, and Q. Tian, “Beyond
http://vismod.media.mit.edu/pub/ explicit codebook generation: Visual representation using implicitly
[29] Salzburg Texture Image Database, accessed on Aug. 2014. [Online]. transferred codebooks,” IEEE Trans. Image Process., vol. 24, no. 12,
Available: http://www.wavelab.at/sources/STex/ pp. 5777–5788, Dec. 2015.
[30] A. R. Backes, D. Casanova, and O. M. Bruno, “Color texture analy- [55] S. Murala and Q. J. Wu, “Local ternary co-occurrence patterns: A new
sis based on fractal descriptors,” Pattern Recognit., vol. 45, no. 5, feature descriptor for MRI and CT image retrieval,” Neurocomputing,
pp. 1984–1992, 2012. vol. 119, no. 7, pp. 399–412, 2013.
[31] FTVL Database, accessed on Aug. 2014. [Online]. Available: [56] Distance Measures MATLAB Code, accessed on Aug. 2014.
http://www.ic.unicamp.br/~rocha/pub/downloads/tropical-fruits-DB- [Online]. Available: http://www.cs.columbia.edu/~mmerler/project/code/
1024x768.tar.gz/ pdist2.m
[32] KTH-TIPS Texture Image Database. [Online]. Available: http://www. [57] Colored Brodatz Database, accessed on Aug. 2014. [Online]. Available:
nada.kth.se/cvap/databases/kth-tips/index.html http://multibandtexture.recherche.usherbrooke.ca/colored%20_brodatz.
[33] G. J. Burghouts and J.-M. Geusebroek, “Material-specific adaptation html
of color invariant features,” Pattern Recognit. Lett., vol. 30, no. 3, [58] LLC MATLAB Code, accessed on Aug. 2014. [Online]. Available:
pp. 306–313, 2009. http://www.ifp.illinois.edu/~jyang29/LLC.htm
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIP.2016.2577887, IEEE
Transactions on Image Processing
4032 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 25, NO. 9, SEPTEMBER 2016
[59] J. Wang, J. Yang, K. Yu, F. Lv, T. Huang, and Y. Gong, “Locality- Satish Kumar Singh (M’11–SM’14) received the
constrained linear coding for image classification,” in Proc. IEEE Conf. B.Tech., M.Tech., and Ph.D. degrees in 2003, 2005,
Comput. Vis. Pattern Recognit., Jun. 2010, pp. 3360–3367. and 2010, respectively. He is an Assistant Professor
[60] GIST MATLAB Code, accessed on Aug. 2014. [Online]. Available: with the Indian Institute of Information Technology
http://people.csail.mit.edu/torralba/code/spatialenvelope/ at Allahabad, India. He is having more than ten
[61] L. Liu, M. Yu, and L. Shao, “Multiview alignment hashing for efficient years of experience in academic and research insti-
image search,” IEEE Trans. Image Process., vol. 24, no. 3, pp. 956–966, tutions. He has several publications in international
Mar. 2015. journal and conference proceedings of repute. He is
[62] J. Tang, Z. Li, M. Wang, and R. Zhao, “Neighborhood discriminant a member of various professional societies, such as
hashing for large-scale image retrieval,” IEEE Trans. Image Process., the IEEE and IETE. He was an Executive Committee
vol. 24, no. 9, pp. 2827–2840, Sep. 2015. Member of the IEEE Uttar Pradesh Section-2014.
[63] L. Liu and L. Shao, “Sequential compact code learning for unsupervised He is serving as an Editorial Board Member and a Reviewer for many
image hashing,” IEEE Trans. Neural Netw. Learn. Syst., in press. international journals. His current research interests are in the areas of digital
[64] L. Liu, M. Yu, and L. Shao, “Unsupervised local feature hashing for image processing, pattern recognition, multimedia data indexing and retrieval,
image similarity search,” IEEE Trans. Cybern., in press. watermarking, and biometrics.
1057-7149 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.