A NOVEL CONTENT BASED IMAGE RETRIEVAL MODEL BASED ON THE MOST RELEVANT FEATURES USING PARTICLE SWARM OPTIMIZATION

P.K.Bhargavi; S.Bhuvana; Dr.R.Radhakrishnan

A NOVEL CONTENT BASED IMAGE RETRIEVAL MODEL BASED ON THE MOST RELEVANT FEATURES USING PARTICLE SWARM OPTIMIZATION

P.K.Bhargavi^*1, S.Bhuvana², Dr.R.Radhakrishnan³

Computer Science and Engineering, Sri Krishna College of Technology, Coimbatore, Tamilnadu, India
Computer Science and Engineering, Sri Krishna College of Technology, Coimbatore, Tamilnadu, India
Computer Science and Engineering, Maharaja Institute of Engineering & Technology, Coimbatore,Tamilnadu,India

Corresponding Author: P.K.Bhargavi, E-mail: pkbhargavi31@gmail.com

Related article at Pubmed, Scholar Google

Visit for more related articles at Journal of Global Research in Computer Sciences

Abstract

Content Based Image Retrieval (CBIR) is the application of computer vision techniques to the image retrieval problem, that is, the problem of searching for digital images in large databases. Content-based image retrieval (CBIR) depends on extracting the most relevant features according to a feature selection technique. The integration of multiple features may cause the curse of dimensionality and the consumed time in the retrieval process. The proposed model includes the following steps: (i) Feature Extraction from images database using color coherence vector (CCV) and Gabor filter algorithm to extract the color and texture features (ii) Feature Discrimination using maximum entropy method for replacing numerical features with nominal features that represent intervals of numerical domains with discrete values using Class Attribute Interdependence Maximization (CAIM) algorithm (iii) Feature Selection using Particle Swarm Optimization (PSO) algorithm for extracting the most relevant features from the original features set. CBIR based applications are used in Internet and law enforcement markets for the purpose of identifying and censoring the images.

Keywords

Content-based image retrieval, Feature Discrimination, Feature Selection, Ant Colony Optimization (ACO) algorithm

INTRODUCTION

Content Based Image Retrieval (CBIR) is the application of computer vision techniques to the image retrieval problem, that is, the problem of searching for digital images in large databases. CBIR systems retrieve relevant images in a database using visual content of the images. Researchers are mainly developed based on the high-level semantic analysis of the image content along with the visual content of the image such as colors, textures, and shapes. Color features include the color histogram, the color coherence vector, the color co-occurrence matrix, vector quantization, and color moments. Texture features are derived from the gray-level co-occurrence matrix, the Tamura feature, and wavelet coefficients, and Gabor filter-based features. Once image features are extracted, another problem arouse that is which features are relevant in the retrieval process. Having more features implies more discriminative power in classification. In addition, high dimensionality of data may cause the „„curse of dimensionality” problem.

Feature reduction refers to the study of methods for reducing the number of dimensions describing data. Its general purpose is to employ fewer features to represent data and reduce computational cost, without deteriorating discriminative capability. Since feature reduction can bring lots of advantages, such as avoiding over fitting, resisting noise and strengthening prediction performance. Feature transform constructs new features by projecting the original feature space to a lower dimensional one. Principal component analysis and independent component analysis are two widely used feature transform methods. Although feature transform can obtain the least dimension, its major drawbacks are that its computational over-head are high and the output is hard to be interpreted for users.

Feature selection is the process of choosing a subset of the original feature spaces according to discrimination capability to improve the quality of data. The present paper proposes a suggested image retrieval model based on extracting the most relevant features from the whole features according to a feature selection technique. The extracted features are color and texture features using the invariant features, color coherence vector and Gabor wavelets.

The color coherence vector (CCV) is an indication of how similar colors are oriented in the image. A color coherence vector (CCV) stores the number of coherent versus incoherent pixels with each color [2]. Regions which have a collection of similar color pixels are termed as coherent regions. The CCV computes the present number of coherent and incoherent pixels. Vastly different images can have the same color histograms. The CCV will differentiate the images on the basis of how many regions are present that have a sizeable contiguous color patches. In order to compute the CCV, we specified the number of bins in which the colors can be classified. Then, for each pixel the neighboring pixels are compared and the region of coherence is marked. If the size of this region is above a threshold, then the region is marked coherent, else it is marked non-coherent. Thus, by dividing the entire blocks of color into either coherent or incoherent, we get information regarding the scattering of color information.

The Gabor wavelet is widely adopted to extract texture features from the images for the image retrieval process. Basically, Gabor filters are a group of wavelets, with each wavelet capturing energy at a specific frequency and a specific orientation. The scale and orientation tunable property of Gabor filter makes it useful especially for texture analysis [3]. Gabor filters are a group of wavelets capturing energy at a specific frequency and a specific direction. The expansion of a signal using this basis provides a localized frequency description, therefore, capturing local features/energy of the signal.

RELATED WORKS

A color coherence vector (CCV) stores the number of coherent versus incoherent pixels with each color. By separating coherent pixels from incoherent pixels, CCV's provide finer distinctions than color histograms [2].Color histograms also have some limitations.

Color coherence as the degree to which pixels of that color are members of large similarly-colored regions. We refer to these significant regions as coherent regions, and observe that they are of significant importance in characterizing images. Color coherence measure classifies pixels as either coherent or incoherent. Coherent pixels are a part of some sizable contiguous region, while incoherent pixels are not. A color coherence vector represents this classification for each color in the image.

In computing, a CCV is similar to the computation of a color histogram. We first blur the image slightly by replacing pixel values with the average value in a small local neighborhood (currently including the 8 adjacent pixels). This eliminates small variations between neighboring pixels. Then the color space is discretized, such that there are only n distinct colors [2] in the image.

Gabor wavelet is widely adopted to extract texture features from the images for retrieval and has been shown to be very efficient. Gabor filters are a group of wavelets, with each wavelet capturing energy at a specific frequency and specific orientation. The scale and orientation tunable property of Gabor filter makes its especially useful for texture analysis. The use of Gabor filters in extracting textured image features is motivated by various factors. The Gabor representation has been shown to be optimal in the sense of minimizing the joint two-dimensional uncertainty in space and frequency. These filters can be considered as orientation and scale tunable edge and line (bar) detectors, and the statistics of these micro-features in a given region are often used to characterize the underlying texture information. Gabor features have been used in several image analysis applications including texture classification and segmentation, image recognition, image registration, and motion tracking.

Support Vector Machines is built on the structural risk minimization principle to seek a decision surface that can separate the data points into two classes with a maximal margin between them. This allows for formulating the classifier training as a constrained optimization problem.

The Genetic Algorithms (GA) is efficient methods for function minimization. In descriptor selection context, the prediction error of the model built upon a set of features is optimized. The genetic algorithm mimics the natural evolution by modeling a dynamic population of solutions. The members of the population, referred to as chromosomes, encode the selected features. The encoding usually takes form of bit strings with bits corresponding to selected features set and others cleared. Each chromosome leads to a model built using the encoded features. The parameters steering the crossover, mutation and survival of chromosomes should be carefully chosen to allow the population to explore the solution space and to prevent early convergence to homogeneous population occupying a local minimum. The choice of initial population is also important in genetic feature selection. In this paper, proposed system uses Maximum Entropy method for feature discrimination and Ant Colony Optimization (ACO), which is used to find the optimal solution for feature selection process.

PROPOSED SYSTEM

In this paper, a suggested image retrieval model is proposed based on extracting the most relevant features from the whole features according to a feature selection technique. The extracted features are color and texture features using the invariant features, color coherence vector and Gabor wavelets. These features are discriminated by using CAIM method for replacing the numerical features with nominal features that represent intervals of numerical domains with discrete values. These features pass to a feature selection technique for extracting the most relevant features and dropping the irrelevant ones using PSO algorithm. The relevant features can not only achieve maximum recognition rate but can also simplify the calculation and reduce the consumed time in the retrieval process. The overall process of the system is shown in Fig.1.

Feature Extraction

Color Extraction:

The features which are extracted in this paper are color and texture representation. The color coherence vector (CCV) is an indication of how similar colors are oriented in the image. Regions which have a collection of similar color pixels are termed as coherent regions. The CCV computes The features which are extracted in this paper are color and texture representation. The color coherence vector (CCV) is an indication of how similar colors are oriented in the image. Regions which have a collection of similar color pixels are termed as coherent regions. The CCV computes

INPUT: Color image

OUTPUT: The values of coherent and incoherent pixels are displayed using color coherence vector

STEPS:

a. Color image is given as input

b. The image is divided into eight regions

c. Regions having a collection of similar color pixels are coherent regions

d. The neighboring pixels are compared and the region of coherence is marked

e. If the size of this region is above a threshold, then the region is marked coherent, else it is marked non-coherent.

Texture Extraction:

Gabor wavelet is used to extract texture features from the images for retrieval and has been shown to be very efficient. Gabor filters are a group of wavelets, with each wavelet capturing energy at a specific frequency and specific orientation. The scale and orientation tunable property of Gabor filter makes its especially useful for texture analysis.

INPUT: Color image

OUTPUT: Texture image

STEPS:

a. Color image is given as input

b. Color image is converted into grey scale image

c. Then the image is extracted into texture representation using gabor filter function,

Feature Discrimination:

The discrimination process is an essential step for the proposed feature selection technique. Once image features are extracted, the feature discrimination technique [4] must be performed to transform continuous values into discrete ones in accordance with certain thresholds. The discrimination process converts continuous features into discrete ones by yielding intervals in which the feature value can reside instead of singleton values, and by associating a discrete numerical value with each interval. Class Attribute Interdependence Maximization (CAIM) algorithm is used for discrimination process. The goal of the CAIM algorithm is to maximize the class-attribute interdependence and to generate a (possibly) minimal number of discrete intervals.

Feature Selection:

Feature selection is the process of choosing a subset of the original feature spaces according to discrimination capability to improve the quality of data. This paper proposes a suggested image retrieval model based on extracting the most relevant features from the whole features according to a feature selection technique [6]. The goal of feature selection technique is to select the best features from the original ones. It can not only achieve maximum recognition rate but can also simplify the calculation of the image retrieval process. The goal of feature selection technique is to select the best features from the original ones. Particle Swarm Optimization (PSO) is used for feature selection technique. PSO is a computational method that optimizes a problem by iteratively trying to improve a candidate solution with regard to a given measure of quality. The result of PSO is shown in Fig.4.

Retrieval Process:

The features‟ vectors are saved in the database with a pointer to the model image to which they are belonging. In the retrieval process, a query image is presented to the system and the features of the image are extracted. Each individual feature vector from the query image is compared using the Euclidean distance measure [8] with all the other features‟ vectors stored in the database of the model images.

RESULTS AND DISCUSSIONS

The first images database was downloaded from www.google.com. This images database consists of 1000 images. These images are grouped into 10 clusters, each cluster contains 100 images. The feature extraction technique was applied on the proposed images database for extracting the color and texture features. A color coherence vector (CCV) stores the number of coherent versus incoherent pixels with each color. Coherent pixels are a part of some sizable contiguous region, while incoherent pixels are not. A color coherence vector represents this classification for each color in the image. CCV's prevent coherent pixels in one image from matching incoherent pixels in another. Color image is given as input.The color image is divided into regions and then the coherent and incoherent pixels are differentiated using color coherence vector. For each pixel the neighboring pixels are compared and the region of coherence is marked. Threshold value is set, if the size is above threshold value then the region is marked as coherent else it is marked as incoherent, which is shown in Fig.2.

Gabor wavelet is widely adopted to extract texture features from the images for retrieval and has been shown to be very efficient. Gabor filters are a group of wavelets, with each wavelet capturing energy at a specific frequency and specific orientation. The use of Gabor filters in extracting textured image features is motivated by various factors. The Gabor representation has been shown to be optimal in the sense of minimizing the joint two-dimensional uncertainty in space and frequency. Color image is given as input. Color image is converted into grey scale image. Then the image is extracted into texture representation using gabor filter function. The values of the texture image are displayed using gabor wavelet function, which is shown in Fig.3. The method used in discrimination technique is Class-Attribute Interdependence Maximization (CAIM). The goal of the CAIM algorithm is to maximize the class-attribute interdependence and to generate a (possibly) minimal number of discrete intervals. The results of discrimination technique are shown in Fig.4. The features discriminated are given as input for feature selection technique.

Feature Selection technique is done by using Particle Swarm Optimization (PSO). PSO is a computational method that optimizes a problem by iteratively trying to improve a candidate solution with regard to a given measure of quality. The selected features are displayed as output using Particle Swarm Optimization (PSO) algorithm.

The feature vectors are saved in the database with a pointer to the model image to which they are belonging. In the retrieval process, a query image is presented to the system and the features of the image are extracted as described above. Each individual feature vector from the query image is compared using the Euclidean distance measure with all the other features‟ vectors stored in the database of the model images. When improved positions are being discovered these will then come to guide the movements of the swarm. The process is repeated and by doing so it is hoped, but not guaranteed, that a satisfactory solution will eventually be discovered. The images are retrieved from database which is relevant to the given input image shown in Fig.6.

$\image$

CONCLUSION

In this paper, the color coherence vector and Gabor wavelets are used to extract color and texture feature using feature extraction technique. In discrimination process, exogenous method is used and to obtain the optimal boundaries, Genetic algorithm is used for feature selection. Therefore, we introduced a proposed feature discrimination technique using Maximum entropy method for transforming the extracted continuous features to the nominal features. The effectiveness of the proposed approach has been also analyzed by optimizing it with the Particle Swarm Optimization (PSO) algorithm. The proposed model reduces the search space for the most relevant features set and reduces the consume time in the retrieval process.

The performance improvement of the proposed method has been successfully demonstrated the effectiveness and efficiency of the proposed model when it is compared with other models using precision and recall. Further this can be improved by using unsupervised algorithms such as equal-width interval, equal-frequency interval, k-mean clustering and also by some other optimization methods.

References

M.E. ElAlami, Knowledge-Based Systems 24 (2011) 23ÃÂ¢Ãâ¬Ãâ32, Elsevier on “A novel image retrieval model based on the most relevant features”.
Greg Pass, RaminZabih , Justin Miller, “Comparing Images Using Color Coherence Vectors”.
ArtiKhaparde et al., Content based image retrieval using independent component analysis, International Journal of Computer Science and Network Security (IJCSNS) 8 (4) (2008).
L. Kurgan, K. Cios, CAIM discretization algorithm, IEEE Transactions on Knowledge and Data Engineering 16 (2) (2004) 145ÃÂ¢Ãâ¬Ãâ153
James Dougherty et al., Supervised and unsupervised discretization of continuous features, in: A. Preditis, S. Russell (Eds.), Proceedings of the Twelfth International Conference on Machine Learning, Morgan Kaufmann Publishers, San Francisco, 1995.
EsinGuldogan, MoncefGabbouj, Feature selection for content-based image retrieval, Signal, Image and Video Processing 2 (3) (2008).
Jianjiang Lu, Tianzhong Zhao, Yafei Zhang, Feature selection based on genetic algorithm for image annotation, Knowledge-Based Systems 21 (8) (2008) 887ÃÂ¢Ãâ¬Ãâ 891.
H. MÃÆÃÂ¼ller et al., A framework for benchmarking in visual information retrieval, International Journal on Multimedia Tools and Applications 21 (2003) 55ÃÂ¢Ãâ¬Ãâ73.
N. Jhanwar et al., Content based image retrieval using motif co-occurrence matrix, Image and Vision Computing 22 (2004) 1211ÃÂ¢Ãâ¬Ãâ1220.
P.W. Huang, S.K. Dai, Image retrieval by texture similarity, Pattern Recognition 36 (3) (2003) 665ÃÂ¢Ãâ¬Ãâ679.