ISSN ONLINE(2320-9801) PRINT (2320-9798)

All submissions of the EM system will be redirected to Online Manuscript Submission System. Authors are requested to submit articles directly to Online Manuscript Submission System of respective journal.

An Approach for Telugu Numeral Recognition by Moment Invariants in Wavelet Transform Domain

M. Radhika Mani1, R. Kavitha Lakshmi2
  1. Associate Professor, Dept. of CSE, Pragati Engineering College, Surampalem, A.P., India
  2. Assistant Professor, Dept. of CSE, Pragati Engineering College, Surampalem, A.P., India
Related article at Pubmed, Scholar Google

Visit for more related articles at International Journal of Innovative Research in Computer and Communication Engineering

Abstract

Optical Character Recognition (OCR) is the task of recognizing the characters which are present in a digital image of text, whose domain can be machine print or handwriting. OCR is one of the most fascinating and challenging areas of pattern recognition with various practical application potentials. The present paper proposes a method for recognizing the Telugu Numerals from zero to nine by using the Moment Invariants. For this, the proposed method uses boundary based shape representation scheme. The input Telugu numeral is decomposed in to one level Haar wavelet transformed sub bands and the Boundary Moment Invariants are measured for each of these four sub bands. The experimental results show the superiority of the proposed method to recognize the Telugu numerals.

Keywords

Boundary, Shape, Wavelet Transform, Feature Vector and Object Recognition.

INTRODUCTION

In recent years, the glory of Document Image Analysis and Optical Character Recognition (OCR) has strongly increased since a paper document is still the most dominant medium for exchanging information. The computer is the most appropriate device for processing this information. Most of the works related to OCR are done in English, Chinese, Japanese and Arabic. However, some preliminary work has also been done on Indian scripts. A comprehensive review of the state of the art in the automatic processing of handwriting is reported in [12]. This paper reports many recent advances and changes that have occurred in this field. Various psychophysical aspects of the generation and perception of handwriting are presented to highlight the different sources of variability that make handwriting processing difficult. Major successes and promising applications of both online and offline approaches are indicated. State of the art Analysis and recognition of Asian scripts was reported in [14]. The paper summarizes the research activities of the past decades on the recognition of handwritten scripts used in China, Japan and Korea. It presents the recognition methodologies, features explored, databases used and classification schemes. In addition, it includes a description of the performance of numerous recognition systems found in both academic and industrial research laboratories. A handwritten character magically survives serious distortions in size, orientation and even structure. The general problem of defining the shape of a 2D line diagram, with character as a significant special case is addressed in [2]. The paper argues that the global shape of a character is determined by a set of local shapes. The local shapes, which are few in number, combine variously to give rise to a great diversity of characters. An unconstrained handwritten character recognition based on fuzzy logic is described in [5]. The approach uses the box method for feature extraction. Two recognition strategies are implemented for comparison. The recognition based on fuzzy logic outweighs that using back propagation neural network (BPNN). A hybrid classification system with neural network and decision tree as the classifiers for handwritten numeral recognition is reported in [11]. First, a variety of stable and reliable global features are defined and extracted based on the character geometric structures. A novel floating detector is then proposed to detect segments along the left and right profile of a character image used as local features. Finally, the recognition system consists of a hierarchical coarse classification and fine classification. A distance feature for neural network-based recognition of handwritten characters is described in [10]. Two new features, which are based on distance information, one on distance transformation and another on directional distance distribution, are described. Experimentation has been done on three standard distinct sets of characters (i.e., numerals, English capital letters, and Hangul initial sounds). Multiresolution recognition of unconstrained handwritten numerals using wavelet transform and a simple multilayer cluster neural network is reported in [7]. The scheme consists of two stages: a feature extraction stage for extracting multi resolution features with wavelet transform, and a classification stage for classifying unconstrained handwritten numerals with a single multilayer cluster neural network. Work on multi wavelets and neural networks can also be seen in [3]. A MLP Classifier for both printed and handwritten Bangla numeral recognition is proposed in [8]. Pixel-based and shape-based features are chosen for the purpose of recognition. Multi-layer neural network architecture was chosen as classifiers of the mixed class of handwritten and printed numerals. An offline hand printed Bangla numeral recognition scheme using a multistage classifier system comprising of multilayer Perceptron (MLP) neural network is described in [1]. The scheme considers multiresolution features based on wavelet transforms. The recognition scheme is robust to various writing styles and size. Method based on multiresolution analysis for Telugu character recognition can also be seen in [13]. Online handwritten character recognition of Devnagari and Telugu characters using Support Vector Machines is reported in [15]. The input to the recognition system consists of features of the strokes in each written character. The present paper has proposed a method to recognize ten Telugu numerals by using boundary moment invariant descriptors. The proposed method is using Wavelet transform domain for evaluating the feature vector. The organization of the present paper follows as section II gives the methodology, section III gives the results and discussions and section IV gives the conclusions.

METHODOLOGY

The character recognition is an important task in the OCR system. The present paper proposes a method to recognize the Telugu numerals from zero to nine. The input object can be represented by using either boundary or region based representations. The present paper uses the boundary based representation approach. On the represented shape of the boundary, the features can be described. The feature description is performed by using the boundary moment invariants (BMI). The size of the feature vector is 7. The BMI1 to BMI7 are given by the Equations (1)-(7).
image
The present paper computes the BMI features in transform domain. The transform domain is achieved by using the Wavelet. A wavelet is a kind of mathematical function used to divide a given function or continuous-time signal into components of different frequency and to study each component with a resolution that matches its scale. A wavelet transform is the representation of a function by wavelets. In the wavelet transform [4, 6, 9] an image signal can be analyzed by passing it through an analysis filter bank followed by a decimation operation. This analysis filter bank consists of a low pass and a high pass filter at each decomposition stage. When the signal passes through these filters it splits into two bands. The low pass filter, which corresponds to an averaging operation, extracts the coarse information of a signal. The high pass filter, which corresponds to a differencing operation, extracts the detail information of the signal. The output of the filtering operations is then decimated by two. A two-dimensional transform can be accomplished by performing two separate one-dimensional transforms. Firstly, the image is filtered along the xdimension using low pass and high pass analysis filters and decimated by two. Low pass filtered coefficients are stored on the left part of the matrix and high pass filtered coefficients are stored in the right part of the matrix. Because of decimation the total size of the transformed image is same as the original image. Then, it is followed by filtering the subimage along the y-dimension and decimated by two. Finally, the image splits into four bands denoted by low-low (LL), high-low (HL), low-high (LH) and high-high (HH) after one-level decomposition. Fig. 1 shows one level of filtering. This process of filtering the image is called ‘Pyramidal decomposition’ of image.
The present paper proposes a method for recognizing the Telugu numerals from zero to nine by using boundary moment invariant (BMI) descriptors. The original images are shown in Fig. 2.
The present paper has used boundary based shape representation scheme. By using the Haar Wavelet transform, the original image is decomposed into four sub bands LL, Hl, LH and HH and within each sub band, seven BMI features are calculated and the results are given in Tables I-X. The average value for all these four sub bands is calculated for each numeral. The Table XI gives the average wavelet boundary descriptor for each numeral. The results are plotted on to a graph as shown in Fig. 3 and from this graph, clearly it is visible that each Telugu numeral has its own unique descriptor value i.e. the average value from Table XI. From the Graph shown in Fig. 3, it is clear that the Telugu Numeral 0 is having the highest BMI value and Numeral 6 is having the lowest BMI value. The Numeral 1 and 8 BMI values are in high range. The Numeral 7 BMI value is in the low range and the remaining numerals BMI value is in the middle range.

CONCLUSION

The character recognition plays a vital role in the document image analysis. The present paper proposes an approach to recognize ten Telugu numerals from zero to nine. The proposed method computes seven moment invariant features on the boundary shape representation. The computation of features is further extended to four decomposed sub bands obtained by the Haar Wavelet transform. The average of all these values will yield a unique feature for each of the ten Telugu numerals. From the results, we can conclude that the Telugu numerals which are having the maximum shape as the full/semi circle, those possess the highest feature value where as the numerals which are not having the circle shapes are possess the lowest feature value where the remaining numerals which are having less amount of circle shapes possess the medium feature values. With this approach, the present paper proposed a method for recognizing ten Telugu numerals.

Tables at a glance

Table icon Table icon Table icon Table icon
Table 1 Table 2 Table 3 Table 4
Table icon Table icon Table icon Table icon
Table 5 Table 6 Table 7 Table 8
Table icon Table icon Table icon
Table 9 Table 10 Table 11

Figures at a glance

Figure 1 Figure 2 Figure 3
Figure 1 Figure 2 Figure 3

References