Human Eye Blink Detection using YCbCr
Color Model, Haar-Like Features and
Template Matching

Sukhwinder Kaur; Hari Singh

Human Eye Blink Detection using YCbCr Color Model, Haar-Like Features and Template Matching

Sukhwinder Kaur¹, Hari Singh²

M. Tech. Student, Dept. of ECE DAV Institute of Engineering and Technology, Jalandhar, India
Assistant Professor, Dept. of ECE, DAV Institute of Engineering and Technology, Jalandhar, India

Related article at Pubmed, Scholar Google

Visit for more related articles at International Journal of Advanced Research in Electrical, Electronics and Instrumentation Engineering

Abstract

This paper presents comparison of two image processing algorithms used for eye blink detection. The motivation of this research work is the need of disabled persons who are unable to move their body parts except eyes. The process of blink detection is divided into three parts viz.face localization, eye pair localization and template matching method. In method 1, YCbCr color model and morphological operations are used for the face and eyes localization. In method 2 face and eyes pair localization is performed by using Viola Jones method. After eye pair localization, the concept of template matching is applied for blink detection, in both the methods. A performance comparison is made for both the methods based upon detection accuracy and processing time. It is observed that method 1, gives better accuracy (80.75%) with low processing time (0.38sec.). The overall success rate of method 1 and method 2 is 71% and55% respectively.

Keywords

Face localization, Eye localization, Eye blink detection, YCbCr Color models, Viola Jones, Morphological operations and Template matching.

INTRODUCTION

Paralysis is the complete loss of muscle function for one or more muscle groups. Paralysis can cause loss of feeling or mobility in the affected areas. Paralysis is most often caused by damage to the nervous system, especially the spinal cord [2]. Fully paralysed patients require 24 hour support. But, it is not possible for anyone to be available at all times. In this paper an improved efficiency of eye blink is given. Before starting work on eye blink we have to go through from some steps as shown in fig.1. There are some techniques devised for blink detection as well. This research work is to improve the efficiency of eye blink detection rate. Before doing work on eye blink we have to go through a survey as explained below.

LITRATURE SURVEY

Ijaz Khan, Hadi Abdullah et al [2013]: In this paper the author presents improved algorithms for face, eyes and mouth detection in an image. Viola Jones and skin color pixel detection as face detection techniques are widely used. Viola Jones gives accurate face detection but consumes more time whereas skin color pixel detection technique consumes less time but lacks in accuracy.

Atish Udayashankar et al [2012]: The main aim of this paper is to design a real time interactive system that can assist the paralyzed to control appliances such as lights, fans etc. or by playing pre-recorded audio messages, through a predefined number of eye blinks. Image processing techniques have been implemented in order to detect the eye blinks.

K Takahashi et al [2012]: This paper proposed a practical method for eye blink detection using a monocular system. Eye blink detection is an important technology in many situations such as facial action analysis and signal processing. However, automatic eye blink detection is quite difficult since the eye blink is occurred fast

Kohei Aai and Ronny Mardiyanto et al [2011]: In this paper blink detection is used for a variety of applications such as Human-Computer Interaction, wearable computing, etc. Blink detection method with Gabor filter is proposed to realize high accurate blink detection. The blink detection accuracy is evaluated by several people and compared to the other existing conventional methods. Through the comparison, it is found that the proposed blink detection method with Gabor filter is superior to the other methods.

A. Human Face Localization

For human face localization Viola Jones and skin color pixel detection as face detection techniques are widely used. Viola Jones gives accurate face detection but consumes more time whereas skin color pixel detection technique consumes less time but lacks in accuracy. In this research both the techniques which are to be used will increases accuracy while consuming less time. Viola Jones and other methods can accurately detect faces but in case of facial features detection their accuracy decreases [1, 6, 14, 16].

B. Eye Localization

The localization of eyes in facial images has many applications in computer vision such as gaze estimation, pose estimation, face detection, face recognition, human computer interaction, eye blink detection etc. Eyes can be extracted from facial images using different eye features such as color, illumination, shape, geometry, edges etc.[10,12,14].Once two eye blocks appear from the segmented image and spread to a certain size; they will be detected by the determination criterion of eye region[3,19]. However, sometimes, it is found that in the detection process the two detected eyes might not be real eyes, furthermore, it is difficult to seek precise centre of eye [31].

C. Eye Blink Detection

As the users eye closes during the process of a blink, its similarity to the open eye template decreases. Likewise, it regains its similarity to the template as the blink ends and the user’s eye becomes fully open again. This decrease and increase in similarity corresponds directly to the correlation scores returned by the template matching procedure [1, 3, 17].

PROBLEM FORMULATION

Need of Proposed Research Work

The main need of eye blink detection system is to help those people who are unable to move any part of their body excepting their eyes. Fully paralysed patients require 24 hour support. But in present day this is not possible for anyone to be available full time. There are some techniques devised for blink detection as well [2]. The detection of blinking and analysis of blink detection will support the people who are not able to move their body parts.

Significance of Proposed Research Work

Significance of this research work is to improve detection time of eye blink as we know that Human face image analysis, detection and recognition have become some of the most important research topics in the field of image processing and pattern classification. The potential applications involve topics such as face detection, face identification and recognition, and facial expression analysis. Among these research topics, one fundamental but very important problem to be solved is automatic eye detection. The eye is the most significant and important feature in a human face, as extraction of the eyes are often easier as compared to other facial features.

METHODOLOGY/PLANNING OF WORK

Two methods are used for eye blink detection in this research. In both the methods different techniques are applied to localize human face and eye and a common method (Template Matching) is used for eye blink detection. Finally, processing time and accuracy of both the methods is compared. Because, processing time is very important parameter when eye blink detection is to be implemented in real-time (in future). Steps we followed in both the methods are as:

Method 1

a) Human Face Localization and Eye Localization using YCbCr Color Model.

b) Template Matching for Blink Detection.

Method 2

a) Haar - like features for Face Localization and Eye Localization.

b) Template Matching for Blink Detection.

YCbCr COLOR MODEL

YCbCr is a family of color spaces used as a part of the color image pipeline in video and digital photography systems. Y is the luma component and CB and CR are the blue-difference and red difference chrome components. Y (with prime) is distinguished from Y which is luminance, meaning that light intensity is non-linearly encoded using gamma correction. YCbCr color model wedge the digital image into two parts, luma or luminance component and chrome or chrominance component. Due to luminance and chrominance effect, it becomes most important in digital video camera to handle video information [9, 23, 25] The YCbCr color space is widely used for digital video. In this paper, luminance information is stored as a single component (Y), and chrominance information is stored as two colordifference components (Cb and Cr). Cb represents the difference between the blue component and a reference value Cr represents the difference between the red component and a reference value.

VIOLA-JONES METHOD

The Viola Jones object detection framework proposed in by Paul Viola and Michael Jones is the first object detection framework to provide competitive object detection rates in real-time .Viola Jones and skin color pixel detection as face detection techniques are widely used.[1,21,27] They proposed a face detection method based on the adaboost learning algorithm using haar features that detected the face successfully with high accuracy.[26] Viola Jones algorithm are:

A. Integral images

B. Haar Features

C. Cascade of boosted classifier

A. Integral images

The simple rectangular features of an image are calculated using an intermediate representation of an image, called the integral image [9]. The integral image is an array containing the sums of the pixels’ intensity values located directly to the left of a pixel and directly above the pixel at location (x, y) inclusive. So if A[x, y] is the original image and AI[x, y] is the integral image.

B. Haar Classifier

The core basis for Haar classifier object detection is the Haar-like features. These features, rather than using the intensity values of a pixel, use the change in contrast values between adjacent rectangular groups of pixels. The contrast variances between the pixel groups are used to determine relative light and dark areas. Two or three adjacent groups with a relative contrast variance form a Haar-like feature. Haar-like features, as shown in Fig.3 are used to detect an image [8, 16, 27]. Haar features can easily be scaled by increasing or decreasing the size of the pixel group being examined. This allows features to be used to detect objects of various sizes.

C. Cascade of boosted classifier

The overall form of the detection process is that of a degenerate decision tree, what we call a “cascade”. A positive result from the first classifier triggers the evaluation of a second classifier which has also been adjusted to achieve very high detection rates. A positive result from the second classifier triggers a third classifier, and so on Stages in the cascade are constructed by training classifiers using AdaBoost and then adjusting the threshold to minimize false negatives as shown in fig.4 has also been adjusted to achieve very high detection rates. A positive result from the second classifier triggers a third classifier, and so on Stages in the cascade are constructed by training classifiers using AdaBoost and then adjusting the threshold to minimize false negatives as shown in fig.4.A series of classifiers are applied to every sub-window. The initial classifier eliminates a large number of negative examples with very little processing. Subsequent layers eliminate additional negatives but require additional computation. After several stages of processing the numbers of sub-windows have been reduced radically.

MORPHOLOGICAL OPERATIONS

Morphological techniques are used for boundary detection. Dilation, followed by erosion and the calculation of differences between the two produces an image with boundaries. The structuring element used in dilation and erosion has a large matrix, so that clear and thick boundaries are detected [5, 25].

TEMPLATE MATCHING

For eye blink detection template matching is used here.In the template matching method, segments of an input image are compared to previously stored images, to evaluate the similarity of the counterpart using correlation values[2, 5].

Diff; Image

D = (Tc-Ts)

s = sum (sum (d));

Tc = current frame

Ts = stored frame

*Threshold is set by empirical method.

Frame differencing is applied to find correlation between current template and stored template. Hence the value of ‘s’ less than threshold value indicates template matching.

Method 1

(a) Human Face and Eye Localization using YCbCr Color Model: In this part following steps are to be followed:

A. I/P frame

B. Convert RGB to YCbCr

C. Morphological operations

D. Face localization

E. Eye pair localization

F. Eye blink detection(Template Matching Method)

A. I/P frame: This is the very first step in which an input frame are to be given to the YCbCr Color Model for the purpose of whole procedure.

B. Conversion RGB to YCbCr: As we know that RGB frame is acquired for the process but we have to convert RGB frame to YCbCr because this transformation is useful in detecting blobs and reduces the computational complexity.

C. Morphological operations

Dilation: The value of the output pixel is the maximum value of all the pixels in the input pixel's neighbourhood. In a binary image, if any of the pixels is set to the value 1, the output pixel is set to 1.

Erosion: The value of the output pixel is the minimum value of all the pixels in the input pixel's neighbourhood. In a binary image, if any of the pixels is set to 0, the output pixel is set to 0.

Structuring Elements: An essential part of the dilation and erosion operations is the structuring element used to probe the input image.

D. Face localization: After applying morphological operation on an input frame at the end face is detected as shown.

E. Eye pair localization: This is the next step i.e. eye pair detection and edge detection is involved in it. Morphological techniques are used for boundary detection. Dilation, followed by erosion and the calculation of differences between the two produces an image with boundaries. Diamond structuring element is used.

Template matching is a technique for finding areas of frame that match (are similar) to a template frame (patch) it has two primary components:

Source frame (I): The frame in which we expect to find a match to the template frame as shown below:

Template frame (T): The patch frame which will be compared to the template frame as shown below:

Method 2

(a) Haar - like features for Face Localization and Eye Localization using Viola Jones Algorithm.

A. I/P frame

B. Face localization

C. Eye pair localization

D. Eye blink detection (Template Matching Method)

Template matching is a technique for finding areas of frame that match (are similar) to a template frame (patch) it has two primary components:

Source frame (I): The frame in which we expect to find a match to the template frame as shown below:

Template frame (T): The patch frame which will be compared to the template frame as shown

RESULTS AND DISCUSSION

Detection Accuracy (DA) =TP/TP+FN…………. (1)

False Alarm Rate (FAR) =FP/TP+FP……………… (2)

Success Rate (%) =DA/DA+FAR…… (3)

TP: Correctly detected blinks

FP: False detection of blinks

FN: Missed blink

Average success rate: 71%

Average Accuracy: 80.75%

Processing Time (Per blink): 0.38 sec.

Average success rate: 55%

Average Accuracy: 60.37%

Processing Time (Per blink): 0.65 sec

CONCLUSION

Two different methods are implemented in this research for Blink Detection, which follows face and eye pair localization in a frame and template matching to check the status of eyes(opened or closed).In Method 1, face and eyes pair localization is performed usingYCbCr skin color model and morphological operations. Then template matching is applied for blink detection. In Method 2, face and eyes localization is performed using Viola Jones Computer-Vision Toolbox and template matching for blink detection. From result analysis, it is observed that, average detection accuracy and processing time for Method 1 and 2 are 80.75%, 60.37% and 0.38 sec., 0.65sec., respectively. Therefore, it can be concluded that the accuracy of Method 1 is better as compared to Method 2 and method one require less processing time too. The fall in accuracy is due to inability of the method to detect face and eye pair area, especially is case of tilted faces. Further, the overall success rate is 71% and 55% for method one and two, respectively. The fall in the success rate is due to large no. of “False Positives” i.e. false detection of blinks. Finally, it can be concluded that, Method 1 is more suitable choice for blink detection for subjects with tilted faces too.

References

Aleksandra Krolak, PawelStrumillo ÃÂ¢Ãâ¬ÃÅEye-Blink Detection System for Human-Computer InteractionÃÂ¢Ãâ¬ÃÂ Institute of Electronics, Technical University of Lodz, Wolczanska.pp.90-924, Lodz, Poland,2012.

A. Yuille, P. Hallinan, D. Cohen, ÃÂ¢Ãâ¬ÃÅFeature extraction from faces using deformable templatesÃÂ¢Ãâ¬ÃÂ, International Journal of Computer Vision, pp. 99-111, 1992.

Bhaskar, T.N., Foo TunKeat ,Ranganath, S., Venkatesh, Y.V., 2003. ÃÂ¢Ãâ¬ÃÅBlink Detection and Eye Tracking for Eye LocalizationÃÂ¢Ãâ¬ÃÂ.Conference on Convergent Technologies for Asia-Pacific Region,10oct 2003.

CihanTopal, ÃÆÃâmerNezihGerek and Atakan ÃÂ¢Ãâ¬ÃÅA Head- Mounted Sensor-Based Eye Tracking Device: Eye Touch SystemÃÂ¢Ãâ¬ÃÂ. Proeedings of the Symposium on Eye tracking research & applications. Savannah, GA, USA, pp. 87-90, 2008.

CC.Hen, H.Y.M.Liao, G.J.Yu, L.H.Chen, ÃÂ¢Ãâ¬ÃÅFast face detection via morphology based pre processing. Pattern RecognitionÃÂ¢Ãâ¬ÃÂ pp.1701-1712, National Science Council of Taiwan 2000.

Cai J, Goshtasby.A. and Yu C, ÃÂ¢Ãâ¬ÃÅDetecting Human Faces in Color ImagesÃÂ¢Ãâ¬ÃÂ Proceedings of International Workshop on Multi-Media Database Management Systems, pp. 124-131,1998.

Constantine KotropoulosIoannis Pitas ÃÂ¢Ãâ¬ÃÅRule-Based Face Detection In Frontal ViewsÃÂ¢Ãâ¬ÃÂ (IEEE) International conference, volume-4,pp. 2537-2540,1997.

Duan-Sheng Chen ,Zheng-Kai Liu, Aug 2011. ÃÂ¢Ãâ¬ÃÅGeneralized Haar-like features for Fast Face DetectionÃÂ¢Ãâ¬ÃÂ. Conference on Machine Learning and Cybernetics, Hong Kong, pp. 2131-2135, 2007.

Hsu, R.L., Abdel-Mottaleb, M. & Jain, A,ÃÂ¢Ãâ¬ÃÂFace detection in color imagesÃÂ¢Ãâ¬ÃÂ, IEEE Transactions on Pattern Analysis and Machine Intelligence , vol. 24, no. 5, pp. 696-706,2002.

Ijaz Khan, Hadi Abdullah and MohdShamian Bin Zainal ÃÂ¢Ãâ¬ÃÅEfficient eyes and mouth detection algorithm use combination of Viola Jones and skin color pixel detectionÃÂ¢Ãâ¬ÃÂ International Journal of Engineering and Applied Sciences. June 2013.

Inho Choi, Seungchul Han, and Daijin Kim, ÃÂ¢Ãâ¬ÃÅEye Detection and Eye Blink Detection using Adaboost learning and groupingÃÂ¢Ãâ¬ÃÂ computer communication and networks (ICCCN), proceeding of 20th international conference .,pp.1-4 July31,2011.

J.Huang ,H.Wechsler, ÃÂ¢Ãâ¬ÃÅVisual routines for eye localization using learning and evolutionÃÂ¢Ãâ¬ÃÂ.IEEETrans.Evol.Comput. 73-82,2000.

K.M. Lam, H. Yan, ÃÂ¢Ãâ¬ÃÅAn analytic-to-holistic approach for face recognition based on a single frontal viewÃÂ¢Ãâ¬ÃÂ, IEEE Trans. Pattern Anal. Mach. Intell. 673- 686 20, 1998.

KoheiAai and Ronny Mardiyanto , ÃÂ¢Ãâ¬ÃÅComparative Study on Blink Detection and Gaze Estimation Methods for HCI, in Particular, Gabor Filter Utilized Blink Detection MethodÃÂ¢Ãâ¬ÃÂ .Proceedings of 18th International Conference on Information Technology: New Generations. Las Vegas, USA, pp. 441-446,2011.

Kwok-Wai Wong, Kin-Man Lam, Wan-Chi Siu ÃÂ¢Ãâ¬ÃÅAn efficient algorithm for human face detection and facial feature extraction under different conditionÃÂ¢Ãâ¬ÃÂ pp.1-12,The Journal of The Pattern Recognition 2004.

Lae-Kyoung Lee, Su-Yong An , Se-Young Oh, Aug 2011. ÃÂ¢Ãâ¬ÃÅEfficient. Face Detection and Tracking with extended camshift and haar-like featuresÃÂ¢Ãâ¬ÃÂ. International Conference on Mechatronics and Automation (ICMA), Pohang, South Korea, pp. 507-513, 2011.

Liting Wang ; Xiaoqing Ding Chi Fang ; Changsong Liu and Kongqiao Wang "Eye blink detection based on eye contour extraction".2009, Proc. SPIE 7245, Image Processing: Algorithms and Systems VII, 72450R ,February 10, 2009.

Michael Chau and MargritBetke, .ÃÂ¢Ãâ¬ÃÅReal Time Eye Tracking and Blink Detection with USB CamerasÃÂ¢Ãâ¬ÃÂ2005. Boston University, USA. May 12, 2005.

Muhammad Shafi and Paul W. H.ChungÃÂ¢Ãâ¬ÃÅA Hybrid Method for Eyes Detection in Facial ImagesÃÂ¢Ãâ¬ÃÂ World Academy of Science, Engineering and Technology.pp.1-6, 2008.

M. Takagi, K. Mohri, M. Katoh and S. Yoshino.ÃÂ¢Ãâ¬ÃÂMagnet-Displacement Sensor Using Magneto-Inductive Elements for Sensing Eyelid MovementÃÂ¢Ãâ¬ÃÂ. IEEE Translation Journal On Magnetics In Japan, Vol. 9, No.2, pp .78-83, 1994.

Paul Viola ÃÂ¢Ãâ¬ÃÅRobust Real-Time Face DetectionÃÂ¢Ãâ¬ÃÂ International Journal of Computer Vision 57(2), 137ÃÂ¢Ãâ¬Ãâ154, 2004.

S Saravanakumar, and N Selvaraju, ÃÂ¢Ãâ¬ÃÅEye Tracking and Blink Detection for Human Computer InterfaceÃÂ¢Ãâ¬ÃÂ, International Journal of Computer Application, VOl. 2, No. 2, pp 7-9, May 2010.

Sanjay Kr. Singh, D. S. Chauhan, MayankVatsa, Richa Singh ÃÂ¢Ãâ¬ÃÅA robust skin color based face detection algorithmÃÂ¢Ãâ¬ÃÂ Tamkang Journal of Science and Engineering, Vol. 6, No. 4, pp. 227-234, 2003.

Takahashi, K., Mitsukura, Y. ÃÂ¢Ãâ¬ÃÅEye blink detection using monocular system and its applicationsÃÂ¢Ãâ¬ÃÂ Grad. Sch. of Sci. & Technol., Keio Univ., Yokohama, Japan. , pp 743 ÃÂ¢Ãâ¬Ãâ 747, 9-13 Sept. 2012

TanmayRajpathak,Ratnesh Kumar and Eric Schwartz ÃÂ¢Ãâ¬ÃÅEye Detection Using Morphological and Color Image ProcessingÃÂ¢Ãâ¬ÃÂFlorida Conference on Recent Advances in Robotics, FCRAR.pp.1-6,2009.

Udayashankar, A. ;Kowshik, A.R. ; Chandramouli, S. ; Prashanth, H.S. ÃÂ¢Ãâ¬ÃÅ Assistance for the Paralyzed Using Eye Blink DetectionÃÂ¢Ãâ¬ÃÂ Fourth International Conference,Digital Home (ICDH), Pp 104 ÃÂ¢Ãâ¬Ãâ 108, 2012.

Viola, P. and Jones, M., ÃÂ¢Ãâ¬ÃÅRapid Object Detection using a Boosted Cascade of Simple FeaturesÃÂ¢Ãâ¬ÃÂ Proc. of the Conf. On Computer Vision and Pattern Recognition (CVPR), Hawaii, USA, Vol. 1, pp. 511-518, December 9-14, 2001.

W. Zhang,ÃÂ¢Ãâ¬ÃÅPrecise eye localization with AdaBoost and fast radial symmetry,ÃÂ¢Ãâ¬ÃÂ Computational Intelligence and Security, pp. 1068-1077, 2006.

YihuYi ,DaokuiQu , Fang Xu ÃÂ¢Ãâ¬ÃÅA Hybrid Algorithm of Eyes Localization in Color Facial RegionÃÂ¢Ãâ¬ÃÂInternational Conference on Artificial Intelligence and Computational Intelligence.pp.1-6,2009.

Yoav Freund and Robert E. Schapire ÃÂ¢Ãâ¬ÃÅA Decision-Theoretic Generalization of On-Line Learning and an Application to BoostingÃÂ¢Ãâ¬ÃÂ journal of computer and system sciences 55, pp.119-139, 1997.

ZhihengNiu, Shiguang Shan, Shengye Yan, Xilin Chen, and Wen Gao, ÃÂ¢Ãâ¬ÃÅ2D Cascaded AdaBoost for Eye LocalizationÃÂ¢Ãâ¬ÃÂ ICPR (International Conference on Pattern Recognitition),pp.1216-1219,(IEEE)2006.