Professional Documents
Culture Documents
AbstractThis paper presents an accurate method of uses a Haar based cascade classifier based method for eye
drowsiness detection for the images obtained using low resolution tracking, Histogram of Oriented Gradients (HOG) and Support
consumer grade web cameras under normal lighting conditions. Vector machines (SVM) for blink detection. After blink
The drowsiness detection method uses Haar based cascade detection the PERCLOS value is calculated. If the PERCLOS
classifier for eye tracking and combination of Histogram of
value is greater than 6000ms then the person is said to be
oriented gradient (HOG) features combined with Support Vector
Machine (SVM) classifier for blink detection. Once the eye blinks drowsy.
are detected then the PERCLOS is calculated from it. If the The presented system is very much user friendly, non
PERCLOS value is greater than 6 seconds then the person is said intrusive and doesnt require any specialized hardware. It
to be drowsy. The presented system was validated by comparing performs well in uncontrolled lighting conditions under normal
the prediction of the system with that of a human rater. The resolutions of an USB web camera. The prototype of the
system matched with the human observer with 91.6 % accuracy. system was developed using MATLAB 2014a. The reliability
of the system was verified by comparing its performance with
Index Terms Drowsiness detection, Eye tracking, blink
the judgments of a human rater. The system has shown a
detection, PERCLOS, Haar based cascade classifier, SVM, HOG
feature
91.6% match with the judgments of that of a human rater.
181
1) Haar features: The face detection method uses Haar process is illustrated in detail in section III during the
features for face detection. Haar features are extracted by using discussion of implementation.
a set of rectangular black and white windows. The black color
has a weight of -1 and the white region has weight 0. The
windows are first applied to the image and corresponding
values are multiplied with the pixel intensities. Then these
values are added together, and the Haar feature corresponding
to the window used is obtained. But all the Haar features
extracted are not required for successful detection of faces.
Hence a boosting algorithm [14] is used to find the most
important features that can be used for face detection.
2) Cascade classifiers: Once the Haar features are
obtained then individual classifiers are built based on the
values of each Haar feature. These individual classifiers are
then arranged into a cascade classifier. A cascaded classifier is Fig 2: Geometric ratios of human face
combination of several classifiers arranged in the different
stages cascaded on after one another. The number of classifiers
Once the eye regions are obtained then the next step is to
in each stage and their threshold values are determined by the
detection of eyes in those regions. The left and right eye
boosting algorithm during the training of the classifiers with
regions are processed separately. Eye detection is performed
labeled face images. The cascade classifier used here has 22
using Haar based cascade classifier described in section II.B.
stages and a total of 2135 features.
Here a cascade classifier trained with eye images is used. The
3) Face detection: The general structure of the cascade classifier classifies the eye region into portions that contains
classifier is given in Fig 1. A certain number of features are eye and those portions that do not contain an eye. Then the
checked at each stage of the cascade classifier. When a region portions containing an eye are extracted and thus the eyes are
of the image is taken it is first checked using the first stage of detected. These eye images are then sent to the next stage for
the cascade classifier. If the image region fails to pass the first further processing.
stage it will be rejected as it does not contain a face else it will
But this detection method fails when the eyes are closed. In
be passed to the next stage. If the face region satisfies all the
such situations the eyes images are cut out from the positions
stages then the region is classified to have a face in it. The
of eyes in the extracted eye region from the preceding frame in
advantage of using a cascade classifier is that it can reduce
the video sequence.
computational load as each stage need only a certain set of the
features to be extracted rather than the entire set of 2135
D. Blink detection:
features extracted together.
Once the eyes are detected next stage is the blink
detection. The presented system uses a Histograms of Oriented
Gradients (HOG) as features and Support Vector Machines as
binary classifiers for eye blink detection from the eye images.
182
feature values. Then these feature values of all the cells inside problem and it is solved using Lagrange multipliers i (i =
each block are concatenated to obtain the final feature 1,...l). In case the data is not linearly separable then it is first
descriptor. The Fig 3 shows the HOG features extracted from mapped into another feature space where it is linearly separable
the open and closed eye images. A more detailed description of using Kernel function. Then the equation becomes:
the HOG features can be seen in [16]. n
f ( h) = y V (h , h(i)) + b
i =1
i i i
E. Measurement of PERCLOS
F. Drowsiness Detection
Once the PERCLOS value of the user is calculated next
step is to detect whether the person in a drowsy state or not.
Basic principle is that the eyes are closed for longer durations
when the person is drowsy than when the person is in active
state. Hence if the value of PERCLOS exceeds above a
particular threshold then the person can be said to be in a
Fig 3: HOG features extracted from the open and closed eyes drowsy state else the person can be said to be active. According
to [19] the average blink duration of a human being is 100-400
ms and the number of blinks per second is 10-15 from [20].
2) SVM classifier: Support Vector Machines were initially From these values the time interval for which human eyes will
developed by Vapnik and his team. It was later improved by be closed in 1 minute would be = 400 x 15 = 6000ms for a
other researchers. It is a statistical learning model used normal person when he is in the active state. So the threshold
commonly for classification problems. A detailed description value for PERCLOS is estimated to be 6.00 seconds. If the
of the SVM is given in [17]. PERCLOS duration exceeds this limit then the person is
Let the data points of the eye images be represented by: estimated to be in a drowsy state else he is said to be active.
(h1,y1), (h2,y2)..... (hn,yn) were hi represent the HOG feature
vector representing the nth eye image and yn represent the class
of the nth eye image. y can have two values 0 or a 1. 0 III. RESULTS & DISCUSSIONS
represents the closed eye image and 1 represents the open eye A. Implementation
image. The basic idea of the HOG features is to find a hyper In the presented system the video frames are captured using
plane with the maximum margin that separates the two classes. an ordinary CMOS web camera which has a resolution of
In case of linearly separable data the hyper plane in terms of 640480 pixels and has a frame rate of 5ps. Since the captured
support vectors is given by: video is as an array of video frames (images), each of these
n image frames are separated and processed individually. The
f ( h) = y h (.)h(i) + b
i i i Fig 4 shows a single video frame extracted from the video
i =1 captured by the web camera.
were yi denotes the class of the data point hi and h(i) represents
the support vector machines. This is a Lagrange optimization
183
(b)
Then the face is detected from the image using Haar based (c)
cascade classifier described in section 2.2. The classifier After that just like the face detection is performed by the
classifies the frames into portions that contain a face and into Haar based cascade classifiers, these are used for detection of
that does not contain a face. The portion containing a face is eyes from the eye region as explained in section II.D. The Fig 7
extracted and thus the faces are detected. The Fig 5 shows the shows the left and right eyes extracted from the two eye
face detection algorithm working on the frame captured by the regions. These eye images are passed on to the next stage for
web camera. further classification.
B. Validation
For validating the efficiency of the method, twelve test
videos of different durations and frame rates captured under
(a) normal lighting conditions of an ordinary room was used. The
Fig 8 shows the sample frames from the test videos used.
184
Table 1: Comparison of predictions made by the developed
and system and human rater
REFERENCES
[1] Sandberg, David, et al. The characteristics of sleepiness during
real driving at nighta study of driving performance,
physiology and subjective experience, Sleep 34.no.10, pp.1317
2011.
[2] Lin, C. T., Chang, C. J., Lin, B. S., Hung, S. H., Chao, C. F., &
Wang, I. J. A real-time wireless braincomputer interface
185
system for drowsiness detection, In IEEE Transactions on [18] Wierwille, W. W. Historical perspective on slow eyelid
Biomedical Circuits and Systems, 4(4), 214-222, 2010. closure: Whence PERCLOS, Ocular Measures of Driver
[3] Lin, C. T., Chen, Y. C., Huang, T. Y., Chiu, T. T., Ko, L. W., Alertness, Technical Conference Proceedings. 1999.
Liang, S. F., ... & Duann, J. R. Development of wireless brain [19] Schiffman, H.R., Sensation and Perception. An Integrated
computer interface with embedded multitask scheduling and its Approach, New York: John Wiley and Sons, Inc., 2001
application on real-time driver's drowsiness detection and [20] https://www.ucl.ac.uk/media/library/blinking
warning, In IEEE Transactions on Biomedical Engineering,
[21] Murray W Johns, What is excessive day time sleeping?, 2009.
55(5), 1582-1591, 2008.
[22] Trutschel, Udo, et al. PERCLOS: An alertness measure of the
[4] Picot, Antoine, Sylvie Charbonnier, and Alice Caplier. "On-line
past In Proceedings of the Sixth International Driving
automatic detection of driver drowsiness using a single
Symposium on Human Factors in Driver Assessment, Training
electroencephalographic channel, In proceedings of 30th
and Vehicle Design. 2011.
Annual International Conference of the Engineering in Medicine
and Biology Society, 2008. EMBS, IEEE, 2008.
[5] Pal, Nikhil R., et al. EEG-based subject-and session-
independent drowsiness detection: an unsupervised approach,
EURASIP Journal on Advances in Signal Processing, 2008.
[6] Lin, Chin-Teng, et al. EEG-based drowsiness estimation for
safety driving using independent component analysis, Circuits
and Systems I: Regular Papers, IEEE Transactions on 52.12, pp
2726-2738, 2005.
[7] Sahayadhas, Arun, Kenneth Sundaraj, and Murugappan
Murugappan. "Drowsiness detection during different times of
day using multiple features." Australasian Physical &
Engineering Sciences in Medicine 36.2, pp.243-250, 2013.
[8] Kurian, D., Johnson Joseph, P. L., Radhakrishnan, K., &
Balakrishnan, A. Drowsiness Detection using
Photoplethysmography Signal. In Fourth International
Conference on Advances in Computing and Communications
(ICACC), pp. 73-76, IEEE, 2014.
[9] Patel, S. P., Patel, B. P., Sharma, M., Shukla, N., & Patel, H. M.
Detection of Drowsiness and Fatigue level of Driver, In
International Journal for Innovative Research in Science and
Technology, 1(11), 133-138, 2015.
[10] Jo, J., Lee, S. J., Jung, H. G., Park, K. R., & Kim, J. Vision-
based method for detecting driver drowsiness and distraction in
driver monitoring system, Optical Engineering, 50(12), 2011.
[11] Dasgupta, A., George, A., Happy, S. L., & Routray, A. A
vision-based system for monitoring the loss of attention in
automotive drivers, In IEEE Transactions on Intelligent
Transportation Systems, 14(4), 1825-1838, 2013.
[12] You, Chuang-Wen, et al. CarSafe: a driver safety app that
detects dangerous driving behavior using dual-cameras on
smartphones, In Proceedings of the 2012 ACM Conference on
Ubiquitous Computing, ACM, 2012.
[13] Viola, Paul, and Michael Jones. Rapid object detection using a
boosted cascade of simple features, In Proceedings of the 2001
IEEE Computer Society Conference on Computer Vision and
Pattern Recognition (CVPR), Vol. 1. IEEE, 2001.
[14] Freund, Y., Schapire, R.E.: A short introduction to boosting
J. Jap. Soci. Artif. Intell. 14(5), 771780, 1999.
[15] Oguz, . "The proportion of the face in younger adults using the
thumb rule of Leonardo da Vinci." Surgical and Radiologic
Anatomy 18.2, pp.111-114, 1996.
[16] Dalal, Navneet, and Bill Triggs. Histograms of oriented
gradients for human detection In proceedings of IEEE
Computer Society Conference on Computer Vision and Pattern
Recognition (CVPR) 2005, vol. 1, pp. 886-893. IEEE, 2005.
[17] Cristianini, Nello, and John Shawe-Taylor. An introduction to
support vector machines and other kernel-based learning
methods. Cambridge university press, 2000.
186