0% found this document useful (0 votes)

13 views7 pages

Gray Word6

The paper presents a method for hand gesture recognition using skin detection and deep learning techniques, focusing on RGB video input. It involves skin color detection, contour extraction, and gesture recognition through a pyramidal pooling module and attention mechanism, achieving an accuracy of 98.41% and a processing speed of 10 frames per second. The proposed method outperforms existing techniques in terms of accuracy and efficiency, making it suitable for real-time applications in Human-Computer Interaction.

Uploaded by

Dr.Ali Adil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views7 pages

Gray Word6

Uploaded by

Dr.Ali Adil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Journal of Physics: Conference Series

PAPER • OPEN ACCESS

Hand Gesture Recognition with Skin Detection and Deep Learning

Method
To cite this article: Hanwen Huang et al 2019 J. Phys.: Conf. Ser. 1213 022001

View the article online for updates and enhancements.

This content was downloaded from IP address 158.46.217.209 on 19/06/2019 at 13:55

ICAACE 2019 IOP Publishing
IOP Conf. Series: Journal of Physics: Conf. Series 1213 (2019) 022001 doi:10.1088/1742-6596/1213/2/022001

Hand Gesture Recognition with Skin Detection and Deep

Learning Method

Hanwen Huang1, Yanwen Chong2*, Congchong Nie2, Shaoming Pan2

1
College of Remote Sensing and Information Engineering, Wuhan University, 129
Luoyu Road, Wuhan 430079, China
2
State Key Laboratory for Information Engineering in Surveying, Mapping and
Remote Sensing, Wuhan University, 129 Luoyu Road, Wuhan 430079, China
*corresponding author: ywchong@whu.edu.cn

Abstract. Gesture recognition, although has been exploring for many years, is still a challenging
problem. Complex background, camera angles and illumination conditions make the problem
more difficult. Thus, this paper presents a fast and robust method for hand gesture recognition
based on RGB video. First we detect the skin based on their color. Then we extract the contour
and segment the hand region. Finally we recognize the gesture. The results of experiment
demonstrate that the proposed method are efficient to recognize gesture with a higher accuracy
than the state of the art.

1. Introduction
There has been great emphasis on Human-Computer-Interaction research to create easy-to-use interfaces
by directly employing natural communication and manipulation skills of humans [1]. As an important
part of the body, recognizing hand gesture is very important for Human-Computer-Interaction. In recent
years, there has been a tremendous amount of research on hand gesture recognition [2]. While there are
numerous researches focused on this topic, there are still several problems to be solved. The speed and
accuracy are two main characteristics of the algorithm, thus a robust and fast method is needed to
improve user experiences.
A contour based method for recognizing hand gesture using depth image data is shown in [3]. Using
depth data, these method can distinguish the hand from background easily without getting confused by
background color. However, depth data are not common and easily available while RGB data solves the
problem. In [4], a hierarchical method of static hand gesture recognition that combines finger detection
and histogram of oriented gradient (HOG) feature is proposed. An algorithm applied for locating
fingertips in hand region extracted by Bayesian rule based skin color segmentation is proposed in [5].
In [6], the continuous gesture recognition problem is tackled with a two streams Recurrent Neural
Networks (2S-RNN) for the RGB-D data input. These methods all get good effect, but they are not that
efficient. This paper provides a more efficient way based on deep learning.

2. Proposed gesture recognition system

The proposed gesture recognition system (shown in figure 1) composed of three main parts. In the first
part, we get RGB video of the user, then after threshold segmentation, the video become a binary video.
Meanwhile, some preprocessing is needed, like expansion and corrosion. In the second part, we extract

Content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution
of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.
Published under licence by IOP Publishing Ltd 1
ICAACE 2019 IOP Publishing
IOP Conf. Series: Journal of Physics: Conf. Series 1213 (2019) 022001 doi:10.1088/1742-6596/1213/2/022001

all the contours and find out the contour of hand based on several indexes. Finally, the gesture can be
recognized using the pyramidal pooling module and attention mechanism.

Figure 1. Gesture recognition system based on RGB video.

3. Skin Color Detection

In order to detect the skin from the video, we need to find out the character of the skin. According to
[7], detecting skin-colored pixels, although seems a straightforward easy task, has proven to be quite a
challenging task in images that are captured under complex unconstrained imaging conditions. So we
developed a method based on the color feature for most human. The formula is shown as below.
R>85
R-B>10
R-G>10
Based on such criterion, we can efficiently segment the skin from the background, which can be
considered to be human part. Then we can convert the image into binary image. The results are shown
as figure 2.

2
ICAACE 2019 IOP Publishing
IOP Conf. Series: Journal of Physics: Conf. Series 1213 (2019) 022001 doi:10.1088/1742-6596/1213/2/022001

Figure 2. Binary image.

4. Preprocessing
Due to the nature of image, the hand region may have holes and cracks, which will definitely affect the
accuracy of hand gesture. Usually the binary image will be noisy, so image preprocessing is necessary,
which fills the holes. According to [8], there are two main methods for digital image restoration, texture-
based method and non-textured-based method. In this paper, an exemplar image completion based on
evolutionary algorithm is proposed. In the non-textured-based method, total variation method is a typical
algorithm. An improved total variation algorithm is proposed in this paper. In the improved algorithm,
the diffusion coefficients are defined according to the distance and direction between the damaged pixel
and its neighborhood pixel. Although in other papers [9], a novel inpainting methods reach a good result.
However, the methods are always too sophisticated and time-costing. Thus we employ some simple
morphological operations like erosion and dilation to fulfill our request. In practice, we dilate two times
and erode two times.

5. Contour Extraction and Hand Region Segmentation

After we remove the noise in the image, we need to extract contours. We consider each point cluster as
a contour. Among these contours, there is only one contour which represents the hand region.
Furthermore, hand region and face region are the largest two contours. Based on such fact, the problem
of finding the hand from the contours becomes the problem of separating hands from faces. So we collect
100 samples of face region and hand region. Then we use VGGNet in [10] to classify them. VGGNet is
a deep convolution neural network developed by the Visual Geometry Group and researchers at Google
DeepMind. VGGNet explores the relationship between the depth and performance of convolutional
neural networks. By stacking 3×3 small convolution cores and 2×2 maximum pool layer repeatedly,
VGGNet has successfully constructed 16-19 layers (convolution layer and fully connected layer) deep
convolution neural networks. In this paper, we use 16 layers VGGNet.

6. Gesture Recognition
In this paper, the pyramidal pooling module and attention mechanism are used to increase the receptive
field and classify the details more efficiently. As can be seen from figure 3, the original input image
passes through the convolution layer of 3 × 3 and the maximum pooling layer to get the size of 1/2 of
the original image feature map. Then the four different scale spatial pyramids are pooled to get the size
of 1/4, 1/8, 1/16, 1/32 feature map respectively. So the different scale features can be captured. Then,
global average pooling is used to obtain the weights of global abstract features as channel dimensions
at lower levels. Finally, the final probability score of each class is obtained by using fully connected
layer and softmax. Comparing with stacked convolution and pooling structure, the proposed structure
can not only get the feature maps of different receptive fields quickly, but also pool the high-resolution
feature maps globally, and then reduce the channel dimension by 1 × 1 convolution, which can be
used as the weight of adjacent low-resolution feature channels effectively. Guide the formation of

3
ICAACE 2019 IOP Publishing
IOP Conf. Series: Journal of Physics: Conf. Series 1213 (2019) 022001 doi:10.1088/1742-6596/1213/2/022001

abstract features. A large number of experiments on our gesture data show that this structure can
accelerate the convergence of the network and improve the accuracy of recognition.

Figure 3. Training network structure.

7. Results
To validate the method proposed in this paper, we conducted two experiments. The first experiment is
hand region segmentation. We use 100 hand region samples with different gesture and 100 face region
samples. We take 70 samples as test data and 30 samples as validation data. The results are shown in
figure 4 and figure 5. The accuracy of segmentation is 98.48%. To sum up, the experimental accuracy
can meet actual needs.

Figure 4. Hand region segmentation Figure 5. Hand region segmentation loss.

accuracy.
The second experiment is gesture recognition. To detect 9 common gestures (shown in figure 6), a
total of 900 test samples form 5 people were tested. Each gesture have 70 test data and 30 validation
data. The recognition results are shown in figure 7 and figure 8.

Figure 6. 9 Common gestures.

4
ICAACE 2019 IOP Publishing
IOP Conf. Series: Journal of Physics: Conf. Series 1213 (2019) 022001 doi:10.1088/1742-6596/1213/2/022001

Figure 7. Gesture recognition accuracy. Figure 8. Gesture recognition loss.

As shown in these figures, we can notice that the method proposed in this paper has a high accuracy.
The overall accuracy has reached 98.41%, which shows the effectiveness of the method. Furthermore,
we also report the number of frames per second that can be processed. It can reach the speed of 10fps,
which can achieve real-time gesture recognition. Comparing to other recent research, this method has
higher accuracy. According to [11], in low dimensionality, the recognition rate of MAOMP algorithm
can still be more than 85% and the recognition rate of SAMP, SP and SWOMP is maintained at 80–
85%, while the ROMP and OMP algorithms are less than 80%. In [12], the paper comes up with a deep
learning-based fast hand gesture recognition method using representative frames. The accuracy of the
method is 95.18%. In [13], the paper uses appearance features and 3D Point Cloud to recognize gesture.
It can recognize 9 gestures and the overall accuracy reaches 94.7%.These results show the accuracy and
efficiency.
Table 1. Accuracy comparison of different methods
Methods MAOMP SAMP,SP, ROMP, Paper Paper Method in
SWOMP OMP [14] [15] this paper
Accuracy 85% 80%~85% 80% 95.18% 94.7% 98.41%

8. Conclusion
Gesture plays an indispensable role in Human-Computer-Interaction. This paper comes up with an
accurate and efficient method for gesture recognition. First, skin is detected using on rules based on
experience and the picture is transformed into binary image. Then expansion and corrosion are adopted.
After that, all the contours are extracted and the contour of hand are found. Finally, the gesture can be
recognized using the pyramidal pooling module and attention mechanism.

Acknowledgements
This work was supported by the National Natural Science Foundation of China (Grant Nos. 61572372,
41671382), State Key Laboratory for Information Engineering in Surveying, Mapping and Remote
Sensing Special Research Funding.

References
[1] Ren Z, Meng J, Yuan J. Depth camera based hand gesture recognition and its application in Human-
Computer-Interaction[C]// Communications and Signal Processing. IEEE, 2011:1-5.
[2] Kang S K, Mi Y N, Rhee P K. Color Based Hand and Finger Detection Technology for User
Interaction[C]// International Conference on Convergence and Hybrid Information
Technology. IEEE, 2008:229-236.
[3] Le T N, Cong D T, Ba T N, et al. Contour Based Hand Gesture Recognition Using Depth Data[C]//
The, International Conference on Signal Processing, Image Processing and Pattern
Recognition. 2013:60-65.

5
ICAACE 2019 IOP Publishing
IOP Conf. Series: Journal of Physics: Conf. Series 1213 (2019) 022001 doi:10.1088/1742-6596/1213/2/022001

[4] Liu S, Liu Y, Yu J, et al. Hierarchical static hand gesture recognition by combining finger detection
and HOG features[J]. Journal of Image & Graphics, 2015.
[5] Bhuyan M K, Neog D R, Kar M K. Fingertip Detection for Hand Pose Recognition[J]. International
Journal on Computer Science & Engineering, 2012, 4(3).
[6] Chai X, Liu Z, Yin F, et al. Two streams Recurrent Neural Networks for Large-Scale Continuous
Gesture Recognition[C]// International Conference on Pattern Recognition. IEEE, 2017:31-36.
[7] Chen W, Wang K, Jiang H, et al. Skin color modeling for face detection and segmentation: a
review and a new approach[J]. Multimedia Tools & Applications, 2016, 75(2):1-24.
[8] Li K, Wei Y, Yang Z, et al. Image inpainting algorithm based on TV model and evolutionary
algorithm[J]. Soft Computing - A Fusion of Foundations, Methodologies and Applications, 2016,
20(3):885-893.
[9] Alexandru Telea. An Image Inpainting Technique Based on the Fast Marching Method[J]. Journal
of Graphics Tools, 2004, 9(1):23-34.
[10] K. Simonyan and A. Zisserman, Very deep convolutional networks for large-scale image
recognition, Proc. Int’l Conf. Learning Represent., 2015.
[11] Li B, Sun Y, Li G, et al. Gesture recognition based on modified adaptive orthogonal matching
pursuit algorithm[J]. Cluster Computing, 2017(3):1-10.
[12] John V, Boyali A, Mita S, et al. Deep Learning-Based Fast Hand Gesture Recognition Using
Representative Frames[C]// International Conference on Digital Image Computing: Techniques
and Applications. IEEE, 2016.
[13] Chong, Y, Huang, J. and Pan, S. (2016) Hand Gesture Recognition Using Appearance Features
Based on 3D Point Cloud. Journal of Software Engineering and Applications, 9, 103-111.
doi:10.4236/jsea.2016.94009.

Implentation of Hand Gesture Recognition Technique For HCI Using Open CV
No ratings yet
Implentation of Hand Gesture Recognition Technique For HCI Using Open CV
5 pages
1376 3343 1 PB
No ratings yet
1376 3343 1 PB
10 pages
Vision Based Real Time Finger Counter For Hand Gesture Recognition
No ratings yet
Vision Based Real Time Finger Counter For Hand Gesture Recognition
5 pages
Gray Word11
No ratings yet
Gray Word11
15 pages
Hand Gesture Recognistion Based On Shape Parameters Using Open CV
No ratings yet
Hand Gesture Recognistion Based On Shape Parameters Using Open CV
5 pages
Gesture Volume Control Using Otsu Method
No ratings yet
Gesture Volume Control Using Otsu Method
12 pages
Sensors: Human-Computer Interaction Based On Hand Gestures Using RGB-D Sensors
No ratings yet
Sensors: Human-Computer Interaction Based On Hand Gestures Using RGB-D Sensors
19 pages
Sensors 13 11842
No ratings yet
Sensors 13 11842
19 pages
Single Shot Detector CNN and Deep Dilated Masks Fo
No ratings yet
Single Shot Detector CNN and Deep Dilated Masks Fo
12 pages
Development of A Hand Pose Recognition System On An Embedded Computer Using Artificial Intelligence
No ratings yet
Development of A Hand Pose Recognition System On An Embedded Computer Using Artificial Intelligence
4 pages
Real Time Hand Gesture Recognition Using Finger Tips
No ratings yet
Real Time Hand Gesture Recognition Using Finger Tips
5 pages
Radially Defined Local Binary Patterns For Hand Gesture Recognition
No ratings yet
Radially Defined Local Binary Patterns For Hand Gesture Recognition
5 pages
Hussain 2017
No ratings yet
Hussain 2017
2 pages
Hand Landmarks Detection and Localization in Color
No ratings yet
Hand Landmarks Detection and Localization in Color
26 pages
Hand Gesture Recognition Technology-1
No ratings yet
Hand Gesture Recognition Technology-1
16 pages
Hand Gesture Recognition Method
No ratings yet
Hand Gesture Recognition Method
2 pages
Convolutional Neural Network Based Hand Gesture Recognition-2
No ratings yet
Convolutional Neural Network Based Hand Gesture Recognition-2
5 pages
ST ND RD: Ntroduction
No ratings yet
ST ND RD: Ntroduction
4 pages
Gesture Recognition
No ratings yet
Gesture Recognition
4 pages
A Hidden Markov Model Based Dynamic Hand Gesture Recognition System Using Opencv
No ratings yet
A Hidden Markov Model Based Dynamic Hand Gesture Recognition System Using Opencv
4 pages
Hand Gesture Recognition with OpenCV
No ratings yet
Hand Gesture Recognition with OpenCV
6 pages
Hand Gestures Recognition and Tracking: Deepak Gurung, Cansen Jiang, Jeremie Deray, D Esir e Sidib e
No ratings yet
Hand Gestures Recognition and Tracking: Deepak Gurung, Cansen Jiang, Jeremie Deray, D Esir e Sidib e
12 pages
Wireless Vision Based Mobile Robot Control Using Hand Gesture Recognition Through Perceptual Color Space
No ratings yet
Wireless Vision Based Mobile Robot Control Using Hand Gesture Recognition Through Perceptual Color Space
6 pages
Real Time Hand Gesture Movements Tracking and Recognizing System
No ratings yet
Real Time Hand Gesture Movements Tracking and Recognizing System
5 pages
Sensors 22 00706
No ratings yet
Sensors 22 00706
14 pages
Real-Time Hand Detection and Tracking Against Complex Background
No ratings yet
Real-Time Hand Detection and Tracking Against Complex Background
4 pages
Static Hand Gesture Recognition System
No ratings yet
Static Hand Gesture Recognition System
18 pages
Shsconf Stehf2022 03011
No ratings yet
Shsconf Stehf2022 03011
5 pages
Fast and Robust Dynamic Hand Gesture Recognition Via Key Frames Extraction and Feature Fusion
No ratings yet
Fast and Robust Dynamic Hand Gesture Recognition Via Key Frames Extraction and Feature Fusion
11 pages
Hand Gesture Recognition To Facilitate Tasks For The Disabled
No ratings yet
Hand Gesture Recognition To Facilitate Tasks For The Disabled
5 pages
Hand Gesture Recognition For Human-Machine Interaction: Elena Sánchez-Nielsen Luis Antón-Canalís Mario Hernández-Tejera
No ratings yet
Hand Gesture Recognition For Human-Machine Interaction: Elena Sánchez-Nielsen Luis Antón-Canalís Mario Hernández-Tejera
8 pages
Hand Gesture Virtual Mouse System
No ratings yet
Hand Gesture Virtual Mouse System
4 pages
Gesture Recognition System
No ratings yet
Gesture Recognition System
12 pages
Hand Gesture Recognition Using Multi-Scale Colour Features, Hierarchical Models and Particle Filtering
No ratings yet
Hand Gesture Recognition Using Multi-Scale Colour Features, Hierarchical Models and Particle Filtering
7 pages
Hand Gesture Recognition With Convolution Neural Networks
No ratings yet
Hand Gesture Recognition With Convolution Neural Networks
4 pages
Sensors 23 05555
No ratings yet
Sensors 23 05555
20 pages
Paper 3
No ratings yet
Paper 3
10 pages
Gray Word2
No ratings yet
Gray Word2
8 pages
Mathematics 12 01393
No ratings yet
Mathematics 12 01393
34 pages
2002 FGR Bretzner
No ratings yet
2002 FGR Bretzner
6 pages
Real Time Finger Tracking and Contour Detection For Gesture Recognition Using Opencv
No ratings yet
Real Time Finger Tracking and Contour Detection For Gesture Recognition Using Opencv
4 pages
Hand Gesture Recognition for ASL
No ratings yet
Hand Gesture Recognition for ASL
6 pages
Hand Detection Using Multiple Proposals: Arpit@robots - Ox.ac - Uk
No ratings yet
Hand Detection Using Multiple Proposals: Arpit@robots - Ox.ac - Uk
11 pages
Gesture Recognition for HCI Experts
No ratings yet
Gesture Recognition for HCI Experts
10 pages
Towards Smart Interaction Hand Gesture Recognition Using Machine Learning in IoT Scenarios
No ratings yet
Towards Smart Interaction Hand Gesture Recognition Using Machine Learning in IoT Scenarios
5 pages
Real-Time Hand Gesture Detection and Recognition Using Boosted Classifiers and Active Learning
No ratings yet
Real-Time Hand Gesture Detection and Recognition Using Boosted Classifiers and Active Learning
14 pages
First Meeting Hasanv3
No ratings yet
First Meeting Hasanv3
20 pages
2009 06 Book Hci Gestures PDF
No ratings yet
2009 06 Book Hci Gestures PDF
56 pages
2009 06 Book Hci Gestures
No ratings yet
2009 06 Book Hci Gestures
56 pages
Deep Learning For Hand Segmentation in Complex Backgrounds
No ratings yet
Deep Learning For Hand Segmentation in Complex Backgrounds
2 pages
IJCSN 2012 1 6 31handregtn11 PDF
No ratings yet
IJCSN 2012 1 6 31handregtn11 PDF
5 pages
Ijcsn 2012 1 6 31
No ratings yet
Ijcsn 2012 1 6 31
5 pages
MCP3008++Raspberry Pi
No ratings yet
MCP3008++Raspberry Pi
17 pages
Hand Gesture Recognition Based On Convolution Neural Network CNN and Support Vector Machine SVM
No ratings yet
Hand Gesture Recognition Based On Convolution Neural Network CNN and Support Vector Machine SVM
4 pages
Ijiie 2019dec1003vol1
No ratings yet
Ijiie 2019dec1003vol1
6 pages
Good Paper
No ratings yet
Good Paper
5 pages
Hand Gesture Recognition Review
No ratings yet
Hand Gesture Recognition Review
14 pages
Sarkar 1993
No ratings yet
Sarkar 1993
6 pages
Gray Word7
No ratings yet
Gray Word7
7 pages
Journal Tiis 14-4 321925708
No ratings yet
Journal Tiis 14-4 321925708
17 pages
Gray Word4
No ratings yet
Gray Word4
9 pages
Skin 2
No ratings yet
Skin 2
6 pages
Gray Word5
No ratings yet
Gray Word5
4 pages
Overfitting and Underfitting
No ratings yet
Overfitting and Underfitting
3 pages
Machine Learning Exam 2020-21
No ratings yet
Machine Learning Exam 2020-21
2 pages
SS 2021 Solutions
No ratings yet
SS 2021 Solutions
16 pages
Yolo
No ratings yet
Yolo
10 pages
Sem 2
No ratings yet
Sem 2
2 pages
3 ML
No ratings yet
3 ML
3 pages
3D Convolutional Neural Networks For Human Action Recognition
No ratings yet
3D Convolutional Neural Networks For Human Action Recognition
11 pages
CV Siddhartha Shrestha
No ratings yet
CV Siddhartha Shrestha
5 pages
Sunita Pradhan Resume
No ratings yet
Sunita Pradhan Resume
2 pages
Sse 27 12 443 01
No ratings yet
Sse 27 12 443 01
1 page
Perfect Crowd Counting Presentation
No ratings yet
Perfect Crowd Counting Presentation
13 pages
Non-Mathematical Introduction To Using Neural Networks - Heaton Research
No ratings yet
Non-Mathematical Introduction To Using Neural Networks - Heaton Research
14 pages
Mini Project PPT Template
No ratings yet
Mini Project PPT Template
15 pages
Schedule 1
No ratings yet
Schedule 1
1 page
NLP UNIT-I Part-II
No ratings yet
NLP UNIT-I Part-II
17 pages
Intro to Machine Learning Basics
100% (1)
Intro to Machine Learning Basics
15 pages
Reinforcement Learning Basics
No ratings yet
Reinforcement Learning Basics
23 pages
Neuromorphic Computing in Data Science
No ratings yet
Neuromorphic Computing in Data Science
4 pages
Neurocomputing 2008
100% (1)
Neurocomputing 2008
6 pages
Machine Learning Using Python Project (PPT)
0% (2)
Machine Learning Using Python Project (PPT)
8 pages
Machine Learning-Assignments PDF
No ratings yet
Machine Learning-Assignments PDF
2 pages
JurisBERT: Legal Text Embeddings
No ratings yet
JurisBERT: Legal Text Embeddings
7 pages
Chap 9-2 - Convolutional Neural Network - Heechul Lim
No ratings yet
Chap 9-2 - Convolutional Neural Network - Heechul Lim
58 pages
21bce5801 53620
No ratings yet
21bce5801 53620
49 pages
Multi Level Deep Learning Model For Network Anomal
No ratings yet
Multi Level Deep Learning Model For Network Anomal
12 pages
KBS ملخص
No ratings yet
KBS ملخص
14 pages
Machine Learning Exam Guide
No ratings yet
Machine Learning Exam Guide
2 pages
Prompt Engineering Tutorial
100% (10)
Prompt Engineering Tutorial
217 pages
CNN and RNN Applications in AI
No ratings yet
CNN and RNN Applications in AI
41 pages
My ML Tiny Notes
No ratings yet
My ML Tiny Notes
5 pages