Journal of Shanghai University(Natural Science Edition) ›› 2018, Vol. 24 ›› Issue (5): 675-685.doi: 10.12066/j.issn.1007-2861.2073

• Digital Film and Television Technology •     Next Articles

Hand segmentation from a single depth image based on histogram threshold selection and shallow CNN

XU Zhengze1,2, ZHANG Wenjun1()   

  1. 1. Shanghai Film Academy, Shanghai University, Shanghai 200072, China
    2. School of Communication, East China Normal University, Shanghai 200241, China
  • Received:2018-06-15 Online:2018-10-30 Published:2018-10-26
  • Contact: ZHANG Wenjun E-mail:wjzhang@mail.shu.edu.cn

Abstract:

Real-time hand gesture recognition technology significantly improves the user's experience for virtual reality/augmented reality (VR/AR) applications, which relies on the identification of the orientation of the hand in captured images or videos. A new three-stage pipeline approach for fast and accurate hand segmentation for the hand from a single depth image is proposed. Firstly, a depth frame is segmented into several regions by histogram-based threshold selection algorithm and by tracing the exterior boundaries of objects after thresholding. Secondly, each segmentation proposal is evaluated by a three-layers shallow convolutional neural network (CNN) to determine whether or not the boundary is associated with the hand. Finally, all hand components are merged as the hand segmentation result. Compared with algorithms based on random decision forest (RDF), the experimental results demonstrate that the approach achieves better performance with high-accuracy (88.34% mean intersection over union, mIoU) and a shorter processing time ($\le $8 ms).

Key words: hand segmentation, histogram threshold selection, convolutional neural network (CNN), depth map

CLC Number: