上海大学学报(自然科学版) ›› 2021, Vol. 27 ›› Issue (3): 454-465.doi: 10.12066/j.issn.1007-2861.2247

• 研究论文 • 上一篇    下一篇

一种基于深度图像的左右手同步分割改进方法

徐正则1,2, 张文俊1()   

  1. 1.上海大学 上海电影学院, 上海 200072
    2.华东师范大学 传播学院, 上海 200241
  • 收稿日期:2020-02-03 出版日期:2021-06-30 发布日期:2021-06-27
  • 通讯作者: 张文俊 E-mail:wjzhang@shu.edu.cn
  • 作者简介:张文俊(1959—), 男, 教授, 博士生导师, 博士, 研究方向为数字图像处理、数字多媒体技术等. E-mail: wjzhang@shu.edu.cn

Improved approach to simultaneous left- and right-hand segmentation from a single depth image

XU Zhengze1,2, ZHANG Wenjun1()   

  1. 1. Shanghai Film Academy, Shanghai University, Shanghai 200072, China
    2. School of Communication, East China Normal University, Shanghai 200241, China
  • Received:2020-02-03 Online:2021-06-30 Published:2021-06-27
  • Contact: ZHANG Wenjun E-mail:wjzhang@shu.edu.cn

摘要:

基于深度图像的手势识别技术是下一代数字媒体设备的主要交互手段, 从深度图像中准确定位出"干净"的手部图像显得尤为重要. 提出了一种同步进行左右手分割的改进方法, 在传统 SegNet 算法的基础上, 加入了类别权重、转置卷积、混合式空洞卷积组合和编解码器之间的拼接合并跳层连接, 使左右手的 F2-Score 相较基准方法分别提高了 7.6% 和 5.9%. 推理速度在 GPU 上达到了 20.5 ms/帧, 可以实时处理深度图像序列. 实验证明本方法对深度图像进行左右手同步分割时可以得到更加精准的分割结果.

关键词: 深度图像, 手部分割, 改进方法

Abstract:

Hand gesture recognition technology based on depth image, which relies on the accurate identification of "clean" hand in the captured depth image, is the primary interactive mode for digital media devices of future generation. We propose an improved approach to simultaneous left- and right-hand segmentation, extending the traditional SegNet algorithm by strategies including class weight, transposed convolution, hybrid dilated convolution, and skip-connection between the encoder and decoder performed by concatenation. Our approach achieves higher F2-Score than the existing baseline by 7.6% for the left and 5.9% for the right hand. The processing on the GPU reaches 20.5 ms per frame at inference time, making real-time hand tracking in depth image sequences feasible. The results of the experiment demonstrate that our approach can considerably improve the performance of simultaneous left- and right-hand segmentation from a single depth map.

Key words: depth image, hand segmentation, improved approach

中图分类号: