Journal of Shanghai University(Natural Science Edition) ›› 2022, Vol. 28 ›› Issue (2): 261-269.doi: 10.12066/j.issn.1007-2861.2287

• Research Articles • Previous Articles     Next Articles

Image matting based on deep learning

WANG Rongrong1,2, XU Shugong2, HUANG Jianbo1,3()   

  1. 1. Shanghai Film Academy, Shanghai University, Shanghai 200072, China
    2. Shanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai 200444, China
    3. Shanghai Engineering Research Center of Motion Picture Special Effects, Shanghai University, Shanghai 200072, China
  • Received:2020-03-13 Online:2022-04-30 Published:2022-04-28
  • Contact: HUANG Jianbo E-mail:huangjianbo110@shu.edu.cn

Abstract:

Image editing technology, which is widely used in the post-production of film and television and in daily life, is based on image matting. In this study, an image matting network based on deep learning which estimates the value of each pixel by inputting the original image and trimap is proposed. Based on the original down- and up-sampling network and to address the problem of slow network convergence caused by the large difference between matting dataset pictures, batch normalisation (BN) is applied after each convolution layer in this study. In the normalisation layer, the input data are normalised to speed up the convergence of the model. This enables the update direction of the parameters to be more consistent with the overall characteristics of the dataset. Because the edge of the object should be carefully considered in the matting task, a deformable convolution layer is used instead of the custom convolution layer. The deformable convolution layer can adaptively learn the shape of the convolution kernel according to different input data, effectively expand the range of the receptive field, and improve the prediction effect in detailed image parts.

Key words: deep learning, image matting, semantic segmentation, prediction

CLC Number: