基于深度学习的图像抠图技术

doi:10.12066/j.issn.1007-2861.2287

Abstract

Abstract:

Image editing technology, which is widely used in the post-production of film and television and in daily life, is based on image matting. In this study, an image matting network based on deep learning which estimates the value of each pixel by inputting the original image and trimap is proposed. Based on the original down- and up-sampling network and to address the problem of slow network convergence caused by the large difference between matting dataset pictures, batch normalisation (BN) is applied after each convolution layer in this study. In the normalisation layer, the input data are normalised to speed up the convergence of the model. This enables the update direction of the parameters to be more consistent with the overall characteristics of the dataset. Because the edge of the object should be carefully considered in the matting task, a deformable convolution layer is used instead of the custom convolution layer. The deformable convolution layer can adaptively learn the shape of the convolution kernel according to different input data, effectively expand the range of the receptive field, and improve the prediction effect in detailed image parts.

Key words: deep learning, image matting, semantic segmentation, prediction

CLC Number:

TP391.4

WANG Rongrong, XU Shugong, HUANG Jianbo. Image matting based on deep learning[J]. Journal of Shanghai University（Natural Science Edition）, 2022, 28(2): 261-269.

Figures/Tables 7

Fig. 1

Fig. 2

Fig. 3

Fig. 4

Table 1

Fig. 5

Table 2

References 13

[1]	孙国星. 全自动抠图技术的研究[D]. 济南: 山东师范大学, 2017.
[2]	栗大智, 孙冰心, 朱少强, 等. 基于 image matting 的旅游相片处理[J]. 福建电脑, 2017, 33(5): 129, 153.
[3]	Russakovsky O, Deng J, Su H, et al. ImageNet large scale visual recognition challenge[J]. International Journal of Computer Vision, 2015, 115(3): 211-252. doi: 10.1007/s11263-015-0816-y
[4]	Porter T K, Duff T D. Compositing digital images[J]. ACM Siggraph Computer Graphics, 1984, 18(3): 253-259. doi: 10.1145/964965.808606
[5]	Wang J, Michael F. Cohen. Optimized Color Sampling for Robust Matting[C]// 2007 IEEE Conference on Computer Vision and Pattern Recognition. 2007.
[6]	Chen Q F, Ding Z Y, Tang C K. KNN matting[C]// Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2012.
[7]	Levin A, Lischinski D, Weiss Y. A closed form solution to natural image matting[C]// IEEE Computer Society. 2006: 61-68.
[8]	Zheng Y J, Kambhamettu C. Learning based digital matting[C]// IEEE 12th International Conference on Computer Vision. 2009.
[9]	Cho D, Tai Y W, Kweon I. Natural image matting using deep convolutional neural networks[C]// European Conference on Computer Vision. 2016.
[10]	Ning X, Brian P, Scott C, et al. Deep image matting[C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2017: 2970-2979.
[11]	Ioffe S, Szegedy C. Batch normalization: accelerating deep network training by reducing internal covariate shift[C]// International Conference on Machine Learning. 2015.
[12]	Dai J F, Qi H Z, Xiong Y W, et al. Deformable convolutional networks[C]// Proceedings of the IEEE International Conference on Computer Version. 2017: 764-773.
[13]	Rhemann C, Rother C, Wang J, et al. A perceptually motivated online benchmark for image matting[C]// IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009). 2009.

[1]	LI Chengfan, ZHAO Junjuan. Improved approach to detect small sample target based on remote sensing image [J]. Journal of Shanghai University（Natural Science Edition）, 2022, 28(2): 314-323.
[2]	XU Genglin, RAN Feng, DENG Liang, SHI Huakang, GUO Aiying. Application of lightweight neural network and Hash tracking algorithm in embedded face capture system [J]. Journal of Shanghai University（Natural Science Edition）, 2021, 27(6): 1018-1028.
[3]	XING Yixue, ZHU Yonghua, GAO Haiyan, ZHOU Jin, ZHANG Ke. Distant supervision for relation extraction via attention CNNs [J]. Journal of Shanghai University（Natural Science Edition）, 2021, 27(5): 983-992.
[4]	SHAO Huixiang, ZENG Dan. Classification and recognition of underwater small targets based on improved YOLOv3 algorithm [J]. Journal of Shanghai University（Natural Science Edition）, 2021, 27(3): 481-491.
[5]	CHEN Yu, DING Youdong, YU Bing, XU Min. Video colourisation based on voxel flow [J]. Journal of Shanghai University（Natural Science Edition）, 2021, 27(1): 18-27.
[6]	ZHAO Huanli, HE Youhua. Bayesian inference for semiparametric ordinal regression [J]. Journal of Shanghai University（Natural Science Edition）, 2021, 27(1): 218-226.
[7]	SHI Yunyang, MIAO Yang, XI Yinfei, ZHANG Qi, LIU Zhiyuan. Dynamic path planning method considering traffic environment and battery capacity of pure electric vehicles [J]. Journal of Shanghai University（Natural Science Edition）, 2020, 26(3): 353-366.
[8]	Fang YU, Ping AN, Xule YAN. Video coding for 3D-HEVC based on saliency information and view synthesis prediction [J]. Journal of Shanghai University（Natural Science Edition）, 2019, 25(5): 679-691.
[9]	Jiacheng HU, Xiangyang WANG, Han LIU. Defect detection of continuous casting slabs based on deep learning [J]. Journal of Shanghai University（Natural Science Edition）, 2019, 25(4): 445-452.
[10]	SU Yaling, HE Youhua. Bayesian estimation for nonparametric regression [J]. Journal of Shanghai University（Natural Science Edition）, 2018, 24(6): 1022-1029.
[11]	XIE Zhifeng, YE Guanhua, YAN Shuqi, HE Shaorong, DING Youdong. HDR image style transfer technique based on generative adversarial networks [J]. Journal of Shanghai University（Natural Science Edition）, 2018, 24(4): 524-534.
[12]	ZHONG Wang, LI Chunxiang. Predicting of nonstationary downburst wind velocity based on extreme learning machines [J]. Journal of Shanghai University（Natural Science Edition）, 2018, 24(3): 446-455.
[13]	WANG Sicong, SUN Dean. Time effect of shear strengths of unsaturated fly-ash and its prediction [J]. Journal of Shanghai University（Natural Science Edition）, 2018, 24(1): 108-117.
[14]	JIANG Lei, LI Chunxiang, DENG Ying. Simulation of non-Gaussian fluctuating wind pressure based on LPZ spectral analysis [J]. Journal of Shanghai University（Natural Science Edition）, 2017, 23(4): 600-608.
[15]	LANG Yue1, ZHOU Jiting1, LIANG Xiaolong2, ZHANG Wenjun1. Automatic annotation for film and Television drama shots and recut system based on face identification [J]. Journal of Shanghai University（Natural Science Edition）, 2017, 23(3): 353-363.

Image matting based on deep learning

RichHTML

PDF

Knowledge

Abstract

Cite this article

share this article

Figures/Tables 7

References 13

Related Articles 15

Recommended Articles

Metrics

Comments