Foreground object perception and location algorithm based on semantic feature propagation model in MR

FANG Zhe, ZHANG Jinyi, JIANG Yuxi

doi:10.12066/j.issn.1007-2861.2413

Journal of Shanghai University >

0 41 - 55

DOI: https://doi.org/10.12066/j.issn.1007-2861.2413

Research Articles

Foreground object perception and location algorithm based on semantic feature propagation model in MR

Expand

1. Key laboratory of Specialty Fiber Optics and Optical Access Networks, Shanghai University, Shanghai 200444, China
2. Joint International Research Laboratory of Specialty Fiber Optics and Advanced Communication, Shanghai University, Shanghai 200444, China
3. Shanghai Sansi Institute for System Integration, Shanghai 201100, China

Received date: 2022-05-18

Online published: 2023-03-28

Fold

Abstract

Accurate location information obtained by mobile agents is the key to building a stable mixed reality (MR) system. However, foreground objects in an MR scene have a significant impact on the accuracy of traditional location algorithms. At present, location algorithms based on deep learning show relatively improved accuracy by identifying foreground objects, but the time consumption of a deep learning model is too high, resulting in a decline in the real-time performance of the algorithms. To solve this problem, this paper proposes a foreground object-aware location algorithm based on an MR semantic feature propagation model. The algorithm builds a semantic feature propagation model based on a semantic segmentation network and the oriented FAST and rotated BRIEF feature extraction algorithm to realize high-speed semantic feature extraction. The model and a geometric feature detection method are fused to realize the foreground object perception layer in the algorithm, which eliminates the feature points on the foreground objects in MR, and to construct a background feature point set to realize high precision and high real-time location. Experimental results show that the proposed algorithm reduces the relative pose error by 60.5% and improves the real-time location performance by 39.5% compared to the dynamic scenes simultaneous localization and mapping location algorithm in the high-dynamic foreground object scene of the Technical University of Munich public dataset. Therefore, this algorithm has high application value in MR scenes.

Key words： mixed reality (MR); foreground object; location; semantic feature

Cite this article

FANG Zhe, ZHANG Jinyi, JIANG Yuxi . Foreground object perception and location algorithm based on semantic feature propagation model in MR[J]. Journal of Shanghai University, 0 : 41 -55 . DOI: 10.12066/j.issn.1007-2861.2413

References

[1]	夏铁男, 刘金鑫, 陈挺, 等. 混合现实技术在腹膜后肿瘤手术中的应用[J]. 中国临床研究, 2021, 34(8): 1053-1056.
[2]	Wang P, Bai X, Billinghurst M, et al. An MR remote collaborative platform based on 3D CAD models for training in industry[C]// 2019 IEEE International Symposium on Mixed and Augmented Reality Adjunct (ISMAR-Adjunct). 2019: 91-92.
[3]	Dalim C S C, Piumsomboon T, Dey A, et al. TeachAR: an interactive augmented reality tool for teaching basic english to non-native children[C]// 2016 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct). 2016: 344-345.
[4]	Younes G, Asmar D, Shammas E, et al. Keyframe-based monocular SLAM: design, survey, and future directions[J]. Robotics and Autonomous Systems, 2017, 98: 67-88.
[5]	高兴波, 史旭华, 葛群峰, 等. 面向动态物体场景的视觉SLAM 综述[J]. 机器人, 2021, 43(6): 733-750.
[6]	Mur-Artal R, Tardós J D. ORB-SLAM2: an open-source SLAM system for monocular, stereo, and RGB-D cameras[J]. IEEE Transactions on Robotics, 2017, 33(5): 1255-1262.
[7]	Wang R, Wan W, Wang Y, et al. A new RGB-D SLAM method with moving object detection for dynamic indoor scenes[J]. Remote Sensing, 2019, 11(10): 1143. 1-1143.19.
[8]	Bescos B, FÁcil J M, Civera J, et al. DynaSLAM: tracking, mapping, and inpainting in dynamic scenes[J]. IEEE Robotics and Automation Letters, 2018, 3(4): 4076-4083.
[9]	Yu C, Liu Z, Liu X J, et al. DS-SLAM: a semantic visual SLAM towards dynamic environ- ments[C]// 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). 2018: 1168-1174.
[10]	Badrinarayanan V, Kendall A, Cipolla R. Segnet: a deep convolutional encoder-decoder architecture for image segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(12): 2481-2495.
[11]	Rublee E, Rabaud V, Konolige K, et al. Orb: an efficient alternative to sift orsurf[C]// 2011 IEEE International Conference on Computer Vision (ICCV). 2011: 2564-2571.
[12]	He K, Gkioxari G, Dollvr P, et al. Mask R-CNN[C]// Proceedings of the IEEE International Conference on Computer Vision. 2017: 2961-2969.
[13]	王榕榕, 徐树公, 黄剑波. 基于深度学习的图像抠图技术[J]. 上海大学学报(自然科学版), 2022, 28(2): 261-269.
[14]	Hartley R I. In defense of the eight-point algorithm[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1997, 19(6): 580-593.
[15]	Chum O, Matas J, Kittler J. Locally optimized RANSAC[C]// Proc of Joint Pattern Recognition Symposium. 2003: 236-243.
[16]	Sturm J, Engelhard N, Endres F, et al. A benchmark for the evaluation of RGB-D SLAM systems[C]// 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems. 2012: 573-580.

Options

Outlines

模态框（Modal）标题

Abstract

Cite this article

References