Foreground object perception and location algorithm based on semantic feature propagation model in MR
Received date: 2022-05-18
Online published: 2023-03-28
Accurate location information obtained by mobile agents is the key to building a stable mixed reality (MR) system. However, foreground objects in an MR scene have a significant impact on the accuracy of traditional location algorithms. At present, location algorithms based on deep learning show relatively improved accuracy by identifying foreground objects, but the time consumption of a deep learning model is too high, resulting in a decline in the real-time performance of the algorithms. To solve this problem, this paper proposes a foreground object-aware location algorithm based on an MR semantic feature propagation model. The algorithm builds a semantic feature propagation model based on a semantic segmentation network and the oriented FAST and rotated BRIEF feature extraction algorithm to realize high-speed semantic feature extraction. The model and a geometric feature detection method are fused to realize the foreground object perception layer in the algorithm, which eliminates the feature points on the foreground objects in MR, and to construct a background feature point set to realize high precision and high real-time location. Experimental results show that the proposed algorithm reduces the relative pose error by 60.5% and improves the real-time location performance by 39.5% compared to the dynamic scenes simultaneous localization and mapping location algorithm in the high-dynamic foreground object scene of the Technical University of Munich public dataset. Therefore, this algorithm has high application value in MR scenes.
Key words: mixed reality (MR); foreground object; location; semantic feature
FANG Zhe, ZHANG Jinyi, JIANG Yuxi . Foreground object perception and location algorithm based on semantic feature propagation model in MR[J]. Journal of Shanghai University, 0 : 41 -55 . DOI: 10.12066/j.issn.1007-2861.2413
| [1] | 夏铁男, 刘金鑫, 陈挺, 等. 混合现实技术在腹膜后肿瘤手术中的应用[J]. 中国临床研究, 2021, 34(8): 1053-1056. |
| [2] | Wang P, Bai X, Billinghurst M, et al. An MR remote collaborative platform based on 3D CAD models for training in industry[C]// 2019 IEEE International Symposium on Mixed and Augmented Reality Adjunct (ISMAR-Adjunct). 2019: 91-92. |
| [3] | Dalim C S C, Piumsomboon T, Dey A, et al. TeachAR: an interactive augmented reality tool for teaching basic english to non-native children[C]// 2016 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct). 2016: 344-345. |
| [4] | Younes G, Asmar D, Shammas E, et al. Keyframe-based monocular SLAM: design, survey, and future directions[J]. Robotics and Autonomous Systems, 2017, 98: 67-88. |
| [5] | 高兴波, 史旭华, 葛群峰, 等. 面向动态物体场景的视觉SLAM 综述[J]. 机器人, 2021, 43(6): 733-750. |
| [6] | Mur-Artal R, Tardós J D. ORB-SLAM2: an open-source SLAM system for monocular, stereo, and RGB-D cameras[J]. IEEE Transactions on Robotics, 2017, 33(5): 1255-1262. |
| [7] | Wang R, Wan W, Wang Y, et al. A new RGB-D SLAM method with moving object detection for dynamic indoor scenes[J]. Remote Sensing, 2019, 11(10): 1143. 1-1143.19. |
| [8] | Bescos B, FÁcil J M, Civera J, et al. DynaSLAM: tracking, mapping, and inpainting in dynamic scenes[J]. IEEE Robotics and Automation Letters, 2018, 3(4): 4076-4083. |
| [9] | Yu C, Liu Z, Liu X J, et al. DS-SLAM: a semantic visual SLAM towards dynamic environ- ments[C]// 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). 2018: 1168-1174. |
| [10] | Badrinarayanan V, Kendall A, Cipolla R. Segnet: a deep convolutional encoder-decoder architecture for image segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(12): 2481-2495. |
| [11] | Rublee E, Rabaud V, Konolige K, et al. Orb: an efficient alternative to sift orsurf[C]// 2011 IEEE International Conference on Computer Vision (ICCV). 2011: 2564-2571. |
| [12] | He K, Gkioxari G, Dollvr P, et al. Mask R-CNN[C]// Proceedings of the IEEE International Conference on Computer Vision. 2017: 2961-2969. |
| [13] | 王榕榕, 徐树公, 黄剑波. 基于深度学习的图像抠图技术[J]. 上海大学学报(自然科学版), 2022, 28(2): 261-269. |
| [14] | Hartley R I. In defense of the eight-point algorithm[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1997, 19(6): 580-593. |
| [15] | Chum O, Matas J, Kittler J. Locally optimized RANSAC[C]// Proc of Joint Pattern Recognition Symposium. 2003: 236-243. |
| [16] | Sturm J, Engelhard N, Endres F, et al. A benchmark for the evaluation of RGB-D SLAM systems[C]// 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems. 2012: 573-580. |
/
| 〈 |
|
〉 |