收稿日期: 2018-07-02
网络出版日期: 2018-07-04
基金资助
上海市科委基金资助项目(14590500500)
A graph-based method for multi-feature entity linking
Received date: 2018-07-02
Online published: 2018-07-04
周金, 朱永华, 张铁男, 邢毅雪, 张克 . 基于图的联合特征实体链接方法[J]. 上海大学学报(自然科学版), 2020 , 26(5) : 747 -755 . DOI: 10.12066/j.issn.1007-2861.2068
Entity linking refers to the process of linking entity mentioned in text with knowledge base entity, which is one of the key steps in knowledge graph and knowledge fusion. This paper proposes a graph-based method for multi-feature entity linking. This method first preprocesses the knowledge base and the text, then identifies the named entity references in the text, and then combines the semantic similarity of multiple features such as topics, context, metadata, etc. In the expanded graph model, the probability of restarting random walk is used, and the target candidate entity is selected by joint disambiguation. The results of experiment show that the joint feature-based entity linking method based on graphs effectively improves the effectiveness of entity linking.
| [1] | Cornolti M, Ferragina P, Ciaramita M. A piggyback system for joint entity mention detection and linking in web queries[C]// International Conference on World Wide Web. 2016: 567-578. |
| [2] | 鄂世嘉, 林培裕, 向阳. 自动化构建的中文知识图谱系统[J]. 计算机应用, 2016,36(4):992-996. |
| [3] | Dong X L, Gabrilovich E, Heitz G, et al. From data fusion to knowledge fusion[J]. Proceedings of the VLDB Endowment, 2014,7(10):881-892. |
| [4] | 林海伦, 王元卓, 贾岩涛, 等. 面向网络大数据的知识融合方法综述[J]. 计算机学报, 2017(1):1-27. |
| [5] | Chen R C, Bau C T, Yeh C J. Merging domain ontologies based on the WordNet system and fuzzy formal concept analysis techniques[J]. Applied Soft Computing, 2011,11(2):1908-1923. |
| [6] | Gattani A, Lamba D S, Garera N, et al. Entity extraction, linking, classification, and tagging for social media: a wikipedia-based approach[J]. Proceedings of the VLDB Endowment, 2013,6(11):1126-1137. |
| [7] | Mihalcea R, Csomai A. Wikify!: linking documents to encyclopedic knowledge[C]// Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management. 2007: 233-242. |
| [8] | Shen W, Han J, Wang J, et al. SHINE+: a general framework for domain-specific entity linking with heterogeneous information networks[J]. IEEE Transactions on Knowledge and Data Engineering, 2018,30(2):353-366. |
| [9] | Zhu G, Iglesias C A. Exploiting semantic similarity for named entity disambiguation in knowledge graphs[J]. Expert Systems with Applications, 2018,101(6):8-24. |
| [10] | 刘峤, 钟云, 李杨, 等. 基于图的中文集成实体链接算法[J]. 计算机研究与发展, 2016,53(2):270-283. |
| [11] | 李超. 一种基于BTM 主题模型的命名实体链接方法研究[D]. 大连: 大连海事大学, 2017. |
| [12] | 谭咏梅, 李晓光, 吕学强. 融合卷积神经网络和重启随机游走的实体链接方法[J]. 北京邮电大学学报, 2018,41(1):65-69. |
| [13] | Han X, Sun L, Zhao J, et al. Collective entity linking in web text: a graph-based method[C]// International ACM SIGIR Conference on Research and Development in Information Retrieval. 2011: 765-774. |
| [14] | Hoffart J, Yosef M A, Bordino I, et al. Robust disambiguation of named entities in text[C]// Conference on Empirical Methods in Natural Language Processing. 2011: 782-792. |
| [15] | 李茂林. 基于主题敏感的重启随机游走实体链接方法[J]. 北京大学学报(自然科学版), 2016,52(1):17-24. |
| [16] | 高艳红, 李爱萍, 段利国. 面向实体链接的多特征图模型实体消歧方法[J]. 计算机应用研究, 2017,34(10):2909-2914. |
| [17] | 谭咏梅, 郑迪, 刘姝雯, 等. 基于重启随机游走的实体识别与链接方法[J]. 北京邮电大学学报, 2017,40(6):115-119. |
| [18] | Kalender M, Korkmaz E E. THINKER-entity linking system for Turkish language[J]. IEEE Transactions on Knowledge and Data Engineering, 2018,30(2):367-380. |
| [19] | Globerson A, Lazic N, Chakrabarti S, et al. Collective entity resolution with multi-focal attention[C]// Meeting of the Association for Computational Linguistics. 2016: 621-631. |
/
| 〈 |
|
〉 |