上海大学学报(自然科学版) ›› 2021, Vol. 27 ›› Issue (3): 535-543.doi: 10.12066/j.issn.1007-2861.2223

• 研究论文 • 上一篇    下一篇

基于组合神经网络的中文事件抽取

季忠祥, 吴悦()   

  1. 上海大学 计算机工程与科学学院, 上海 200444
  • 收稿日期:2020-02-23 出版日期:2021-06-30 发布日期:2021-06-27
  • 通讯作者: 吴悦 E-mail:ywu@shu.edu.cn
  • 作者简介:吴悦(1960—), 女, 教授, 博士生导师, 博士, 研究方向为自然语言处理. E-mail: ywu@shu.edu.cn

Event extraction of Chinese text based on composite neural network

JI Zhongxiang, WU Yue()   

  1. School of Computer Engineering and Science, Shanghai University, Shanghai 200444, China
  • Received:2020-02-23 Online:2021-06-30 Published:2021-06-27
  • Contact: WU Yue E-mail:ywu@shu.edu.cn

摘要:

中文事件抽取通常使用循环神经网络(recurrent neural network, RNN)来进行事件和事件要素的抽取, 但 RNN 在处理长度较长的词语时容易丢失重要信息, 为此提出一种组合卷积神经网络(convolutional neural network, CNN)与双向长短期记忆(bidirectional long short-term memory, Bi-LSTM)网络的中文事件抽取模型 CNN-Bi-LSTM-CRF, 其中 CRF (conditional random field) 为条件随机场. 采用基于注意力机制和语义特征生成的字词联合向量, 使用 CNN 和 Bi-LSTM 模型对字词联合向量进行处理, 以获取其隐含表示, 最后通过 CRF 得出预测结果. 实验结果表明, 所提出的方法与其他现有的中文事件抽取方法相比, 准确率有明显提升.

关键词: 事件抽取, 卷积神经网络, 长短期记忆

Abstract:

The recurrent neural network is widely used in the event extraction of Chinese text to extract events and event elements, but it usually loses essential information when processing long words. In this study, the convolutional neural network (CNN) and the bidirectional long short-term memory (Bi-LSTM) network were combined to develop a novel event extraction model known as CNN-Bi-LSTM-conditional random field (CRF). A joint vector of characters and words was adopted based on the attention mechanism and semantic features, and the CNN and Bi-LSTM models were used to process the vector to obtain its implicit representation. Finally, the CRF was used to obtain the prediction results. The experimental results show that the proposed method is more accurate than other existing event extraction methods in extracting Chinese text.

Key words: event extraction, convolutional neural network, long short-term memory

中图分类号: