数字与影视技术

一种结合文章信息的新闻评论情感分析方法

展开
  • 1.上海大学 上海电影学院, 上海 200072
    2.上海大学 生命科学学院, 上海 200444
朱永华(1967—), 男, 副教授, 博士, 研究方向为计算机、人工智能和数据科学. E-mail: zyh@shu.edu.cn

收稿日期: 2019-12-17

  网络出版日期: 2020-07-16

基金资助

"十三五"国家重点研发计划项目(2017YFD0400101)

Incorporating article information for sentiment analysis of news comments

Expand
  • 1. Shanghai Film Academy, Shanghai University, Shanghai 200072, China
    2. School of Life Sciences, Shanghai University, Shanghai 200444, China

Received date: 2019-12-17

  Online published: 2020-07-16

摘要

新闻评论表达了人们对新闻事件的看法与态度, 因此对新闻评论进行分析具有潜在的应用价值. 传统的情感分析方法仅对评论文本进行分析, 忽略了新闻文章主题及语义信息对评论的影响. 针对这个问题, 提出了一种基于支持向量机和 $K$ 均值聚类的情感分析方法, 将新闻文章信息对评论情感的影响因素引入到新闻评论的情感分类中. 实验结果证明了该方法在新闻评论情感分析任务中的有效性.

本文引用格式

杨一璞, 朱永华, 高海燕, 高文靖 . 一种结合文章信息的新闻评论情感分析方法[J]. 上海大学学报(自然科学版), 2022 , 28(1) : 170 -178 . DOI: 10.12066/j.issn.1007-2861.2252

Abstract

News comments reflect people's opinions or sentiments toward news events. Therefore, analysis of news comments is potentially useful for many applications. Traditional methods of sentiment analysis focus on the contents of comments while ignoring the influence of news topics and semantics information from news articles. This study proposes a sentiment analysis approach using support vector machine and $K$-means clustering that considers the impact of news articles on the sentiments of news comments. Experimental results on a news comment dataset demonstrate the effectiveness of our proposed method.

参考文献

[1] Turney P D. Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews [C]// Meeting of the Association for Computational Linguistics. 2002: 417-424.
[2] Pang B, Lee L, Vaithyanathan S, et al. Thumbs up? Sentiment classification using machine learning techniques [C]// Empirical Methods in Natural Language Processing. 2002: 79-86.
[3] Gamon M. Sentiment classification on customer feedback data: noisy data, large feature vectors, and the role of linguistic analysis [C]// International Conference on Computational Linguistics. 2004.
[4] 吴杰胜, 陆奎. 基于多部情感词典和规则集的中文微博情感分析研究[J]. 计算机应用与软件, 2019, 36(9): 93-99.
[5] 王志涛, 於志文, 郭斌, 等. 基于词典和规则集的中文微博情感分析[J]. 计算机工程与应用, 2015, 51(8): 218-225.
[6] 姜杰, 夏睿. 机器学习与语义规则融合的微博情感分类方法[J]. 北京大学学报 (自然科学版), 2017, 53(2): 247-254.
[7] 马丽菲, 莫倩, 杜辉. 面向中文短影评的分类技术研究[J]. 山东大学学报 (理学版), 2016, 51(1): 52-57.
[8] Mikolov T, Chen K, Corrado G, et al. Efficient estimation of word representations in vector space [C]// International Conference on Learning Representations. 2013.
[9] Mikolov T, Sutskever I, Chen K, et al. Distributed representations of words and phrases and their compositionality [C]// Advances in Neural Information Processing Systems. 2013: 3111-3119.
[10] Kim Y. Convolutional neural networks for sentence classification [C]// Empirical Methods in Natural Language Processing. 2014: 1746-1751.
[11] 朱晓亮, 石昀东. 基于 TextRank 和字符级卷积神经网络的小学作文素材自动分类模型研究[J]. 计算机应用与软件, 2019, 36(1): 220-226.
[12] Johnson R, Zhang T. Effective use of word order for text categorization with convolutional neural networks[EB/OL]. [2020-10-08]. https://xueshu.baidu.com/usercenter/paper/show?paperid=be21496c000b08ae337b7da98e06558b&site=xueshu_se.
[13] Huang M, Cao Y, Dong C, et al. Modeling rich contexts for sentiment classification with LSTM[EB/OL]. [2020-11-05]. https://xueshu.baidu.com/usercenter/paper/show?paperid=98bde1bdd380df1603d8cce2b84281ff&site=xueshu_se.
[14] Xu J, Chen D, Qiu X, et al. Cached long short-term memory neural networks for document-level sentiment classification [C]// Empirical Methods in Natural Language Processing. 2016: 1660-1669.
[15] Yang Z, Yang D, Dyer C, et al. Hierarchical attention networks for document classification [C]// North American Chapter of the Association for Computational Linguistics. 2016: 1480-1489.
[16] Tang D, Qin B, Liu T, et al. Document modeling with gated recurrent neural network for sentiment classification [C]// Empirical Methods in Natural Language Processing. 2015: 1422-1432.
[17] Shi S, Zhao M, Guan J, et al. A hierarchical LSTM model with multiple features for sentiment analysis of sina weibo texts [C]// International Conference on Asian Language Processing. 2017: 379-382.
[18] Wang B. Attention-based hierarchical LSTM model for document sentiment classification [C]// Proceedings of 2018 2nd International Conference on Artificial Intelligence Applications and Technologies (AIAAT2018). 2018: 436-441.
[19] 白静, 李霏, 姬东鸿. 基于注意力的 BiLSTM-CNN 中文微博立场检测模型[J]. 计算机应用与软件, 2018, 35(3): 266-274.
[20] Napoles C, Tetreault J, Pappu A, et al. Finding good conversations online: the yahoo news annotated comments corpus [C]// Linguistic Annotation Workshop. 2017: 13-23.
[21] Napoles C, Pappu A, Tetreault J, et al. Automatically identifying good conversations online (Yes, they do exist!) [C]// International Conference on Weblogs and Social Media. 2017: 628-631.
文章导航

/