首页> 外文期刊>Neural computing & applications >How textual quality of online reviews affect classification performance: a case of deep learning sentiment analysis
【24h】

How textual quality of online reviews affect classification performance: a case of deep learning sentiment analysis

机译:在线评论的文本质量如何影响分类绩效:一个深入学习情绪分析的情况

获取原文
获取原文并翻译 | 示例
           

摘要

Cognitive computing is an interdisciplinary research field that simulates human thought processes in a computerized model. One application for cognitive computing is sentiment analysis on online reviews, which reflects opinions and attitudes toward products and services experienced by consumers. A high level of classification performance facilitates decision making for both consumers and firms. However, while much effort has been made to propose advanced classification algorithms to improve the performance, the importance of the textual quality of the data has been ignored. This research explores the impact of two influential textual features, namely the word count and review readability, on the performance of sentiment classification. We apply three representative deep learning techniques, namely SRN, LSTM, and CNN, to sentiment analysis tasks on a benchmark movie reviews dataset. Multiple regression models are further employed for statistical analysis. Our findings show that the dataset with reviews having a short length and high readability could achieve the best performance compared with any other combinations of the levels of word count and readability and that controlling the review length is more effective for garnering a higher level of accuracy than increasing the readability. Based on these findings, a practical application, i.e., a text evaluator or a website plug-in for text evaluation, can be developed to provide a service of review editorials and quality control for crowd-sourced review websites. These findings greatly contribute to generating more valuable reviews with high textual quality to better serve sentiment analysis and decision making.
机译:认知计算是一种跨学科研究领域,用于在计算机化模型中模拟人类思维过程。一种用于认知计算的应用是关于在线评论的情感分析,这反映了消费者所经历的产品和服务的意见和态度。高水平的分类性能促进了消费者和公司的决策。但是,虽然已经努力提出了先进的分类算法来提高性能,但忽略了数据的文本质量的重要性。本研究探讨了两个有影响力的文本特征的影响,即关于情绪分类的表现。我们在基准电影评论数据集上应用三个代表性深度学习技术,即SRN,LSTM和CNN,在基准电影评论上的情感分析任务。进一步用于统计分析的多元回归模型。我们的调查结果表明,具有短长度和高可读性的评论的数据集可以实现最佳性能,而单词计数和可读性水平的任何其他组合相比,控制审查长度更有效地获得更高水平的准确性。增加可读性。基于这些发现,可以制定实际应用,即文本评估员或文本评估的网站插件,为人群审查网站提供评论编辑和质量控制的服务。这些发现极大地有助于以高文本质量产生更有价值的评论,以更好地提供情感分析和决策。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号