...
首页> 外文期刊>Expert Systems with Application >Document-level sentiment classification: An empirical comparison between SVM and ANN
【24h】

Document-level sentiment classification: An empirical comparison between SVM and ANN

机译:文档级情感分类:SVM与ANN的经验比较

获取原文
获取原文并翻译 | 示例
           

摘要

Document-level sentiment classification aims to automate the task of classifying a textual review, which is given on a single topic, as expressing a positive or negative sentiment. In general, supervised methods consist of two stages: (i) extraction/selection of informative features and (ii) classification of reviews by using learning models like Support Vector Machines (SVM) and Naive Bayes (NB). SVM have been extensively and successfully used as a sentiment learning approach while Artificial Neural Networks (ANN) have rarely been considered in comparative studies in the sentiment analysis literature. This paper presents an empirical comparison between SVM and ANN regarding document-level sentiment analysis. We discuss requirements, resulting models and contexts in which both approaches achieve better levels of classification accuracy. We adopt a standard evaluation context with popular supervised methods for feature selection and weighting in a traditional bag-of-words model. Except for some unbalanced data contexts, our experiments indicated that ANN produce superior or at least comparable results to SVM's. Specially on the benchmark dataset of Movies reviews, ANN outperformed SVM by a statistically significant difference, even on the context of unbalanced data. Our results have also confirmed some potential limitations of both models, which have been rarely discussed in the sentiment classification literature, like the computational cost of SVM at the running time and ANN at the training time.
机译:文档级情感分类旨在使对单个主题的文本评论进行分类的任务自动化,以表达积极或消极的情感。通常,监督方法包括两个阶段:(i)信息特征的提取/选择和(ii)使用支持向量机(SVM)和朴素贝叶斯(NB)等学习模型对评论进行分类。支持向量机已广泛且成功地用作情感学习方法,而在情感分析文献中的比较研究中很少考虑使用人工神经网络(ANN)。本文提出了SVM和ANN在文档级情感分析方面的经验比较。我们讨论了需求,结果模型和上下文,在这两种方法中,分类精度都达到了更高的水平。在传统的词袋模型中,我们采用标准的评估环境以及流行的监督方法进行特征选择和加权。除了某些不平衡的数据上下文外,我们的实验表明ANN可以产生比SVM更好或至少可比的结果。特别是在电影评论的基准数据集上,即使在数据不平衡的情况下,人工神经网络也具有统计学上的显着差异,优于SVM。我们的研究结果还证实了这两种模型的某些潜在局限性,在情感分类文献中很少讨论,例如运行时SVM的计算成本和训练时ANN的计算成本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号