Document-level sentiment classification: An empirical comparison between SVM and ANN

Rodrigo Moraes; Joao Francisco Valiati; Wilson P. Gaviao Neto

首页> 外文期刊>Expert Systems with Application >Document-level sentiment classification: An empirical comparison between SVM and ANN

【24h】

Document-level sentiment classification: An empirical comparison between SVM and ANN

机译：文档级情感分类：SVM与ANN的经验比较

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Document-level sentiment classification aims to automate the task of classifying a textual review, which is given on a single topic, as expressing a positive or negative sentiment. In general, supervised methods consist of two stages: (i) extraction/selection of informative features and (ii) classification of reviews by using learning models like Support Vector Machines (SVM) and Naive Bayes (NB). SVM have been extensively and successfully used as a sentiment learning approach while Artificial Neural Networks (ANN) have rarely been considered in comparative studies in the sentiment analysis literature. This paper presents an empirical comparison between SVM and ANN regarding document-level sentiment analysis. We discuss requirements, resulting models and contexts in which both approaches achieve better levels of classification accuracy. We adopt a standard evaluation context with popular supervised methods for feature selection and weighting in a traditional bag-of-words model. Except for some unbalanced data contexts, our experiments indicated that ANN produce superior or at least comparable results to SVM's. Specially on the benchmark dataset of Movies reviews, ANN outperformed SVM by a statistically significant difference, even on the context of unbalanced data. Our results have also confirmed some potential limitations of both models, which have been rarely discussed in the sentiment classification literature, like the computational cost of SVM at the running time and ANN at the training time.

机译：文档级情感分类旨在使对单个主题的文本评论进行分类的任务自动化，以表达积极或消极的情感。通常，监督方法包括两个阶段：（i）信息特征的提取/选择和（ii）使用支持向量机（SVM）和朴素贝叶斯（NB）等学习模型对评论进行分类。支持向量机已广泛且成功地用作情感学习方法，而在情感分析文献中的比较研究中很少考虑使用人工神经网络（ANN）。本文提出了SVM和ANN在文档级情感分析方面的经验比较。我们讨论了需求，结果模型和上下文，在这两种方法中，分类精度都达到了更高的水平。在传统的词袋模型中，我们采用标准的评估环境以及流行的监督方法进行特征选择和加权。除了某些不平衡的数据上下文外，我们的实验表明ANN可以产生比SVM更好或至少可比的结果。特别是在电影评论的基准数据集上，即使在数据不平衡的情况下，人工神经网络也具有统计学上的显着差异，优于SVM。我们的研究结果还证实了这两种模型的某些潜在局限性，在情感分类文献中很少讨论，例如运行时SVM的计算成本和训练时ANN的计算成本。

著录项

来源
《Expert Systems with Application》 |2013年第2期|621-633|共13页
作者
Rodrigo Moraes; Joao Francisco Valiati; Wilson P. Gaviao Neto;
展开▼
作者单位

Programa lnterdistiplinar de Pos-Graduacao em Computacao Aplicada - P1PCA, Universidade do Vale do Rio dos Sinos - UNISINOS, Av. Unisinos, 950 Sao Leopoldo, RS, Brazil;

Programa lnterdistiplinar de Pos-Graduacao em Computacao Aplicada - P1PCA, Universidade do Vale do Rio dos Sinos - UNISINOS, Av. Unisinos, 950 Sao Leopoldo, RS, Brazil;

Programa lnterdistiplinar de Pos-Graduacao em Computacao Aplicada - P1PCA, Universidade do Vale do Rio dos Sinos - UNISINOS, Av. Unisinos, 950 Sao Leopoldo, RS, Brazil;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
sentiment classification; opinion mining; text classification; artificial neural networks; support vector machines; comparative study;

机译：情绪分类;意见挖掘;文字分类人工神经网络;支持向量机;比较研究;

相似文献

外文文献
中文文献
专利

1. Unsupervised Sentiment-Bearing Feature Selection for Document-Level Sentiment Classification [J] . Yan LI, Zhen QIN, Weiran XU, IEICE transactions on information and systems . 2013,第12期

机译：用于文档级情感分类的无监督情感特征选择
2. Unsupervised Sentiment-Bearing Feature Selection for Document-Level Sentiment Classification [J] . Yan LI, Zhen QIN, Weiran XU, IEICE Transactions on Information and Systems . 2013,第12期

机译：用于文档级情感分类的无监督情感特征选择
3. User's Review Habits Enhanced Hierarchical Neural Network for Document-Level Sentiment Classification [J] . Chen Jie, Yu Jingying, Zhao Shu, Neural processing letters . 2021,第3期

机译：用户的评论习惯增强了文档级情绪分类的分层神经网络
4. Comparison of SVM classification method and semantic similarity method for sentiment classification [C] . Changqin Quan, Xiquan Wei, Fuji Ren IEEE International Conference on Cloud Computing and Intelligent Systems . 2014

机译：支持向量机分类法与语义相似度法在情感分类中的比较
5. An empirical comparison of tabu search, simulated annealing, and genetic algorithms for facilities location problems. [D] . Arostegui, Marvin Antonio, Jr. 1997

机译：禁忌搜索，模拟退火和遗传算法对设施选址问题的经验比较。
6. Multilingual Twitter Sentiment Classification: The Role of Human Annotators [O] . Igor Mozetič, Miha Grčar, Jasmina Smailović -1

机译：多语言Twitter情感分类：人类注释者的作用
7. Figure 3: Classification of tea cultivars in the study region, with image pre-processing and classification method combinations of: (A) None+MLC (B) None+MDC (C) None+ANN (D) None+SVM (E) MNF+MLC (F) MNF+MDC (G) MNF+ANN (H) MNF+SVM (I) PCA+MLC (J) PCA+MDC (K) PCA+ANN (L) PCA+SVM (M) ICA+MLC (N) ICA+MDC (O) ICA+ANN (P) ICA+SVM. [O] . -1

机译：图3：研究区域茶叶种类分类，图像预处理和分类方法组合：（a）无+ mlc（b）无+ mdc（c）none + Ann（d）无+ svm（e） MNF + MLC（F）MNF + MDC（G）MNF + ANN（H）MNF + SVM（I）PCA + MLC（j）PCA + MDC（k）PCA + ANN（L）PCA + SVM（M）ICA + MLC（n）ICA + MDC（O）ICA + ANN（P）ICA + SVM。

Document-level sentiment classification: An empirical comparison between SVM and ANN

摘要

著录项

相似文献

相关主题

期刊订阅