...
首页> 外文期刊>International journal of data mining, modelling and management >Analysing sentiments based on multi feature combination with supervised learning
【24h】

Analysing sentiments based on multi feature combination with supervised learning

机译:基于多特征结合监督学习的情感分析

获取原文
获取原文并翻译 | 示例
           

摘要

Researches on sentiment analysis are growing to a great extent and attracting wide ranges of attention from academics and industries as well. Feature generation and selection are consequent for text mining as the high dimensional feature set can affect the performance of sentiment analysis. This paper exhibits the efficacy of the proposed combined feature selection technique on machine learning classification algorithms over their individual usefulness. Initially, we transform the review datasets into the feature vector of unigram features along with bi-tagged features based on POS pattern. Next, information gain (IG), Chi squared (χ~(2)) and minimum redundancy maximum relevancy (mRMR) feature selection methods are applied to obtain an optimal feature subset for further functionality. These features are then given input to multiple machine learning classifiers, namely, support vector machine (SVM), multinomial Naïve Bayes (MNB), Bernoulli Naïve Bayes (BNB) and logistic regression (LR) on multi domain product review datasets. The performance of the algorithm is measured by evaluation methods such as precision, recall, and F-measure. Experimental results show that the feature selection method mRMR with SVM achieved a better accuracy of 91.39, which is encouraging and comparable to the related research.
机译:情绪分析的研究正在迅速发展,并引起了学术界和行业的广泛关注。特征生成和选择对于文本挖掘是必然的,因为高维特征集会影响情感分析的性能。本文展示了在机器学习分类算法上所提出的组合特征选择技术的有效性,超过了它们的各自用途。最初,我们将评论数据集转换为unigram特征的特征向量以及基于POS模式的双标签特征。接下来,应用信息增益(IG),卡方(χ〜(2))和最小冗余最大相关性(mRMR)特征选择方法,以获得用于进一步功能的最佳特征子集。然后,将这些功能输入多个机器学习分类器,即支持向量机(SVM),多项式朴素贝叶斯(MNB),伯努利朴素贝叶斯(BNB)和多域产品评论数据集上的逻辑回归(LR)。算法的性能是通过评估方法(例如精度,召回率和F量度)来衡量的。实验结果表明,带支持向量机的特征选择方法mRMR达到了91.39的较高准确度,这是令人鼓舞的,可与相关研究相媲美。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号