首页> 外文会议>International conference on advances in computing, communications and informatics >Is sentiment analysis an art or a science? Impact of lexical richness in training corpus on machine learning
【24h】

Is sentiment analysis an art or a science? Impact of lexical richness in training corpus on machine learning

机译:情感分析是一门艺术还是一门科学?训练语料库中词汇丰富度对机器学习的影响

获取原文

摘要

Social Media is exploding with data - that can help you derive an optimal marketing strategy in the internet world, engage with your audience on the fly, and protect your reputation from smearing campaigns if it is processed and analyzed in a timely fashion. Digital marketing analysts and data scientists rely on social media analytics tools to deduce customer sentiment from countless opinions and reviews. While numerous attempts have been made to improve their accuracy in the past, yet we know surprisingly little about how accurate their results are. We present an unbiased study of users' tweets and the methods that leverage the available tools & technologies for opinion mining. Our prime focus is on improving the consistency of text classifiers used for linguistic analysis. We also measure the impact of lexical richness in the sample data on the trained algorithm. This paper attempts to improve the reliability of sentiment classification process by the creation of a custom vote classifier using natural language processing techniques and various machine learning algorithms.
机译:社交媒体数据爆炸式增长-如果及时进行处理和分析,可以帮助您在互联网世界中制定最佳的营销策略,与受众互动,并保护您的声誉免于抹黑广告活动。数字营销分析师和数据科学家依靠社交媒体分析工具从无数的意见和评论中推断出客户的情绪。尽管过去曾进行过许多尝试来提高其准确性,但令人惊讶的是,我们对其结果的准确性知之甚少。我们对用户的推文以及利用现有工具和技术进行观点挖掘的方法进行了公正的研究。我们的主要重点是提高用于语言分析的文本分类器的一致性。我们还测量了样本数据中词汇丰富度对训练算法的影响。本文试图通过使用自然语言处理技术和各种机器学习算法创建自定义投票分类器来提高情感分类过程的可靠性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号