首页> 外文期刊>EPJ Data Science >Sentiment analysis methods for understanding large-scale texts: a case for using continuum-scored words and word shift graphs
【24h】

Sentiment analysis methods for understanding large-scale texts: a case for using continuum-scored words and word shift graphs

机译:理解大规模文本的情感分析方法:使用Continuum-Iscored单词和Word Shift图表的情况

获取原文
           

摘要

The emergence and global adoption of social media has rendered possible the real-time estimation of population-scale sentiment, an extraordinary capacity which has profound implications for our understanding of human behavior. Given the growing assortment of sentiment-measuring instruments, it is imperative to understand which aspects of sentiment dictionaries contribute to both their classification accuracy and their ability to provide richer understanding of texts. Here, we perform detailed, quantitative tests and qualitative assessments of 6 dictionary-based methods applied to 4 different corpora, and briefly examine a further 20 methods. We show that while inappropriate for sentences, dictionary-based methods are generally robust in their classification accuracy for longer texts. Most importantly they can aid understanding of texts with reliable and meaningful word shift graphs if (1)?the dictionary covers a sufficiently large portion of a given text’s lexicon when weighted by word usage frequency; and (2)?words are scored on a continuous scale.
机译:社交媒体的出现和全球采用具有人口规模情绪的实时估计,这是对我们对人类行为的理解产生深远影响的非凡能力。鉴于各种各样的各种各样的情绪测量仪器,必须了解情绪词典的哪个方面有助于其分类准确性以及他们提供对文本的更丰富的理解的能力。在这里,我们在4种不同的基础上进行了对6个字典的方法的详细,定量测试和定性评估,并简要检查了另外20种方法。我们展示,虽然不适合句子,但对于更长的文本,基于字典的方法通常是它们的分类准确性的强大。最重要的是,如果(1)?字典涵盖单词使用频率的加权时,可以帮助了解具有可靠和有意义的单词换档图表的文本的文本,如果(1)?字典覆盖了一系列给定文本的Lexicon的足够大的部分; (2)?在连续规模上得分的单词。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号