首页> 外文会议>Conference on empirical methods in natural language processing >The Rating Game: Sentiment Rating Reproducibility from Text
【24h】

The Rating Game: Sentiment Rating Reproducibility from Text

机译:评级游戏:情绪评级从文本中的再现性

获取原文

摘要

Sentiment analysis models often use ratings as labels, assuming that these ratings reflect the sentiment of the accompanying text. We investigate (ⅰ) whether human readers can infer ratings from review text, (ⅱ) how human performance compares to a regression model, and (ⅲ) whether model performance is affected by the rating "source" (i.e. original author vs. annotator). We collect IMDb movie reviews with author-provided ratings, and have them re-annotated by crowdsourced and trained annotators. Annotators reproduce the original ratings better than a model, but are still far off in more than 5% of the cases. Models trained on annotator-labels outperform those trained on author-labels, questioning the usefulness of author-rated reviews as training data for sentiment analysis.
机译:假设这些评级反映了随附文本的情绪,这些感应分析模型通常使用评级作为标签。我们调查(Ⅰ)人类读者是否可以从审查文本推断评级,(Ⅱ)如何对回归模型进行比较,(Ⅲ)是否受评级“来源”(即原作者与注释器)影响模型性能。我们通过作者提供的评级收集IMDB电影评论,并通过众群和培训的注释器重新注释。注释器比模型更好地重现了原始评级,但在超过5%的情况下仍然远远超过。在注释器 - 标签上培训的型号优于作者标签培训的模型,质疑作者评级审查的有用性作为情感分析的培训数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号