首页> 外文学位 >Aspect-based opinion mining in online reviews.
【24h】

Aspect-based opinion mining in online reviews.

机译:在线评论中基于方面的观点挖掘。

获取原文
获取原文并翻译 | 示例

摘要

Other people's opinions are important piece of information for making informed decisions. Today the Web has become an excellent source of consumer opinions. However, as the volume of opinionated text is growing rapidly, it is getting impossible for users to read all reviews to make a good decision. Reading different and possibly even contradictory opinions written by different reviewers even make them more confused. In the same way, monitoring consumer opinions is getting harder for the manufactures and providers. These needs have inspired a new line of research on mining consumer reviews, or opinion mining. Aspect-based opinion mining, is a relatively new sub-problem that attracted a great deal of attention in the last few years. Extracted aspects and estimated ratings clearly provides more detailed information for users to make decisions and for suppliers to monitor their consumers.;In this thesis, we address the problem of aspect-based opinion mining and seek novel methods to improve limitations and weaknesses of current techniques. We first propose a method, called Opinion Digger, that takes advantages of syntactic patterns to improve the accuracy of frequency-based techniques. We then move on to model-based approaches and propose an LDA-based model, called ILDA, to jointly extract aspects and estimate their ratings. In our next work, we compare ILDA with a series of increasingly sophisticated LDA models representing the essence of the major published methods in the literature. A comprehensive evaluation of these models indicates that while ILDA works best for items with large number of reviews, it performs poorly when the size of the training dataset is small, i.e., for cold start items. The cold start problem is critical as in real-life data sets around 90% of items are cold start. We address this problem in our last work and propose an LDA-based model, called FLDA. It models items and reviewers by a set of latent factors and learns them using reviews of an item category. Experimental results on real life data sets show that FLDA achieve significant gain for cold start items compared to the state-of-the-art models.
机译:他人的意见是做出明智决定的重要信息。今天,Web已成为消费者意见的绝佳来源。但是,随着带有评论的文本量的迅速增长,用户无法阅读所有评论来做出正确的决定。阅读由不同审稿人撰写的不同甚至可能矛盾的观点,甚至会使他们更加困惑。同样,对于制造商和供应商而言,监视消费者的意见变得越来越困难。这些需求激发了有关挖掘消费者评论或观点挖掘的新研究领域。基于方面的意见挖掘是一个相对较新的子问题,在最近几年中引起了很多关注。提取的方面和估计的评分显然为用户做出决定以及供供应商监视他们的消费者提供了更详细的信息。;在本文中,我们解决了基于方面的观点挖掘问题,并寻求新颖的方法来改善当前技术的局限性和弱点。我们首先提出一种称为Opinion Digger的方法,该方法利用句法模式的优势来提高基于频率的技术的准确性。然后,我们继续基于模型的方法,并提出一个基于LDA的模型,称为ILDA,以联合提取方面并评估其等级。在我们的下一个工作中,我们将ILDA与一系列越来越复杂的LDA模型进行比较,这些模型代表了文献中主要已发表方法的实质。对这些模型的综合评估表明,虽然ILDA最适合具有大量评论的项目,但是当训练数据集的大小较小时(即对于冷启动项目),ILDA的效果较差。冷启动问题非常关键,因为在现实生活中,大约90%的项目都是冷启动。我们在上一份工作中解决了这个问题,并提出了一个基于LDA的模型FLDA。它通过一组潜在因素对项目和审阅者进行建模,并使用对项目类别的审阅来学习它们。实际数据集上的实验结果表明,与最新模型相比,FLDA在冷启动项目上获得了可观的收益。

著录项

  • 作者

    Abbasi Moghaddam, Samaneh.;

  • 作者单位

    Simon Fraser University (Canada).;

  • 授予单位 Simon Fraser University (Canada).;
  • 学科 Computer science.;Information technology.
  • 学位 Ph.D.
  • 年度 2013
  • 页码 156 p.
  • 总页数 156
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 能源与动力工程;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号