首页> 外文期刊>Journal of intelligent & fuzzy systems: Applications in Engineering and Technology >Semi-supervised collective extraction of opinion target and opinion word from online reviews based on active labeling
【24h】

Semi-supervised collective extraction of opinion target and opinion word from online reviews based on active labeling

机译:基于主动标签的在线评价半监督集体提取意见目标和意见词

获取原文
获取原文并翻译 | 示例
           

摘要

Online reviews play important roles in many Web Applications like e-business and government intelligence, since such user-generated-contents (UGC) contain rich user opinion. Opinion target and opinion word are a pair of core objects for user opinion expression in reviews. Extracting these two objects from reviews is crucial for the tasks of opinion mining. However, traditional extraction methods have various limitations such as ignoring the opinion relationship, the restriction of word span, the error propagation caused by iterative expansion, which would reduce the extraction performance. For the above deficiencies, we propose a supervised method based on the constrained word alignment model to extract opinion target and opinion word collectively at first. To tackle the time-consuming and error-prone problem of manual annotation encountered by the supervised method, we further devise a semi-supervised extraction method based on active learning. In this method, we design the sample uncertainty-based sampling strategy and the feature evidence-based one to choose the most informative samples for labeling manually. At last, a series of experiments on a real-world dataset show that our approaches outperform several state-of-the-art baselines significantly.
机译:在线评论在许多Web应用程序中扮演重要角色,如电子商务和政府智能,因为这种用户生成的内容(UGC)包含丰富的用户意见。意见目标和意见单词是一个用于评论中的用户意见表达的一对核心对象。从评论中提取这两个对象对于意见挖掘任务至关重要。然而,传统的提取方法具有各种局限性,如忽略意见关系,单词跨度的限制,由迭代扩展引起的误差传播,这将降低提取性能。对于上述缺陷,我们提出了一种基于受约束词对齐模型的监督方法,首先是共同提取意见目标和意见词。为了解决受监督方法遇到的手动注释的耗时和错误的问题,我们进一步设计了一种基于主动学习的半监督的提取方法。在这种方法中,我们设计了基于样本的不确定性的采样策略和基于特征的证据,以便手动选择最具信息性的样本。最后,一系列关于现实世界数据集的实验表明,我们的方法显着优于若干最先进的基线。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号