首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Improving Citation Polarity Classification with Product Reviews
【24h】

Improving Citation Polarity Classification with Product Reviews

机译:通过产品评论改善引证极性分类

获取原文

摘要

Recent work classifying citations in scientific literature has shown that it is possible to improve classification results with extensive feature engineering. While this result confirms that citation classification is feasible, there are two drawbacks to this approach: (ⅰ) it requires a large annotated corpus for supervised classification, which in the case of scientific literature is quite expensive; and (ⅱ) feature engineering that is too specific to one area of scientific literature may not be portable to other domains, even within scientific literature. In this paper we address these two drawbacks. First, we frame citation classification as a domain adaptation task and leverage the abundant labeled data available in other domains. Then, to avoid over-engineering specific citation features for a particular scientific domain, we explore a deep learning neural network approach that has shown to generalize well across domains using unigram and bigram features. We achieve better citation classification results with this cross-domain approach than using in-domain classification.
机译:最近的工作级别在科学文献中进行分类,表明,可以通过广泛的特征工程来改善分类结果。虽然该结果证实了引文分类是可行的,但这种方法有两个缺点:(Ⅰ)它需要一个用于监督分类的大型注释语料库,这在科学文学中是非常昂贵的; (Ⅱ)特定于科学文学领域的特点工程,即使在科学文献中,也可能无法携带其他领域。在本文中,我们解决了这两个缺点。首先,我们将引文分类为域适应任务,并利用其他域中可用的丰富标记数据。然后,为了避免针对特定科学域的过度工程特定引文特征,我们探讨了一个深入的学习神经网络方法,这些方法已经显示出使用Unigram和Bigram功能跨越域的概括。使用这种跨域方法,我们实现了更好的引文分类结果,而不是使用域的分类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号