首页> 外文会议>ACM/IEEE Joint Conference on Digital Libraries >What Drives Research Efforts? Find Scientific Claims that Count!
【24h】

What Drives Research Efforts? Find Scientific Claims that Count!

机译:是什么推动研究工作?查找重要的科学主张!

获取原文

摘要

Researchers often struggle to solve a common problem: how does one know whether a research hypothesis is worth investigating? Given the increasing number of research publications, it is complicated to guide such decisions. Previous work has shown how predicting generally emerging research topics can provide some help. Yet, in specialized scientific domains, only little is known about how to provide a service that allows users to ease the identification of scientific claims worth investigating. Scientific claims here means a natural language sentence that expresses a relationship between two entities. In particular, how one of them affects, manipulates, or causes the other entity. In this paper, we propose a data-driven approach aiming at filling this gap and empowering users at query level: given the results of a query, we deliver a characterization of clusters of the query results to discover the contextualization of scientific claims and the identification of those claims that may be worth more research efforts. To do so, we cluster documents with scientific claims that share the same context by leveraging co-clustering. After that, we characterize the clusters to annotate them. Our annotation focuses on two core aspects: controversy and diversity of claims in a given cluster. Controversy arises when two or more claims semantically contradict each other; diversity means the presence of different semantics of the claims that do not contradict each other but provide different insights expressed by some paper. To evaluate the benefits of our approach, we performed an extensive retrospective analysis on PubMed.
机译:研究人员经常努力解决一个普遍的问题:如何知道一项研究假设是否值得研究?鉴于研究出版物的数量不断增加,指导此类决策很复杂。先前的工作表明,预测一般出现的研究主题将如何提供帮助。然而,在专门的科学领域,关于如何提供一种服务,使用户能够简化对值得研究的科学主张的识别,知之甚少。这里的科学主张是指表达两个实体之间关系的自然语言句子。特别是其中一个如何影响,操纵或导致另一个实体。在本文中,我们提出了一种数据驱动的方法,旨在填补这一空白并在查询级别赋予用户权限:给定查询结果,我们对查询结果的簇进行表征,以发现科学主张和识别的上下文这些主张可能值得进行更多的研究。为此,我们利用协同聚类将具有相同上下文的科学主张归类为文档。之后,我们对聚类进行表征以对其进行注释。我们的注释集中在两个核心方面:给定集群中的争议和主张的多样性。当两个或多个声明在语义上相互矛盾时,就会引起争议。多样性意味着权利要求存在不同的语义,这些语义并不相互矛盾,而是提供了某些论文表达的不同见解。为了评估我们方法的好处,我们对PubMed进行了广泛的回顾性分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号