【24h】

Evidence-based Trustworthiness

机译:循证诚信

获取原文

摘要

The information revolution brought with it information pollution. Information retrieval and extraction help us cope with abundant information from diverse sources. But some sources are of anonymous authorship, and some are of uncertain accuracy, so how can we determine what we should actually believe? Not all information sources are equally trustworthy, and simply accepting the majority view is often wrong. This paper develops a general framework for estimating the trustworthiness of information sources in an environment where multiple sources provide claims and supporting evidence, and each claim can potentially be produced by multiple sources. We consider two settings: one in which information sources directly assert claims, and a more realistic and challenging one, in which claims are inferred from evidence provided by sources, via (possibly noisy) NLP techniques. Our key contribution is to develop a family of probabilistic models that jointly estimate the trustworthiness of sources, and the credibility of claims they assert. This is done while accounting for the (possibly noisy) NLP needed to infer claims from evidence supplied by sources. We evaluate our framework on several datasets. showing strong results and significant improvement over baselines.
机译:信息革命带来了信息污染。信息检索和提取有助于我们应对来自各种来源的大量信息。但是有些来源是匿名作者,有些来源的准确性不确定,那么我们如何确定我们应该真正相信的呢?并非所有信息源都同样值得信赖,仅接受多数意见常常是错误的。本文开发了一个通用框架,用于估计在多个来源提供索赔和支持证据且每个索赔都可能由多个来源产生的环境中信息源的可信赖性。我们考虑两种设置:一种是信息源直接声明主张,另一种是更现实和更具挑战性的设置,其中主张是通过NLP技术(可能是嘈杂的)从信息源提供的证据中推断出来的。我们的主要贡献是开发了一个概率模型系列,可以共同评估来源的可信赖性以及他们主张的主张的可信度。这样做是在考虑从来源提供的证据推断索赔所需的(可能有噪声的)NLP的情况下进行的。我们在几个数据集上评估我们的框架。显示出强劲的结果,并且比基准有了显着改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号