首页> 外文会议>Advanced information systems engineering >Probabilistic Models to Reconcile Complex Data from Inaccurate Data Sources
【24h】

Probabilistic Models to Reconcile Complex Data from Inaccurate Data Sources

机译:从不准确的数据源调和复杂数据的概率模型

获取原文
获取原文并翻译 | 示例

摘要

Several techniques have been developed to extract and integrate data from web sources. However, web data are inherently imprecise and uncertain. This paper addresses the issue of characterizing the uncertainty of data extracted from a number of inaccurate sources. We develop a probabilistic model to compute a probability distribution for the extracted values, and the accuracy of the sources. Our model considers the presence of sources that copy their contents from other sources, and manages the misleading consensus produced by copiers. We extend the models previously proposed in the literature by working on several attributes at a time to better leverage all the available evidence. We also report the results of several experiments on both synthetic and real-life data to show the effectiveness of the proposed approach.
机译:已经开发了几种技术来从Web来源提取和集成数据。但是,Web数据本质上是不精确且不确定的。本文解决了表征从许多不准确来源提取的数据的不确定性的问题。我们开发了一个概率模型来计算提取值的概率分布以及源的准确性。我们的模型考虑了从其他来源复制其内容的来源的存在,并管理了复印机产生的误导性共识。我们通过一次研究多个属性来扩展先前文献中提出的模型,以更好地利用所有可用证据。我们还报告了关于合成和真实数据的几个实验的结果,以显示所提出方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号