首页> 外文会议>ACM international conference on information and knowledge management >On the Foundations of Probabilistic Information Integration
【24h】

On the Foundations of Probabilistic Information Integration

机译:概率信息集成的基础

获取原文

摘要

Information integration has been a subject of research for several decades and still remains a very active research area. Many new applications depend or benefit from large scale integration. Examples include large research projects in life sciences, need for data sharing among government agencies, reliance of corporations on business intelligence (which requires data integration from many heterogeneous sources), and integration of information on the web. The importance of information integration with uncertainty has been observed in recent years. Frequently, information from multiple sources are uncertain and possibly inconsistent. Further the process of integration often depends on approximate schema mappings, another source of uncertainty. An integration system is useful only to the extent that the information it produces can be trusted. Hence, providing a measure of certainty for integrated information is of crucial importance in many important applications. In this paper we study the problem of integration of uncertain information. We present a simple and intuitive approach to the representation and integration of uncertain information from multiple sources, and show that our integration approach coincides with a recent formalism for uncertain information integration. We extend the model to probabilistic possible-worlds, and show certain unintuitive constraints are imposed upon probabilities of possible-worlds of sources. In particular, we show the probabilities of possible worlds of a source are not independent, rather, they are dependent on probabilities of other sources. We study the problem of determining the probabilities for the result of integration. Finally, we present a practical approach to relaxing probabilistic constraints in integration.
机译:信息集成已经成为数十年来的研究主题,并且仍然是一个非常活跃的研究领域。许多新应用程序都依赖于大规模集成或从中受益。例如,生命科学领域的大型研究项目,政府机构之间数据共享的需求,公司对商业智能的依赖(这需要来自许多不同来源的数据集成)以及网络上信息的集成。近年来,已经观察到信息集成具有不确定性的重要性。通常,来自多个来源的信息不确定且可能不一致。此外,集成过程通常取决于近似模式映射,这是不确定性的另一个来源。集成系统仅在其生成的信息可以信任的范围内才有用。因此,在许多重要应用中,为集成信息提供确定性措施至关重要。在本文中,我们研究了不确定信息的整合问题。我们提出了一种简单直观的方法来表示和集成来自多个来源的不确定信息,并表明我们的集成方法与最近的不确定信息集成形式主义相吻合。我们将模型扩展到概率可能世界,并表明对源可能世界的概率施加了某些不直观的约束。特别是,我们显示了一个来源的可能世界的概率不是独立的,而是取决于其他来源的概率。我们研究确定积分结果概率的问题。最后,我们提出了一种放松集成中的概率约束的实用方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号