首页> 外文会议>International conference on current trends in theory and practice of computer science >Pay-as-you-go Data Integration: Experiences and Recurring Themes
【24h】

Pay-as-you-go Data Integration: Experiences and Recurring Themes

机译:现收现付数据集成:体验和重复主题

获取原文

摘要

Data integration typically seeks to provide the illusion that data from multiple distributed sources comes from a single, well managed source. Providing this illusion in practice tends to involve the design of a global schema that captures the users data requirements, followed by manual (with tool support) construction of mappings between sources and the global schema. This overall approach can provide high quality integrations but at high cost, and tends to be unsuitable for areas with large numbers of rapidly changing sources, where users may be willing to cope with a less than perfect integration. Pay-as-you-go data integration has been proposed to overcome the need for costly manual data integration. Pay-as-you-go data integration tends to involve two steps. Initialisation: automatic creation of mappings (generally of poor quality) between sources. Improvement: the obtaining of feedback on some aspect of the integration, and the application of this feedback to revise the integration. There has been considerable research in this area over a ten year period. This paper reviews some experiences with pay-as-you-go data integration, providing a framework that can be used to compare or develop pay-as-you-go data integration techniques.
机译:数据集成通常试图提供一种幻觉,即来自多个分布式源的数据来自单一且管理良好的源。在实践中提供这种错觉往往涉及设计一种可捕获用户数据需求的全局架构,然后手动(在工具支持下)构建源与全局架构之间的映射。这种整体方法可以提供高质量的集成,但成本较高,并且往往不适用于源头数量迅速变化的区域,在这些区域,用户可能愿意应对不太理想的集成。已经提出了按需付费的数据集成方案,以克服对昂贵的手动数据集成的需求。现收现付数据集成通常涉及两个步骤。初始化:自动创建源之间的映射(通常质量较差)。改进:获得有关集成某些方面的反馈,并将此反馈应用于修订集成。在过去的十年中,在这一领域进行了大量的研究。本文回顾了现收现付数据集成的一些经验,提供了可用于比较或开发现收现付数据集成技术的框架。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号