...
首页> 外文期刊>Communications in Statistics. B, Simulation and Computation >Imputation in Data Fusion of Heterogeneous Data Sets A Model-Based Numerical Experiment
【24h】

Imputation in Data Fusion of Heterogeneous Data Sets A Model-Based Numerical Experiment

机译:异构数据集数据融合中的归因基于模型的数值实验

获取原文
获取原文并翻译 | 示例
           

摘要

Given the very large amount of data obtained everyday through population surveys, much of the new research again could use this information instead of collecting new samples. Unfortunately, relevant data are often disseminated into different files obtained through different sampling designs. Data fusion is a set of methods used to combine information from different sources into a single dataset. In this article, we are interested in a specific problem: the fusion of two data files, one of which being quite small. We propose a model-based procedure combining a logistic regression with an Expectation-Maximization algorithm. Results show that despite the lack of data, this procedure can perform better than standard matching procedures.
机译:考虑到每天通过人口调查获得的大量数据,许多新研究再次可以使用此信息来代替收集新样本。不幸的是,相关数据经常散布到通过不同抽样设计获得的不同文件中。数据融合是用于将来自不同来源的信息组合到单个数据集中的一组方法。在本文中,我们对一个特定问题感兴趣:两个数据文件的融合,其中一个很小。我们提出了一个基于模型的程序,将逻辑回归与期望最大化算法结合在一起。结果表明,尽管缺少数据,但该过程的性能要优于标准匹配过程。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号