Imputation in Data Fusion of Heterogeneous Data Sets A Model-Based Numerical Experiment

ANDRE BERCHTOLD; ANDRE JEANNIN

首页> 外文期刊>Communications in Statistics. B, Simulation and Computation >Imputation in Data Fusion of Heterogeneous Data Sets A Model-Based Numerical Experiment

【24h】

Imputation in Data Fusion of Heterogeneous Data Sets A Model-Based Numerical Experiment

机译：异构数据集数据融合中的归因基于模型的数值实验

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Given the very large amount of data obtained everyday through population surveys, much of the new research again could use this information instead of collecting new samples. Unfortunately, relevant data are often disseminated into different files obtained through different sampling designs. Data fusion is a set of methods used to combine information from different sources into a single dataset. In this article, we are interested in a specific problem: the fusion of two data files, one of which being quite small. We propose a model-based procedure combining a logistic regression with an Expectation-Maximization algorithm. Results show that despite the lack of data, this procedure can perform better than standard matching procedures.

机译：考虑到每天通过人口调查获得的大量数据，许多新研究再次可以使用此信息来代替收集新样本。不幸的是，相关数据经常散布到通过不同抽样设计获得的不同文件中。数据融合是用于将来自不同来源的信息组合到单个数据集中的一组方法。在本文中，我们对一个特定问题感兴趣：两个数据文件的融合，其中一个很小。我们提出了一个基于模型的程序，将逻辑回归与期望最大化算法结合在一起。结果表明，尽管缺少数据，但该过程的性能要优于标准匹配过程。

著录项

来源
《Communications in Statistics. B, Simulation and Computation》 |2008年第7期|p.1316-1328|共13页
作者
ANDRE BERCHTOLD; ANDRE JEANNIN;
展开▼
作者单位

Groupe de Recherche sur la Sante des Adolescents, University Hospital Center and University of Lausanne, Lausanne, Switzerland;

展开▼
收录信息美国《科学引文索引》(SCI);
原文格式 PDF
正文语种 eng
中图分类统计学;
关键词
binary variable; data fusion; data structure; expectationmaximization algorithm; logistic regression; matching.;

机译：二进制变量;数据融合;数据结构;期望最大化算法;逻辑回归;匹配;

相似文献

外文文献
中文文献
专利

1. Initializing numerical weather prediction models with satellite-derived surface soil moisture: Data assimilation experiments with ECMWF's Integrated Forecast System and the TMI soil moisture data set [J] . M. Drusch Journal of Geophysical Research, D. Atmospheres: JGR . 2007,第d3期

机译：使用卫星衍生的地表土壤湿度初始化数值天气预报模型：使用ECMWF的综合预报系统和TMI土壤湿度数据集进行数据同化实验
2. Exploring and Determining Missing-data Imputation Method for Socio-economic Data by Way of Designing a Simulation Study in the Context of Gross National Happiness (GNH) Data Set [J] . Sonam Tshering, Takeo Okazaki, Satoshi Endo International journal of computer science and network security . 2012,第5期

机译：通过设计国民幸福总值（GNH）数据集中的模拟研究，探索和确定社会经济数据的缺失数据估算方法
3. Exploring and Determining Missing-data Imputation Method for Socio-economic Data by Way of Designing a Simulation Study in the Context of Gross National Happiness (GNH) Data Set [J] . Sonam Tshering, Takeo Okazaki, Satoshi Endo International journal of computer science and network security . 2012,第5期

机译：通过设计国民幸福总值（GNH）数据集中的模拟研究，探索和确定社会经济数据的缺失数据估算方法
4. A Rough Set Approach to Data Imputation and Its Application to a Dissolved Gas Analysis Dataset [C] . Junzo Watada, Chen Shi, Yoshiyuki Yabuuchi, International Conference on Computing Measurement Control and Sensor Network . 2017

机译：一种粗略的数据归档方法及其在溶解气体分析数据集的应用方法
5. Multiple Imputation Methods for Large Multi-Scale Data Sets with Missing or Suppressed Values [D] . Cao, Jian. 2018

机译：具有缺失或抑制值的大型多尺度数据集的多重估算方法
6. Model-Based Heterogeneous Data Fusion for Reliable Force Estimation in Dynamic Structures under Uncertainties [O] . Babak Khodabandeloo, Dyan Melvin, Hongki Jo 2017

机译：不确定条件下动态结构可靠力估计的基于模型的异构数据融合
7. Model-Based Heterogeneous Data Fusion for Reliable Force Estimation in Dynamic Structures under Uncertainties [O] . Babak Khodabandeloo, Dyan Melvin, Hongki Jo 2017

机译：基于模型的非均匀数据融合在动态结构不确定性下的可靠力估计

Imputation in Data Fusion of Heterogeneous Data Sets A Model-Based Numerical Experiment

摘要

著录项

相似文献

相关主题

期刊订阅