首页> 外文期刊>Statistical Journal of the IAOS: Journal of the International Association for Official Statistics >A first step towards a German SynLBD: Constructing a German Longitudinal Business Database
【24h】

A first step towards a German SynLBD: Constructing a German Longitudinal Business Database

机译:迈向德国SynLBD的第一步:构建德国纵向业务数据库

获取原文
获取原文并翻译 | 示例
           

摘要

One major criticism against the use of synthetic data has been that the efforts necessary to generate useful synthetic data are so intense that many statistical agencies cannot afford them. We argue many lessons in this evolving field have been learned in the early years of synthetic data generation, and can be used in the development of new synthetic data products, considerably reducing the required investments. The final goal of the project described in this paper will be to evaluate whether synthetic data algorithms developed in the U.S. to generate a synthetic version of the Longitudinal Business Database (LBD) can easily be transferred to generate a similar data product for other countries. We construct a German data product with information comparable to the LBD - the German Longitudinal Business Database (GLBD) - that is generated from different administrative sources at the Institute for Employment Research, Germany. In a future step, the algorithms developed for the synthesis of the LBD will be applied to the GLBD. Extensive evaluations will illustrate whether the algorithms provide useful synthetic data without further adjustment. The ultimate goal of the project is to provide access to multiple synthetic datasets similar to the SynLBD at Cornell to enable comparative studies between countries. The Synthetic GLBD is a first step towards that goal.
机译:对使用合成数据的一种主要批评是,生成有用的合成数据所需的努力如此之大,以至于许多统计机构无法负担得起。我们认为,在合成数据生成的早期阶段已经汲取了这一不断发展的领域中的许多经验教训,这些经验教训可用于开发新的合成数据产品,从而大大减少了所需的投资。本文所述项目的最终目标将是评估在美国开发的用于生成纵向业务数据库(LBD)的合成版本的综合数据算法是否可以轻松地转移以生成其他国家的类似数据产品。我们构建的德国数据产品具有与LBD相当的信息-德国纵向业务数据库(GLBD),该信息是从德国就业研究所的不同管理来源生成的。在未来的步骤中,为LBD的合成而开发的算法将应用于GLBD。广泛的评估将说明该算法是否无需进一步调整即可提供有用的合成数据。该项目的最终目标是提供与康奈尔大学SynLBD相似的多个综合数据集的访问权限,以实现国家间的比较研究。合成GLBD是朝着这个目标迈出的第一步。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号