首页> 美国卫生研究院文献>International Journal of Epidemiology >DataSHIELD: taking the analysis to the data not the data to the analysis
【2h】

DataSHIELD: taking the analysis to the data not the data to the analysis

机译:DataSHIELD:将分析带到数据而不是数据带到分析

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

>Background: Research in modern biomedicine and social science requires sample sizes so large that they can often only be achieved through a pooled co-analysis of data from several studies. But the pooling of information from individuals in a central database that may be queried by researchers raises important ethico-legal questions and can be controversial. In the UK this has been highlighted by recent debate and controversy relating to the UK’s proposed ‘care.data’ initiative, and these issues reflect important societal and professional concerns about privacy, confidentiality and intellectual property. DataSHIELD provides a novel technological solution that can circumvent some of the most basic challenges in facilitating the access of researchers and other healthcare professionals to individual-level data.>Methods: Commands are sent from a central analysis computer (AC) to several data computers (DCs) storing the data to be co-analysed. The data sets are analysed simultaneously but in parallel. The separate parallelized analyses are linked by non-disclosive summary statistics and commands transmitted back and forth between the DCs and the AC. This paper describes the technical implementation of DataSHIELD using a modified R statistical environment linked to an Opal database deployed behind the computer firewall of each DC. Analysis is controlled through a standard R environment at the AC.>Results: Based on this Opal/R implementation, DataSHIELD is currently used by the Healthy Obese Project and the Environmental Core Project (BioSHaRE-EU) for the federated analysis of 10 data sets across eight European countries, and this illustrates the opportunities and challenges presented by the DataSHIELD approach.>Conclusions: DataSHIELD facilitates important research in settings where: (i) a co-analysis of individual-level data from several studies is scientifically necessary but governance restrictions prohibit the release or sharing of some of the required data, and/or render data access unacceptably slow; (ii) a research group (e.g. in a developing nation) is particularly vulnerable to loss of intellectual property—the researchers want to fully share the information held in their data with national and international collaborators, but do not wish to hand over the physical data themselves; and (iii) a data set is to be included in an individual-level co-analysis but the physical size of the data precludes direct transfer to a new site for analysis.
机译:>背景:现代生物医学和社会科学的研究要求样本量如此之大,以至于通常只能通过对若干项研究的数据进行汇总联合分析来实现。但是研究人员可能会查询中央数据库中来自个人的信息,这引起了重要的伦理法律问题,并且可能会引起争议。在英国,有关英国拟议的“ care.data”计划的最新辩论和争议突出表明了这一点,这些问题反映出社会和专业人士对隐私,机密性和知识产权的关注。 DataSHIELD提供了一种新颖的技术解决方案,可以克服一些最基本的挑战,以便利研究人员和其他医疗保健专业人员访问个人数据。>方法:命令从中央分析计算机(AC)发送)到存储要进行共同分析的数据的几台数据计算机(DC)。同时但并行地分析数据集。单独的并行分析通过DC和AC之间来回传输的非公开摘要统计信息和命令进行链接。本文介绍了使用修改后的R统计环境链接到每个DC的计算机防火墙后面部署的Opal数据库的DataSHIELD的技术实现。分析是通过AC上的标准R环境进行控制的。>结果:基于此Opal / R实施,Healthy肥胖项目和环境核心项目(BioSHaRE-EU)目前正在将DataSHIELD用于联合分析了八个欧洲国家的10个数据集,这说明了DataSHIELD方法带来的机遇和挑战。>结论: DataSHIELD促进了以下环境中的重要研究:(i)对个体进行共同分析科学上有必要从几项研究中获得最高水平的数据,但治理限制禁止释放或共享某些所需数据,和/或使数据访问速度过慢而无法接受; (ii)一个研究小组(例如在发展中国家)特别容易遭受知识产权损失-研究人员希望与国家和国际合作者充分共享其数据中保存的信息,但不希望移交物理数据他们自己; (iii)数据集应包含在个人级别的协同分析中,但数据的物理大小无法将其直接传输到新站点进行分析。

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号