DataSHIELD: taking the analysis to the data not the data to the analysis

机译：DataSHIELD：将分析带到数据而不是数据带到分析

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

>Background: Research in modern biomedicine and social science requires sample sizes so large that they can often only be achieved through a pooled co-analysis of data from several studies. But the pooling of information from individuals in a central database that may be queried by researchers raises important ethico-legal questions and can be controversial. In the UK this has been highlighted by recent debate and controversy relating to the UK’s proposed ‘care.data’ initiative, and these issues reflect important societal and professional concerns about privacy, confidentiality and intellectual property. DataSHIELD provides a novel technological solution that can circumvent some of the most basic challenges in facilitating the access of researchers and other healthcare professionals to individual-level data.>Methods: Commands are sent from a central analysis computer (AC) to several data computers (DCs) storing the data to be co-analysed. The data sets are analysed simultaneously but in parallel. The separate parallelized analyses are linked by non-disclosive summary statistics and commands transmitted back and forth between the DCs and the AC. This paper describes the technical implementation of DataSHIELD using a modified R statistical environment linked to an Opal database deployed behind the computer firewall of each DC. Analysis is controlled through a standard R environment at the AC.>Results: Based on this Opal/R implementation, DataSHIELD is currently used by the Healthy Obese Project and the Environmental Core Project (BioSHaRE-EU) for the federated analysis of 10 data sets across eight European countries, and this illustrates the opportunities and challenges presented by the DataSHIELD approach.>Conclusions: DataSHIELD facilitates important research in settings where: (i) a co-analysis of individual-level data from several studies is scientifically necessary but governance restrictions prohibit the release or sharing of some of the required data, and/or render data access unacceptably slow; (ii) a research group (e.g. in a developing nation) is particularly vulnerable to loss of intellectual property—the researchers want to fully share the information held in their data with national and international collaborators, but do not wish to hand over the physical data themselves; and (iii) a data set is to be included in an individual-level co-analysis but the physical size of the data precludes direct transfer to a new site for analysis.

机译：>背景：现代生物医学和社会科学的研究要求样本量如此之大，以至于通常只能通过对若干项研究的数据进行汇总联合分析来实现。但是研究人员可能会查询中央数据库中来自个人的信息，这引起了重要的伦理法律问题，并且可能会引起争议。在英国，有关英国拟议的“ care.data”计划的最新辩论和争议突出表明了这一点，这些问题反映出社会和专业人士对隐私，机密性和知识产权的关注。 DataSHIELD提供了一种新颖的技术解决方案，可以克服一些最基本的挑战，以便利研究人员和其他医疗保健专业人员访问个人数据。>方法：命令从中央分析计算机（AC）发送）到存储要进行共同分析的数据的几台数据计算机（DC）。同时但并行地分析数据集。单独的并行分析通过DC和AC之间来回传输的非公开摘要统计信息和命令进行链接。本文介绍了使用修改后的R统计环境链接到每个DC的计算机防火墙后面部署的Opal数据库的DataSHIELD的技术实现。分析是通过AC上的标准R环境进行控制的。>结果：基于此Opal / R实施，Healthy肥胖项目和环境核心项目（BioSHaRE-EU）目前正在将DataSHIELD用于联合分析了八个欧洲国家的10个数据集，这说明了DataSHIELD方法带来的机遇和挑战。>结论： DataSHIELD促进了以下环境中的重要研究：（i）对个体进行共同分析科学上有必要从几项研究中获得最高水平的数据，但治理限制禁止释放或共享某些所需数据，和/或使数据访问速度过慢而无法接受；（ii）一个研究小组（例如在发展中国家）特别容易遭受知识产权损失-研究人员希望与国家和国际合作者充分共享其数据中保存的信息，但不希望移交物理数据他们自己; （iii）数据集应包含在个人级别的协同分析中，但数据的物理大小无法将其直接传输到新站点进行分析。

著录项

期刊名称 International Journal of Epidemiology
作者
Amadou Gaye; Yannick Marcon; Julia Isaeva; Philippe LaFlamme; Andrew Turner; Elinor M Jones; Joel Minion; Andrew W Boyd; Christopher J Newby; Marja-Liisa Nuotio; Rebecca Wilson; Oliver Butters; Barnaby Murtagh; Ipek Demir; Dany Doiron; Lisette Giepmans; Susan E Wallace; Isabelle Budin-Ljøsne; Carsten Oliver Schmidt; Paolo Boffetta; Mathieu Boniol; Maria Bota; Kim W Carter; Nick deKlerk; Chris Dibben; Richard W Francis; Tero Hiekkalinna; Kristian Hveem; Kirsti Kvaløy; Sean Millar; Ivan J Perry; Annette Peters; Catherine M Phillips; Frank Popham; Gillian Raab; Eva Reischl; Nuala Sheehan; Melanie Waldenberger; Markus Perola; Edwin van den Heuvel; John Macleod; Bartha M Knoppers; Ronald P Stolk; Isabel Fortier; Jennifer R Harris; Bruce HR Woffenbuttel; Madeleine J Murtagh; Vincent Ferretti; Paul R Burton;
展开▼
作者单位

展开▼
年(卷),期 -1(43),6
年度 -1
页码 1929–1944
总页数 16
原文格式 PDF
正文语种
中图分类公共卫生工程;
关键词
DataSHIELD pooled analysis ELSI privacy confidentiality disclosure distributed computing intellectual property bioinformatics;

机译：DataSHIELD;汇总分析;ELSI;隐私;机密性;披露;分布式计算;知识产权;生物信息学;

相似文献

外文文献
中文文献
专利

1. DataSHIELD: taking the analysis to the data, not the data to the analysis [J] . Gaye Amadou, Marcon Yannick, Isaeva Julia, International Journal of Epidemiology: Official Journal of the International Epidemiological Association . 2014,第Pta6期

机译：DataSHIELD：将分析带到数据，而不是数据带到分析
2. DataSHIELD: taking the analysis to the data, not the data to the analysis [J] . Gaye Amadou, Marcon Yannick, Isaeva Julia, International Journal of Epidemiology: Official Journal of the International Epidemiological Association . 2014,第Pta6期

机译：DataShield：将分析分析到数据，而不是分析的数据
3. DataSHIELD: resolving a conflict in contemporary bioscience--performing a pooled analysis of individual-level data without sharing the data. [J] . Wolfson M, Wallace SE, Masca N, International Journal of Epidemiology: Official Journal of the International Epidemiological Association . 2010,第5期

机译：DataSHIELD：解决当代生物科学中的冲突-在不共享数据的情况下对个人数据进行汇总分析。
4. On the Analysis of Utility and Risk for Masked Data in Big Data: A Small Data Analysis [C] . Vicenc TORRA International Conference of the Catalan Association for Artificial Intelligence . 2018

机译：在大数据中屏蔽数据的实用性和风险分析：小数据分析
5. Scaling the Technology Opportunity Analysis text data mining methodology: Data extraction, cleaning, online analytical processing analysis, and reporting of large multi-source datasets. [D] . George, Richard Peyton. 2006

机译：扩展技术机会分析文本数据挖掘方法：数据提取，清理，在线分析处理分析以及大型多源数据集的报告。
6. DataSHIELD: resolving a conflict in contemporary bioscience—performing a pooled analysis of individual-level data without sharing the data [O] . Michael Wolfson, Susan E Wallace, Nicholas Masca, -1

机译：DataSHIELD：解决当代生物科学中的冲突-在不共享数据的情况下对个人数据进行汇总分析
7. DataSHIELD : taking the analysis to the data, not the data to the analysis [O] . Gaye Amadou, Marcon Yannick, Isaeva Julia, 2014

机译：DataSHIELD：将分析带到数据，而不是数据带到分析
8. Pump Failure Data Analysis. VTT's Contribution to the EuReDatA Benchmark Exercise on Data Analysis (Pumppujen Vikatietojen Analyysi. VTT:n osuus EuReDatAn Benchmark-Tutkimuksessa) [R] . Simola, K., Huovinen, T., Lehtinen, E., 1991

机译：泵故障数据分析。 VTT对EuReData数据分析基准测试的贡献（pumppujen Vikatietojen analyysi.VTT：n osuus EuReDatan Benchmark-Tutkimuksessa）

DataSHIELD: taking the analysis to the data not the data to the analysis

摘要

著录项

相似文献

相关主题

期刊订阅