The federated database--a basis for biobank-based post-genome studies, integrating phenome and genome data from 600,000 twin pairs in Europe.

Muilu J; Peltonen L; Litton JE

首页> 外文期刊>European journal of human genetics: EJHG >The federated database--a basis for biobank-based post-genome studies, integrating phenome and genome data from 600,000 twin pairs in Europe.

【24h】

The federated database--a basis for biobank-based post-genome studies, integrating phenome and genome data from 600,000 twin pairs in Europe.

机译：联邦数据库-基于生物库的后基因组研究的基础，整合了来自欧洲60万对双胞胎的表型和基因组数据。

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Integration of complex data and data management represent major challenges in large-scale biobank-based post-genome era research projects like GenomEUtwin (an international collaboration between eight Twin Registries) with extensive amounts of genotype and phenotype data combined from different data sources located in different countries. The challenge lies not only in data harmonization and constant update of clinical details in various locations, but also in the heterogeneity of data storage and confidentiality of sensitive health-related and genetic data. Solid infrastructure must be built to provide secure, but easily accessible and standardized, data exchange also facilitating statistical analyses of the stored data. Data collection sites desire to have full control of the accumulation of data, and at the same time the integration should facilitate effortless slicing and dicing of the data for different types of data pooling and study designs. Here we describe how we constructed a federated database infrastructure for genotype and phenotype information collected in seven European countries and Australia and connected this database setting via a network called TwinNET to guarantee effortless data exchange and pooled analyses. This federated database system offers a powerful facility for combining different types of information from multiple data sources. The system is transparent to end users and application developers, since it makes the set of federated data sources look like a single system. The user need not be aware of the format or site where the data are stored, the language or programming interface of the data source, how the data are physically stored, whether they are partitioned and/or replicated or what networking protocols are used. The user sees a single standardized interface with the desired data elements for pooled analyses.

机译：复杂数据和数据管理的集成代表了大规模的基于生物库的后基因组时代研究项目（如GenomEUtwin（八个孪生注册管理机构之间的国际合作））的重大挑战，该项目具有大量基因型和表型数据，这些数据来自不同地点的不同数据源国家。挑战不仅在于数据协调和在各个位置不断更新临床细节，还在于数据存储的异质性和敏感的健康相关基因数据的机密性。必须建立坚实的基础架构，以提供安全但易于访问和标准化的数据交换，还必须促进对存储数据的统计分析。数据收集站点希望完全控制数据的积累，同时，集成应有助于轻松地对不同类型的数据池和研究设计进行数据的切片和切块。在这里，我们描述了我们如何构建用于收集七个欧洲国家和澳大利亚的基因型和表型信息的联邦数据库基础结构，以及如何通过称为TwinNET的网络连接此数据库设置，以确保轻松进行数据交换和汇总分析。该联合数据库系统提供了强大的功能，可以组合来自多个数据源的不同类型的信息。该系统对最终用户和应用程序开发人员透明，因为它使联合数据源集看起来像一个系统。用户不需要知道数据的存储格式或站点，数据源的语言或编程接口，如何物理存储数据，是否对其进行分区和/或复制或使用何种网络协议。用户将看到一个单一的标准化界面，其中包含用于合并分析的所需数据元素。

著录项

来源
《European journal of human genetics: EJHG》 |2007年第7期|共6页
作者
Muilu J; Peltonen L; Litton JE;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类医学遗传学;
关键词
Database Management Systems; Databases; Genetic; Registries; 数据库管理系统; 登记;

机译：Database Management Systems;Databases;Genetic;Registries;数据库管理系统;登记;

相似文献

外文文献
中文文献
专利

1. The federated database--a basis for biobank-based post-genome studies, integrating phenome and genome data from 600,000 twin pairs in Europe. [J] . Muilu J, Peltonen L, Litton JE European journal of human genetics: EJHG . 2007,第7期

机译：联邦数据库-基于生物库的后基因组研究的基础，整合了来自欧洲60万对双胞胎的表型和基因组数据。
2. Advancing Post-Genome Data and System Integration through Machine Learning [J] . FranciscoAzuaje Comparative and functional genomics . 2002,第1期

机译：通过机器学习推进后基因组数据和系统集成
3. Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data [J] . Joshua C Denny, Lisa Bastarache, Marylyn D Ritchie, Nature biotechnology . 2013,第12期

机译：电子病历数据的全基因组关联研究与全基因组的关联研究数据的系统比较
4. INVESTIGATION OF THE MOLECULAR BASIS OF MOONLIGHTING PROTEIN, ENOLASE, IN THE POST-GENOME STUDY [C] . Natsuko MIURA, Kazuma MATSUI, Satoshi ENDO, Progress on post-genome technologies and modern natural products . 2011

机译：基因组后研究中月光照蛋白烯醇酶的分子基础研究
5. Encore: A computational framework for the integrative analysis of genome-wide association study data and other biological data. [D] . Davis, Nicholas A. 2012

机译：Encore：用于对全基因组关联研究数据和其他生物学数据进行综合分析的计算框架。
6. A Genome-Wide Integrative Association Study of DNA Methylation and Gene Expression Data and Later Life Cognitive Functioning in Monozygotic Twins [O] . Mette Soerensen, Dominika Marzena Hozakowska-Roszkowska, Marianne Nygaard, 2020

机译：一种全基因组甲基化和基因表达数据和后期生命认知在单卵双胞胎的后期生命认知研究
7. Advancing Post-Genome Data and System Integration Through Machine Learning [O] . Francisco Azuaje 2006

机译：通过机器学习推进后基因组数据和系统集成

The federated database--a basis for biobank-based post-genome studies, integrating phenome and genome data from 600,000 twin pairs in Europe.

摘要

著录项

相似文献

相关主题

期刊订阅