首页> 外文学位 >Alternatives to relational databases in precision medicine: Comparison of NoSQL approaches for big data storage using supercomputers.
【24h】

Alternatives to relational databases in precision medicine: Comparison of NoSQL approaches for big data storage using supercomputers.

机译:精密医学中关系数据库的替代方案:使用超级计算机的NoSQL存储大数据方法的比较。

获取原文
获取原文并翻译 | 示例

摘要

Improvements in medical and genomic technologies have dramatically increased the production of electronic data over the last decade. As a result, data management is rapidly becoming a major determinant, and urgent challenge, for the development of Precision Medicine. Although successful data management is achievable using Relational Database Management Systems (RDBMS), exponential data growth is a significant contributor to failure scenarios. Growing amounts of data can also be observed in other sectors, such as economics and business, which, together with the previous facts, suggests that alternate database approaches (NoSQL) may soon be required for efficient storage and management of big databases. However, this hypothesis has been difficult to test in the Precision Medicine field since alternate database architectures are complex to assess and means to integrate heterogeneous electronic health records (EHR) with dynamic genomic data are not easily available.;In this dissertation, we present a novel set of experiments for identifying NoSQL database approaches that enable effective data storage and management in Precision Medicine using patients' clinical and genomic information from the cancer genome atlas (TCGA). The first experiment draws on performance and scalability from biologically meaningful queries with differing complexity and database sizes. The second experiment measures performance and scalability in database updates without schema changes. The third experiment assesses performance and scalability in database updates with schema modifications due dynamic data. We have identified two NoSQL approach, based on Cassandra and Redis, which seems to be the ideal database management systems for our precision medicine queries in terms of performance and scalability. We present NoSQL approaches and show how they can be used to manage clinical and genomic big data. Our research is relevant to the public health since we are focusing on one of the main challenges to the development of Precision Medicine and, consequently, investigating a potential solution to the progressively increasing demands on health care.
机译:在过去的十年中,医学和基因组技术的进步极大地提高了电子数据的生成。结果,数据管理正迅速成为精密医学发展的主要决定因素和紧迫挑战。尽管使用关系数据库管理系统(RDBMS)可以实现成功的数据管理,但是指数数据增长是导致失败情况的重要因素。在其他领域,例如经济和商业领域,也可以观察到越来越多的数据,这与以前的事实一起表明,可能很快需要备用数据库方法(NoSQL)来有效地存储和管理大型数据库。然而,由于备用数据库架构的评估很复杂,并且难以将异构电子健康记录(EHR)与动态基因组数据整合在一起的方法,因此在精密医学领域很难对此假设进行检验。用于识别NoSQL数据库方法的一组新颖的实验,这些方法可以使用来自癌症基因组图谱(TCGA)的患者的临床和基因组信息,在Precision Medicine中实现有效的数据存储和管理。第一个实验从具有不同复杂性和数据库大小的生物学意义上的查询中获取性能和可伸缩性。第二个实验在不更改架构的情况下测量了数据库更新中的性能和可伸缩性。第三个实验通过对由于动态数据引起的架构修改来评估数据库更新中的性能和可伸缩性。我们已经确定了两种基于Cassandra和Redis的NoSQL方法,就性能和可伸缩性而言,这似乎是我们精确医学查询的理想数据库管理系统。我们将介绍NoSQL方法,并展示如何将其用于管理临床和基因组大数据。我们的研究与公共卫生息息相关,因为我们专注于精确医学发展的主要挑战之一,因此,正在研究对医疗保健日益增长的需求的潜在解决方案。

著录项

  • 作者

    Velazquez, Enrique Israel.;

  • 作者单位

    University of Pittsburgh.;

  • 授予单位 University of Pittsburgh.;
  • 学科 Science education.;Health education.;Information science.
  • 学位 Ph.D.
  • 年度 2015
  • 页码 232 p.
  • 总页数 232
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号