首页> 外文会议>IEEE EMBS International Conference on Biomedical and Health Informatics >Fast and efficient genotype encoding using sparse 2D bitmaps for database-driven genomics applications
【24h】

Fast and efficient genotype encoding using sparse 2D bitmaps for database-driven genomics applications

机译:使用稀疏2D位图对数据库驱动的基因组学应用进行快速有效的基因型编码

获取原文

摘要

Data management is a main challenge facing many genomics applications. A central target for genomic research is identifying and storing genetic variants present in human populations. Recently, there has been increasing interest in adopting a database representation for variant information. However, the massive scale of variant data pose many storage and access time challenges for database-driven genomic applications. Efficient database-driven variant encoding techniques need to be developed to address this problem. In this paper we propose a variant encoding technique for Single Nucleotide Polymorphisms (SNPs) based on 2D sparse bitmaps. The proposed encoding technique was designed to achieve high compressibility while minimizing access time. Using this approach, we were able to reduce the database storage space of the 1000 Genome dataset pilot data to 4.75GB from the 45.24GB required in a basic implementation. Our approach achieved this reduction while reducing database access time by around 100 times. Furthermore, we compared our approach to the popular Ensembl Variant Database and achieved database size reductions reaching up to 47.33% without compromising access time.
机译:数据管理是许多基因组学应用程序面临的主要挑战。基因组研究的中心目标是识别和存储人类群体中存在的遗传变异。近来,对于采用用于变体信息的数据库表示形式已经引起了越来越多的兴趣。但是,大规模的变体数据给数据库驱动的基因组应用带来了许多存储和访问时间的挑战。需要开发有效的数据库驱动的变体编码技术来解决此问题。在本文中,我们提出了一种基于2D稀疏位图的单核苷酸多态性(SNP)的变体编码技术。提出的编码技术旨在在最大程度减少访问时间的同时实现高可压缩性。使用这种方法,我们能够将1000个基因组数据集试验数据的数据库存储空间从基本实现所需的45.24GB减少到4.75GB。我们的方法实现了这种减少,同时将数据库访问时间减少了约100倍。此外,我们将我们的方法与流行的Ensembl Variant数据库进行了比较,并在不影响访问时间的情况下将数据库大小减少了多达47.33%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号