首页> 外文会议>International conference on very large data bases >Massive Genomic Data Processing and Deep Analysis
【24h】

Massive Genomic Data Processing and Deep Analysis

机译:大规模基因组数据处理和深度分析

获取原文

摘要

Today large sequencing centers are producing genomic data at the rate of 10 terabytes a day and require complicated processing to transform massive amounts of noisy raw data into biological information. To address these needs, we develop a system for end-to-end processing of genomic data, including alignment of short read sequences, variation discovery, and deep analysis. We also employ a range of quality control mechanisms to improve data quality and parallel processing techniques for performance. In the demo, we will use real genomic data to show details of data transformation through the workflow, the usefulness of end results (ready for use as testable hypotheses), the effects of our quality control mechanisms and improved algorithms, and finally performance improvement.
机译:如今,大型测序中心正在每天10磅的速度产生基因组数据,并且需要复杂的处理来将大量的嘈杂的原始数据转化为生物信息。为了解决这些需求,我们开发一个用于基因组数据的端到端处理系统,包括短读序列的对齐,变异发现和深度分析。我们还采用了一系列质量控制机制,以提高性能的数据质量和并行处理技术。在演示中,我们将使用实际基因组数据通过工作流程来显示数据转换的详细信息,最终结果的有用性(准备用作可测试假设),我们的质量控制机制和改进算法的影响,以及最终的性能改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号