首页> 美国卫生研究院文献>BMC Bioinformatics >BM-BC: a Bayesian method of base calling for Solexa sequence data
【2h】

BM-BC: a Bayesian method of base calling for Solexa sequence data

机译:BM-BC:贝叶斯方法的基础要求Solexa序列数据

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Base calling is a critical step in the Solexa next-generation sequencing procedure. It compares the position-specific intensity measurements that reflect the signal strength of four possible bases (A, C, G, T) at each genomic position, and outputs estimates of the true sequences for short reads of DNA or RNA. We present a Bayesian method of base calling, BM-BC, for Solexa-GA sequencing data. The Bayesian method builds on a hierarchical model that accounts for three sources of noise in the data, which are known to affect the accuracy of the base calls: fading, phasing, and cross-talk between channels. We show that the new method improves the precision of base calling compared with currently leading methods. Furthermore, the proposed method provides a probability score that measures the confidence of each base call. This probability score can be used to estimate the false discovery rate of the base calling or to rank the precision of the estimated DNA sequences, which in turn can be useful for downstream analysis such as sequence alignment.
机译:碱基检出是Solexa下一代测序程序中的关键步骤。它比较了特定位置的强度测量结果,这些测量结果反映了每个基因组位置上四个可能碱基(A,C,G,T)的信号强度,并输出了短读DNA或RNA的真实序列的估计值。我们为Solexa-GA测序数据提供了一种贝叶斯基本调用方法BM-BC。贝叶斯方法建立在一个层次模型的基础上,该模型考虑了数据中的三个噪声源,已知这三个噪声源会影响碱基检出的准确性:衰落,定相和通道之间的串扰。我们表明,与当前领先的方法相比,该新方法提高了基本调用的精度。此外,所提出的方法提供了一种概率分数,用于测量每个碱基检出的置信度。此概率分数可用于估计碱基检出的错误发现率或对估计的DNA序列的精度进行排名,这反过来可用于下游分析(例如序列比对)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号