首页> 美国卫生研究院文献>RNA >RNAcode: Robust discrimination of coding and noncoding regions in comparative sequence data
【2h】

RNAcode: Robust discrimination of coding and noncoding regions in comparative sequence data

机译:RNA码:比较序列数据中对编码区和非编码区的鲁棒区分

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

With the availability of genome-wide transcription data and massive comparative sequencing, the discrimination of coding from noncoding RNAs and the assessment of coding potential in evolutionarily conserved regions arose as a core analysis task. Here we present RNAcode, a program to detect coding regions in multiple sequence alignments that is optimized for emerging applications not covered by current protein gene-finding software. Our algorithm combines information from nucleotide substitution and gap patterns in a unified framework and also deals with real-life issues such as alignment and sequencing errors. It uses an explicit statistical model with no machine learning component and can therefore be applied “out of the box,” without any training, to data from all domains of life. We describe the RNAcode method and apply it in combination with mass spectrometry experiments to predict and confirm seven novel short peptides in Escherichia coli and to analyze the coding potential of RNAs previously annotated as “noncoding.” RNAcode is open source software and available for all major platforms at .
机译:随着全基因组转录数据的可用性和大量的比较测序,区分非编码RNA的编码和评估进化保守区域的编码潜力成为核心分析任务。在这里,我们介绍RNAcode,该程序可检测多个序列比对中的编码区,该程序针对当前蛋白质基因发现软件未涵盖的新兴应用进行了优化。我们的算法在一个统一的框架中结合了来自核苷酸取代和缺口模式的信息,还处理了诸如序列比对和测序错误等现实问题。它使用没有机器学习成分的显式统计模型,因此无需任何培训即可“开箱即用”应用于来自生活各个领域的数据。我们描述了RNAcode方法,并将其与质谱实验结合应用,以预测和确认大肠杆菌中的七个新型短肽,并分析先前标注为“非编码”的RNA的编码潜力。 RNAcode是开放源代码软件,可从以下网站下载到所有主要平台。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号