...
首页> 外文期刊>Nucleic Acids Research >Identification of transcribed protein coding sequence remnants within lincRNAs
【24h】

Identification of transcribed protein coding sequence remnants within lincRNAs

机译:Lincrnas内转录蛋白质编码序列残留的鉴定

获取原文
获取原文并翻译 | 示例
           

摘要

Long intergenic non-coding RNAs (lincRNAs) are non-coding transcripts 200 nucleotides long that do not overlap protein-coding sequences. Importantly, such elements are known to be tissue-specifically expressed and to play a widespread role in gene regulation across thousands of genomic loci. However, very little is known of the mechanisms for the evolutionary biogenesis of these RNA elements, especially given their poor conservation across species. It has been proposed that lincRNAs might arise from pseudogenes. To test this systematically, we developed a novel method that searches for remnants of protein-coding sequences within lincRNA transcripts; the hypothesis is that we can trace back their biogenesis from protein-coding genes or posterior transposon/retrotransposon insertions. Applying this method, we found 203 human lincRNA genes with regions significantly similar to protein-coding sequences. Our method provides a visualization tool to trace the evolutionary biogenesis of lincRNAs with respect to protein-coding genes by sequence divergence. Subsequently, we show the expression correlation between lincRNAs and their identified parental protein-coding genes using public RNA-seq repositories, hinting at novel gene regulatory relationships. In summary, we developed a novel computational methodology to study non-coding gene sequences, which can be applied to identify the evolutionary biogenesis and function of lincRNAs.
机译:长性非编码RNA(LincrNA)是非编码转录物& 200个核苷酸长度,不重叠蛋白质编码序列。重要的是,已知这些元素是特异性表达的组织和在数千个基因组基因座的基因调节中起着广泛的作用。然而,众所周知的是这些RNA元素的进化生物发生的机制,特别是鉴于它们在物种中的差。已经提出了Lincrnas可能来自伪原。为了系统地测试,我们开发了一种新的方法,用于搜索LincrNA转录物中的蛋白质编码序列的残余物;假设是我们可以从蛋白质编码基因或后转座/回复转换插入中追踪它们的生物发生。应用这种方法,我们发现203个人LincrNA基因与蛋白质编码序列显着类似的区域。我们的方法提供了一种可视化工具,以通过序列发散追踪蛋白质编码基因的Lincrnas的进化生物发生。随后,我们展示了利用公共RNA-SEQ储存库的LincrNA和其鉴定的父母编码基因之间的表达相关性,暗示了新的基因调节关系。总之,我们开发了一种研究非编码基因序列的新型计算方法,其可以应用于鉴定Lincrnas的进化生物发生和功能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号