Drosophila GRAIL: an intelligent system for gene recognition in Drosophila DNA sequences

机译：果蝇GRAIL：用于果蝇DNA序列中基因识别的智能系统

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

An AI-based system for gene recognition in Drosophila DNA sequences was designed and implemented. The system consists of two main modules, one for coding exon recognition and one for single gene model construction. The exon recognition module finds a coding exon by recognition of its splice junctions (or translation start) and coding potential. The core of this module is a set of neural networks which evaluate an exon candidate for the possibility of being a true coding exon using the "recognized" splice junction (or translation start) and coding signals. The recognition process consists of four steps: generation of an exon candidate pool, elimination of improbable candidates using heuristic rules, candidate evaluation by trained neural networks, and candidate cluster resolution and final exon prediction. The gene model construction module takes as input the clustered exon candidates and builds a "best" possible single gene model using an efficient dynamic programming algorithm. 129 Drosophila sequences consisting of 441 coding exons including 216358 coding bases were extracted from GenBank and used to build statistical matrices and to train the neural networks. On this training set the system recognized 97% of the coding messages and predicted only 5% false messages. Among the "correctly" predicted exons, 68% match the actual exon exactly and 96% have at least one edge predicted correctly. On an independent test set consisting of 30 Drosophila sequences, the system recognized 96% of the coding messages and predicted 7% false messages.

机译：设计并实现了一个基于AI的果蝇DNA序列基因识别系统。该系统由两个主要模块组成，一个模块用于编码外显子识别，另一个模块用于单基因模型构建。外显子识别模块通过识别其剪接点（或翻译起点）和编码电位来找到编码外显子。该模块的核心是一组神经网络，这些神经网络使用“已识别的”剪接点（或翻译起点）和编码信号来评估外显子候选对象是否可能成为真正的编码外显子。识别过程包括四个步骤：生成外显子候选物库，使用启发式规则消除不可能的候选物，通过经过训练的神经网络进行候选物评估以及候选物簇解析和最终外显子预测。基因模型构建模块将聚集的外显子候选物作为输入，并使用有效的动态编程算法构建“最佳”可能的单基因模型。从GenBank中提取了由441个编码外显子（包括216358个编码碱基）组成的129个果蝇序列，用于构建统计矩阵和训练神经网络。在此训练集上，系统识别出97％的编码消息，并且仅预测5％的错误消息。在“正确”预测的外显子中，有68％的人与实际外显子完全匹配，而96％的人至少有一个边缘被正确预测。在由30个果蝇序列组成的独立测试集上，系统识别出96％的编码消息，并预测了7％的错误消息。

著录项

来源
《Intelligence in Neural and Biological Systems, 1995. INBS'95, Proceedings., First International Symposium on》|1995年|P.128-135|共8页
会议地点
作者
Ying Xu; Helt; G.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Intraspecific diversity of nucleotide sequences within the adenine + thymine-rich region of mitochondrial DNA molecules of Drosophila mauritiana, Drosophila melanogaster and Drosophila simulans [J] . Christiane M.R. Fauron, David R. Wolstenholme Nucleic acids research . 1980,第22期

机译：毛果蝇，果蝇和拟果蝇线粒体DNA分子的腺嘌呤+胸腺嘧啶丰富区域内核苷酸序列的种内多样性。
2. Intraspecific diversity of nucleotide sequences within the adenine + thymine-rich region of mitochondrial DNA molecules of Drosophila mauritiana, Drosophila melanogaster and Drosophila simulans [J] . Christiane M.R. Fauron, David R. Wolstenholme Nucleic acids research . 1980,第22期

机译：毛果蝇，果蝇和拟果蝇线粒体DNA分子的腺嘌呤+胸腺嘧啶丰富区域内核苷酸序列的种内多样性。
3. Essential role of grim-led programmed cell death for the establishment of corazonin-producing peptidergic nervous system during embryogenesis and metamorphosis in Drosophila melanogaster Essential role of grim-led programmed cell death for the establishment of corazonin-producing peptidergic nervous system during embryogenesis and metamorphosis in Drosophila melanogaster Essential role of grim-led programmed cell death for the establishment of corazonin-producing peptidergic nervous system during embryogenesis and metamorphosis in Drosophila melanogaster [J] . Zixing Wang, Jae H. Park, Chun-Hong Chen, Biology Open . 2013,第3期

机译：严峻的程序性细胞死亡在果蝇胚胎发生和变态过程中建立corazonin产生的肽能神经系统中的重要作用严峻的程序性细胞死亡在果蝇的胚胎发生和变形过程中建立corazonin产生的肽能神经系统中的重要作用。果蝇致死的程序性细胞死亡在果蝇胚胎发生和变态过程中建立可乐宁生产肽能神经系统的建立中的重要作用
4. Drosophila GRAIL: an intelligent system for gene recognition in Drosophila DNA sequences [C] . Ying Xu, Helt G., Institute of Electric and Electronic Engineer International symposium on intelligence in neural and biological systems . 1995

机译：果蝇Grail：奇节炎DNA序列中基因识别的智能系统
5. An analysis of population genetic structure and species history of Drosophila melanogaster and Drosophila simulans using restriction fragment length polymorphisms of mitochondrial-DNA. [D] . Hale, Lawrence Richard. 1990

机译：利用线粒体DNA的限制性片段长度多态性分析果蝇和果蝇的种群遗传结构和物种历史。
6. Intraspecific diversity of nucleotide sequences within the adenine + thymine-rich region of mitochondrial DNA molecules of Drosophila mauritiana Drosophila melanogaster and Drosophila simulans. [O] . C M Fauron, D R Wolstenholme 1980

机译：毛果蝇果蝇和拟果蝇线粒体DNA分子的腺嘌呤+胸腺嘧啶丰富区域内核苷酸序列的种内多样性。
7. Drosophila GRAIL: an intelligent system for gene recognition in Drosophila DNA sequences [O] . G. Helt, J.R. Einstein, G. Rubin, -1

机译：果蝇Grail：奇节炎DNA序列中基因识别的智能系统
8. Drosophila GRAIL: An intelligent system for gene recognition in Drosophila DNA sequences [R] . Xu, Y. , Einstein, J. R. , Uberbacher, E. C. , 1995

机译：Drosophila GRaIL：果蝇DNa序列中基因识别的智能系统

Drosophila GRAIL: an intelligent system for gene recognition in Drosophila DNA sequences

摘要

著录项

相似文献

相关主题

期刊订阅