首页> 外文学位 >De novo protein structure modeling from cryoem data through a dynamic programming algorithm in the secondary structure topology graph.
【24h】

De novo protein structure modeling from cryoem data through a dynamic programming algorithm in the secondary structure topology graph.

机译:从低温蛋白质数据通过二级结构拓扑图中的动态编程算法从头进行蛋白质结构建模。

获取原文
获取原文并翻译 | 示例

摘要

Proteins are the molecules carry out the vital functions and make more than the half of dry weight in every cell. Protein in nature folds into a unique and energetically favorable 3-Dimensional (3-D) structure which is critical and unique to its biological function. In contrast to other methods for protein structure determination, Electron Cryo-rricroscopy (CryoEM) is able to produce volumetric maps of proteins that are poorly soluble, large and hard to crystallize. Furthermore, it studies the proteins in their native environment. Unfortunately, the volumetric maps generated by current advances in CryoEM technique produces protein maps at medium resolution about (~5 to 10Å) in which it is hard to determine the atomic-structure of the protein. However, the resolution of the volumetric maps is improving steadily, and recent works could obtain atomic models at higher resolutions (~3Å).;De novo protein modeling is the process of building the structure of the protein using its CryoEM volumetric map. Thereupon, the volumetric maps at medium resolution generated by CryoEM technique proposed a new challenge. At the medium resolution, the location and orientation of secondary structure elements (SSE) can be visually and computationally identified. However, the order and direction (called protein topology) of the SSEs detected from the CryoEM volumetric map are not visible. In order to determine the protein structure, the topology of the SSEs has to be figured out and then the backbone can be built. Consequently, the topology problem has become a bottle neck for protein modeling using CryoEM.;In this dissertation, we focus to establish an effective computational framework to derive the atomic structure of a protein from the medium resolution CryoEM volumetric maps. This framework includes a topology graph component to rank effectively the topologies of the SSEs and a model building component. In order to generate the small subset of candidate topologies, the problem is translated into a layered graph representation. We developed a dynamic programming algorithm (TopoDP) for the new representation to overcome the problem of large search space. Our approach shows the improved accuracy, speed and memory use when compared with existing methods. However, the generating of such set was infeasible using a brute force method. Therefore, the topology graph component effectively reduces the topological space using the geometrical features of the secondary structures through a constrained K-shortest paths method in our layered graph. The model building component involves the bending of a helix and the loop construction using skeleton of the volumetric map. The forward-backward CCD is applied to bend the helices and model the loops.
机译:蛋白质是执行重要功能的分子,在每个细胞中占干重的一半以上。自然界中的蛋白质折叠成独特且在能量上有利的3维(3-D)结构,这对于其生物学功能至关重要且独特。与其他用于蛋白质结构确定的方法相比,电子低温立体成像(CryoEM)能够生成蛋白质的体积图,这些蛋白质的溶解度差,大且难以结晶。此外,它还研究了其天然环境中的蛋白质。不幸的是,由CryoEM技术的最新进展所产生的体积图会以中等分辨率(约5至10Å)生成蛋白质图,在其中很难确定蛋白质的原子结构。然而,体积图的分辨率正在稳步提高,并且最近的工作可以得到更高分辨率(约3Å)的原子模型。从头蛋白质建模是利用其CryoEM体积图构建蛋白质结构的过程。于是,CryoEM技术生成的中等分辨率的体积图提出了新的挑战。在中等分辨率下,可以从视觉和计算上识别二级结构元素(SSE)的位置和方向。但是,从CryoEM体积图检测到的SSE的顺序和方向(称为蛋白质拓扑)不可见。为了确定蛋白质结构,必须弄清楚SSE的拓扑,然后可以构建骨架。因此,拓扑问题已成为使用CryoEM进行蛋白质建模的瓶颈。本文旨在建立一个有效的计算框架,以从中等分辨率的CryoEM体积图导出蛋白质的原子结构。该框架包括用于有效排名SSE拓扑的拓扑图组件和模型构建组件。为了生成候选拓扑的小子集,将问题转换为分层图表示。我们针对新的表示形式开发了动态编程算法(TopoDP),以克服搜索空间大的问题。与现有方法相比,我们的方法显示出更高的准确性,速度和内存使用率。但是,使用蛮力方法无法生成这样的集合。因此,拓扑图组件通过我们的分层图中的约束K最短路径方法,利用二级结构的几何特征有效地减少了拓扑空间。模型构建组件涉及螺旋的弯曲和使用体积图骨架的环构建。使用向前-向后CCD弯曲螺旋并为环路建模。

著录项

  • 作者

    Al Nasr, Kamal H.;

  • 作者单位

    Old Dominion University.;

  • 授予单位 Old Dominion University.;
  • 学科 Biology Bioinformatics.;Computer Science.;Information Science.
  • 学位 Ph.D.
  • 年度 2012
  • 页码 146 p.
  • 总页数 146
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 古生物学;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号