...
首页> 外文期刊>Source Code for Biology Medicine >CombAlign: a code for generating a one-to-many sequence alignment from a set of pairwise structure-based sequence alignments
【24h】

CombAlign: a code for generating a one-to-many sequence alignment from a set of pairwise structure-based sequence alignments

机译:CombAlign:用于从一组基于成对结构的序列比对中生成一对多序列比对的代码

获取原文
           

摘要

Background In order to better define regions of similarity among related protein structures, it is useful to identify the residue-residue correspondences among proteins. Few codes exist for constructing a one-to-many multiple sequence alignment derived from a set of structure or sequence alignments, and a need was evident for creating such a tool for combining pairwise structure alignments that would allow for insertion of gaps in the reference structure. Results This report describes a new Python code, CombAlign, which takes as input a set of pairwise sequence alignments (which may be structure based) and generates a one-to-many, gapped, multiple structure- or sequence-based sequence alignment (MSSA). The use and utility of CombAlign was demonstrated by generating gapped MSSAs using sets of pairwise structure-based sequence alignments between structure models of the matrix protein (VP40) and pre-small/secreted glycoprotein (sGP) of Reston Ebolavirus and the corresponding proteins of several other filoviruses. The gapped MSSAs revealed structure-based residue-residue correspondences, which enabled identification of structurally similar versus differing regions in the Reston proteins compared to each of the other corresponding proteins. Conclusions CombAlign is a new Python code that generates a one-to-many, gapped, multiple structure- or sequence-based sequence alignment (MSSA) given a set of pairwise sequence alignments (which may be structure based). CombAlign has utility in assisting the user in distinguishing structurally conserved versus divergent regions on a reference protein structure relative to other closely related proteins. CombAlign was developed in Python 2.6, and the source code is available for download from the GitHub code repository.
机译:背景技术为了更好地定义相关蛋白质结构之间的相似性区域,鉴定蛋白质之间的残基-残基对应关系是有用的。很少有用于构建从一组结构或序列比对中得出的一对多序列比对的代码,显然需要创建这种工具来组合成对的结构比对,以允许在参考结构中插入缺口。结果此报告描述了一个新的Python代码CombAlign,它将一组成对的序列比对(可能是基于结构的)作为输入,并生成一对多,有间隙,基于多个结构或序列的序列比对(MSSA) )。通过使用雷斯顿埃博拉病毒的基质蛋白(VP40)和预小/分泌糖蛋白(sGP)的结构模型与几种相应蛋白的结构模型之间的成对基于结构的序列对,证明了CombAlign的用途和效用其他丝状病毒。空缺的MSSA揭示了基于结构的残基-残基对应关系,与其他相应的蛋白相比,该结构能够识别Reston蛋白中结构相似或不同的区域。结论CombAlign是一种新的Python代码,在给定一组成对的序列比对(可能是基于结构)的情况下,该代码会生成一对多,有间隙,基于多结构或序列的序列比对(MSSA)。 CombAlign具有实用性,可帮助用户区分参考蛋白结构上相对于其他紧密相关蛋白的结构保守区与发散区。 CombAlign是使用Python 2.6开发的,其源代码可从GitHub代码存储库下载。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号