...
首页> 外文期刊>Biology Direct >De-DUFing the DUFs: Deciphering distant evolutionary relationships of Domains of Unknown Function using sensitive homology detection methods
【24h】

De-DUFing the DUFs: Deciphering distant evolutionary relationships of Domains of Unknown Function using sensitive homology detection methods

机译:DUFS DUFS:使用敏感同源性检测方法解密未知功能的域的远端进化关系

获取原文
           

摘要

Background In the post-genomic era where sequences are being determined at a rapid rate, we are highly reliant on computational methods for their tentative biochemical characterization. The Pfam database currently contains 3,786 families corresponding to “Domains of Unknown Function” (DUF) or “Uncharacterized Protein Family” (UPF), of which 3,087 families have no reported three-dimensional structure, constituting almost one-fourth of the known protein families in search for both structure and function. Results We applied a ‘computational structural genomics’ approach using five state-of-the-art remote similarity detection methods to detect the relationship between uncharacterized DUFs and domain families of known structures. The association with a structural domain family could serve as a start point in elucidating the function of a DUF. Amongst these five methods, searches in SCOP-NrichD database have been applied for the first time. Predictions were classified into high, medium and low- confidence based on the consensus of results from various approaches and also annotated with enzyme and Gene ontology terms. 614 uncharacterized DUFs could be associated with a known structural domain, of which high confidence predictions, involving at least four methods, were made for 54 families. These structure-function relationships for the 614 DUF families can be accessed on-line at http://proline.biochem.iisc.ernet.in/RHD_DUFS/ . For potential enzymes in this set, we assessed their compatibility with the associated fold and performed detailed structural and functional annotation by examining alignments and extent of conservation of functional residues. Detailed discussion is provided for interesting assignments for DUF3050, DUF1636, DUF1572, DUF2092 and DUF659. Conclusions This study provides insights into the structure and potential function for nearly 20?% of the DUFs. Use of different computational approaches enables us to reliably recognize distant relationships, especially when they converge to a common assignment because the methods are often complementary. We observe that while pointers to the structural domain can offer the right clues to the function of a protein, recognition of its precise functional role is still ‘non-trivial’ with many DUF domains conserving only some of the critical residues. It is not clear whether these are functional vestiges or instances involving alternate substrates and interacting partners. Reviewers This article was reviewed by Drs Eugene Koonin, Frank Eisenhaber and Srikrishna Subramanian.
机译:背景技术在基因组时代以快速率确定序列,我们高度依赖于其暂定生化表征的计算方法。 PFAM数据库目前包含3,786个家庭,对应于“未知功能”(DUF)或“无表征蛋白质家庭”(UPF)的家族,其中3,087个家庭没有报告的三维结构,构成了已知的蛋白质家族的几乎四分之一在寻找结构和功能。结果我们使用五个最先进的远程相似性检测方法应用了“计算结构基因组学”方法,以检测无表称DUFS和已知结构的域家族之间的关系。与结构域家族的关联可以作为阐明DUF功能的开始点。在这五种方法中,第一次应用了SCOP-NRICHD数据库中的搜索。基于各种方法的结果共识,并以酶和基因本体论术语共识,预测分为高,中等和低信心。对于54个家庭来说,614无表达DUFS可能与已知的结构领域有关,其中涉及至少四种方法的高置信度预测。这些结构函数关系对于614 DUF系列可以在线访问http://proline.biochem.iisc.ernet.in/rhd_dufs/。对于该组中的潜在酶,我们通过检查功能残留物的守恒的对准和程度来评估它们与相关折叠的兼容性,并通过检查恒定的守恒的持续程度进行详细的结构和功能注释。提供详细讨论,用于DUF3050,DUF1636,DUF1572,DUF2092和DUF659的有趣分配。结论本研究提供了近20次DUFS的结构和潜在功能的见解。使用不同的计算方法使我们能够可靠地识别远程关系,特别是当它们收敛到常见的分配时,因为这些方法通常是互补的。我们观察到,虽然指向结构领域的指针可以向蛋白质的功能提供正确的线索,但识别其精确的功能作用仍然是“非琐碎”,许多DUF结构域只节省一些关键残留物。目前尚不清楚这些是涉及替代基板和互动伴侣的功能性痕迹或实例。审核人员本文由Drs Eugene Koonin,Frank Eisenhaber和Srikrishna Subramanian审核。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号