...
首页> 外文期刊>BMC Bioinformatics >IPRStats: visualization of the functional potential of an InterProScan run
【24h】

IPRStats: visualization of the functional potential of an InterProScan run

机译:IPRStats:可视化InterProScan运行的功能潜力

获取原文
           

摘要

BackgroundInterPro is a collection of protein signatures for the classification and automated annotation of proteins. Interproscan is a software tool that scans protein sequences against Interpro member databases using a variety of profile-based, hidden markov model and positional specific score matrix methods. It not only combines a set of analysis tools, but also performs data look-up from various sources, as well as some redundancy removal. Interproscan is robust and scalable, able to perform on any machine from a netbook to a large cluster. However, when performing whole-genome or metagenome analysis, there is a need for a fast statistical visualization of the results to have good initial grasp on the functional potential of the sequences in the analyzed data set. This is especially important when analyzing and comparing metagenomic or metaproteomic data-sets.ResultsIPRStats is a tool for the visualization of Interproscan results. Interproscan results are parsed from the Interproscan XML or EBIXML file into an SQLite or MySQL database. The results for each signature database scan are read and displayed as pie-charts or bar charts as summary statistics. A table is also provided, where each entry is a signature (e.g. a Pfam entry) accompanied by one or more Gene Ontology terms, if Interproscan was run using the Gene Ontology option.ConclusionsWe present an platform-independent, open source licensed tool that is useful for Interproscan users who wish to view the summary of their results in a rapid and concise fashion.
机译:BackgroundInterPro是蛋白质签名的集合,用于蛋白质的分类和自动注释。 Interproscan是一种软件工具,可以使用各种基于配置文件的隐藏马尔可夫模型和位置特定分数矩阵方法,针对Interpro成员数据库扫描蛋白质序列。它不仅结合了一组分析工具,而且还可以从各种来源执行数据查找以及一些冗余删除。 Interproscan强大且可扩展,能够在从上网本到大型群集的任何计算机上执行。但是,在进行全基因组或元基因组分析时,需要对结果进行快速统计可视化,以便对所分析数据集中的序列的功能潜力有良好的初步了解。在分析和比较宏基因组或元蛋白质组数据集时,这一点尤其重要。ResultsIPRStats是Interproscan结果可视化的工具。 Interproscan结果从Interproscan XML或EBIXML文件解析到SQLite或MySQL数据库中。读取每个特征库扫描的结果,并以饼图或条形图的形式显示为摘要统计信息。还提供了一个表格,如果Interproscan是使用Gene Ontology选项运行的,则每个条目都是一个签名(例如Pfam条目),并带有一个或多个Gene Ontology术语。结论我们提供了一个独立于平台的开源许可工具,该工具是对于希望快速简洁地查看结果摘要的Interproscan用户有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号