...
首页> 外文期刊>Current gene therapy >Deep Forest-based Prediction of Protein Subcellular Localization
【24h】

Deep Forest-based Prediction of Protein Subcellular Localization

机译:基于深度森林的蛋白质亚细胞定位预测

获取原文
获取原文并翻译 | 示例
           

摘要

Motivation: Knowledge of the correct protein subcellular localization is necessary for understandingthe function of a protein and revealing the mechanism of many human diseases due to proteinsubcellular mislocalization, which is required before approaching gene therapy to treat a disease. In addition,it is well-known that the gene therapy is an effective way to overcome disease by targeting a genetherapy product to a specific subcellular compartment. Deep neural networks to predict protein functionhave become increasingly popular due to large increases in the available genomics data due to itsstrong superiority in the non-linear classification ability. However, they still have some drawbackssuch as too many hyper-parameters and sufficient amount of labeled data.Results: We present a deep forest-based protein location algorithm relying on sequence information.The prediction model uses a random forest network with a multi-layered structure to identify thesubcellular regions of protein. The model was trained and tested on a latest UniProt releases proteindataset, and we demonstrate that our deep forest predict the subcellular location of proteins given onlythe protein sequence with high accuracy, outperforming the current state-of-art algorithms.Meanwhile, unlike the deep neural networks, it has a significantly smaller number of parameters andis much easier to train.
机译:动机:了解正确的蛋白质亚细胞定位是理解蛋白质的功能并揭示由于蛋白质细胞错误级化引起的许多人类疾病的机制,这是在接近基因治疗疾病之前需要的许多人类疾病的机制。此外,众所周知,基因治疗是通过将基因治疗产品靶向特定的亚细胞室来克服疾病的有效方法。由于在非线性分类能力中的优势,由于可用基因组学数据的大幅增加,深神经网络以预测蛋白质功能越来越受欢迎。然而,它们仍然有一些缺点是过多的超参数和足够量的标记数据。结果:我们呈现了一个依赖于序列信息的深林的蛋白质位置算法。预测模型使用具有多层的随机林网结构鉴定蛋白质的蛋白细胞区域。该模型在最新的Uniprot释放ProteindataSet上培训并测试,我们证明我们的深森林预测蛋白质的亚细胞位置仅为蛋白质序列具有高精度,优于当前的最先进的算法。同样,与深神经无关网络,它具有明显较少数量的参数,更容易训练。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号