首页> 外文会议>Annual International Conference of the IEEE Engineering in Medicine and Biology Society >Prediction of Influenza A Virus Infections in Humans using an Artificial Neural Network Learning Approach
【24h】

Prediction of Influenza A Virus Infections in Humans using an Artificial Neural Network Learning Approach

机译:利用人工神经网络学习方法预测人类人类病毒感染

获取原文

摘要

The Influenza type A virus can be considered as one of the most severe viruses that can infect multiple species with often fatal consequences to the hosts. The Haemagglutinin (HA) gene of the virus has the potential to be a target for antiviral drug development realised through accurate identification of its sub-types and possible the targeted hosts. In this paper, to accurately predict if an Influenza type A virus has the capability to infect human hosts, by using only the HA gene, is therefore developed and tested. The predictive model follows three main steps; (i) decoding the protein sequences into numerical signals using EIIP amino acid scale, (ii) analysing these sequences by using Discrete Fourier Transform (DFT) and extracting DFT-based features, (iii) using a predictive model, based on Artificial Neural Networks and using the features generated by DFT. In this analysis, from the Influenza Research Database, 30724, 18236 and 8157 HA protein sequences were collected for Human, Avian and Swine respectively. Given this set of the proteins, the proposed method yielded 97.36% (± 0.04%), 97.26% (± 0.26%), 0.978 (± 0.004), 0.963 (± 0.005) and 0.945 (± 0.005) for the training accuracy validation accuracy, precision, recall and Mathews Correlation Coefficient (MCC) respectively, based on a 10-fold cross-validation. The classification model generated by using one of the largest dataset, if not the largest, yields promising results that could lead to early detection of such species and help develop precautionary measurements for possible human infections.
机译:流感型病毒可以被认为是最严重的病毒之一,可以感染多种物种,通常对宿主致命的后果。病毒的血凝素(HA)基因具有通过精确识别其子类型和可能的宿主来实现抗病毒药物开发的潜力。在本文中,为了准确预测流感型病毒,通过仅使用HA基因具有感染人宿主的能力,因此开发和测试。预测模型遵循三个主要步骤; (i)使用EiIP氨基酸标度将蛋白质序列解码为数值信号,(ii)通过使用离散的傅里叶变换(DFT)分析这些序列并使用预测模型提取基于DFT的特征(III),基于人工神经网络并使用DFT生成的功能。在该分析中,来自流感研究数据库,分别为人,禽和猪收集30724,18236和8157 HA蛋白序列。考虑到这套蛋白质,所提出的方法得到97.36%(±0.04%),97.26%(±0.26%),0.978(±0.26%),0.963(±0.005)和0.945(±0.005),用于训练精度验证精度基于10倍交叉验证,分别,精度,回忆和Mathews相关系数(MCC)。通过使用最大数据集之一产生的分类模型,如果不是最大的,则产生有希望的结果,这可能导致早期检测此类物种,并有助于为可能的人类感染产生预防性测量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号