首页> 外文会议>International Electronics Symposium >Performance Evaluation of Classifiers for Predicting Infection Cases of Dengue Virus Based on Clinical Diagnosis Criteria
【24h】

Performance Evaluation of Classifiers for Predicting Infection Cases of Dengue Virus Based on Clinical Diagnosis Criteria

机译:基于临床诊断标准的登革病毒感染病例分类器性能评估

获取原文

摘要

Dengue fever caused by dengue virus infection is a severe health threat that can lead to death. In the medical and health field, to classify data, data mining exploitation and classification methods have an essential role in predicting disease. Two main criteria are crucial to diagnosing dengue virus infection, namely the criteria clinical diagnosis and laboratory diagnosis. Dengue infection based on clinical signs and symptoms, as well as laboratory examinations, is made in three clinical diagnosis criteria, which consist of dengue fever (DF), dengue hemorrhagic fever (DHF), and dengue shock syndrome (DSS). This study was conducted with the primary objective to test and evaluate eight different classification algorithms to find the best algorithm in terms of efficiency and effectiveness. Classification algorithm used to predict dengue virus infection cases into three classes of DF, DHF, and DSS based on the performance of accuracy, precision, and recall. The classification algorithm used in this comparison were Neural Networks (NN), Support Vector Machine (SVM), K-Nearest Neighbor (KNN), Decision Tree, Random Forest, Naïve Bayes, AdaBoost, and Logistic Regression. The dataset called DBDDKK was collected from the Division of Disease Prevention and Control in the Semarang City Health Office, Central Java, Indonesia. Impute missing values, selection relevant feature, and normalize feature conducted in the preprocessing stage resulted in 14,019 records with 16 attributes for each record. Then the data were split into 70% for training data and 30% for testing data. Cross-validation with the number of folds 10 is applied to validate the accuracy during the dataset training process. The result of the comparison shows that the NN algorithm has the best accuracy that was over other algorithms.
机译:登革热病毒感染引起的登革热是严重的健康威胁,可能导致死亡。在医学和卫生领域,为了对数据进行分类,数据挖掘的开发和分类方法在预测疾病方面具有至关重要的作用。两个主要标准对于诊断登革热病毒感染至关重要,即临床诊断和实验室诊断标准。基于临床体征和症状以及实验室检查的登革热感染是根据三种临床诊断标准进行的,包括登革热(DF),登革出血热(DHF)和登革热休克综合征(DSS)。进行这项研究的主要目的是测试和评估八种不同的分类算法,以找到效率和效果最佳的算法。用于根据准确性,准确性和召回性能将登革热病毒感染病例预测为DF,DHF和DSS三类的分类算法。在此比较中使用的分类算法是神经网络(NN),支持向量机(SVM),K最近邻(KNN),决策树,随机森林,朴素贝叶斯,AdaBoost和Logistic回归。名为DBDDKK的数据集是从印度尼西亚中爪哇省三宝垄市卫生局疾病预防与控制部门收集的。在预处理阶段执行归因缺失值,选择相关特征和归一化特征,结果得到14019条记录,每条记录具有16个属性。然后,将数据分为训练数据的70%和测试数据的30%。倍数为10的交叉验证适用于在数据集训练过程中验证准确性。比较结果表明,NN算法具有优于其他算法的最佳精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号