首页> 外文期刊>Kuwait Journal of Science >Estimation of risk factors associated with colorectal cancer: an application of knowledge discovery in databases
【24h】

Estimation of risk factors associated with colorectal cancer: an application of knowledge discovery in databases

机译:估计与大肠癌相关的危险因素:知识发现在数据库中的应用

获取原文
           

摘要

Colorectal cancer is one of the first reasons for death due to cancer in the world.The goal of this study is to predict important risk factors of colorectal cancer (CRC)by knowledge discovery in databases (KDD) methods. This study comprised aretrospective CRC data of patients who had been diagnosed with colorectal cancer. Theselected records between 1 January 2010 and 1 March 2014 were collected randomlyfrom Turgut Ozal Medical Centre databases. The study included 160 individuals: 80patients admitted to Department of Oncology and diagnosed with CRC, and 80 controlsubjects with non-CRC categorization. The groups were matched for age and gender.We mined retrospective CRC data from large integrated health systems with electronichealth records. Specific demographical and clinical variables including calcium,hemoglobin, white blood cells, platelets, potassium, sodium, glucose, creatinine andtotal bilirubin were used in multilayer perceptron (MLP) artificial neural networks(ANN) modeling. In this study, patient and control groups consist of 160 individuals.In each group, 45 of these (56.3%) are male, and 35 (43.7%) are women. Mean ageof CRC patients and control groups is 58.6?±13.0. While the accuracy was 71.31%in training dataset (n=122), the accuracy was 81.82% in testing dataset. Area undercurve (AUC) values of training and testing datasets were 0.73 and 0.81, respectively.The suggested MLP ANN model identified significant factors of calcium, creatinine,potassium, platelets, sodium, hemoglobin and total bilirubin. Taken together, thesuggested MLP ANN model might be used for the estimation of risk factors associatedwith CRC as an application of medical KDD.
机译:结直肠癌是世界范围内因癌症死亡的首要原因之一。本研究的目的是通过数据库中的知识发现(KDD)方法预测结直肠癌(CRC)的重要危险因素。这项研究包括已被诊断患有大肠癌的患者的回顾性CRC数据。从Turgut Ozal Medical Center数据库中随机收集2010年1月1日至2014年3月1日之间的选定记录。该研究包括160名患者:80名入院肿瘤科并被确诊为CRC的患者,以及80名非CRC分类的对照组。根据年龄和性别对这些组进行匹配。我们从具有电子病历的大型综合卫生系统中提取了回顾性CRC数据。在多层感知器(MLP)人工神经网络(ANN)建模中使用了特定的人口统计学和临床​​变量,包括钙,血红蛋白,白细胞,血小板,钾,钠,葡萄糖,肌酐和总胆红素。在这项研究中,患者和对照组由160个人组成,每组中男性为45(56.3%),女性为35(43.7%)。 CRC患者和对照组的平均年龄为58.6±13.0。训练数据集中的准确性为71.31%(n = 122),而测试数据集中的准确性为81.82%。训练和测试数据集的曲线下面积(AUC)值分别为0.73和0.81。建议的MLP ANN模型确定了钙,肌酐,钾,血小板,钠,血红蛋白和总胆红素的重要因素。两者合计,建议的MLP ANN模型可用于估计与CRC相关的危险因素,作为医学KDD的应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号