...
首页> 外文期刊>International Journal of Artificial Intelligence and Knowledge Discovery >Optimized Classification Method for Prognosis of Breast Cancer Using RapidMiner
【24h】

Optimized Classification Method for Prognosis of Breast Cancer Using RapidMiner

机译:RapidMiner优化的乳腺癌预后分类方法

获取原文
           

摘要

The quality of the data is one of the most important factors influencing the performance of any classification or clustering algorithm. The attributes defining the feature space of a given data set can often be inadequate, which make it difficult to discover interesting knowledge or desired output. An electrical impedance measurement in samples of freshly excised tissue from the breast is one of the techniques to discriminate Carcinoma, Fibro-adenoma, Mastopathy, Glandular, Connective and Adipose (cellular transformation). In this context, the state-of-the-art software RapidMiner already provides easy to use interfaces for developing and evaluating classification and validation and Machine Learning applications. C.Y.V.Watanabe reported output at very high accuracy and prove the value of validation using SACMiner. The aim of this work is to implement a RapidMiner extension that contains the most applicable classification algorithms to enable non-experts to easily apply them. Secondly an evaluation of the implemented algorithms was carried out in an attempt to show the relative strength and weakness of the algorithms as Watanabe’s work. Simultaneously attempt was made to use evolutionary parameter optimization for selection of optimal values of model parameters for enhancement of model accuracy. Efforts were also made to identify the attributes that influences the accuracy of the result using weighting attributes. The performance of the implemented and the proposed algorithms were evaluated on real world data sets from the UCI machine learning repository.
机译:数据质量是影响任何分类或聚类算法性能的最重要因素之一。定义给定数据集的特征空间的属性通常可能不足,这使得难以发现有趣的知识或所需的输出。测量从乳房中新鲜切除的组织样本中的电阻抗是区分癌,纤维腺瘤,乳腺疾病,腺体,结缔组织和脂肪(细胞转化)的技术之一。在这种情况下,最先进的软件RapidMiner已经提供了易于使用的界面,用于开发和评估分类和验证以及机器学习应用程序。渡边C.Y.V. Watanabe以非常高精度报告了输出,并证明了使用SACMiner进行验证的价值。这项工作的目的是实现一个RapidMiner扩展,该扩展包含最适用的分类算法,以使非专家可以轻松地应用它们。其次,对已实现的算法进行了评估,以试图展示出渡边工作中算法的相对优势和劣势。同时尝试使用进化参数优化来选择模型参数的最佳值以增强模型精度。还努力使用权重属性来识别影响结果准确性的属性。在UCI机器学习存储库中的真实数据集上评估了已实现算法和拟议算法的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号