首页> 外文会议>2012 2nd IEEE International Conference on Parallel Distributed and Grid Computing. >Undiagnosed samples aided rough set feature selection for medical data
【24h】

Undiagnosed samples aided rough set feature selection for medical data

机译:未诊断的样品有助于选择医疗数据的粗糙集特征

获取原文
获取原文并翻译 | 示例

摘要

Medical data often consists of a large number of disease markers. For medical data analysis, some disease markers are not helpful and sometimes even have negative effects. Therefore, applying feature selection is necessary as it can remove those unimportant disease markers. Among many feature selection methods, rough set based feature selection (RSFS) has been widely used. Unlike other methods, RSFS is completely data-driven. It does not require any other information like probability distributions. Traditional RSFS methods extract the information only from the diagnosed samples. Therefore, they usually require a large number of diagnosed samples to achieve the good feature selection performance. However, in many real medical applications, diagnosed samples are limited, yet the number of undiagnosed samples is large. Motivated by semi-supervised learning methodology, in this paper, we propose a novel RSFS method which can learn from both diagnosed and undiagnosed samples. This method is called undiagnosed samples aided rough set feature selection (USA-RSFS). Its main benefit is to reduce the requirement on diagnosed samples by the help of undiagnosed ones. Finally, the promising performance of USA-RSFS is validated through a set of experiments on medical datasets.
机译:医学数据通常包含大量疾病标记。对于医学数据分析,某些疾病标志物无济于事,甚至有负面影响。因此,应用特征选择是必要的,因为它可以删除那些不重要的疾病标记。在许多特征选择方法中,基于粗糙集的特征选择(RSFS)已被广泛使用。与其他方法不同,RSFS完全由数据驱动。它不需要任何其他信息,例如概率分布。传统的RSFS方法仅从诊断的样本中提取信息。因此,它们通常需要大量的诊断样本才能实现良好的特征选择性能。然而,在许多实际医学应用中,诊断样本有限,但未诊断样本的数量却很大。基于半监督学习方法,本文提出了一种新颖的RSFS方法,该方法可以从已诊断和未诊断的样本中学习。这种方法称为未诊断样本辅助粗集特征选择(USA-RSFS)。它的主要好处是借助未诊断的样品减少对诊断样品的需求。最后,通过在医学数据集上进行的一组实验验证了USA-RSFS的有前途的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号