首页> 外文会议>Pattern recognition and image analysis >Unsupervised Joint Feature Discretization and Selection
【24h】

Unsupervised Joint Feature Discretization and Selection

机译:无监督联合特征离散化和选择

获取原文
获取原文并翻译 | 示例

摘要

In many applications, we deal with high dimensional datasets with different types of data. For instance, in text classification and information retrieval problems, we have large collections of documents. Each text is usually represented by a bag-of-words or similar representation, with a large number of features (terms). Many of these features may be irrelevant (or even detrimental) for the learning tasks. This excessive number of features carries the problem of memory usage in order to represent and deal with these collections, clearly showing the need for adequate techniques for feature representation, reduction, and selection, to both improve the classification accuracy and the memory requirements. In this paper, we propose a combined unsupervised feature discretization and feature selection technique. The experimental results on standard datasets show the efficiency of the proposed techniques as well as improvement over previous similar techniques.
机译:在许多应用程序中,我们处理具有不同类型数据的高维数据集。例如,在文本分类和信息检索问题中,我们收集了大量文档。每个文本通常用单词袋或类似的表示形式表示,并带有大量的功能(术语)。这些功能中的许多功能对于学习任务可能无关紧要(甚至有害)。过多的特征会带来内存使用问题,以表示和处理这些集合,从而清楚地表明需要有足够的技术来表示特征,进行缩减和选择,以同时提高分类准确性和存储要求。在本文中,我们提出了一种组合的无监督特征离散化和特征选择技术。在标准数据集上的实验结果表明了所提出技术的效率以及对以前类似技术的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号