首页> 外文会议>IEEE International Conference on Data Mining Workshops >A Novel Pre-Classification Based kNN Algorithm
【24h】

A Novel Pre-Classification Based kNN Algorithm

机译:一种基于预分类的新型kNN算法

获取原文

摘要

kNN (k nearest neighbors) is widely adopted because of its simplicity. However, its shortcomings can not be neglected, especially its time complexity. Consequently a great amount of approaches emerged in large numbers in decades to cope with this issue with a tradeoff in performance of the classification. In this paper, a novel improved kNN algorithm is proposed with a better performance than traditional kNN when its time complexity is meanwhile reduced. In the proposed algorithm, a pre-classification which cost little time is to be conducted before proposed kNN algorithm. Then the training set can be divided into several parts according to the classification probability with some thresholds. After that the parts with probability nearer to 1 or 0 are selected to be training sets. The accuracy rate and the area under the ROC curve (the receiver operating characteristic curve) of the proposed algorithm is calculated and compared with basic kNN algorithm in the experiments. The experiment results show that not only the pre-classification based kNN algorithm greatly reduced the time cost, but it also performs better than the original kNN algorithm in accuracy and AUC (the area under the ROC curve).
机译:由于其简单性,kNN(k最近邻居)被广泛采用。但是,它的缺点不能忽略,特别是它的时间复杂度。因此,几十年来涌现了大量方法,以权衡分类性能来解决此问题。本文提出了一种新的改进的kNN算法,该算法在降低时间复杂度的同时,具有比传统kNN更好的性能。在提出的算法中,在提出的kNN算法之前要进行花费很少时间的预分类。然后,根据具有一定阈值的分类概率,可以将训练集分为几个部分。之后,选择概率接近于1或0的部分作为训练集。计算了所提算法的准确率和ROC曲线下面积(接收机工作特性曲线),并与实验中的基本kNN算法进行了比较。实验结果表明,不仅基于预分类的kNN算法大大减少了时间成本,而且在准确性和AUC(ROC曲线下的面积)方面也比原始kNN算法更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号