首页> 外文期刊>International Journal of Distributed Sensor Networks >A novel multi-label classification algorithm based on K-nearest neighbor and random walk
【24h】

A novel multi-label classification algorithm based on K-nearest neighbor and random walk

机译:一种基于K-Colly Exbank和随机步行的新型多标签分类算法

获取原文
           

摘要

The multi-label classification problem occurs in many real-world tasks where an object is naturally associated with multiple labels, that is, concepts. The integration of the random walk approach in the multi-label classification methods attracts many researchers’ sight. One challenge of using the random walk-based multi-label classification algorithms is to construct a random walk graph for the multi-label classification algorithms, which may lead to poor classification quality and high algorithm complexity. In this article, we propose a novel multi-label classification algorithm based on the random walk graph and the K -nearest neighbor algorithm (named MLRWKNN). This method constructs the vertices set of a random walk graph for the K -nearest neighbor training samples of certain test data and the edge set of correlations among labels of the training samples, thus considerably reducing the overhead of time and space. The proposed method improves the similarity measurement by differentiating and integrating the discrete and continuous features, which reflect the relationships between instances more accurately. A label predicted method is devised to reduce the subjectivity of the traditional threshold method. The experimental results with four metrics demonstrate that the proposed method outperforms the seven state-of-the-art multi-label classification algorithms in contrast and makes a significant improvement for multi-label classification.
机译:在许多实际任务中发生多标签分类问题,其中一个对象自然与多个标签相关联,即概念。在多标签分类方法中随机步行方法的集成吸引了许多研究人员的视线。使用随机散步的多标签分类算法的一个挑战是为多标签分类算法构造随机步行图,这可能导致分类质量差和高算法复杂性。在本文中,我们提出了一种基于随机步行图的新型多标签分类算法和 k-nearest邻居算法(名为MLRWKN)。该方法构造用于训练样本标签之间的某些测试数据的 K -Nearest邻居训练样本的随机步行图的顶点集,从而大大减少了时间和空间的开销。所提出的方法通过区分和整合离散和连续特征来改善相似度测量,这更准确地反映了实例之间的关系。设计了标签预测方法,以降低传统阈值方法的主观性。具有四个度量的实验结果表明,所提出的方法相比之下,七种最先进的多标签分类算法,对多标签分类进行了重大改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号