首页> 外文期刊>E3S Web of Conferences >Analytical method for selection an informative set of features with limited resources in the pattern recognition problem
【24h】

Analytical method for selection an informative set of features with limited resources in the pattern recognition problem

机译:选择具有在模式识别问题中有限资源的信息集的分析方法

获取原文
           

摘要

Feature selection is one of the most important issues in Data Mining and Pattern Recognition. Correctly selected features or a set of features in the final report determines the success of further work, in particular, the solution of the classification and forecasting problem. This work is devoted to the development and study of an analytical method for determining informative attribute sets (IAS) taking into account the resource for criteria based on the use of the scattering measure of classified objects. The areas of existence of the solution are determined. Statements and properties are proved for the Fisher type informativeness criterion, using which the proposed analytical method for determining IAS guarantees the optimality of results in the sense of maximizing the selected functional. The relevance of choosing this type of informativeness criterion is substantiated. The universality of the method with respect to the type of features is shown. An algorithm for implementing this method is presented. In addition, the paper discussed the dynamics of the growth of information in the world, problems associated with big data, as well as problems and tasks of data preprocessing. The relevance of reducing the dimension of the attribute space for the implementation of data processing and visualization without unnecessary difficulties is substantiated. The disadvantages of existing methods and algorithms for choosing an informative set of attributes are shown.
机译:特征选择是数据挖掘和模式识别中最重要的问题之一。在最终报告中正确选择的功能或一组特征决定了进一步工作的成功,特别是对分类和预测问题的解决方案。这项工作致力于开发和研究,用于确定基于使用分类对象的散射量度的标准的资源来确定信息的分析方法(IAS)。确定了解决方案的存在区域。陈述和属性被证明为Fisher型信息性标准,使用该方法的确定方法,用于确定IAS的最佳结果,以最大化所选功能的最佳结果。选择这种类型的信息性标准的相关性得到了证实。示出了关于特征类型的方法的普遍性。提出了一种实现该方法的算法。此外,本文讨论了世界上信息增长的动态,与大数据相关的问题,以及数据预处理的问题和任务。证实了在没有不必要的困难的情况下减少属性空间的维度的相关性的相关性。示出了用于选择信息集的现有方法和算法的缺点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号