首页> 外文会议>Annual Conference of the Society of Instrument and Control Engineers of Japan >Detection of Weak Relevant Variables using Random Forests
【24h】

Detection of Weak Relevant Variables using Random Forests

机译:使用随机森林检测弱相关变量

获取原文

摘要

In general, the purpose of a machine learning technique is to obtain an approximation of a function based on a set of training data. To obtain a good approximation of the function, the input variables irrelevant to the output should be removed in advance. The process used to remove the irrelevant input variables is known as feature selection. The existing feature selection methods generally select input variables that make it possible to approximate the function more accurately. As a consequence, these methods often fail to detect input variables that weakly affect the output, especially when the training data given are insufficient. In some practical applications of machine learning techniques, however, all of the input variables that actually affect the output must be detected. In this study, we propose a new feature selection method to overcome this drawback of the existing methods. Our method evaluates the relevance of a certain input variable by comparing it with a random variable. We show, through numerical experiments, that the proposed method is capable of detecting even input variables that weakly affect the output.
机译:通常,机器学习技术的目的是基于一组训练数据来获得函数的近似值。为了获得良好的函数逼近,应事先删除与输出无关的输入变量。删除不相关的输入变量的过程称为功能选择。现有的特征选择方法通常选择输入变量,从而可以更准确地近似函数。结果,这些方法通常无法检测到对输出有微弱影响的输入变量,尤其是在给定的训练数据不足时。但是,在机器学习技术的某些实际应用中,必须检测到实际上影响输出的所有输入变量。在这项研究中,我们提出了一种新的特征选择方法,以克服现有方法的这一缺点。我们的方法通过将某个输入变量与随机变量进行比较来评估其相关性。我们通过数值实验表明,所提出的方法能够检测甚至对输出产生微弱影响的输入变量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号