首页> 外文会议>International Conference on Combinatorial Optimization and Applications >TNT: An Effective Method for Finding Correlations Between Two Continuous Variables
【24h】

TNT: An Effective Method for Finding Correlations Between Two Continuous Variables

机译:TNT:一种有效的方法,用于在连续变量之间找到相关性

获取原文

摘要

Determining whether two continuous variables are relevant, either linearly or non-linearly correlated, is a fundamental problem in data science. To test whether two continuous variables have a linear correlation is simple and has a much complete solution, but to judge whether they are in nonlinear correlation is far more difficult. Here, we propose a novel method, Tight Nearest-neighbor prediction correlation Test (TNT), to determine whether two continuous variables are nonlinearly correlated. TNT first use the values of one variable to construct a tight neighborhood structure to predict the value of the other variable and then use the sum of squared errors to measure how well the prediction is. A permutation test based on the sum of squared errors is employed to determine whether two continuous variables are relevant. To evaluate the performance of TNT, we performed extensive simulations comparing with seven existing methods. The results on both simulation and real data demonstrate that TNT is an efficient method to test nonlinear correlations, particularly for some nonlinear correlation which existing methods cannot solve, such as "ring".
机译:确定两个连续变量是否与线性或非线性相关的连续变量是相关的,是数据科学中的一个基本问题。为了测试两个连续变量是否具有线性相关性,具有很大的解决方案,而是判断它们是否处于非线性相关性是更困难的。这里,我们提出了一种新颖的方法,紧密最近邻预测相关试验(TNT),以确定两个连续变量是否是非线性相关的。 TNT首先使用一个变量的值来构建紧密的邻域结构来预测其他变量的值,然后使用平方误差的总和来测量预测的程度。基于平方误差之和的置换测试用于确定两个连续变量是否相关。为了评估TNT的性能,我们与七种现有方法进行了广泛的模拟。仿真和实际数据的结果表明TNT是测试非线性相关性的有效方法,特别是对于现有方法无法解的一些非线性相关性,例如“环”。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号