...
首页> 外文期刊>Signal processing >Hyperparameter selection for group-sparse regression: A probabilistic approach
【24h】

Hyperparameter selection for group-sparse regression: A probabilistic approach

机译:组稀疏回归的超参数选择:一种概率方法

获取原文
获取原文并翻译 | 示例
           

摘要

This work analyzes the effects on support recovery for different choices of the hyper- or regularization parameter in LASSO-like sparse and group-sparse regression problems. The hyperparameter implicitly selects the model order of the solution, and is typically set using cross-validation (CV). This may be computationally prohibitive for large-scale problems, and also often overestimates the model order, as CV optimizes for prediction error rather than support recovery. In this work, we propose a probabilistic approach to select the hyperparameter, by quantifying the type I error (false positive rate) using extreme value analysis. From Monte Carlo simulations, one may draw inference on the upper tail of the distribution of the spurious parameter estimates, and the regularization level may be selected for a specified false positive rate. By solving the e group-LASSO problem, the choice of hyperparameter becomes independent of the noise variance. Furthermore, the effects on the false positive rate caused by collinearity in the dictionary is discussed, including ways of circumventing them. The proposed method is compared to other hyperparameter-selection methods in terms of support recovery, false positive rate, false negative rate, and computational complexity. Simulated data illustrate how the proposed method outperforms CV and comparable methods in both computational complexity and support recovery. (C) 2018 Elsevier B.V. All rights reserved.
机译:这项工作分析了在类似LASSO的稀疏和群体稀疏回归问题中,选择不同的超化或正则化参数对支持恢复的影响。超参数隐式选择解决方案的模型顺序,通常使用交叉验证(CV)进行设置。由于CV针对预测误差而非支持恢复进行了优化,因此这可能在计算上无法解决大规模问题,并且经常高估模型阶数。在这项工作中,我们提出了一种通过使用极值分析对I型错误(误报率)进行量化来选择超参数的概率方法。从蒙特卡洛模拟中,可以推断出虚假参数估计值分布的上尾,并且可以为指定的误报率选择正则化级别。通过解决e组LASSO问题,超参数的选择变得独立于噪声方差。此外,讨论了字典中共线性对误报率的影响,包括规避它们的方法。在支持恢复,误报率,误报率和计算复杂度方面,将所提方法与其他超参数选择方法进行了比较。仿真数据说明了该方法在计算复杂度和支持恢复方面均优于CV和同类方法。 (C)2018 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号