...
首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >On the distance concentration awareness of certain data reduction techniques
【24h】

On the distance concentration awareness of certain data reduction techniques

机译:关于某些数据约简技术的距离集中意识

获取原文
获取原文并翻译 | 示例
           

摘要

We make a first investigation into a recently raised concern about the suitability of existing data analysis techniques when faced with the counter-intuitive properties of high dimensional data spaces, such as the phenomenon of distance concentration. Under the structural assumption of a generic linear model with a latent variable and an additive unstructured noise, we find that dimension reduction that explicitly guards against distance concentration recovers the well-known techniques of Fisher's linear discriminant analysis, Fisher's discriminant ratio and a variant of projection pursuit. Extrapolation to regression uncovers a close link to sure independence screening, which is a recently proposed technique for variable selection in ultra-high dimensional feature spaces. Hence, these techniques may be seen as distance concentration aware, despite they have not been explicitly designed to have this property. Throughout our analysis, other than the dependency structure implied by the mentioned linear model, we make no assumptions about the distributions of the variables involved.
机译:当面对高维数据空间的反直觉特性(如距离集中现象)时,我们对现有数据分析技术的适用性提出了新的关注。在具有潜在变量和加性非结构噪声的通用线性模型的结构假设下,我们发现明确防止距离集中的降维恢复了费舍尔线性判别分析,费舍尔判别率和投影变体的著名技术。追求。对回归的外推揭示了与确定独立性筛选的紧密联系,这是最近提出的用于在超高维特征空间中进行变量选择的技术。因此,尽管这些技术尚未明确设计为具有此属性,但可以将它们视为具有距离集中性。在整个分析过程中,除了上述线性模型所隐含的依赖关系结构外,我们不对涉及的变量的分布进行任何假设。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号