...
首页> 外文期刊>Computational statistics & data analysis >Feature screening for ultrahigh dimensional categorical data with covariates missing at random
【24h】

Feature screening for ultrahigh dimensional categorical data with covariates missing at random

机译:具有随机缺少的协调因子的超高尺寸分类数据的特征筛选

获取原文
获取原文并翻译 | 示例
           

摘要

Most existing feature screening methods assume that data are fully observed. It is quite a challenge to develop screening methods for incomplete data since the traditional missing data analysis techniques cannot be directly applied to ultrahigh dimensional case. A two-step model-free feature screening procedure for ultrahigh dimensional categorical data when some covariate values are missing at random is developed. For each covariate with missing data, the first step screens out the variables in the unspecified propensity function. In the second step, screening statistics such as the adjusted Pearson Chi-Square statistics can be calculated by leveraging the variables obtained in the first step and the special structure of categorical data. Sure screening properties are established for the proposed method. Finite sample performance is investigated by simulation studies and a real data example. (C) 2019 Elsevier B.V. All rights reserved.
机译:大多数现有特征筛选方法假设完全观察到数据。 由于传统缺失的数据分析技术不能直接应用于超高尺寸案例,开发不完整数据的筛选方法是非常挑战的。 开发了一个随机缺少某些协变量的超高尺寸分类数据的两步模型特征筛选程序。 对于具有缺失数据的每个协变量,第一步将在未指定的倾向函数中屏蔽变量。 在第二步中,可以通过利用第一步中获得的变量和分类数据的特殊结构来计算诸如调整后的Pearson Chi-Square统计等筛选统计数据。 确保为提出的方法建立筛选属性。 通过模拟研究和实际数据示例研究了有限的样品性能。 (c)2019年Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号