首页> 外国专利> SAMPLE ANALYSIS METHOD, APPARATUS, ELECTRONIC DEVICE, AND MEDIUM BASED ON MISSING DATA

SAMPLE ANALYSIS METHOD, APPARATUS, ELECTRONIC DEVICE, AND MEDIUM BASED ON MISSING DATA

机译:基于缺失数据的示例分析方法,装置,电子设备和介质

摘要

Provided are a sample analysis method, apparatus, electronic device, and medium based on missing data, relating to big data technology, said method comprising: obtaining a missing data set and a corresponding label value, calculating the saturation of each missing data dimension in the missing data set, selecting missing data dimensions whose saturation is greater than a preset saturation, and generating a feature dimension list (S1); calculating a coefficient of correlation between each missing data dimension in the feature dimension list and the label value, and selecting missing data dimensions whose correlation coefficient is greater than a preset correlation coefficient to obtain a target missing-data dimension set (S2); modeling the target missing-data dimension set using a preset data missing insensitivity algorithm to generate a missing data insensitive model (S3); using the missing data insensitivity model to perform data analysis on a sample data set to be analyzed to obtain an analysis result (S4). In addition, the invention also relates to blockchain technology; the selected missing-data dimension can be stored in the blockchain. The method can solve the problem of inaccurate sample analysis caused by missing data.
机译:提供了基于缺失数据的样本分析方法,装置,电子设备和介质,与大数据技术有关,所述方法包括:获取缺少的数据集和相应的标签值,计算每个缺失数据维度的饱和度缺少数据集,选择缺失的数据维度,其饱和度大于预设饱和度,并生成特征维列表(S1);计算特征维列表中的每个缺失数据维度与标签值之间的相关系数,以及选择相关系数大于预设相关系数以获得目标缺失数据维度集的缺失数据维度(S2);使用预设数据缺少的不敏感性算法建模目标缺失数据尺寸集以生成缺失的数据不敏感模型(S3);使用缺失的数据不敏感模型在要分析的样本数据集上执行数据分析以获得分析结果(S4)。此外,本发明还涉及区块技术;所选缺失数据维度可以存储在区块链中。该方法可以解决缺失数据引起的样本分析的不准确性问题。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号