Missing Categorical Data Imputation for FCM Clusterings of Mixed Incomplete Data

机译：缺少混合不完整数据的FCM群集的分类数据归档

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The Data mining is related to human congnitive ability, and one of popular method is fuzzy clustering. The focus of fuzzy c-means (FCM) clustering method is normally used on numerical data. However, most data existing in databases are both categorical and numerical. To date, clustering methods have been developed to analyze only complete data. Although we, sometimes, encounter data sets that contain one or more missing feature values (incomplete data) in data intensive classification systems, traditional clustering methods cannot be used for such data. Thus, we study this theme and discuss clustering methods that can handle mixed numerical and categorical incomplete data. In this paper, we propose some algorithms that use the missing categorical data imputation method and distances between numerical data that contain missing values. Finally, we show through a real data experiment that our proposed method is more effective than without imputation, when missing ratio becomes higher.

机译：数据挖掘与人类突出能力相关，流行方法之一是模糊聚类。模糊C-Means（FCM）聚类方法的焦点通常用于数值数据。但是，数据库中存在的大多数数据都是分类和数值。迄今为止，已经开发了群集方法来分析完整数据。虽然我们有时，遇到包含一个或多个缺失特征值（不完整数据）的数据集，但是在数据密集型分类系统中，传统的聚类方法不能用于此类数据。因此，我们研究了这个主题并讨论了可以处理混合数值和分类不完整数据的聚类方法。在本文中，我们提出了一些使用缺失的分类数据载旋方法和包含缺失值的数字数据之间的距离的算法。最后，我们通过真实的数据实验表明，当缺失的比率变高时，我们所提出的方法比毫无归发的毫无效益。

著录项

来源
《International Conference on Advanced Cognitive Technologies and Applications》|2014年||共5页
会议地点
作者
Takashi Furukawa; Shin-ichi Ohnishi; Takahiro Yamanoi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Clustering; Incomplete data; Mixed data; FCM;

机译：聚类;不完整的数据;混合数据;FCM;

相似文献

外文文献
中文文献
专利

1. Working with Missing Data: Imputation of Nonresponse Items in Categorical Survey Data with a Non-Monotone Missing Pattern [J] . Machelle D.Wilson, KerstinLueck Journal of applied mathematics . 2014,第2期

机译：处理缺失数据：在具有非单调缺失模式的分类调查数据中插补未答复项
2. Pre-processing of incomplete spectrum sensing data in spectrum sensing data falsification attacks detection: a missing data imputation approach [J] . Junnan Yao, Jianjun Cao, Qibin Zheng, Communications, IET . 2016,第11期

机译：频谱感测数据伪造攻击检测中不完整的频谱感测数据的预处理：缺失的数据插补方法
3. Missing Categorical Data Imputation and Individual Observation Level Imputation [J] . Zimmermann Pavel, Mazouch Petr, Hulíková Tesárková Klára Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis . 2014,第6期

机译：分类数据归因缺失和个人观察水平归因
4. Missing Categorical Data Imputation for FCM Clusterings of Mixed Incomplete Data [C] . Takashi Furukawa, Shin-ichi Ohnishi, Takahiro Yamanoi International Conference on Advanced Cognitive Technologies and Applications . 2014

机译：缺少混合不完整数据的FCM群集的分类数据归档
5. Handling Incomplete High-Dimensional Multivariate Longitudinal Data with Mixed Data Types by Multiple Imputation Using a Longitudinal Factor Analysis Model. [D] . Lu, Xiang. 2016

机译：使用纵向因素分析模型通过多重插补处理具有混合数据类型的不完整的高维多元纵向数据。
6. How handling missing data may impact conclusions: A comparison of six different imputation methods for categorical questionnaire data [O] . Marianne Riksheim Stavseth, Thomas Clausen, Jo Røislien 2019

机译：处理缺失的数据可能如何影响结论：对分类问卷数据的六种不同估算方法的比较
7. Working with Missing Data: Imputation of Nonresponse Items in Categorical Survey Data with a Non-Monotone Missing Pattern [O] . Machelle D. Wilson, Kerstin Lueck 2014

机译：使用缺失数据：非单调缺失模式的分类调查数据中的非响应项目的归纳

Missing Categorical Data Imputation for FCM Clusterings of Mixed Incomplete Data

摘要

著录项

相似文献

相关主题

期刊订阅