首页> 外文期刊>Computational statistics & data analysis >Factorial k-means analysis for two-way data
【24h】

Factorial k-means analysis for two-way data

机译:双向数据的阶乘k均值分析

获取原文
获取原文并翻译 | 示例
           

摘要

A discrete clustering model together with a continuous factorial one are fitted simultaneously to two-way data, with the aim of identifying the best partition of the objects, described by the best orthogonal linear combinations of the variables (factors) according to the least-squares criterion. This methodology named for its features factorial k-means analysis has a very wide range of applications since it fulfills a double objective: data reduction and synthesis, simultaneously in the direction of objects and variables; variable selection in cluster analysis, identifying variables that most contribute to determine the classification of the objects. The least-squares fitting problem proposed here is mathematically formalized as a quadratic constrained minimization problem with mixed variables. An iterative alternating least-squares algorithm based on two main steps is proposed to solve the quadratic constrained problem. Starting from the cluster centroids, the subspace projection is found that leads to the smallest distances between object points and centroids. Updating the centroids, the partition is detected assigning objects to the closest centroids. At each step the algorithm decreases the least-squares criterion, thus converging to an optimal solution. Two data sets are analyzed to show the features of the factorial k-means model. The proposed technique has a fast algorithm that allows researchers to use it also with large data sets.
机译:同时将离散聚类模型与连续阶乘模型同时拟合到双向数据,目的是确定对象的最佳划分,并根据最小二乘变量(因子)的最佳正交线性组合进行描述标准。这种以因子k均值分析为特征的方法具有广泛的应用范围,因为它实现了双重目标:在对象和变量的方向上同时进行数据精简和合成;聚类分析中的变量选择,确定最有助于确定对象分类的变量。这里提出的最小二乘拟合问题在数学上形式化为带有混合变量的二次约束最小化问题。为了解决二次约束问题,提出了一种基于两个主要步骤的迭代交替最小二乘算法。从聚类质心开始,发现子空间投影导致物体点和质心之间的最小距离。更新质心时,会检测到将对象分配给最接近的质心的分区。在每一步中,算法都会减小最小二乘准则,从而收敛到最优解。分析了两个数据集以显示阶乘k均值模型的特征。所提出的技术具有一种快速算法,允许研究人员也将其用于大型数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号