A novel clustering approach and prediction of optimal number of clusters: global optimum search with enhanced positioning

Meng Piao Tan; James R. Broach; Christodoulos A. Floudas

首页> 外文期刊>Journal of Global Optimization >A novel clustering approach and prediction of optimal number of clusters: global optimum search with enhanced positioning

【24h】

A novel clustering approach and prediction of optimal number of clusters: global optimum search with enhanced positioning

机译：一种新颖的聚类方法和最佳簇数预测：增强定位的全局最优搜索

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Cluster analysis of genome-wide expression data from DNA microarray hybridization studies is a useful tool for identifying biologically relevant gene groupings (DeRisi et al. 1997; Weiler et al. 1997). It is hence important to apply a rigorous yet intuitive clustering algorithm to uncover these genomic relationships. In this study, we describe a novel clustering algorithm framework based on a variant of the Generalized Benders Decomposition, denoted as the Global Optimum Search (Floudas et al. 1989; Floudas 1995), which includes a procedure to determine the optimal number of clusters to be used. The approach involves a pre-clustering of data points to define an initial number of clusters and the iterative solution of a Linear Programming problem (the primal problem) and a Mixed-Integer Linear Programming problem (the master problem), that are derived from a Mixed Integer Nonlinear Programming problem formulation. Badly placed data points are removed to form new clusters, thus ensuring tight groupings amongst the data points and incrementing the number of clusters until the optimum number is reached. We apply the proposed clustering algorithm to experimental DNA microarray data centered on the Ras signaling pathway in the yeast Saccharomyces cerevisiae and compare the results to that obtained with some commonly used clustering algorithms. Our algorithm compares favorably against these algorithms in the aspects of intra-cluster similarity and inter-cluster dissimilarity, often considered two key tenets of clustering. Furthermore, our algorithm can predict the optimal number of clusters, and the biological coherence of the predicted clusters is analyzed through gene ontology.

机译：来自DNA微阵列杂交研究的全基因组表达数据的聚类分析是鉴定生物学相关基因分组的有用工具（DeRisi等，1997； Weiler等，1997）。因此，重要的是应用严格而直观的聚类算法来发现这些基因组关系。在这项研究中，我们描述了一种基于广义Benders分解变体的新颖聚类算法框架，称为全局最优搜索（Floudas等，1989； Floudas 1995），其中包括确定最佳聚类数的过程。使用。该方法涉及对数据点进行预聚类以定义群集的初始数量，以及从一个矩阵派生的线性规划问题（原始问题）和混合整数线性规划问题（主问题）的迭代解。混合整数非线性规划问题的表述。不良放置的数据点将被删除以形成新的群集，从而确保数据点之间的紧密分组，并增加群集的数量，直到达到最佳数量。我们将提出的聚类算法应用于以酿酒酵母中Ras信号通路为中心的实验DNA微阵列数据，并将结果与使用某些常用聚类算法获得的结果进行比较。我们的算法在集群内相似性和集群间不相似性（通常被认为是聚类的两个关键原则）方面与这些算法相比具有优势。此外，我们的算法可以预测最佳的簇数，并通过基因本体分析预测簇的生物学一致性。

著录项

来源
《Journal of Global Optimization》 |2007年第3期|323-346|共24页
作者
Meng Piao Tan; James R. Broach; Christodoulos A. Floudas;
展开▼
作者单位

Department of Chemical Engineering, Princeton University, Princeton, NJ 08544, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类应用数学;
关键词
clustering; microarray data; optimization;

机译：集群芯片数据优化;

相似文献

外文文献
中文文献
专利

1. EVALUATION OF NORMALIZATION AND PRE-CLUSTERING ISSUES IN A NOVEL CLUSTERING APPROACH: GLOBAL OPTIMUM SEARCH WITH ENHANCED POSITIONING [J] . MENG P. TAN, JAMES R. BROACH, CHRISTODOULOS A. FLOUDAS Journal of Bioinformatics and Computational Biology . 2007,第4期

机译：新型聚类方法中归一化和聚类前问题的评估：增强定位的全局最佳搜索
2. Cluster-based optimal sink repositioning technique for WSNs using an improved glowworm swarm optimisation and S* position search algorithm [J] . B. Santhosh Kumar, P. Trinatha Rao International Journal of Internet Technology and Secured Transactions . 2021,第1期

机译：基于群集的最佳沉降技术，用于WSNS使用改进的萤火虫群优化和S *位置搜索算法
3. Word prediction using a clustered optimal binary search tree [J] . Eyas El-Qawasmeh Information Processing Letters . 2004,第5期

机译：使用聚类最佳二叉搜索树的单词预测
4. A Novel Clustering Approach: Global Optimum Search with Enhanced Positioning [C] . Meng P. Tan, James R. Broach, Christodoulos A. Floudas European Symposium on Computer Aided Process Engineering . 2006

机译：一种新的聚类方法：具有增强定位的全局最佳搜索
5. An optimality-theoretic approach to Saudi English learners' production of word-initial biconsonantal clusters [D] . Alfaifi, Abdullah. 2015

机译：沙特英语学习者产生单词初始双辅音类的最佳理论方法
6. Optimally adjusted last cluster for prediction based on balancing the bias and variance by bootstrapping [O] . Jeongwoo Kim 2019

机译：基于平衡偏差和通过自举偏差的预测最佳地调整了最后一个群集
7. Fast Search Algorithm for Determining the Optimal Number of Clusters using Cluster Validity Index [O] . Sang-Wook Lee 2009

机译：快速搜索算法，用于使用群集有效性索引确定最佳群集数

A novel clustering approach and prediction of optimal number of clusters: global optimum search with enhanced positioning

摘要

著录项

相似文献

相关主题

期刊订阅