Study on frequent term set-based hierarchical clustering algorithm

机译：基于频繁的基于术语的分层聚类算法研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper, we present a text-clustering algorithm of Frequent Term Set-based Clustering (FTSC), which uses frequent term sets for texts clustering. This algorithm can reduce the dimensionality of the text data efficiently, thus it can improve accurate rate and running speed of the clustering algorithm. The results of clustering texts by the FTSC algorithm cannot reflect the overlap of texts' classes. Based on the FTSC algorithm, its improved algorithm—Frequent Term Set-based Hierarchical Clustering algorithm (FTSHC) is given. This algorithm can determine the overlap of texts' classes by the overlap of frequent words sets, and provide an understandable description of the discovered clusters by the frequent terms sets. The experiment results prove that FTSC and FTSHC algorithms are more efficient than K-Means algorithm in the performance of clustering.

机译：本文，我们介绍了一种基于常规集合的群集（FTSC）的文本聚类算法，它使用频繁的术语集聚类。该算法可以有效地降低文本数据的维度，从而可以提高聚类算法的准确速率和运行速度。 FTSC算法的聚类文本结果不能反映文本类的重叠。基于FTSC算法，给出了其改进的基于算法的基于算法的分层聚类算法（FTSHC）。该算法可以通过频繁单词集的重叠确定文本类的重叠，并通过频繁的术语集提供发现的集群的可理解描述。实验结果证明了FTSC和FTSHC算法比群集性能更有效地比K-Means算法更有效。

著录项

来源
《International Conference on Fuzzy Systems and Knowledge Discovery》|2011年||共5页
会议地点
作者
Wang Huiying; Liu Xiangwei;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
FTSC; Frequent Term; Text Clustering;

机译：FTSC;频繁的术语;文本群集;

相似文献

外文文献
中文文献
专利

1. A Rough Set-Based Hierarchical Clustering Algorithm for Categorical Data [J] . Chaoxue Wang Zhurong Wang, Du-wu Cui, Duo Chen International Journal of Information Technology . 2006,第03期

机译：基于粗糙集的分类数据层次聚类算法
2. Does Determination of Initial Cluster Centroids Improve the Performance of K-Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm, Minimum Spanning Tree, and Hierarchical Clustering in an Applied Study [J] . Saeedeh Pourahmad, Atefeh Basirat, Amir Rahimi, Computational and mathematical methods in medicine . 2020,第1期

机译：初始簇质心的确定是否提高了K-Means聚类算法的性能？应用研究中遗传算法，最小生成树和分层聚类的三种混合方法的比较
3. How frequently do clusters occur in hierarchical clustering analysis? A graph theoretical approach to studying ties in proximity [J] . Wilmer Leal, Eugenio J. Llanos, Guillermo Restrepo, Journal of Cheminformatics . 2016,第1期

机译：聚类在层次聚类分析中出现的频率如何？研究邻近关系的图论方法
4. Study on frequent term set-based hierarchical clustering algorithm [C] . Wang Huiying, Liu Xiangwei 2011 Eighth International Conference on Fuzzy Systems and Knowledge Discovery . 2011

机译：基于频繁项集的层次聚类算法研究
5. Aspect-based opinion mining of product reviews in microblogs using most relevant frequent clusters of terms. [D] . Ejieh, Chukwuma. 2016

机译：使用最相关的频繁术语集群在微博中基于方面的产品评论意见挖掘。
6. Gene-Set Local Hierarchical Clustering (GSLHC)—A Gene Set-Based Approach for Characterizing Bioactive Compounds in Terms of Biological Functional Groups [O] . Feng-Hsiang Chung, Zhen-Hua Jin, Tzu-Ting Hsu, -1

机译：基因组局部层次聚类（GSLHC）-一种基于基因组的方法根据生物学功能组表征生物活性化合物
7. Gene-Set Local Hierarchical Clustering (GSLHC)--A Gene Set-Based Approach for Characterizing Bioactive Compounds in Terms of Biological Functional Groups. [O] . Feng-Hsiang Chung, Zhen-Hua Jin, Tzu-Ting Hsu, 2015

机译：基因集局部分层聚类（GsLHC） - 基于基因集的生物活性化合物生物功能基团表征方法。

Study on frequent term set-based hierarchical clustering algorithm

摘要

著录项

相似文献

相关主题

期刊订阅