首页> 外文会议>Computer and Information Technology and Workshops (ICCIT), 2008 11th International Conference on; Khulna,Bangladesh >Web document clustering approach using wordnet lexical categories and fuzzy clustering
【24h】

Web document clustering approach using wordnet lexical categories and fuzzy clustering

机译:使用Wordnet词汇类别和模糊聚类的Web文档聚类方法

获取原文
获取原文并翻译 | 示例

摘要

Web mining is defined as applying data mining techniques to the content, structure, and usage of Web resources. The three areas of Web mining are commonly distinguished: content mining, structure mining, and usage mining. In all these areas, a wide range of general data mining techniques, in particular association rule discovery, clustering, classification, and sequence mining, are employed and developed further to reflect the specific structures of Web resources and the specific questions posed in Web mining. In this paper, we introduced a web document clustering approach that uses WordNet lexical categories and fuzzy c-means algorithm to improve the performance of clustering problem for web document. Experiments show that Fuzzy c-means algorithm achieves great performance optimization with comparison with the recent algorithms for document clustering.
机译:Web挖掘被定义为将数据挖掘技术应用于Web资源的内容,结构和使用。 Web挖掘的三个领域通常是不同的:内容挖掘,结构挖掘和用法挖掘。在所有这些领域中,采用并开发了广泛的常规数据挖掘技术,尤其是关联规则发现,聚类,分类和序列挖掘,以反映Web资源的特定结构和Web挖掘中提出的特定问题。在本文中,我们介绍了一种使用WordNet词汇类别和模糊c均值算法的Web文档聚类方法,以提高Web文档聚类问题的性能。实验表明,与最近的文档聚类算法相比,Fuzzy c-means算法在性能上实现了极大的优化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号