首页> 外文会议>International Conference on Semantic Web and Web Services >Modifying Online Text Clustering Algorithm using Inverted Index based Operation
【24h】

Modifying Online Text Clustering Algorithm using Inverted Index based Operation

机译:使用基于索引的操作修改在线文本聚类算法

获取原文

摘要

This article concerns the experiments where two strategies of encoding documents are compared with each other for text clustering. We used two test beds for these experiments: NewsPage.com and 20NewsGroups. In order to compute an operation on string vectors, inverted indices of words are built from a corpus as the basis for doing that. In these experiments, single pass algorithm is adopted as the approaches to text clustering. The goal of these experiments is to observe whether modified versions of the two approaches are comparable to their traditional versions, when we use the inverted indices as the basis for performing the operation on string vectors, instead of a restricted sized similarity matrix.
机译:本文涉及实验,其中对文本聚类相互比较了两种编码文件的策略。我们使用了两个实验的两张测试床:NewsPage.com和20新款。为了计算字符串向量的操作,从语料库构建了倒置的单词指标作为执行此操作的基础。在这些实验中,采用单通算法作为文本群集的方法。这些实验的目标是遵守两种方法的修改版本是否与其传统版本相当,当我们使用反转索引作为对字符串向量执行操作的基础,而不是限制大小的相似矩阵。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号