首页> 外文期刊>Information Processing & Management >Clustered SVD strategies in latent semantic indexing
【24h】

Clustered SVD strategies in latent semantic indexing

机译:潜在语义索引中的聚类SVD策略

获取原文
获取原文并翻译 | 示例
       

摘要

The text retrieval method using latent semantic indexing (LSI) technique with truncated singular value decomposition (SVD) has been intensively studied in recent years. The SVD reduces the noise contained in the original representation of the term-document matrix and improves the information retrieval accuracy. Recent studies indicate that SVD is mostly useful for small homogeneous data collections. For large inhomogeneous datasets, the performance of the SVD based text retrieval technique may deteriorate. We propose to partition a large inhomogeneous dataset into several smaller ones with clustered structure, on which we apply the truncated SVD. Our experimental results show that the clustered SVD strategies may enhance the retrieval accuracy and reduce the computing and storage costs.
机译:近年来,人们对使用潜在语义索引(LSI)技术和截断奇异值分解(SVD)的文本检索方法进行了深入研究。 SVD减少了术语文档矩阵原始表示中包含的噪声,并提高了信息检索的准确性。最近的研究表明,SVD对于小型同类数据收集最有用。对于大型不均匀数据集,基于SVD的文本检索技术的性能可能会下降。我们建议将一个较大的不均匀数据集划分为几个具有聚类结构的较小数据集,然后在该数据集上应用截短的SVD。我们的实验结果表明,聚簇SVD策略可以提高检索精度,并降低计算和存储成本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号