首页> 外文会议>IEEE/ACIS International Conference on Computer and Information Science >Text similarity algorithm based on semantic vector space model
【24h】

Text similarity algorithm based on semantic vector space model

机译:基于语义向量空间模型的文本相似度算法

获取原文

摘要

In this paper, a text similarity computation method named VSM-Cilin which is based on semantic vector space model is proposed in the background of radio station. VSM-Cilin improved the traditional VSM in the following areas. First, consider the semantic relations between words. Second, use semantic resources to reduce dimension. Third, use inverted index to filter out candidate document set. Forth, take the weight of the feature item into consideration when compute the similarity. The experiments show that the accuracy of VSM-Cilin is significantly improved compared with the traditional vector space model and the method of bidirectional mapping based on HITIR-Lab Tongyici Cilin.
机译:在广播电台的背景下,提出了一种基于语义向量空间模型的文本相似度计算方法VSM-Cilin。 VSM-Cilin在以下方面改进了传统的VSM。首先,考虑单词之间的语义关系。其次,使用语义资源来减少维度。第三,使用倒排索引过滤掉候选文档集。第四,计算相似度时要考虑特征项的权重。实验表明,与传统的矢量空间模型和基于HITIR-Lab Tongyici Cilin的双向映射方法相比,VSM-Cilin的精度有了显着提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号