首页> 外文会议>SLSP 2013 >Experiments with Semantic Similarity Measures Based on LDA and LSA
【24h】

Experiments with Semantic Similarity Measures Based on LDA and LSA

机译:基于LDA和LSA的语义相似度量实验

获取原文

摘要

We present in this paper experiments with several semantic similarity measures based on the unsupervised method Latent Dirichlet Allocation. For comparison purposes, we also report experimental results using an algebraic method, Latent Semantic Analysis. The proposed semantic similarity methods were evaluated using one dataset that includes student answers from conversational intelligent tutoring systems and a standard paraphrase dataset, the Microsoft Research Paraphrase corpus. Results indicate that the method based on word representations as topic vectors outperforms methods based on distributions over topics and words. The proposed evaluation methods can also be regarded as an extrinsic method for evaluating topic coherence or selecting the number of topics in LDA models, i.e. a task-based evaluation of topic coherence and selection of number of topics in LDA.
机译:我们在本文中展示了几种基于无监督方法潜在Dirichlet分配的语义相似度量的实验。为了比较目的,我们还使用代数方法报告实验结果,潜在语义分析。使用一个数据集进行评估所提出的语义相似性方法,该数据集包括从对话智能辅导系统和标准释义数据集,Microsoft Research rase语料库中的学生答案。结果表明,基于Word表示的方法作为主题向量超出了基于主题和单词的分布的方法。所提出的评估方法也可以被视为用于评估主题一致性的外在方法,或者选择LDA模型中的主题数量,即,基于任务的主题协调性的评估和LDA中的主题数量的选择。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号