首页> 外文会议>International conference on asian language processing >Towards improving the performance of Vector Space Model for Chinese Frequently Asked Question Answering
【24h】

Towards improving the performance of Vector Space Model for Chinese Frequently Asked Question Answering

机译:旨在提高向量空间模型在中文常见问答中的性能

获取原文

摘要

This paper presents a method which improves the performance of Vector Space Model (VSM) when applying it to Chinese Frequently Asked Questions (FAQ). This method combines unigram and bigram models in determining the similarity of document vectors. The performance is further improved by applying shallow lexical semantics and the document length information. Experiments showed that the proposed methods outperform baselines (segmentation and bigram) across different datasets which include FAQs from restricted domains and open domains.
机译:本文提出了一种将向量空间模型(VSM)应用于中文常见问题(FAQ)时的性能提高方法。该方法结合了unigram模型和bigram模型来确定文档向量的相似性。通过应用浅层词汇语义和文档长度信息,可以进一步提高性能。实验表明,所提出的方法在包括受限域和开放域常见问题解答在内的不同数据集上均优于基线(分段和二元组)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号