首页> 外文期刊>ACM transactions on Asian language information processing >Response Selection and Automatic Message-Response Expansion in Retrieval-Based QA Systems using Semantic Dependency Pair Model
【24h】

Response Selection and Automatic Message-Response Expansion in Retrieval-Based QA Systems using Semantic Dependency Pair Model

机译:基于检索依赖模型的基于检索的QA系统中的响应选择和自动消息响应扩展

获取原文
获取原文并翻译 | 示例
       

摘要

This article presents an approach to response selection and message-response (MR) database expansion from the unstructured data on the psychological consultation websites for a retrieval-based question answering (QA) system in a constrained domain for emotional support and comforting. First, we manually construct an initial MR database based on the articles collected from the psychological consultation websites. The Chinese Knowledge and Information Processing probabilistic context-free grammar is adopted to obtain the semantic dependency graphs (SDGs) of all the messages and responses in the initial MR database. For each sentence in the MR database, all the semantic dependencies, each composed of two words and their semantic relation, are extracted from the SDG of the sentence to form a semantic dependency set. Finally, a matrix with the element representing the correlation between the semantic dependencies of the messages and their corresponding responses is constructed as a semantic dependency pair model (SDPM) for response selection. Moreover, as the number of MR pairs in the psychological consultation websites is increasing day by day, the MR database in the QA system should be expanded to meet the needs of the users. For MR database expansion, the unstructured data from the message board are automatically collected. For the collected data, the supervised latent Dirichlet allocation is adopted for event detection and then the event-based delta Bayesian Information Criterion is used for message and response article segmentation. Each extracted message segment is then fed to the constructed retrieval-based QA system to find the best matched response segment and the matching score is also estimated to verify if the new MR pair is suitable to be included in the expanded MR database. Fivefold cross validation was employed to evaluate the performance of the proposed retrieval-based QA system over the expanded MR database based on SDPM. Compared to the vector space model-based method, the Okapi BM25 model, and the deep learning-based sequence-to-sequence with attention model, the proposed approach achieved a more favorable performance according to a statistical significance test. The retrieval accuracy based on MR expansion was also evaluated and a satisfactory result was obtained confirming the effectiveness of the expanded MR database. In addition, the user's satisfaction score of the proposed system was evaluated using the Cronbach's alpha value and the satisfaction score of the proposed SDPM was higher than those of the methods for comparison.
机译:本文提出了一种方法,用于从心理咨询网站上的非结构化数据扩展响应选择和消息响应(MR)数据库,以在情感支持和安慰的受限域中使用基于检索的问题解答(QA)系统。首先,我们根据从心理咨询网站收集的文章手动构建初始MR数据库。采用中文知识和信息处理概率无上下文语法来获取初始MR数据库中所有消息和响应的语义依赖图(SDG)。对于MR数据库中的每个句子,从句子的SDG中提取所有语义依赖关系,每个语义依赖关系由两个单词组成,并包含它们的语义关系,以形成语义依赖关系集。最后,将具有表示消息的语义依赖性及其相应响应之间的相关性的元素的矩阵构造为用于响应选择的语义依赖性对模型(SDPM)。此外,随着心理咨询网站中MR对的数量日益增加,应扩展QA系统中的MR数据库以满足用户的需求。对于MR数据库扩展,会自动收集来自留言板的非结构化数据。对于收集的数据,采用监督的潜在Dirichlet分配进行事件检测,然后将基于事件的德尔塔贝叶斯信息准则用于消息和响应文章的细分。然后将每个提取的消息段输入到构建的基于检索的QA系统中,以找到最佳匹配的响应段,并且还估计匹配分数,以验证新MR对是否适合包含在扩展MR数据库中。五重交叉验证用于评估基于SDPM的扩展MR数据库上基于检索的QA系统的性能。与基于向量空间模型的方法,Okapi BM25模型和基于深度学习的序列到注意力模型相比,该方法通过统计显着性检验获得了更好的性能。还评估了基于MR扩展的检索精度,并获得了令人满意的结果,证实了扩展MR数据库的有效性。此外,使用Cronbach的alpha值评估了建议系统的用户满意度得分,并且建议SDPM的满意度得分高于比较方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号