首页> 外文期刊>Pattern recognition letters >Neural sentence embedding using only in-domain sentences for out-of-domain sentence detection in dialog systems
【24h】

Neural sentence embedding using only in-domain sentences for out-of-domain sentence detection in dialog systems

机译:在对话系统中仅使用域内语句进行神经语句嵌入以进行域外语句检测

获取原文
获取原文并翻译 | 示例
       

摘要

To ensure satisfactory user experience, dialog systems must be able to determine whether an input sentence is in-domain (ID) or out-of-domain (OOD). We assume that only ID sentences are available as training data because collecting enough OOD sentences in an unbiased way is a laborious and time-consuming job. This paper proposes a novel neural sentence embedding method that represents sentences in a low dimensional continuous vector space that emphasizes aspects that distinguish ID cases from OOD cases. We first used a large set of unlabeled text to pre-train word representations that are used to initialize neural sentence embedding. Then we used domain-category analysis as an auxiliary task to train neural sentence embedding for OOD sentence detection. After the sentence representations were learned, we used them to train an autoencoder aimed at OOD sentence detection. We evaluated our method by experimentally comparing it to the state-of-the-art methods in an eight-domain dialog system; our proposed method achieved the highest accuracy in all tests. (C) 2017 Elsevier B.V. All rights reserved.
机译:为了确保令人满意的用户体验,对话系统必须能够确定输入句子是域内(ID)还是域外(OOD)。我们假设只有ID语句可用作训练数据,因为以无偏见的方式收集足够的OOD语句是一项费时费力的工作。本文提出了一种新颖的神经语句嵌入方法,该方法在低维连续向量空间中表示语句,该方法着重于区分ID案件与OOD案件的方面。我们首先使用大量未标记的文本来预训练用于初始化神经语句嵌入的单词表示。然后我们使用领域类别分析作为辅助任务来训练用于OOD句子检测的神经句子嵌入。在学习了句子表示之后,我们使用它们来训练针对OOD句子检测的自动编码器。我们通过与八域对话系统中的最新方法进行实验比较来评估我们的方法。我们提出的方法在所有测试中均达到了最高的准确性。 (C)2017 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号