首页> 外文会议>Conference on empirical methods in natural language processing >A Teacher-Student Framework for Maintainable Dialog Manager
【24h】

A Teacher-Student Framework for Maintainable Dialog Manager

机译:可维护对话框管理器的师生框架

获取原文

摘要

Reinforcement learning (RL) is an attractive solution for task-oriented dialog systems. However, extending RL-based systems to handle new intents and slots requires a system redesign. The high maintenance cost makes it difficult to apply RL methods to practical systems on a large scale. To address this issue, we propose a practical teacher-student framework to extend RL-based dialog systems without retraining from scratch. Specifically, the "student" is an extended dialog manager based on a new ontology, and the "teacher" is existing resources used for guiding the learning process of the "student". By specifying constraints held in the new dialog manager, we transfer knowledge of the "teacher" to the "student" without additional resources. Experiments show that the performance of the extended system is comparable to the system trained from scratch. More importantly, the proposed framework makes no assumption about the unsupported intents and slots, which makes it possible to improve RL-based systems incrementally.
机译:强化学习(RL)是面向任务的对话系统的一种有吸引力的解决方案。但是,扩展基于RL的系统以处理新的意图和位置需要对系统进行重新设计。高昂的维护成本使得难以将RL方法大规模应用于实际系统。为了解决这个问题,我们提出了一个实用的师生框架来扩展基于RL的对话系统,而无需从头开始进行再培训。具体地,“学生”是基于新本体的扩展对话管理器,“老师”是用于指导“学生”的学习过程的现有资源。通过指定新对话框管理器中保留的约束,我们无需其他资源即可将“老师”的知识转移到“学生”。实验表明,扩展系统的性能可与从头开始训练的系统相媲美。更重要的是,提出的框架不对不支持的意图和插槽进行任何假设,这使得有可能逐步改进基于RL的系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号