首页> 外文会议>International Conference on Communication, Circuits, and Systems >Reinforcement Learning for Improving Coherence of Multi-turn Responses in Deep Learning-Based Chatbots
【24h】

Reinforcement Learning for Improving Coherence of Multi-turn Responses in Deep Learning-Based Chatbots

机译:加强学习,提高基于深入学习的Chatbots中的多转响应的一致性

获取原文

摘要

Chatbots are still far behind in their ability to hold meaningful conversations. The objective of the work is to implement and improve the multi-turn responses of deep learning-based chatbots. Multi-turn response is the ability of a chatbot to give coherent and sensible responses in successive turns. Firstly, sequence to sequence (Seq2Seq) model was built, and its responses were analyzed by varying training parameters. Secondly, the reinforcement learning (RL) method using the Seq2Seq model was implemented, and it is demonstrated that this improves coherence in multi-turn conversations. The RL model performed better than the Seq2Seq model in terms of BiLingual Evaluation Understudy (BLEU) score with a score of 0.3334 compared to 0.2336 of the Seq2Seq model. Average conversation length was found to increase with RL with 3.75 turns compared to 3.05 turns with Seq2Seq.
机译:聊天仍然远远落后于拥有有意义的对话。 该工作的目的是实施和改进基于深度学习的聊天禁令的多转响应。 多转响应是聊天响应在连续转弯中提供连贯性和明智的反应的能力。 首先,建立了序列(SEQ2SEQ)模型的顺序,通过不同的训练参数分析其响应。 其次,实现了使用SEQ2SEQ模型的增强学习(RL)方法,并证明这改善了多转谈话中的一致性。 与双语评估升值(BLEU)评分的SEQ2SEQ模型比SEQ2SEQ模型的0.2366相比,RL模型比SEQ2SEQ模型更好。 发现平均会话长度与RL增加3.75匝,而SEQ2Seq与3.05匝数相比。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号