Reinforcement Learning for Improving Coherence of Multi-turn Responses in Deep Learning-Based Chatbots

机译：加强学习，提高基于深入学习的Chatbots中的多转响应的一致性

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Chatbots are still far behind in their ability to hold meaningful conversations. The objective of the work is to implement and improve the multi-turn responses of deep learning-based chatbots. Multi-turn response is the ability of a chatbot to give coherent and sensible responses in successive turns. Firstly, sequence to sequence (Seq2Seq) model was built, and its responses were analyzed by varying training parameters. Secondly, the reinforcement learning (RL) method using the Seq2Seq model was implemented, and it is demonstrated that this improves coherence in multi-turn conversations. The RL model performed better than the Seq2Seq model in terms of BiLingual Evaluation Understudy (BLEU) score with a score of 0.3334 compared to 0.2336 of the Seq2Seq model. Average conversation length was found to increase with RL with 3.75 turns compared to 3.05 turns with Seq2Seq.

机译：聊天仍然远远落后于拥有有意义的对话。该工作的目的是实施和改进基于深度学习的聊天禁令的多转响应。多转响应是聊天响应在连续转弯中提供连贯性和明智的反应的能力。首先，建立了序列（SEQ2SEQ）模型的顺序，通过不同的训练参数分析其响应。其次，实现了使用SEQ2SEQ模型的增强学习（RL）方法，并证明这改善了多转谈话中的一致性。与双语评估升值（BLEU）评分的SEQ2SEQ模型比SEQ2SEQ模型的0.2366相比，RL模型比SEQ2SEQ模型更好。发现平均会话长度与RL增加3.75匝，而SEQ2Seq与3.05匝数相比。

著录项

来源
《International Conference on Communication, Circuits, and Systems》|2020年|273-279|共7页
会议地点
作者
D. G. Suhaas Kiran; Swapneel; Safal Deepak Pansare; B. N. Krupa;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Chatbot; Seq2seq; Reinforcement learning; Multi-turn responses; Dialogue systems; BLEU score;

机译：聊天;SEQ2SEQ;加强学习;多转响应;对话系统;bleu得分;

相似文献

外文文献
中文文献
专利

1. Learning bi-utterance for multi-turn response selection in retrieval-based chatbots [J] . Shuliang Wang, Dapeng Li, Jing Geng, International Journal of Advanced Robotic Systems . 2019,第2期

机译：在基于检索的聊天机器人中学习双话语以进行多回合响应选择
2. A deep learning-based multi-turn conversation modeling for diagnostic Q&A document recommendation [J] . Zhan Yang, Wei Xu, Runyu Chen Information Processing & Management . 2021,第3期

机译：基于深度学习的诊断Q＆A文档推荐的多转对谈话建模
3. Ensemble-based deep reinforcement learning for chatbots [J] . Cuayahuitl Heriberto, Lee Donghyeon, Ryu Seonghan, Neurocomputing . 2019,第Nova13期

机译：基于集成的聊天机器人深度强化学习
4. Multi-Turn Response Selection for Chatbots with Deep Attention Matching Network [C] . Xiangyang Zhou, Lu Li, Daxiang Dong, Annual meeting of the Association for Computational Linguistics . 2018

机译：具有深度注意力匹配网络的聊天机器人的多回合响应选择
5. Deep Reinforcement Learning-Based Portfolio Management [D] . Kanwar, Nitin. 2019

机译：基于深度加强学习的投资组合管理
6. Deep Reinforcement Learning-Based Task Scheduling in IoT Edge Computing [O] . Shuran Sheng, Peng Chen, Zhimin Chen, 2021

机译：基于深度加强学习的IOT Edge Computing任务调度
7. Multi-Turn Response Selection for Chatbots with Deep Attention Matching Network [O] . Xiangyang Zhou, Lu Li, Daxiang Dong, 2018

机译：深度关注网络的聊天响应选择多转响应选择

Reinforcement Learning for Improving Coherence of Multi-turn Responses in Deep Learning-Based Chatbots

摘要

著录项

相似文献

相关主题

期刊订阅