首页> 外文会议>IEEE International Conference on Semantic Computing >Active Learning Approach for Intent Classification in Portuguese Language Conversations
【24h】

Active Learning Approach for Intent Classification in Portuguese Language Conversations

机译:葡萄牙语对话中意图分类的积极学习方法

获取原文

摘要

Intent classification from conversations is a challenging task especially if messages are collected from real data since messages can contain several grammatical errors, outliers, language slang, and a lot of categories. Most of the intent classification methods use supervised learning approaches and do not consider the lack of labeled data, because supervised learning approach requires a large amount of labeled samples during its training process. In this article, we propose to reduce the sample scale to be labeled and maintain a desired classification effectiveness. Our method is based on active learning to minimizes amount of labeled data required and a convolutional neural network that obtains effective vector representations from BERT to perform accurate classification of messages. Experimental results on a large Brazilian Portuguese corpus suggest that the proposed method can achieve improvements with more than half of the training data, and accurate results with less than half of data in small dataset like as ATIS.
机译:谈话的意图分类是一个具有挑战性的任务,特别是如果消息可以包含几个语法错误,异常值,语言俚语和大量类别,则从真实数据收集消息。大多数意图分类方法使用监督学习方法,并不考虑缺乏标记数据,因为监督学习方法需要大量标记的样本在其培训过程中。在本文中,我们建议减少要标记的样本量表并保持所需的分类效果。我们的方法基于主动学习,可最大限度地减少所需的标记数据量和卷积神经网络,其获得来自BERT的有效矢量表示来执行准确的消息分类。大型巴西葡萄牙语法上的实验结果表明,该方法可以通过超过一半的训练数据实现改进,并且准确的结果在小型数据集中的小于DataSet中的一半。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号