Active Learning Approach for Intent Classification in Portuguese Language Conversations

机译：葡萄牙语对话中意图分类的积极学习方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Intent classification from conversations is a challenging task especially if messages are collected from real data since messages can contain several grammatical errors, outliers, language slang, and a lot of categories. Most of the intent classification methods use supervised learning approaches and do not consider the lack of labeled data, because supervised learning approach requires a large amount of labeled samples during its training process. In this article, we propose to reduce the sample scale to be labeled and maintain a desired classification effectiveness. Our method is based on active learning to minimizes amount of labeled data required and a convolutional neural network that obtains effective vector representations from BERT to perform accurate classification of messages. Experimental results on a large Brazilian Portuguese corpus suggest that the proposed method can achieve improvements with more than half of the training data, and accurate results with less than half of data in small dataset like as ATIS.

机译：谈话的意图分类是一个具有挑战性的任务，特别是如果消息可以包含几个语法错误，异常值，语言俚语和大量类别，则从真实数据收集消息。大多数意图分类方法使用监督学习方法，并不考虑缺乏标记数据，因为监督学习方法需要大量标记的样本在其培训过程中。在本文中，我们建议减少要标记的样本量表并保持所需的分类效果。我们的方法基于主动学习，可最大限度地减少所需的标记数据量和卷积神经网络，其获得来自BERT的有效矢量表示来执行准确的消息分类。大型巴西葡萄牙语法上的实验结果表明，该方法可以通过超过一半的训练数据实现改进，并且准确的结果在小型数据集中的小于DataSet中的一半。

著录项

来源
《IEEE International Conference on Semantic Computing》|2021年|227-232|共6页
会议地点
作者
Jeanfranco D. Farfan-Escobedo; Kelly Lopes; Julio C. Dos Reis;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Supervised learning; Bit error rate; Training data; Computer architecture; Data models; Task analysis;

机译：培训;监督学习;误码率;培训数据;计算机架构;数据模型;任务分析;

相似文献

外文文献
中文文献
专利

1. Entropy query by bagging-based active learning approach in the extreme learning machine framework for hyperspectral image classification [J] . Pradhan Monoj K., Minz Sonajharia, Shrivastava Vimal K. Current Science: A Fortnightly Journal of Research . 2020,第6期

机译：基于Bagging的主动学习方法在极端学习机框架中进行熵查询进行高光谱图像分类
2. Medical subdomain classification of clinical notes using a machine learning-based natural language processing approach [J] . Wei-Hung Weng, Kavishwar B. Wagholikar, Alexa T. McCray, BMC Medical Informatics and Decision Making . 2017,第1期

机译：医疗子域使用基于机器学习的自然语言处理方法的临床票据分类
3. An online reversed French Sign Language dictionary based on a learning approach for signs classification [J] . Zbakh Mohammed, Haddad Zehira, Krahe Jaime Lopez Pattern recognition letters . 2015,第DECa1PTa1期

机译：基于符号分类学习方法的在线反向法语手语词典
4. Intent Classification based on Deep Learning Language Model in Turkish Dialog Systems [C] . Eyup Halit Yilmaz, Cagri Toraman Signal Processing and Communications Applications Conference . 2021

机译：基于土耳其对话系统深层学习语言模型的意图分类
5. Investments in Communities of Learners and Speakers: How African American students of Portuguese negotiate ethno-racialized, gendered, and social-classed identities in second language learning. [D] . Anya, Obianuju Chinyelu. 2011

机译：在学习者和演说者社区中的投资：葡萄牙语的非洲裔美国学生如何在第二语言学习中就种族种族，性别和社会分类的身份进行谈判。
6. Medical subdomain classification of clinical notes using a machine learning-based natural language processing approach [O] . Wei-Hung Weng, Kavishwar B. Wagholikar, Alexa T. McCray, 2017

机译：使用基于机器学习的自然语言处理方法对临床笔记进行医学子域分类
7. Multi-Spectral Image Classification Based on an Object-Based Active Learning Approach [O] . Tengfei Su, Shengwei Zhang, Tingxi Liu 2020

机译：基于基于对象的活动学习方法的多光谱图像分类

Active Learning Approach for Intent Classification in Portuguese Language Conversations

摘要

著录项

相似文献

相关主题

期刊订阅