A New Pre-training Method for Training Deep Learning Models with Application to Spoken Language Understanding

机译：一种新的培训深入学习模型的新预训练方法，以应用于口语语言理解

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a simple and efficient approach for pre-training deep learning models with application to slot filling tasks in spoken language understanding. The proposed approach leverages unlabeled data to train the models and is generic enough to work with any deep learning model. In this study, we consider the CNN2CRF architecture that contains Convolutional Neural Network (CNN) with Conditional Random Fields (CRF) as top layer, since it has shown great potential for learning useful representations for supervised sequence learning tasks. The proposed pre-training approach with this architecture learns the feature representations from both labeled and unlabeled data at the CNN layer, covering features that would not be observed in limited labeled data. At the CRF layer, the unlabeled data uses predicted classes of words as latent sequence labels together with labeled sequences. Latent labeled sequences, in principle, has the regularization effect on the labeled sequences, yielding a better generalized model. This allows the network to learn representations that are useful for not only slot tagging using labeled data but also learning dependencies both within and between latent clusters of unseen words. The proposed pre-training method with the CRF2CNN architecture achieves significant gains with respect to the strongest semi-supervised baseline.

机译：我们提出了岗前培训深度学习模型与应用在口语理解槽分配任务的简单而有效的方法。所提出的方法利用未标记数据来训练模型，是通用的，足以工作，任何深刻的学习模式。在这项研究中，我们认为包含卷积神经网络（CNN）与条件随机域（CRF）作为顶层CNN2CRF架构，因为它为学习监督序列学习任务用的表现显示出巨大的潜力。这种体系结构所提出的预培训方法学习到在CNN层标记的和未标记的数据的特征表示，覆盖，不会在限定标记的数据可观察到的特征。在CRF层，所述未标记的数据使用与标记序列来预测单词作为潜序列标签的类在一起。潜标有序列原则，对标有序列正规化作用，产生更好的广义模型。这允许网络得知是不仅插槽使用标记的数据标记，而且内部和看不见的话潜伏集群之间的学习依赖性用的表现。与CRF2CNN架构建议前期训练方法实现相对于最强的半监督基线显著的收益。

著录项

来源
《Annual Conference of the International Speech Communication Association》|2016年|p3106-3887|共5页
会议地点
作者
Asli Celikyilmaz; Ruhi Sarikaya; Dilek Hakkani-Tur; Xiaohu Liu; Nikhil Ramesh; Gokhan Tur;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TB95-53;
关键词

相似文献

外文文献
中文文献
专利

1. Spoken Language Understanding of Human-Machine Conversations for Language Learning Applications [J] . Yao Qian, Rutuja Ubale, Patrick Lange, Journal of VLSI signal processing systems for signal, image, and video technology . 2020,第8期

机译：对人机对话进行语言学习应用的口语理解
2. Summary of Research Methods on Pre-Training Models of Natural Language Processing [J] . Yu Xiao, Zhezhi Jin Open Access Library Journal . 2021,第7期

机译：自然语言处理预培训模型研究方法综述
3. Extending the Classifier Algorithms in Machine Learning to Improve the Performance in Spoken Language Understanding Systems Under Deficient Training Data [J] . Sheetal Jagdale, Milind Shah Advances in Science, Technology and Engineering Systems . 2020,第6期

机译：在机器学习中扩展分类器算法，以提高在缺乏训练数据下的口语理解系统中的性能
4. A New Pre-training Method for Training Deep Learning Models with Application to Spoken Language Understanding [C] . Asli Celikyilmaz, Ruhi Sarikaya, Dilek Hakkani-Tur, Annual Conference of the International Speech Communication Association . 2016

机译：一种新的培训深入学习模型的新预训练方法，以应用于口语语言理解
5. Toward Deep Language Understanding: Methods for Learning Conceptual Knowledge from Definitions [D] . Orfan, Jansen. 2020

机译：走向深刻的语言理解：从定义学习概念知识的方法
6. The Role of Statistical Learning in Understanding and Treating Spoken Language Outcomes in Deaf Children With Cochlear Implants [O] . Joanne A. Deocampo, Gretchen N. L. Smith, William G. Kronenberger, -1

机译：统计学习在聋人耳蜗植入儿童理解和治疗口语结果中的作用
7. Pre-Training for Query Rewriting in a Spoken Language Understanding System [O] . Zheng Chen, Xing Fan, Yuan Ling 2020

机译：语言理解系统中查询重写的预先培训

A New Pre-training Method for Training Deep Learning Models with Application to Spoken Language Understanding

摘要

著录项

相似文献

相关主题

期刊订阅