首页> 外文会议>Conference of Open Innovations Association >SPARTA: Speaker Profiling for ARabic TAlk
【24h】

SPARTA: Speaker Profiling for ARabic TAlk

机译:Sparta:用于阿拉伯语谈话的扬声器分析

获取原文

摘要

This paper proposes a novel approach to an automatic estimation of three speaker traits from Arabic speech: gender, emotion, and dialect. After showing promising results on different text classification tasks, the multi-task learning (MTL) approach is used in this paper for Arabic speech classification tasks. The dataset was assembled from six publicly available datasets. First, The datasets were edited and thoroughly divided into train, development, and test sets (open to the public), and a benchmark was set for each task and dataset throughout the paper. Then, three different networks were explored: Long Short Term Memory (LSTM), Convolutional Neural Network (CNN), and Fully-Connected Neural Network (FCNN) on five different types of features: two raw features (MFCC and MEL) and three pre-trained vectors (i-vectors, d-vectors, and x-vectors). LSTM and CNN networks were implemented using raw features: MFCC and MEL, wher FCNN was explored on the pre-trained vectors while varying the hyper-parameters of these networks to obtain the best results for each dataset and task. MTL was evaluated against the single task learning (STL) approach for the three tasks and six datasets, in which the MTL and pre-trained vectors almost constantly outperformed STL. All the data and pre-trained models used in this paper are available and can be acquired by the public.
机译:本文提出了一种新的方法来从阿拉伯语演讲3个扬声器特性自动估计:性别,情绪和方言。显示在不同的文本分类任务有希望的结果后,多任务学习(MTL)方法在本文阿拉伯语语音分类任务使用。该数据集从6个可公开获得的数据集组装。首先,该数据集被编辑和彻底分为火车,开发和测试集(向公众开放),和一个基准定为每一个任务和数据集在整个纸。然后,三种不同的网络进行了探讨:长短期记忆(LSTM),卷积神经网络(CNN)和5种不同类型的特征完全连接的神经网络(FCNN):两个原始特征(MFCC和MEL)和三个预-trained矢量(I-载体,d的载体,和x矢量)。 LSTM和CNN网络被使用原始特征实现:MFCC和MEL,wher FCNN对所预先训练矢量探索同时改变这些网络的超参数,以获得每个数据集和任务的最佳结果。 MTL是针对三个任务和六个数据集,其中MTL和预先训练矢量几乎一直跑赢STL单任务学习(STL)的方法进行评估。所有的数据和本文采用预先训练型号可供选择,并且可以通过公开收购。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号