首页> 外文会议>International Conference on Signal Processing and Communication Systems >Automatic Classification of Indian Languages into Tonal and Non-tonal Categories Using Cascade Convolutional Neural Network (CNN)-Long Short-Term Memory (LSTM) Recurrent Neural Networks
【24h】

Automatic Classification of Indian Languages into Tonal and Non-tonal Categories Using Cascade Convolutional Neural Network (CNN)-Long Short-Term Memory (LSTM) Recurrent Neural Networks

机译:使用级联卷积神经网络(CNN)-长短期记忆(LSTM)递归神经网络将印度语言自动分类为音调和非音调类别

获取原文

摘要

This work aims to develop an automatic tonal and non-tonal language classification system of Indian languages using cascade Convolutional Neural Network (CNN)-Long Short-Term Memory (LSTM) Recurrent neural networks (RNNs). Motivated by their success in modelling sequences, this study proposes LSTM-RNNs cascaded with CNN in context of tonal and non-tonal language classification. Here RNNs show its effectiveness to exploit temporal dependencies in acoustic data. This paper also proposes the use of pitch chroma spectrogram coefficients to address this classification task. The proposed feature is then combined with log-mel spectrogram coefficients to enhance the system performance. The system has been tested for NIT Silchar language database (NITS-LD) which is developed for 9 Indian languages using All India radio broadcast news. And it reports accuracies of 82.30% for 10s and 81.16% for 3s data. Performance of the proposed system is also analyzed on Oregon Graduate Institute Multi-Language Telephone-based Speech (OGI-MLTS) database. It shows accuracies of 77.2% and 74.95% for 10s and 3s data respectively.
机译:这项工作旨在使用级联卷积神经网络(CNN)-长短期记忆(LSTM)递归神经网络(RNN)开发印度语言的自动调性和非调性语言分类系统。鉴于其在建模序列中的成功,本研究提出了在色调和非调性语言分类的背景下将LSTM-RNN与CNN级联的方法。在这里,RNN展示了利用声学数据中的时间依赖性的有效性。本文还提出使用音高色度频谱图系数来解决此分类任务。然后,将拟议的功能与对数-梅尔频谱图系数结合使用,以增强系统性能。该系统已经过NIT Silchar语言数据库(NITS-LD)的测试,该数据库使用All India广播新闻针对9种印度语言开发。并且它报告的10s精度为82.30%,3s数据精度为81.16%。俄勒冈大学研究生院基于多语言电话的语音(OGI-MLTS)数据库上也分析了所提出系统的性能。它显示10s和3s数据的准确度分别为77.2%和74.95%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号