Cascade convolutional neural network-long short-term memory recurrent neural networks for automatic tonal and nontonal preclassification-based Indian language identification

China Bhanja Chuya; Laskar Mohammad A.; Laskar Rabul H.

首页> 外文期刊>Expert Systems >Cascade convolutional neural network-long short-term memory recurrent neural networks for automatic tonal and nontonal preclassification-based Indian language identification

【24h】

Cascade convolutional neural network-long short-term memory recurrent neural networks for automatic tonal and nontonal preclassification-based Indian language identification

机译：级联卷积神经网络长短期内存经常性神经网络，用于自动色调和非统计学预分配的印度语言识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This work presents an automatic tonalontonal preclassification-based Indian language identification (LID) system. Languages are firstly classified into tonal and nontonal categories, and then, individual languages are identified from the languages of the respective categories. This work proposes the use of pitch Chroma and formant features for this task, and also investigates how Mel-frequency Cepstral Coefficients (MFCCs) complement these features. It further explores block processing (BP), pitch synchronous analysis (PSA)- and glottal closure regions (GCRs)-based approaches for feature extraction, using syllables as basic units. Cascade convolutional neural network (CNN)-long short-term memory (LSTM) model using syllable-level features has been developed. National Institute of Technology Silchar language database (NITS-LD) and OGI-Multilingual Telephone Speech Corpus (OGI-MLTS) have been used for experimental validation. The proposed system based on the score combination of Cascade CNN-LSTM models of Chroma (extracted from BP method), first two formants and MFCCs (both extracted from GCR method) reports the highest accuracies. In the preclassification stage, the observed accuracies are 91%, 87.3%, and 85.1% for NITS-LD, for 30 s, 10 s, and 3 s test data respectively. For OGI-MLTS database, the respective accuracies are 86.7%, 83.1%, and 80.6%. That amounts to absolute improvements of 11.6%, 12.3%, and 13.9% for NITS-LD, and 12.5%, 11.9%, and 12.6% for OGI-MLTS database with respect to that of the baseline system. The proposed preclassification-based LID system shows improvements of 7.3%, 6.4%, and 7.4% for NITS-LD and 6.1%, 6.7%, and 7.2% for OGI-MLTS database over the baseline system for the three respective test data conditions.

机译：这项工作介绍了一种自动色调/非晶预分配的印度语言识别（盖子）系统。语言首先被分类为色调和非州类别，然后，单个语言是从各个类别的语言中识别的。这项工作提出了对该任务的音高色度和中原特征的使用，并研究了敏料谱系谱系数（MFCCS）如何补充这些功能。进一步探讨了使用音节作为基本单元的基于特征提取的基于特征提取的块处理（BP），俯仰同步分析（PSA）和最小的闭合区域（GCR）的方法。已经开发了使用音节级别特征的级联卷积神经网络（CNN）-Long短期内存（LSTM）模型。美国国家技术研究所Silchar语言数据库（NITS-LD）和OGI多语言电话语音语料库（OGI-MLTS）已被用于实验验证。基于Cromade CNN-LSTM模型的谱（从BP方法中提取的CNN-LSTM模型的得分组合，前两种塑料和MFCC（两者从GCR方法中提取）报告了最高的精度。在预分散阶段，分别观察到的精度为91％，87.3％和85.1％，分别为30 s，10 s和3 s测试数据。对于Ogi-MLTS数据库，各自的精度为86.7％，83.1％和80.6％。对于NITS-LD的绝对改善量为11.6％，12.3％和13.9％，对于基线系统的ogi-MLTS数据库的12.5％，11.9％和12.6％。基于预读数的基于Preclasification的盖系统显示出在三个相应的测试数据条件下的基线系统中的终点和6.3％，6.4％和7.4％，6.1％，6.7％和7.2％。

著录项

来源
《Expert Systems》 |2020年第5期|e12544.1-e12544.21|共21页
作者
China Bhanja Chuya; Laskar Mohammad A.; Laskar Rabul H.;
展开▼
作者单位

Natl Inst Technol Silchar Dept Elect & Commun Engn Silchar 788010 Assam India;

Natl Inst Technol Silchar Dept Elect & Commun Engn Silchar 788010 Assam India;

Natl Inst Technol Silchar Dept Elect & Commun Engn Silchar 788010 Assam India;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Cascade CNN-LSTM; Chroma; database; formants; PSA and GCR; syllables; tonal and nontonal languages;

机译：cascade cnn-lstm;色度;数据库;素质;psa和gcr;音节;音调和圆周语言;

相似文献

外文文献
中文文献
专利

1. A serialized classification method for pulmonary nodules based on lightweight cascaded convolutional neural network-long short-term memory [J] . Ni Zihao, Peng Yanjun International journal of imaging systems and technology . 2020,第4期

机译：基于轻质级联卷积神经网络长短短期记忆的肺结核序列化分类方法
2. Modelling multi-level prosody and spectral features using deep neural network for an automatic tonal and non-tonal pre-classification-based Indian language identification system [J] . Bhanja Chuya China, Laskar Mohammad Azharuddin, Laskar Rabul Hussain Language Resources and Evaluation . 2021,第3期

机译：基于自动色调和非音调预分类的印度语言识别系统建模多级韵律和光谱特征
3. Sentiment analysis of tweets using a unified convolutional neural network-long short-term memory network model [J] . Umer Muhammad, Ashraf Imran, Mehmood Arif, Computational Intelligence . 2021,第1期

机译：使用统一卷积神经网络长短期内存网络模型的推文的情感分析
4. Automatic Classification of Indian Languages into Tonal and Non-tonal Categories Using Cascade Convolutional Neural Network (CNN)-Long Short-Term Memory (LSTM) Recurrent Neural Networks [C] . Chuya China, Dipjyoti Bisharad, Rabul Hussain Laskar International Conference on Signal Processing and Communication Systems . 2018

机译：使用级联卷积神经网络（CNN）-长短期记忆（LSTM）递归神经网络将印度语言自动分类为音调和非音调类别
5. Deep Neural Language Model for Text Classification Based on Convolutional and Recurrent Neural Networks [D] . Hassan, Abdalraouf. 2018

机译：基于卷积神经网络和递归神经网络的深度神经语言文本分类模型
6. Language Identification in Short Utterances Using Long Short-Term Memory (LSTM) Recurrent Neural Networks [O] . Ruben Zazo, Alicia Lozano-Diez, Javier Gonzalez-Dominguez, 2011

机译：使用长短期记忆（LSTM）递归神经网络的短话语语言识别
7. Language Identification in Short Utterances Using Long Short-Term Memory (LSTM) Recurrent Neural Networks. [O] . Ruben Zazo, Alicia Lozano-Diez, Javier Gonzalez-Dominguez, 2016

机译：利用长短时记忆（LsTm）递归神经网络识别短语中的语言。

Cascade convolutional neural network-long short-term memory recurrent neural networks for automatic tonal and nontonal preclassification-based Indian language identification

摘要

著录项

相似文献

相关主题

期刊订阅