...
首页> 外文期刊>Journal of voice: official journal of the Voice Foundation >Intra- and Inter-database Study for Arabic, English, and German Databases: Do Conventional Speech Features Detect Voice Pathology?
【24h】

Intra- and Inter-database Study for Arabic, English, and German Databases: Do Conventional Speech Features Detect Voice Pathology?

机译:阿拉伯语,英语和德语数据库的内部和数据库内的研究:进行传统语音功能检测语音病理学吗?

获取原文
获取原文并翻译 | 示例
           

摘要

A large population around the world has voice complications. Various approaches for subjective and objective evaluations have been suggested in the literature. The subjective approach strongly depends on the experience and area of expertise of a clinician, and human error cannot be neglected. On the other hand, the objective or automatic approach is noninvasive. Automatic developed systems can provide complementary information that may be helpful for a clinician in the early screening of a voice disorder. At the same time, automatic systems can be deployed in remote areas where a general practitioner can use them and may refer the patient to a specialist to avoid complications that may be life threatening. Many automatic systems for disorder detection have been developed by applying different types of conventional speech features such as the linear prediction coefficients, linear prediction cepstral coefficients, and Mel-frequency cepstral coefficients (MFCCs). This study aims to ascertain whether conventional speech features detect voice pathology reliably, and whether they can be correlated with voice quality. To investigate this, an automatic detection system based on MFCC was developed, and three different voice disorder databases were used in this study. The experimental results suggest that the accuracy of the MFCC-based system varies from database to database. The detection rate for the intra-database ranges from 72% to 95%, and that for the inter-database is from 47% to 82%. The results conclude that conventional speech features are not correlated with voice, and hence are not reliable in pathology detection.
机译:世界各地的大群人具有语音并发症。在文献中提出了各种主观和客观评估方法。主观方法强烈取决于临床医生的经验和领域,人类错误不能容忽视。另一方面,目标或自动方法是非侵入性的。自动开发系统可以提供对临床医生的互补信息,在语音障碍的早期筛查中。与此同时,可以部署在一般从业者可以使用它们的偏远地区部署自动系统,并且可以将患者推荐给专家以避免可能是危及生命的并发症。通过应用不同类型的常规语音特征,诸如线性预测系数,线性预测谱系齐数和熔融频率谱系数(MFCCs)的不同类型的传统语音特征开发了许多用于无序检测的自动系统。本研究旨在确定常规语音特征是否可靠地检测语音病理学,以及它们是否可以与语音质量相关联。为了调查这一点,开发了一种基于MFCC的自动检测系统,本研究中使用了三种不同的语音障碍数据库。实验结果表明,基于MFCC的系统的准确性因数据库而异。数据库内部范围的检出速率为72%至95%,而且为数据库的数据库为47%至82%。结果得出结论,传统语音特征与语音不相关,因此在病理检测中不可靠。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号