首页> 外文会议>International Conference on Image Analysis and Recognition >Contributions to a Quantitative Unsupervised Processing and Analysis of Tongue in Ultrasound Images
【24h】

Contributions to a Quantitative Unsupervised Processing and Analysis of Tongue in Ultrasound Images

机译:对超声图像中舌的定量无监督处理和分析的贡献

获取原文

摘要

Speech production studies and the knowledge they bring forward are of paramount importance to advance a wide range of areas including Phonetics, speech therapy, synthesis and interaction. Several technologies have been considered to study static and dynamic features of the articulators and speech motor control, such as electromagnetic articulography (EMA), real-time magnetic resonance (RTMRI) and ultrasound (US) imaging. While the latest advances in RTMRI provide a wealth of data of the full vocal tract, it is an expensive resource that requires specialized facilities. In this sense, US is a more affordable alternative for several contexts, enabling the acquisition of larger datasets, but demands adequate computational approaches for processing and analysis. While the literature is prolific in proposing methods for tongue segmentation from US, the noisy nature of the images and the specificities of the equipment often dictate a poor performance on novel datasets, a matter that needs to be assessed, before large data acquisition, to devise suitable acquisition and processing methods. In the scope of a line of research studying speech changes with age, this work describes the first results of an automatic tongue segmentation method from US, along with a characterization of the main challenges posed by the image data. Even though improvements are still needed, particularly to ensure temporal coherence, at its current stage, this method can already provide the required data for an automatic analysis of maximum tongue height, a relevant parameter to assess speech changes on vowel production.
机译:语音产生研究及其带来的知识对于推进包括语音,语音治疗,合成和交互作用在内的广泛领域至关重要。已经考虑了多种技术来研究咬合架的静态和动态特征以及语音运动控制,例如电磁关节造影(EMA),实时磁共振(RTMRI)和超声(US)成像。尽管RTMRI的最新进展提供了整个声道的大量数据,但它是一种昂贵的资源,需要专门的设施。从这个意义上讲,美国是在某些情况下更实惠的替代方案,可以获取更大的数据集,但需要适当的计算方法来进行处理和分析。尽管文献中大量提出了从美国进行舌头分割的方法,但图像的嘈杂本质和设备的特殊性通常表明在新颖的数据集上性能较差,需要在大数据采集之前进行评估,以设计出一个问题。合适的采集和处理方法。在研究语音随年龄变化的研究范围内,这项工作描述了美国自动舌头分割方法的最初结果,以及图像数据所面临的主要挑战的特征。即使在当前阶段仍需要改进,尤其是确保时间连贯性,该方法仍可以提供所需的数据,以自动分析最大舌高,这是评估元音产生时语音变化的相关参数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号