首页> 外文会议>Audio Engineering Society convention >A voice classification system for younger children with applications to content navigation
【24h】

A voice classification system for younger children with applications to content navigation

机译:幼儿语音分类系统及其在内容导航中的应用

获取原文

摘要

A speech classification system is proposed which has applications for accessibility of content for younger children. To allow a young child to access online content (where typical interfaces such as search engines or hierarchical navigation would be inappropriate) we propose a voice classification system trained to recognise a range of sounds and vocabulary typical of younger children. As an example we design a system for classifying animal noises. Acoustic features are extracted from a corpus of animal noises made by a class of young children. A Support Vector Machine is trained to classify the sounds into one of 12 corresponding animals. We investigate the precision and recall of the classifier for various classification parameters. We investigate an appropriate choice of features to extract from the audio and compare the performance when using mean Mel-frequency Cepstral Coefficients (MFCC), a single-Gaussian model fitted to the MFCCs as well as a range of temporal features. To investigate the real-world applicability of the system we pay particular attention to the difference between training a generic classifier from a collected corpus of examples and one trained to a particular voice.
机译:提出了一种语音分类系统,该系统具有用于年幼儿童的内容可访问性的应用。为了使幼儿能够访问在线内容(在这种情况下,不适合使用诸如搜索引擎或分层导航之类的典型界面),我们提出了一种语音分类系统,该系统经过训练可以识别幼儿的各种声音和词汇。作为示例,我们设计了一个用于对动物噪音进行分类的系统。声音特征是从一类幼儿发出的动物声音语料库中提取的。支持向量机经过训练可以将声音分类为12种相应动物中的一种。我们研究了各种分类参数的分类器的精度和召回率。我们从音频调查的特征提取适当的选择和使用的平均梅尔频率倒谱系数(MFCC),装到的MFCC以及一系列的时空特征单高斯模型时比较性能。为了研究该系统在现实世界中的适用性,我们特别注意了从收集的示例语料库训练通用分类器与训练到特定语音的方法之间的区别。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号