首页> 外文期刊>IEE Proceedings. Part K, Vision, image and signal processing >Efficient text analyser with prosody generator-driven approach for Mandarin text-to-speech
【24h】

Efficient text analyser with prosody generator-driven approach for Mandarin text-to-speech

机译:高效的文本分析器,采用韵律发生器驱动的普通话语音合成方法

获取原文
获取原文并翻译 | 示例
           

摘要

A new approach for an efficient text analyser is proposed. The prosody generator-driven method is employed to design an efficient text analyser for Mandarin text-to-speech. More simple structure of text analysis, more suitable classification of linguistic features and more efficient contribution of linguistic features to the prosody generator can be achieved. Three heuristic and theoretical methods are used to analyse and examine the capability of each linguistic feature. First, the contribution of each linguistic feature to the prosody generator is examined experimentally. Secondly, the cross-influence of each linguistic feature on the prosody generator is analysed. Thirdly, the problem of over- and under- classification of the linguistic features is inspected. Finally, these three analytic results are referenced to design an efficient text analyser. In total 35 243 Chinese characters are employed to examine the performance of our text analyser. Only 79ms CPU time on a P4-1.4G PC is needed for word segmentation and POS tagging. Correction rates of 97.5 and 93.2percent are achieved for the word segmentation and POS tagging, respectively. This confirms that the performance of our text analyser is very good. Moreover, a Mandarin text-to-speech system is implemented to inspect the performance of the text analysis and the contribution to the prosody generator. More natural and fluent speech is obtained under the lower computation. The MOS of prosody of the synthesised and original speech are 4.2 and 4.8, respectively, which is reasonably good.
机译:提出了一种有效的文本分析器的新方法。采用韵律发生器驱动的方法来设计用于普通话到语音的高效文本分析器。可以实现更简单的文本分析结构,更合适的语言特征分类以及更有效的语言特征对韵律生成器的贡献。三种启发式和理论方法用于分析和检查每种语言功能的能力。首先,通过实验检查每种语言特征对韵律生成器的贡献。其次,分析了每种语言特征对韵律产生器的交叉影响。第三,检查语言特征的过分和分类不足的问题。最后,参考这三个分析结果来设计有效的文本分析器。总共使用了35243个汉字来检查我们的文本分析器的性能。在P4-1.4G PC上,仅需79毫秒的CPU时间即可进行分词和POS标记。分词和POS标记的正确率分别为97.5%和93.2%。这证实了我们的文本分析器的性能非常好。此外,还实施了普通话文本转语音系统,以检查文本分析的性能以及对韵律生成器的作用。在较低的计算量下可以获得更自然流畅的语音。合成语音和原始语音的韵律MOS分别为4.2和4.8,这是相当不错的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号