Efficient text analyser with prosody generator-driven approach for Mandarin text-to-speech

C.-Y. Yeh; S.-H. Hwang

首页> 外文期刊>IEE Proceedings. Part K, Vision, image and signal processing >Efficient text analyser with prosody generator-driven approach for Mandarin text-to-speech

【24h】

Efficient text analyser with prosody generator-driven approach for Mandarin text-to-speech

机译：高效的文本分析器，采用韵律发生器驱动的普通话语音合成方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A new approach for an efficient text analyser is proposed. The prosody generator-driven method is employed to design an efficient text analyser for Mandarin text-to-speech. More simple structure of text analysis, more suitable classification of linguistic features and more efficient contribution of linguistic features to the prosody generator can be achieved. Three heuristic and theoretical methods are used to analyse and examine the capability of each linguistic feature. First, the contribution of each linguistic feature to the prosody generator is examined experimentally. Secondly, the cross-influence of each linguistic feature on the prosody generator is analysed. Thirdly, the problem of over- and under- classification of the linguistic features is inspected. Finally, these three analytic results are referenced to design an efficient text analyser. In total 35 243 Chinese characters are employed to examine the performance of our text analyser. Only 79ms CPU time on a P4-1.4G PC is needed for word segmentation and POS tagging. Correction rates of 97.5 and 93.2percent are achieved for the word segmentation and POS tagging, respectively. This confirms that the performance of our text analyser is very good. Moreover, a Mandarin text-to-speech system is implemented to inspect the performance of the text analysis and the contribution to the prosody generator. More natural and fluent speech is obtained under the lower computation. The MOS of prosody of the synthesised and original speech are 4.2 and 4.8, respectively, which is reasonably good.

机译：提出了一种有效的文本分析器的新方法。采用韵律发生器驱动的方法来设计用于普通话到语音的高效文本分析器。可以实现更简单的文本分析结构，更合适的语言特征分类以及更有效的语言特征对韵律生成器的贡献。三种启发式和理论方法用于分析和检查每种语言功能的能力。首先，通过实验检查每种语言特征对韵律生成器的贡献。其次，分析了每种语言特征对韵律产生器的交叉影响。第三，检查语言特征的过分和分类不足的问题。最后，参考这三个分析结果来设计有效的文本分析器。总共使用了35243个汉字来检查我们的文本分析器的性能。在P4-1.4G PC上，仅需79毫秒的CPU时间即可进行分词和POS标记。分词和POS标记的正确率分别为97.5％和93.2％。这证实了我们的文本分析器的性能非常好。此外，还实施了普通话文本转语音系统，以检查文本分析的性能以及对韵律生成器的作用。在较低的计算量下可以获得更自然流畅的语音。合成语音和原始语音的韵律MOS分别为4.2和4.8，这是相当不错的。

著录项

来源
《IEE Proceedings. Part K, Vision, image and signal processing》 |2005年第6期|p.793-799|共7页
作者
C.-Y. Yeh; S.-H. Hwang;
展开▼
作者单位

Department of Electrical Engineering, National Taipei University of Technology, Taipei, Taiwan, Republic of China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计量学;
关键词

相似文献

外文文献
中文文献
专利

1. High-Quality Prosody Generation in Mandarin Text-to-Speech System [J] . Qing Guo, Jie Zhang, Nobuyuki Katae, Fujitsu Scientific & Technical Journal . 2010,第1期

机译：普通话语音合成系统中的高质量韵律生成
2. A novel prosody adaptation method for Mandarin concatenation-based text-to-speech system [J] . Jian Yu, Jianhua Tao Acoustical science and technology . 2009,第1期

机译：基于普通话串联的语篇转换系统的新韵律调适方法
3. A novel prosody adaptation method for Mandarin concatenation-based text-to-speech system [J] . Jian Yu, Jianhua Tao Acoustical science and technology . 2009,第1期

机译：基于普通话串联的语篇转换系统的新韵律调适方法
4. AN EFFICIENT TEXT ANALYZER WITH PROSODY GENERATOR-DRIVEN APPROACH FOR MANDARIN TEXT-TO-SPEECH [C] . IEEE IEEE International Conference on Acoustics, Speech, and Signal Processing . 2003

机译：一个有效的文本分析仪，具有普罗旺达文本到语音的韵律发电机驱动方法
5. Building a prosodically sensitive diphone database for a Korean text-to-speech synthesis system. [D] . Yoon, Kyuchul. 2005

机译：为韩国文字转语音合成系统建立一个对韵律敏感的diphone数据库。
6. The First Vietnamese FOSD-Tacotron-2-based Text-to-Speech Model Dataset [O] . Duc Chung Tran 2020

机译：基于越南FOSD-Tacotron-2的文本到语音模型数据集
7. A novel prosody adaptation method for Mandarin concatenation-based text-to-speech system [O] . Jian Yu, Jianhua Tao 2009

机译：基于普通话级联的文本到语音系统的一种新型韵律适应方法

Efficient text analyser with prosody generator-driven approach for Mandarin text-to-speech

摘要

著录项

相似文献

相关主题

期刊订阅