首页> 中文期刊> 《计算机工程与应用》 >用决策树指导TBL进行多音字消歧

用决策树指导TBL进行多音字消歧

         

摘要

多音字消歧是普通话语音合成系统中字音转换模块的核心问题.选择了常见易错的 33 个多音字和 24 个多音词作为研究对象,构建了一个平均每个多音字(词) 5 000 句的语料率,并且提出了-种结合决策树和基于转换的错误驱动的学习(Transformation-Based error-driven Learning,TBL)的混合算法.该方法根据决策树的指导,自动生成 TBL 算法的模板,避免了手工总结模板这一费时费力的过程.实验结果表明,该方法生成的模板与手工模板性能相当,其平均准确率迭90.36%,明显优于决策树.%Polyphone disambiguation is the core issue of the grapheme-to-phoneme conversion in Mandarin Text-To-Speeeh (TTS) system. This paper selects 33 key polyphones and 24 key polyphonic words which are most ambiguous and frequently used as study objects, and builds a polyphone corpus of 5 000 sentences per polyphone on average. Furthermore, a hybrid algorithm called Tree-Guided Transformation-Based Learning(TGTBL),which combines decision tree with Transformation-Based error-driven Learning(TBL),is proposed to resolve the polyphonic ambiguity. It automatically generates TBL templates,thereby avoiding manually summarizing templates, which is time-consuming and laborious in conventional TBL.Results of comparative experiments show that, for the task of polyphone disambiguation, templates automatically generated by decision tree achieve comparable performance to manually summarized templates,and the average precision of TGTBL reaches 90.36%,significantly higher than that of decision tree.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号