Large vocabulary mandarin Chinese continuous speech recognition has been a difficult problem in speech recognition area because of several reasons. First, it is a tone language. There are five lexical tones that are important in distinguishing the confusable words in mandarin. So the modeling of tones plays an important role in mandarin speech recognition. Second, the variation of tones in spontaneous mandarin speech would have some effects on the performance. Third, the co-articulation is inevitable in spontaneous mandarin speech recognition. In this paper, a large vocabulary mandarin Chinese continuous system based on tonal triphone was constructed. The experimental results shows that a good performance in acoustic level has been achieved while poor performance in word level.
展开▼