Research on speech synthesis, and particularly on prosodic generation, is still the biggest challenge in Text-to-Speech Synthesis. In the MBROLA project, database voices (i.e., the DSP part of a TTS system) for Arabic, Brazilian, Portuguese, Breton, British English, Dutch, French, German, Romanian, Spanish,... are available. Several complete TTS Systems for Indo-European languages were developed around this project but not yet for the Arabic language. The quality of a text to speech synthesis depends on the naturalness, on the intelligibility of the speech generated and the specific characteristics to the produced voice. These characteristics depend on the techniques and the methods of synthesis, but also on the care taken to linguistic and prosodic modeling. Several works underlines the fact that linguistic structures maintain the close links with the prosodic achievements. Concerning the Arab language, the models are based on the syllabic structure of the words, the stressing, the concept of markers intonates (interrogative mainly) and very little on syntactic information. The object of this paper is to describe modelling and the integration of phonological, morpho-lexical and syntactic knowledge necessary to the development of a complete Arabic Text To Speech System starting from diacritized Arab texts.
展开▼