Speech translation and dialogue systems must accept conversational speech. In this paper, we discuss acoustic and linguistic characteristics based on results of speech recognition experimetns using speech from human-to-human and human-to-machine conversations. Conversational speech inputs to machines consist of frozen expressions such as greetings and yeso statements, and informative individual expressions like numerical data such as dates and telephone numbers. The former has a lower perplexity and acoustic characteristics close to spontaneous speech. The latter has a higher perplexity and acoustic characteristics close to read speech. Each utterance or each itner-pausal unit can be classified into the former or the latter. This new knowledge will help future research on speech translation and dialgoue systems.
展开▼