Semantic language models for Automatic Speech Recognition

机译：用于自动语音识别的语义语言模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We are interested in the problem of semantics-aware training of language models (LMs) for Automatic Speech Recognition (ASR). Traditional language modeling research have ignored semantic constraints and focused on limited size histories of words. Semantic structures may provide information to capture lexically realized long-range dependencies as well as the linguistic scene of a speech utterance. In this paper, we present a novel semantic LM(SELM) that is based on the theory of frame semantics. Frame semantics analyzes meaning of words by considering their role in the semantic frames they occur and by considering their syntactic properties. We show that by integrating semantic frames and target words into recurrent neural network LMs we can gain significant improvements in perplexity and word error rates. We have evaluated the semantic LM on the publicly available ASR baselines on the Wall Street Journal (WSJ) corpus. SELMs achieve 50% and 64% relative reduction in perplexity compared to n-gram models by using frames and target words respectively. In addition, 12% and 7% relative improvements in word error rates are achieved by SELMs on the Nov'92 and Nov'93 test sets with respect to the baseline tri-gram LM.

机译：我们对自动语音识别（ASR）的语言模型（LM）的语义感知训练问题感兴趣。传统语言建模研究已经忽略了语义约束，而将注意力集中在有限的单词历史上。语义结构可以提供信息以捕获词汇实现的远程依存关系以及语音表达的语言场景。在本文中，我们提出了一种基于框架语义理论的新型语义LM（SELM）。框架语义通过考虑单词在其出现的语义框架中的作用并考虑其句法属性来分析单词的含义。我们表明，通过将语义框架和目标词集成到递归神经网络LM中，我们可以在困惑和词错误率方面获得显着改善。我们已经在《华尔街日报》（WSJ）语料库上公开使用的ASR基准上评估了语义LM。与n-gram模型相比，分别使用框架和目标词，SELM的困惑度相对降低了50％和64％。此外，相对于基线三元语法LM，通过SELM在Nov'92和Nov'93测试集上实现的单词错误率相对提高了12％和7％。

著录项

来源
《IEEE Workshop on Spoken Language Technology》|2014年|7-12|共6页
会议地点
作者
Bayer Ali Orkan; Riccardi Giuseppe;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Frame Semantics; Language Modeling; Recurrent Neural Networks; Semantic Language Models;

机译：框架语义;语言建模;递归神经网络;语义语言模型;

相似文献

外文文献
中文文献
专利

1. A Review on Speech Corpus Development for Automatic Speech Recognition in Indian Languages [J] . Cini kurian International Journal of Advanced Networking and Applications . 2015,第7018期

机译：语音语料库在印度语言中自动语音识别的发展述评
2. Investigation of Automatic Speech Recognition Systems via the Multilingual Deep Neural Network Modeling Methods for a Very Low-Resource Language, Chaha [J] . Tessfu Geteye Fantaye, Junqing Yu, Tulu Tilahun Hailu Journal of Signal and Information Processing . 2020,第1期

机译：Chaha非常低于资源语言的多语言深神经网络建模方法对自动语音识别系统的研究
3. Investigation of Automatic Speech Recognition Systems via the Multilingual Deep Neural Network Modeling Methods for a Very Low-Resource Language, Chaha [J] . Tessfu Geteye Fantaye, Junqing Yu, Tulu Tilahun Hailu 信号与信息处理（英文） . 2020,第001期

机译：资源非常少的语言Chaha通过多语言深层神经网络建模方法研究自动语音识别系统
4. Semantic word embedding neural network language models for automatic speech recognition [C] . Kartik Audhkhasi, Abhinav Sethy, Bhuvana Ramabhadran IEEE International Conference on Acoustics, Speech and Signal Processing . 2016

机译：语义词嵌入神经网络语言模型用于语音自动识别
5. Arabic language modeling with stem-derived morphemes for automatic speech recognition. [D] . Heintz, Ilana. 2010

机译：具有词干衍生语素的阿拉伯语言建模，可实现自动语音识别。
6. Using Automatic Speech Recognition to Assess Spoken Responses to Cognitive Tests of Semantic Verbal Fluency [O] . Serguei V.S. Pakhomov, Susan E. Marino, Sarah Banks, -1

机译：使用自动语音识别来评估对语音口语流利度的认知测试的口头反应
7. Comparison Of Part-Of-Speech And Automatically Derived Category-Based Language Models For Speech Recognition [O] . T.R. Niesler, E. W. D. Whittaker, P.C. Woodland 1998

机译：语音识别的词性和自动派生基于类别的语言模型的比较

Semantic language models for Automatic Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅