An empirical study of statistical language models: n-gram language models vs. neural network language models

Freha Mezzoudj; Abdelkader Benyettou

首页> 外文期刊>International Journal of Innovative Computing and Applications >An empirical study of statistical language models: n-gram language models vs. neural network language models

【24h】

An empirical study of statistical language models: n-gram language models vs. neural network language models

机译：统计语言模型的实证研究：n-gram语言模型与神经网络语言模型

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Statistical language models are an important module in many areas of successful applications such as speech recognition and machine translation. And n-gram models are basically the state-of-the-art. However, due to sparsity of data, the modelled language cannot be completely represented in the n-gram language model. In fact, if new words appear in the recognition or translation steps, we need to provide a smoothing method to distribute the model probabilities over the unknown values. Recently, neural networks were used to model language based on the idea of projecting words onto a continuous space and performing the probability estimation in this space. In this experimental work, we compare the behaviour of the most popular smoothing methods with statistical n-gram language models and neural network language models in different situations and with different parameters. The language models are trained on two corpora of French and English texts. Good empirical results are obtained by the recurrent neural network language models.

机译：统计语言模型是许多成功应用程序的重要模块，例如语音识别和机器翻译。而n-gram模型基本上是最先进的。但是，由于数据的稀疏性，所建模语言不能完全在n克语言模型中表示。实际上，如果新单词出现在识别或翻译步骤中，我们需要提供平滑方法来分配未知值的模型概率。最近，神经网络用于基于将单词突出到连续空间的想法和执行该空间中的概率估计来模拟语言。在该实验工作中，我们将最流行的平滑方法的行为与不同情况和不同参数的统计n-gram语言模型和神经网络语言模型进行比较。语言模型在法语和英语文本的两个语言中受过培训。经常性神经网络语言模型获得了良好的经验结果。

著录项

来源
《International Journal of Innovative Computing and Applications》 |2018年第4期|共14页
作者
Freha Mezzoudj; Abdelkader Benyettou;
展开▼
作者单位

Université des Sciences et de la Technologie d'Oran Mohamed Boudiaf;

Signal Image et Parole (SIMPA) Laboratory Université des Sciences et de la Technologie d'Oran Mohamed Boudiaf;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
language models; n-grams; Kneser-Ney smoothing; modified Kneser-Ney smoothing; Good-Turing smoothing; interpolation; back-off; feed-forward neural networks; continuous space language models; CSLM; recurrent neural networks; RNN; speech recognition; machine translation;

机译：语言模型;N-GRAMS;KNESER-NEY平滑;改进的KNESER-NEY平滑;良好的平滑;插值;退出;前馈神经网络;连续空间语言模型;CSLM;反复性神经网络;RNN;语音识别;机器翻译;

相似文献

外文文献
中文文献
专利

1. An empirical study of statistical language models: n-gram language models vs. neural network language models [J] . Freha Mezzoudj, Abdelkader Benyettou International Journal of Innovative Computing and Applications . 2018,第4期

机译：统计语言模型的实证研究：n-gram语言模型与神经网络语言模型
2. Are models easier to understand than code? An empirical study on comprehension of entity-relationship (ER) models vs. structured query language (SQL) code [J] . Pablo Sanchez, Marta Zorrilla, Rafael Duque, Computer Science Education . 2011,第4期

机译：模型比代码容易理解吗？关于实体关系（ER）模型与结构化查询语言（SQL）代码的理解的实证研究
3. A survey on the application of recurrent neural networks to statistical language modeling [J] . Wim De Mulder, Steven Bethard, Marie-Francine Moens Computer speech and language . 2015,第1期

机译：递归神经网络在统计语言建模中的应用研究
4. Neural or Statistical: An Empirical Study on Language Models for Chinese Input Recommendation on Mobile [C] . Hainan Zhang, Yanyan Lan, Jiafeng Guo, Information retrieval . 2017

机译：神经或统计：移动电话上中文输入推荐语言模型的实证研究
5. Language-independent text learning with statistical n-gram language models. [D] . Peng, Fuchun. 2003

机译：统计n-gram语言模型的独立于语言的文本学习。
6. Tracking Child Language Development With Neural Network Language Models [O] . Kenji Sagae 2021

机译：用神经网络语言模型跟踪儿童语言开发
7. Auto-Sizing Neural Networks: With Applications to n-gram Language Models [O] . Kenton Murray, David Chiang 2015

机译：自动调整大小神经网络：应用于n-gram语言模型
8. Investigation of Back-off Based Interpolation Between Recurrent Neural Network and N-gram Language Models (Author's Manuscript). [R] . Chen, X., Liu, X., Gales, M. J. F., 2016

机译：基于回退的递归神经网络与N-gram语言模型的插值研究（作者手稿）。

An empirical study of statistical language models: n-gram language models vs. neural network language models

摘要

著录项

相似文献

相关主题

期刊订阅