首页> 外文会议>International Joint Conference on Natural Language Processing >Long Distance Dependency in Language Modeling: An Empirical Study
【24h】

Long Distance Dependency in Language Modeling: An Empirical Study

机译:语言建模的长距离依赖性:实证研究

获取原文

摘要

This paper presents an extensive empirical study on two language modeling techniques, linguistically-motivated word skipping and predictive clustering, both of which are used in capturing long distance word dependencies that are beyond the scope of a word trigram model. We compare the techniques to others that were proposed previously for the same purpose. We evaluate the resulting models on the task of Japanese Kana-Kanji conversion. We show that the two techniques, while simple, outperform existing methods studied in this paper, and lead to language models that perform significantly better than a word trigram model. We also investigate how factors such as training corpus size and genre affect the performance of the models.
机译:本文介绍了两个语言建模技术的大量实证研究,语言 - 激励的单词跳过和预测聚类,这两者都用于捕获超出单词三字节模型范围的长距离字依赖性。我们将技术与以前以同一目的提出的其他方式进行比较。我们评估了日本Kana-Kanji转换任务的结果模型。我们展示了这两种技术,而在本文中研究的现有方法简单,始于现有的方法,并导致语言模型,这些模型比单词三字谜模型显着更好。我们还调查培训语料库大小和流派等因素如何影响模型的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号