【24h】

Optimal Selection of Different Order N-Grams

机译:不同阶数N克的最佳选择

获取原文
获取原文并翻译 | 示例

摘要

We subjectively introduce different assumptions about the mutual information among words for separate usage of different N-grams regardless of the fact that some assumptions cannot be true. An approach is proposed to evaluate different order N-grams based on the concept of confidence of assumption or assumption probability. The assumption probabilities are estimated on the basis of discriminative estimation criterion. To reduce the redundant information among different order N-grams, we bring up with a method to select M-gram elements from different order N-grams with reference to the assumption probabilities. We conduct experiments on the platform of conversion from Chinese pinyin to Chinese character. At first, we emplov Newton Gradient method to estimate the assumption probabilities and then lest the optimally selected language model. The experimental results show that the memory capacity of language model can be remarkably lowered with little loss of accuracy.
机译:我们主观地引入了关于单词之间相互信息的不同假设,以用于不同N-gram的单独使用,而不管某些假设不能成立的事实。提出了一种基于假设或假设概率的置信度概念评估不同阶N-gram的方法。假设概率是根据判别估计标准进行估计的。为了减少不同阶N-gram之间的冗余信息,我们提出了一种基于假设概率从不同阶N-gram中选择M-gram元素的方法。我们在从汉语拼音转换为汉字的平台上进行实验。首先,我们采用牛顿梯度方法来估计假设概率,然后再选择最佳选择的语言模型。实验结果表明,语言模型的存储容量可以显着降低,而准确性损失很小。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号