首页> 外文会议>Workshop on Chinese Lexical Semantics >High Order N-gram Model Construction and Application Based on Natural Annotation
【24h】

High Order N-gram Model Construction and Application Based on Natural Annotation

机译:基于自然注释的高阶N-GRAM模型建设与应用

获取原文

摘要

The language model based on the n-gram grammar plays an important role in NLP tasks. In this paper, language models based on language boundary are proposed to conquer the challenge of the very big language data: intra-sentence boundary model and inter-sentence boundary model. We developed a training tool on the Hadoop platform based on MapReduce programming, and conducted the prefix tree to compress and store the model. We implemented our model in identifying the boundary in the syntactic parsing, achieving a good result.
机译:基于N-GRAM语法的语言模型在NLP任务中发挥着重要作用。本文提出了基于语言边界的语言模型来征服非常大语言数据的挑战:句子内界边界模型和际际边界模型。我们基于MapReduce编程在Hadoop平台上开发了一个培训工具,并进行了前缀树以压缩和存储模型。我们在识别句法解析中的边界来实现我们的模型,实现了良好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号