High Order N-gram Model Construction and Application Based on Natural Annotation

机译：基于自然注释的高阶N-GRAM模型建设与应用

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The language model based on the n-gram grammar plays an important role in NLP tasks. In this paper, language models based on language boundary are proposed to conquer the challenge of the very big language data: intra-sentence boundary model and inter-sentence boundary model. We developed a training tool on the Hadoop platform based on MapReduce programming, and conducted the prefix tree to compress and store the model. We implemented our model in identifying the boundary in the syntactic parsing, achieving a good result.

机译：基于N-GRAM语法的语言模型在NLP任务中发挥着重要作用。本文提出了基于语言边界的语言模型来征服非常大语言数据的挑战：句子内界边界模型和际际边界模型。我们基于MapReduce编程在Hadoop平台上开发了一个培训工具，并进行了前缀树以压缩和存储模型。我们在识别句法解析中的边界来实现我们的模型，实现了良好的结果。

著录项

来源
《Workshop on Chinese Lexical Semantics》|2019年|xviii 861 p.|共8页
会议地点
作者
Qibo Wang; Gaoqi Rao; Endong Xun;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类现代词汇;
关键词
Boundary language model; N-gram; Prefix tree; Boundary recognition;

机译：边界语言模型;n-gram;前缀树;边界识别;

相似文献

外文文献
中文文献
专利

1. Construction of a web-based nanomaterial database by big data curation and modeling friendly nanostructure annotations [J] . Xiliang Yan, Alexander Sedykh, Wenyi Wang, Nature Communications . 2020,第1期

机译：大数据策择构建基于Web的纳米材料数据库，建模友好的纳米结构注释
2. Web-Based Annotation Learning System: Construction and Application [J] . Hsiu-Ping Yueh, Ya-Ting Teng, Weijane Lin, Creative Education . 2012,第8期

机译：基于Web的注释学习系统：构建与应用
3. Natural Language-based Machine Learning Models for the Annotation of Clinical Radiology Reports [J] . Zech John, Pain Margaret, Titano Joseph, Radiology . 2018,第2期

机译：基于自然语言的机器学习模型，用于注释临床放射学报告
4. High Order N-gram Model Construction and Application Based on Natural Annotation [C] . Qibo Wang, Gaoqi Rao, Endong Xun Workshop on Chinese Lexical Semantics . 2019

机译：基于自然注释的高阶N元语法模型的构建与应用
5. Automatic biological term annotation using n-gram and classification models [D] . Jiampojamarn, Sittichai 2005

机译：使用n-gram和分类模型的自动生物术语注释
6. Unsupervised acquisition of idiomatic units of symbolic natural language: An n-gram frequency-based approach for the chunking of news articles and tweets [O] . Dario Borrelli, Gabriela Gongora Svartzman, Carlo Lipizzi 2020

机译：无监督的象征自然语言惯用单位的收购：新闻文章和推文的分组的基于n克频率的方法
7. Auto-Sizing Neural Networks: With Applications to n-gram Language Models [O] . Kenton Murray, David Chiang 2015

机译：自动调整大小神经网络：应用于n-gram语言模型

High Order N-gram Model Construction and Application Based on Natural Annotation

摘要

著录项

相似文献

相关主题

期刊订阅