首页> 外文会议>International conference on text, speech and dialogue >Ship-LemmaTagger: Building an NLP Toolkit for a Peruvian Native Language
【24h】

Ship-LemmaTagger: Building an NLP Toolkit for a Peruvian Native Language

机译:Ship-LemmaTagger:为秘鲁本地语言构建NLP工具包

获取原文

摘要

Natural Language Processing deals with the understanding and generation of texts through computer programs. There are many different functionalities used in this area, but among them there are some functions that axe the support of the remaining ones. These methods are related to the core processing of the morphology of the language (such as lemmatization) and automatic identification of the part-of-speech tag. Thereby, this paper describes the implementation of a basic NLP toolkit for a new language, focusing in the features mentioned before, and testing them in an own corpus built for the occasion. The obtained results exceeded the expected results and could be used for more complex tasks such as machine translation.
机译:自然语言处理通过计算机程序来处理文本的理解和生成。在此领域中使用了许多不同的功能,但是其中有些功能会减少其余功能的支持。这些方法与语言形态的核心处理(例如lemmatization)和词性标签的自动识别有关。因此,本文描述了针对一种新语言的基本NLP工具包的实现,着重介绍了前面提到的功能,并在为此场合构建的自己的语料库中对其进行了测试。获得的结果超出了预期的结果,可用于更复杂的任务,例如机器翻译。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号