Automatic keywords extraction from the domain texts: Implementation of the algorithm based on the MapReduce model

机译：从领域文本中自动提取关键字：基于MapReduce模型的算法的实现

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic keywords extraction is used in almost all the tasks related to natural language processing, such as annotation, indexing, classification, machine translation, knowledge extraction, etc. A large number of effective methods and approaches were developed to solve this problem, and the most simple and robust ones of them are based on the statistics of words. In this paper we describe a statistical method based on Chi-square test. The traditional algorithm implementing this method is an inefficient and time-consuming one. The aim of the paper is to develop the algorithm of this method based on distributed computing model. So we describe the implementation of the algorithm based on the MapReduce model of distributed computing and present the results of experiments showing the benefits of distributed computing.

机译：关键字自动提取几乎用于与自然语言处理有关的所有任务，例如注释，索引，分类，机器翻译，知识提取等。开发了许多有效的方法和方法来解决此问题，其中大多数其中简单而强大的功能是基于单词的统计信息。在本文中，我们描述了一种基于卡方检验的统计方法。实现该方法的传统算法是一种低效且耗时的算法。本文的目的是开发基于分布式计算模型的该方法的算法。因此，我们描述了基于MapReduce分布式计算模型的算法的实现，并给出了表明分布式计算的好处的实验结果。

著录项

来源
《2013 International Conference on Current Trends in Information Technology》|2013年|186-189|共4页
会议地点 Dubai(AE)
作者
Nugumanova Aliya; Novosselov Artem; Baiburin Yerzhan; Karimov Alexey;
展开▼
作者单位

Department of Information Technologies, Eastern Kazakhstan State Technical University, Ust-Kamenogorsk, Kazakhstan;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
chi-square test; keywords extraction; mapreduce; natural language processing;

机译：卡方检验;关键词提取; Mapreduce;自然语言处理;;

相似文献

外文文献
中文文献
专利

1. Language-independent extractive automatic text summarization based on automatic keyword extraction [J] . Angel Hernandez-Castaneda, Rene Arnulfo Garcia-Hernandez, Yulia Ledeneva, Computer speech and language . 2022,第Jana期

机译：基于自动关键字提取的语言独立的提取自动文本摘要
2. Automatic extraction of keywords from scientific text:application to the knowledge domain of protein families [J] . Miguel A.Andrade... Bioinformatics . 1998,第7期

机译：从科学文本中自动提取关键词：在蛋白质家族知识领域的应用
3. An intelligent approach towards automatic shape modelling and object extraction from satellite images using cellular automata-based algorithms [J] . Arun P. V., Katiyar S. K. GIScience & remote sensing . 2013,第3期

机译：使用基于细胞自动机的算法进行卫星图像自动形状建模和对象提取的智能方法
4. Automatic keywords extraction from the domain texts: Implementation of the algorithm based on the MapReduce model [C] . Nugumanova Aliya, Novosselov Artem, Baiburin Yerzhan, International Conference on Current Trends in Information Technology . 2013

机译：自动关键字从域文本提取：基于MapReduce模型的算法实现
5. Identifying the gist of conversational text: Automatic keyword extraction and summarization. [D] . Liu, Fei. 2011

机译：识别对话文本的要点：自动关键词提取和汇总。
6. The Fractal Patterns of Words in a Text: A Method for Automatic Keyword Extraction [O] . Elham Najafi, Amir H. Darooneh -1

机译：文本中词的分形模式：一种自动关键词提取方法
7. Extraction-Based Text Categorization: Generating Domain-Specific Role Relationships Automatically [O] . Ellen Riloff, Jeffrey Lorenzen 1998

机译：基于提取的文本分类：自动生成特定于域的角色关系

Automatic keywords extraction from the domain texts: Implementation of the algorithm based on the MapReduce model

摘要

著录项

相似文献

相关主题

期刊订阅