首页> 外文会议>International Conference on Intelligent Systems Design and Applications >SAID: A new stemmer algorithm to indexing unstructured Document
【24h】

SAID: A new stemmer algorithm to indexing unstructured Document

机译:说:一种新的终结文件索引非结构化文件的算法

获取原文

摘要

In this work, we propose a new stemmer algorithm to indexing unstructured Document. It can detect the most relevant words in an unstructured document. This algorithm is based on two main modules: the first module ensures the processing of compound words and the second allows the detection of the endings of the words that have not been taken into consideration by the approaches presented in literature. The proposed algorithm allows the detection and removal of suffixes and enriches the basis of suffixes by eliminating the suffixes of compound words. We have experienced our algorithm on a standard basis of terms and the results show the remarkable effectiveness of our algorithm compared to others presented in related works.
机译:在这项工作中,我们向索引非结构化文档提出了一种新的Sefalmer算法。它可以检测非结构化文件中最相关的单词。该算法基于两个主模块:第一模块确保了复合词的处理,第二模块允许检测文献中呈现的方法未被考虑的单词的结尾。所提出的算法允许通过消除复合词的后缀来检测和移除后缀并富集后缀的基础。我们在标准的术语中经历了算法,结果表明了与相关工程中提出的其他人相比的算法的显着效益。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号