【24h】

STEMMING FOR TERM CONFLATION IN MALAY TEXTS

机译:马来文中的术语冲突

获取原文
获取原文并翻译 | 示例

摘要

Stemming is an important process to improve the performance of an information retrieval. It reduces the variant word forms to common forms. Various algorithms for stemming have been developed for the English and foreign languages. Due to its essence and popularity, research for stemming Malay words has been extended for retrieval of Malay documents. The existing Malay stemming algorithm is studied and new algorithm is proposed to improve the performance of the stemming process especially for a specific domain. The modified algorithm utilizes important morphological aspects of Malay language. Experiments on the ability of the stemmer were done and the results indicating major improvement in terms of less errors and greater rate of correctness are shown.
机译:词干是提高信息检索性能的重要过程。它将变体词形式简化为普通形式。已经针对英语和外语开发了各种词干提取算法。由于其本质和受欢迎程度,词干马来词的研究已扩展到检索马来语文档。研究了现有的马来语词干提取算法,并提出了新的算法来提高词干提取过程的性能,特别是针对特定领域。改进的算法利用了马来语的重要形态。进行了茎杆能力的实验,结果表明,在减少错误和提高正确率方面有重大改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号