首页> 外文会议>Conference on Information and Knowledge Technology >A content-based method for Persian real-word spell checking
【24h】

A content-based method for Persian real-word spell checking

机译:一种基于内容的波斯实词拼写检查方法

获取原文

摘要

Here, a content-based method for real-word spell checking in Persian language is presented. In this method real-word mistakes are classified in 5 categories and are resolved using a content-based procedure. Each word which may cause any real-word error is listed in a candidate set in a same entry with its similar words (potential mistakes in a single entry). In next step, a content-word list is constructed based on adjacent frequent N-grams for each word in confusion set. Evaluations indicate that proposed method not only provides promising performance and acceptable precision, but also outperforms a similar existing system from precision and recall points of view.
机译:在此,提出了一种基于内容的波斯语实词拼写检查方法。在这种方法中,将实词错误分为5类,并使用基于内容的过程来解决。可能导致任何实词错误的每个单词都以相同的单词(在单个条目中可能存在的错误)列在同一条目的候选集中。在下一步中,基于混淆集中每个单词的相邻频繁N-gram构建内容单词列表。评估表明,所提出的方法不仅提供了有希望的性能和可接受的精度,而且从精度和召回率的角度来看,其性能优于类似的现有系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号