首页> 外文会议>International conference on neural information processing >Enhancing the Mongolian Historical Document Recognition System with Multiple Knowledge-Based Strategies
【24h】

Enhancing the Mongolian Historical Document Recognition System with Multiple Knowledge-Based Strategies

机译:运用多种知识策略增强蒙古历史文献识别系统

获取原文

摘要

This paper describes recent work on integrating multiple strategies to improve the performance of the Mongolian historical document recognition system which utilize the segmentation-based scheme. We analyze the reasons why the recognition errors happened. On such basis, we propose three strategies according to the knowledge of the glyph characteristics of Mongolian and integrate them into glyph-unit recognition. The strategies are recognizing the under-segmented and over-segmented fragments (RUOF), glyph-unit grouping (GG) and incorporating the baseline information (IBI). The first strategy helps in correcting the segmentation error and the remaining two strategies further improve the classifiers accuracies. The experiment on the historical Mongolian Kanjur demonstrates that utilizing these strategies could effectively increase the accuracy of word recognition.
机译:本文介绍了利用基于分段的方案整合多种策略以提高蒙古历史文献识别系统性能的最新工作。我们分析了识别错误发生的原因。在此基础上,根据对蒙古人字形特征的认识,提出了三种策略,并将其整合到字形单元识别中。这些策略是识别分段不足和分段过度的片段(RUOF),字形单元分组(GG)并合并基线信息(IBI)。第一种策略有助于纠正分割误差,而其余两种策略则进一步提高了分类器的准确性。在历史悠久的蒙古语Kanjur上进行的实验表明,利用这些策略可以有效地提高单词识别的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号