首页> 外文会议>2011 IEEE Conference on Open Systems >An extended method for recognition of broken typewritten characters special reference to tamil script
【24h】

An extended method for recognition of broken typewritten characters special reference to tamil script

机译:一种识别破损打字字符的扩展方法,特别是针对泰米尔文字的参考

获取原文

摘要

Preparing clean and clear images for the recognition engines is often taken for granted as a trivial task that requires little attention. Most of the existing OCRs have been designed in such a way that which correctly identify fine printed documents in all scripts. The performance of standard machine printed OCR system works fails, if it is tested on documents with distorted characters. This paper presents an approach to overcome the difficulties presented in such distorted type written documents especially with broken characters. As a first step, isolation of character is forwarded using character position location and character localization and enclosing it in a matrix which will be analyzing and repairing in the later part of our study. An attempt is incorporated using shape and line tracing method for recognition of distorted broken characters and then it is fine tuned by lexical knowledge.
机译:为识别引擎准备干净清晰的图像通常被认为是一项琐碎的任务,几乎不需要关注。现有的大多数OCR的设计方式都可以正确识别所有脚本中的精美打印文档。如果在字符失真的文档上进行测试,则标准的机印OCR系统的性能将无法正常工作。本文提出了一种克服这种变形的书面文件特别是字符破损时所遇到的困难的方法。第一步,使用字符位置定位和字符定位来转发字符隔离,并将其封装在矩阵中,该矩阵将在本研究的后续部分进行分析和修复。尝试使用形状和线条跟踪方法进行合并,以识别变形的残破字符,然后通过词汇知识对其进行微调。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号