首页> 外文期刊>IEICE Transactions on Information and Systems >Scene Character Detection and Recognition with Cooperative MultiDle-HvDothesis Framework
【24h】

Scene Character Detection and Recognition with Cooperative MultiDle-HvDothesis Framework

机译:协同MultiDle-HvDothesis框架的场景字符检测与识别

获取原文
获取原文并翻译 | 示例
           

摘要

To handle the variety of scene characters, we propose a cooperative multiple-hypothesis framework which consists of an image operator set module, an Optical Character Recognition (OCR) module and an integration module. Multiple image operators activated by multiple parameters probe suspected character regions. The OCR module is then applied to each suspected region and returns multiple candidates with weight values for future integration. Without the aid of the heuristic rules which impose constraints on segmentation area, aspect ratio, color consistency, text line orientations, etc., the integration module automatically prunes the redundant detection/recognition and pads the missing detection/recognition. The proposed framework bridges the gap between scene character detection and recognition, in the sense that a practical OCR engine is effectively leveraged for result refinement. In addition, the proposed method achieves the detection and recognition at the character level, which enables dealing with special scenarios such as single character, text along arbitrary orientations or text along curves. We perform experiments on the benchmark ICDAR 2011 Robust Reading Competition dataset which includes a text localization task and a word recognition task. The quantitative results demonstrate that multiple hypotheses outperform a single hypothesis, and be comparable with state-of-the-art methods in terms of recall, precision, F-measure, character recognition rate, total edit distance and word recognition rate. Moreover, two additional experiments are conducted to confirm the simplicity of parameter setting in this proposal.
机译:为了处理各种场景字符,我们提出了一个协作的多重假设框架,该框架由图像运算符集模块,光学字符识别(OCR)模块和集成模块组成。由多个参数激活的多个图像运算符会探查可疑字符区域。然后,将OCR模块应用于每个可疑区域,并返回多个具有权重值的候选项以供将来集成。在没有启发式规则的帮助下,该启发式规则对分割区域,纵横比,颜色一致性,文本行方向等施加了约束,集成模块会自动修剪多余的检测/识别并填充缺失的检测/识别。在有效利用实际OCR引擎进行结果细化的意义上,提出的框架弥补了场景字符检测和识别之间的差距。此外,所提出的方法在字符级别实现了检测和识别,从而可以处理特殊情况,例如单个字符,任意方向的文本或曲线的文本。我们对基准ICDAR 2011健壮阅读比赛数据集进行了实验,该数据集包括文本本地化任务和单词识别任务。定量结果表明,多个假设优于单个假设,并且在召回率,精度,F量度,字符识别率,总编辑距离和单词识别率方面与最新技术相当。此外,还进行了两个额外的实验以确认该建议中参数设置的简便性。

著录项

  • 来源
    《IEICE Transactions on Information and Systems》 |2013年第10期|2235-2244|共10页
  • 作者单位

    Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University, Fukuoka-shi, 819-0395 Japan;

    Department of Computer Science, National University of Singapore, Singapore,Presently, with Systems and Information Technology, Faculty of Computer Science and Information Technology, University of Malaya, Malaysia;

    Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University, Fukuoka-shi, 819-0395 Japan;

    Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University, Fukuoka-shi, 819-0395 Japan;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    cooperative multiple-hypothesis framework; scene character; OCR; integration; voting;

    机译:合作多重假设框架场景角色OCR;积分;表决;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号