首页> 外文会议>Camera-based document analysis and recognition. >NEOCR: A Configurable Dataset for Natural Image Text Recognition
【24h】

NEOCR: A Configurable Dataset for Natural Image Text Recognition

机译:NEOCR:用于自然图像文本识别的可配置数据集

获取原文
获取原文并翻译 | 示例

摘要

Recently growing attention has been paid to recognizing text in natural images. Natural image text OCR is far more complex than OCR in scanned documents. Text in real world environments appears in arbitrary colors, font sizes and font types, often affected by perspective distortion, lighting effects, textures or occlusion. Currently there are no datasets publicly available which cover all aspects of natural image OCR. We propose a comprehensive well-a.nnot.uted configurable dataset for optical character recognition in natural images for the evaluation and comparison of approaches tackling with natural image text OCR,. Based on the rich annotations of the proposed NKOCR dataset new and more precise evaluations are now possible, which give more detailed information on where improvements are most required in natural image text OCR.
机译:最近,人们越来越重视识别自然图像中的文本。在扫描的文档中,自然图像文本OCR比OCR复杂得多。现实环境中的文本以任意颜色,字体大小和字体类型显示,通常会受到透视变形,灯光效果,纹理或遮挡的影响。当前没有公开可用的数据集涵盖自然图像OCR的所有方面。我们为自然图像中的光学字符识别提出了一个完善的,可配置的数据集,用于评估和比较处理自然图像文本OCR的方法。基于提议的NKOCR数据集的丰富注释,现在可以进行新的更精确的评估,从而提供有关自然图像文本OCR中最需要改进的地方的更详细信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号