【24h】

Issue Based OCR Error Prediction in Video Streams

机译:视频流中基于问题的OCR错误预测

获取原文

摘要

This paper increases the reliability of Optical Character Recognition (OCR) systems in natural scene by proposing a novel Image Quality Assessment (IQA) system. We propose to increase reliability based on the principle that OCR accuracy is a function of the quality of the input image. Detected text boxes are analyzed regarding their OCR score and different quality issues, such as blur, light and reflection effects. The novelty of our approach is to model IQA as a classification task, where one class represents high quality elements and each of the other classes represent a specific quality issue. We demonstrate how this methodology allows the training of IQA systems for complex quality metrics, even when no data labeled with the desired metric is available. Furthermore, a single IQA system outputs the quality score as well as the quality issues for a given image. We built on publicly available databases to generate 60k text boxes for each class and obtain 97,1% classification accuracy on a test set of 24k images. We conclude that the learnt quality metric is a valid indicator of common OCR errors by evaluating on the ICDAR 2003 Robust Word Recognition dataset.
机译:通过提出一种新颖的图像质量评估(IQA)系统,本文提高了自然场景中光学字符识别(OCR)系统的可靠性。我们建议基于OCR精度取决于输入图像质量的原理来提高可靠性。分析检测到的文本框的OCR分数和不同的质量问题,例如模糊,光和反射效果。我们方法的新颖之处在于将IQA建模为分类任务,其中一个类别代表高质量的元素,而其他各个类别则代表一个特定的质量问题。我们演示了即使没有可用所需指标标记的数据,该方法也可如何培训IQA系统以获取复杂的质量指标。此外,单个IQA系统输出给定图像的质量得分以及质量问题。我们建立在可公开获取的数据库的基础上,为每个类别生成60k文本框,并在24k图像测试集上获得97.1%的分类精度。我们得出结论,通过对ICDAR 2003健壮单词识别数据集进行评估,学习的质量指标是常见OCR错误的有效指标。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号