首页> 外文会议>Camera-based document analysis and recognition. >The IUPR Dataset of Camera-Captured Document Images
【24h】

The IUPR Dataset of Camera-Captured Document Images

机译:相机捕获的文档图像的IUPR数据集

获取原文
获取原文并翻译 | 示例

摘要

Major challenges in camera-base document analysis are dealing with uneven shadows, high degree of curl ami perspective distortions. In CBDAR 2007. we introduced the first dataset (I)FKI-I) of camera-captured document images in conjunction with a page dewarping contest. One of the main limitations of this dataset is that it contains images only from technical books with simple layouts and moderate curl/skew. Moreover, it does not contain information about camera's specifications and settings, imaging environment, and document contents. This kind of information would be more helpful for understanding the results of the experimental evaluation of camera-based document image processing (binarization. page segmentation, dewarping. etc.). In this paper, we introduce a new dataset (the IUPR dataset) of camera-captured document images. As compared to the previous dataset. the new dataset contains images from different varieties of technical and non-technical books with more challenging problems, like different types of layouts, large variety of curl, wide range of perspective distortions, and high to low resolutions. Additionally, the document images in the new dataset are provided with detailed information about thickness of books, imaging environment and camera's viewing angle and its internal settings. The new dataset, will help research community to develop robust camera-captured document processing algorithms in order to solve the challenging problems in the dataset and to compare different methods on a common ground.
机译:基于相机的文档分析中的主要挑战是处理不均匀的阴影,高度的卷曲和透视变形。在CBDAR 2007中,我们与页面变形竞赛一起引入了第一个相机捕获的文档图像数据集(I)FKI-I。该数据集的主要限制之一是它仅包含来自技术书籍的图像,这些图像具有简单的布局和适度的卷曲/偏斜。而且,它不包含有关相机规格和设置,成像环境和文档内容的信息。这种信息将有助于理解基于相机的文档图像处理(二进制化,页面分割,变形等)的实验评估结果。在本文中,我们介绍了一个由相机捕获的文档图像的新数据集(IUPR数据集)。与之前的数据集相比。新的数据集包含来自各种技术和非技术书籍的图像,这些图像具有更具挑战性的问题,例如不同类型的布局,各种卷曲,广泛的透视变形以及从高到低的分辨率。此外,新数据集中的文档图像还提供有关书籍厚度,成像环境和相机视角及其内部设置的详细信息。新的数据集将帮助研究社区开发强大的相机捕获的文档处理算法,以解决数据集中的难题,并在一个共同的基础上比较不同的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号