首页> 外文会议>International Conference on Pattern Recognition Workshops >Subjective Assessments of Legibility in Ancient Manuscript Images - The SALAMI Dataset
【24h】

Subjective Assessments of Legibility in Ancient Manuscript Images - The SALAMI Dataset

机译:古代稿件图像中易读性的主观评估 - 萨拉米亚数据集

获取原文

摘要

The research field concerned with the digital restoration of degraded written heritage lacks a quantitative metric for evaluating its results, which prevents the comparison of relevant methods on large datasets. Thus, we introduce a novel dataset of Subjective Assessments of Legibility in Ancient Manuscript Images (SALAMI) to serve as a ground truth for the development of quantitative evaluation metrics in the field of digital text restoration. This dataset consists of 250 images of 50 manuscript regions with corresponding spatial maps of mean legibility and uncertainty, which are based on a study conducted with 20 experts of philology and paleography. As this study is the first of its kind, the validity and reliability of its design and the results obtained are motivated statistically: we report a high intra- and inter-rater agreement and show that the bulk of variation in the scores is introduced by the image regions observed and not by controlled or uncontrolled properties of participants and test environments, thus concluding that the legibility scores measured are valid attributes of the underlying images.
机译:关联与退化的书面遗产数字恢复有关的研究领域缺乏评估其结果的定量指标,这可以防止在大型数据集上的相关方法的比较。因此,我们介绍了古代稿件图像(萨拉米人)中易读性易读性的主观评估的新型数据集,以作为发展数字文本恢复领域的定量评估指标的基础事实。该数据集由250个稿区的250张图像组成,具有相应的平均易读性和不确定性的空间地图,该地图是基于与20个理论和古文古专业专业专业专业专业的研究。由于本研究首先,其设计的有效性和可靠性以及所获得的结果在统计上进行了激励:我们报告了一个高中的内部协议,并显示了分数的大部分变化观察到的图像区域,而不是由参与者和测试环境的受控或不受控制的属性,从而得出的结论是测量的可怜分数是底层图像的有效属性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号