Partitioning of the degradation space for OCR training

机译：划分退化空间以进行OCR训练

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Generally speaking optical character recognition algorithms tend to perform better when presented with homogeneous data. This paper studies a method that is designed to increase the homogeneity of training data, based on an understanding of the types of degradations that occur during the printing and scanning process, and how these degradations affect the homogeneity of the data. While it has been shown that dividing the degradation space by edge spread improves recognition accuracy over dividing the degradation space by threshold or point spread function width alone, the challenge is in deciding how many partitions and at what value of edge spread the divisions should be made. Clustering of different types of character features, fonts, sizes, resolutions and noise levels shows that edge spread is indeed shown to be a strong indicator of the homogeneity of character data clusters.

机译：一般而言，光学字符识别算法在呈现同质数据时往往会表现更好。本文基于对打印和扫描过程中发生的降级类型以及这些降级如何影响数据均一性的理解，研究了一种旨在提高训练数据同质性的方法。虽然已经表明，通过边缘扩展来划分退化空间比仅通过阈值或点扩展函数宽度来划分退化空间能够提高识别精度，但挑战在于确定应进行多少划分以及以边缘扩展的值进行划分。不同类型的字符特征，字体，大小，分辨率和噪声水平的聚类表明，边缘扩展确实显示为字符数据聚类同质性的有力指标。

著录项

来源
《Document Recognition and Retrieval XIII; Electronic Imaging Science and Technology》|2006年|P.606705.1-606705.8|共8页
会议地点 San JoseCA(US)
作者
Elisa H. Barney Smith; Tim ersen;
展开▼
作者单位

Boise State University Boise, ID, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类模式识别与装置;
关键词
character degradations; OCR training; character homogeneity; clustering;

机译：性格退化； OCR训练；字符同质性；聚类;

相似文献

外文文献
中文文献
专利

1. The Rates of Protein Synthesis and Degradation Account for the Differential Response of Neurons to Spaced and Massed Training Protocols [J] . Faisal Naqib, Carole A. Farah, Christopher C. Pack, PLoS Computational Biology . 2011,第12期

机译：蛋白质合成和降解的速率解释了神经元对间隔和大规模训练方案的差异反应。
2. Partitioning of feature space by iterative classification for degraded document image binarisation [J] . Valizadeh M., Kabir E. Image Processing, IET . 2012,第6期

机译：通过迭代分类对特征空间进行分区，以实现降级的文档图像二值化
3. Binarization of degraded document image based on feature space partitioning and classification [J] . Morteza Valizadeh, Ehsanollah Kabir International Journal on Document Analysis and Recognition . 2012,第1期

机译：基于特征空间划分和分类的退化文档图像二值化
4. Partitioning of the degradation space for OCR training [C] . Elisa H. Barney Smith, Tim Andersen Conference on Document Recognition and Retrieval . 2006

机译：对OCR训练的降级空间分区
5. Probabilistic methods for searching OCR-degraded Arabic text. [D] . Darwish, Kareem M. 2003

机译：用于搜索OCR降级的阿拉伯文本的概率方法。
6. The Rates of Protein Synthesis and Degradation Account for the Differential Response of Neurons to Spaced and Massed Training Protocols [O] . Faisal Naqib, Carole A. Farah, Christopher C. Pack, 2011

机译：蛋白质合成和降解的速率解释了神经元对间隔和大量训练方案的差异反应。
7. Partitioning of the Degradation Space for OCR Training [O] . Barney Smith Elisa H., Andersen Tim 2006

机译：划分OCR培训的降级空间
8. Fighting in a Contested Space Environment: Training Marines for Operations with Degraded or Denied Space-Enabled Capabilities. [R] . Garcia, D. M. 2015

机译：在有争议的太空环境中进行战斗：为具有降级或被拒绝的空间能力的作战训练海军陆战队员。

Partitioning of the degradation space for OCR training

摘要

著录项

相似文献

相关主题

期刊订阅