首页> 外文学位 >A segmentation-free approach to text recognition with application to Arabic text.
【24h】

A segmentation-free approach to text recognition with application to Arabic text.

机译:一种无分段的文本识别方法,适用于阿拉伯文本。

获取原文
获取原文并翻译 | 示例

摘要

As the recognition rates and speeds of optical character recognition (OCR) systems steadily improve, the problem of OCR--and subsequently research interest--is shifting from recognizing: isolated, high-quality characters to reading cursive scripts and degraded documents. In recognizing such texts, a major undertaking is segmenting cursive words into characters and isolating merged characters. In OCR systems that recognize cursive text, the segmentation subsystem becomes the pivotal stage in the system to which a sizable portion of processing is devoted and a considerable share of recognition errors is attributed. The most notable feature of Arabic writing is its cursiveness. It also poses the most difficult problem for recognition algorithms.; In this work, we describe the design and implementation of a system that is automatically trainable and that recognizes noisy and cursive words. To recognize a word, the system does not segment it into symbols (character shapes) in advance; rather, it recognizes the input word by detecting a set of "shape primitives" on the word. It then matches the regions of the word (represented by the detected primitives) to a set of symbol models. A spatial arrangement of symbol models that are matched to regions of the word, then, becomes the description of the recognized word. Since the number of potential arrangements of all symbol models is large, the system imposes a set of word structure and spatial consistency. It searches the space comprised of the arrangements that satisfy the constraints and tries to maximize the a posteriori probability of the symbol-models' arrangement.; Large-scale experimentation with the system on isolated characters reveals that it has a recognition rate of 99.7% for synthetically degraded symbols and 94.1% for scanned symbols. Experimentation on isolated words reveals that the system has a recognition rate of 99.4% for noise-free words, 95.6% for synthetically degraded words, and 73% for scanned words.; The main theoretical contribution of this work is in laying the foundation for a segmentation-free approach for Arabic word recognition. Recognition is based on maximizing the probability of the word given the detected primitives. The system is designed to minimize training effort and is extensible as training determines the symbols the system recognizes.
机译:随着光学字符识别(OCR)系统的识别率和速度稳步提高,OCR(以及随后的研究兴趣)的问题已从识别:隔离的高质量字符转变为阅读草书和退化的文档。在识别此类文本时,一项主要任务是将草书单词分割成多个字符并隔离合并的字符。在识别草书文本的OCR系统中,分段子系统成为系统中的关键阶段,在该阶段中,相当大的一部分处理被投入到了系统中,并且很大一部分识别错误被归因于此。阿拉伯文字的最显着特征是其草书性。对于识别算法来说,这也是最困难的问题。在这项工作中,我们描述了系统的设计和实现,该系统是可自动训练的并且可以识别嘈杂和草率的单词。为了识别单词,系统不会预先将其分割为符号(字符形状);相反,它通过检测单词上的一组“形状图元”来识别输入单词。然后将单词的区域(由检测到的图元表示)与一组符号模型进行匹配。然后,与单词的区域匹配的符号模型的空间排列成为对已识别单词的描述。由于所有符号模型的潜在排列数量很大,因此系统强加了一组单词结构和空间一致性。它搜索由满足约束的排列组成的空间,并尝试使符号模型排列的后验概率最大化。在孤立字符上对该系统进行的大规模实验表明,该系统对合成降级符号的识别率为99.7%,对扫描符号的识别率为94.1%。对孤立单词的实验表明,该系统对无噪声单词的识别率为99.4%,对于合成降级单词的识别率为95.6%,对于扫描单词的识别率为73%。这项工作的主要理论贡献在于为阿拉伯单词识别的无分段方法奠定了基础。识别基于给定检测到的原语的单词的最大概率。该系统旨在最大程度地减少培训工作,并且在培训确定系统可识别的符号时可扩展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号