首页> 外文学位 >Toward a robust and accurate vision system which recognizes character images.
【24h】

Toward a robust and accurate vision system which recognizes character images.

机译:迈向识别字符图像的强大而准确的视觉系统。

获取原文
获取原文并翻译 | 示例

摘要

Substantial research effort have been devoted to the development of automatic character recognition, which has led to the blooming of office automation industry.; Recent developments have concentrated on solving problems due to seriously degraded images. To deal with these problems, algorithms of character recognition technique at various functional levels have to be redesigned or enhanced. In this thesis, we target at developing new character recognition techniques for preprocessing, feature representation, recognition and pose estimation, so as to handle degraded character images.; For image preprocessing, a novel algorithm for character region extraction based on grayscale mathematical morphology is proposed. It detects features that may be occurred rarely in other regions, except the character regions. Under complex scene and noisy environment, character regions in 36 out of 40 natural scene pictures are correctly exacted. Based on the rough set theory, these extracted character images can be further manipulated to generate skeletons of good quality in one pass of scan, which are useful for syntactic and structural recognition.; In order to represent characters in consideration of perspective distortion due to different viewpoints, an approach using Fourier-coefficients based descriptors is presented. The approach is proved to be viewpoint invariant and more efficient in implementation as compared with other techniques. In addition, a wavelet-transformed based semi-local descriptors are proposed to describe the boundary of degraded character images. These descriptors are better than other global descriptors such as Fourier descriptors and moments as they can include both global and local information on the shape.; For recognition, a new method of similarity measure for Fuzzy-attributed graphs (FAGs) is proposed which provides a more robust tool to compare degraded character images from their reference patterns. Experimental results show that an accuracy rate around 93 percent can be achieved by our approach for handwritten numerals.; Finally, a direct solution of pose information including viewing distance, rotation angle, slant and tilt angles for known characters is proposed. It provides a more accurate recognition or simpler segmentation by feedbacking these information to adjust the position and orientation of the camera. It has been demonstrated that this method is accurate, efficient and robust to high frequency noises and even small occlusion. (Abstract shortened by UMI.)
机译:大量的研究工作致力于自动字符识别的发展,这导致了办公自动化行业的蓬勃发展。最近的发展集中在解决由于图像严重劣化而引起的问题。为了解决这些问题,必须重新设计或增强各种功能级别的字符识别技术的算法。本文旨在开发用于预处理,特征表示,识别和姿势估计的新字符识别技术,以处理退化的字符图像。对于图像预处理,提出了一种基于灰度数学形态学的字符区域提取新算法。它可以检测在字符区域以外的其他区域很少出现的特征。在复杂的场景和嘈杂的环境下,正确地提取了40张自然场景图片中36张的字符区域。基于粗糙集理论,可以进一步处理这些提取的字符图像,以一次扫描生成高质量的骨架,这对于语法和结构识别很有用。为了考虑由于不同视点引起的视点失真来表示字符,提出了一种使用基于傅立叶系数的描述符的方法。与其他技术相比,该方法被证明是视点不变的,并且在实现上效率更高。另外,提出了一种基于小波变换的半局部描述符,以描述退化字符图像的边界。这些描述符比其他全局描述符(例如傅立叶描述符和矩)更好,因为它们可以同时包含形状的全局和局部信息。为了进行识别,提出了一种用于模糊属性图(FAG)的相似度度量的新方法,该方法提供了一种更强大的工具来比较退化字符图像与参考图形之间的差异。实验结果表明,通过我们的手写数字方法,可以达到93%左右的准确率。最后,提出了姿态信息的直接解决方案,包括已知字符的视角距离,旋转角度,倾斜角度和倾斜角度。通过反馈这些信息以调整相机的位置和方向,它可以提供更准确的识别或更简单的分割。已经证明,该方法对于高频噪声甚至很小的遮挡是准确,有效和鲁棒的。 (摘要由UMI缩短。)

著录项

  • 作者

    Man, Gary Man Tat.;

  • 作者单位

    Hong Kong Polytechnic University (People's Republic of China).;

  • 授予单位 Hong Kong Polytechnic University (People's Republic of China).;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 1998
  • 页码 216 p.
  • 总页数 216
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号