...
首页> 外文期刊>International Journal of Engineering and Technology >Robust Text Extraction for Automated Processing of Multi-Lingual Personal Identity Documents
【24h】

Robust Text Extraction for Automated Processing of Multi-Lingual Personal Identity Documents

机译:强大的文本提取功能,可自动处理多语言个人身份文档

获取原文
           

摘要

Text extraction is a technique to extract the textual portion from non-textual background like images. It plays an important role in deciphering valuable information from images. Variation in text size, font, orientation, alignment, contrast etc. makes the task of text extraction challenging. Existing text extraction methods focus on certain regions of interest and address characteristics like noise, blur, distortion and variations in fonts makes text extraction difficult. This paper proposes a technique to extract textual characters from scanned personal identity document images. Current procedures keep track of user records manually and thus give way to inefficient practices and need for abundant time and human resources. The proposed methodology digitizes personal identity documents and eliminates the need for a large portion of the manual work involved in existing data entry and verification procedures. The proposed method has been experimented extensively with large datasets of varying sizes and image qualities. The results obtained indicate high accuracy in the extraction of important textual features from the document images.
机译:文本提取是一种从非文本背景(如图像)提取文本部分的技术。它在从图像中解密有价值的信息中起着重要作用。文本大小,字体,方向,对齐方式,对比度等方面的变化使文本提取的任务具有挑战性。现有的文本提取方法集中于某些感兴趣的区域,并且诸如字体的噪声,模糊,失真和变化等地址特征使文本提​​取变得困难。本文提出了一种从扫描的个人身份证件图像中提取文本字符的技术。当前的程序手动跟踪用户记录,因此让步给效率低下的做法,并且需要大量的时间和人力资源。所提出的方法将个人身份证件数字化,从而消除了现有数据输入和验证程序中涉及的大部分手工工作。所提出的方法已经在具有不同大小和图像质量的大型数据集上进行了广泛的实验。获得的结果表明从文档图像中提取重要文本特征的准确性很高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号