首页> 外文会议>Advance Computing Conference,IACC,2009 IEEE International >Retrieval Of Information In Document Image Databases Using Partial Word Image Matching Technique
【24h】

Retrieval Of Information In Document Image Databases Using Partial Word Image Matching Technique

机译:使用部分单词图像匹配技术检索文档图像数据库中的信息

获取原文

摘要

With the popularity and importance of document images as an information source, information retrieval in document image databases has become a challenge. In this paper, an approach with the capability of matching partial word images to address two issues in document image retrieval: word spotting and similarity measurement between documents has been proposed. Initially, each word image is represented by a primitive string. Then, an inexact string matching technique is utilized to measure the similarity between the string generated of the query word with the word string generated from the document. Based on the similarity, we can find out how a word image is relevant to the other and, can be decided whether one is a portion of the other. In order to deal with various character fonts, a primitive string which is tolerant to serif and font differences to represent a word image has been used. Using this technique of inexact string matching, our method is able to successfully handle the problem of heavily touching characters. From the experimental results on a variety of document image databases it is confirmed that the proposed approach is feasible, valid, and efficient in document image retrieval.
机译:随着文档图像作为信息源的普及和重要性,文档图像数据库中的信息检索已成为挑战。在本文中,提出了一种方法,其具有匹配部分字图像来解决文档图像检索中的两个问题:已经提出了文档之间的单词斑点和相似性测量。最初,每个单词图像由基本字符串表示。然后,利用不精确的字符串匹配技术来测量与从文档生成的单词字符串生成的字符串之间的相似度。基于相似性,我们可以了解单词图像如何与另一个是如何相关的,并且可以决定一个是另一个的一部分。为了处理各种字符字体,已经使用了对Serif和字体差异的原始字符串已经使用以表示单词图像。使用这种不精确的字符串匹配技术,我们的方法能够成功处理大量触摸字符的问题。从实验结果来自各种文档图像数据库,确认所提出的方法是可行,有效,有效的文档图像检索。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号