首页> 外文会议>International Workshop on Graphics Recognition(GREC 2005); 20050825-26; Hong Kong(CN) >Extraction of Index Components Based on Contents Analysis of Journal's Scanned Cover Page
【24h】

Extraction of Index Components Based on Contents Analysis of Journal's Scanned Cover Page

机译:基于期刊扫描封面内容分析的索引成分提取

获取原文
获取原文并翻译 | 示例

摘要

In this paper, a method for automatically indexing the contents to reduce the effort that used to be required for input paper information and constructing index is sought. Various contents formats for journals, which have different features from those for general documents, are described. The principal elements that we want to represent are titles, authors, and pages for each paper. Thus, the three principal elements are modeled according to the order of their arrangement, and then their features are generalized. The content analysis system is then implemented based on the suggested modeling method. The content analysis system, implemented for verifying the suggested method, gets its input in the form containing more than 300 dpi gray scale image and analyze structural features of the contents. It classifies titles, authors and pages using efficient projection method. The definition of each item is classified according to regions, and then is extracted automatically as index information. It also helps to recognize characters region by region. The experimental result is obtained by applying to some of the suggested 6 models, and the system shows 97.3% success rate for various journals.
机译:在本文中,寻求一种用于自动索引内容以减少过去用于输入纸张信息和构造索引的工作量的方法。描述了日记的各种内容格式,这些格式具有与一般文档不同的功能。我们要代表的主要元素是每篇论文的标题,作者和页面。因此,根据三个主要元素的排列顺序对其建模,然后对其特征进行概括。然后基于建议的建模方法实施内容分析系统。为验证所建议的方法而实施的内容分析系统以包含300 dpi以上的灰度图像的形式获取其输入,并分析内容的结构特征。它使用有效的投影方法对标题,作者和页面进行分类。根据区域对每个项目的定义进行分类,然后自动将其提取为索引信息。它还有助于按区域识别字符。通过将所建议的6个模型中的某些模型应用,可以获得实验结果,并且该系统显示各种期刊的成功率为97.3%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号