USING THE STRUCTURAL CONTENT OF DOCUMENTS TO AUTOMATICALLY GENERATE QUALITY METADATA

机译：使用文档的结构内容自动生成质量元数据

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Giving search engines access to high quality document metadata is crucial for efficient document retrieval efforts on the Internet and on corporate Intranets. Presence of such metadata is currently sparsely present. This paper presents how the structural content of document files can be used for Automatic Metadata Generation (AMG) efforts, basing efforts directly on the documents' content (code) and enabling effective usage of combinations of AMG algorithms for additional harvesting and extraction efforts. This enables usage of AMG efforts to generate high quality metadata in terms of syntax, semantics and pragmatics, from non-homogenous data sources in terms of visual characteristics and language of their intellectual content.

机译：给出搜索引擎访问高质量文档元数据对于互联网和企业内网上的高效文件检索工作至关重要。目前稀疏存在这种元数据。本文介绍了文档文件的结构内容如何用于自动元数据生成（AMG）努力，直接基于文档内容（代码）的工作，并实现了AMG算法组合的有效使用，以便进行额外的收获和提取努力。这使得可以使用AMG努力在其智力内容的视觉特征和语言方面，从非同质数据源的语法，语义和语用来实现高质量元数据。

著录项

来源
《International Conference on Web Information Systems and Technologies》|2009年||共10页
会议地点
作者
Lars Fredrik Hoimyr Edvardsen; Ingeborg Torvik Solvberg; Trond Aalberg; Hallvard Trastteberg;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词
Automatic Metadata Generation; Extraction; Metadata Quality; Word; PowerPoint; PDF; OpenXML;

机译：自动元数据生成;提取;元数据质量;单词;powerpoint;pdf;OPENXML;

相似文献

外文文献
中文文献
专利

1. A Learning Quality Metadata approach: Automatic quality assessment of virtual training from metadata [J] . Daniel Pons, Jose Ramon Hilera, Luis Fernandez, Computer standards & interfaces . 2016,第Mara期

机译：学习质量元数据方法：根据元数据对虚拟培训进行自动质量评估
2. Automatic Extraction of Apparent Semantic Structure from Text Contents of a Structural Calculation Document [J] . Bong-Geun Kim, Sang II Park, Hyo-Jin Kim, Journal of Computing in Civil Engineering . 2010,第3期

机译：从结构计算文档的文本内容中自动提取表观语义结构
3. Generating metadata from web documents: a systematic approach [J] . Hsiang-Yuan Hsueh, Chun-Nan Chen, Kun-Fu Huang Human-centric Computing and Information Sciences . 2013,第1期

机译：从Web文档生成元数据：系统的方法
4. USING THE STRUCTURAL CONTENT OF DOCUMENTS TO AUTOMATICALLY GENERATE QUALITY METADATA [C] . Lars Fredrik Hoimyr Edvardsen, Ingeborg Torvik Solvberg, Trond Aalberg, International Conference on Web Information Systems and Technologies . 2009

机译：使用文档的结构内容自动生成质量元数据
5. Data mining revision controlled document history metadata for automatic classification. [D] . Maass, Dustin. 2013

机译：数据挖掘修订版本控制的文档历史记录元数据，用于自动分类。
6. Using XML Metadata to Enable the Automatic Generation and Processing of HTML Forms from XML Documents [O] . Anil K. Dubey, Henry C. Chueh 2001

机译：使用XML元数据启用从XML文档自动生成和处理HTML表单的功能
7. Using the structural content of documents to automatically generate quality metadata [O] . Edvardsen Lars Fredrik Høimyr 2013

机译：使用文档的结构内容自动生成高质量的元数据

USING THE STRUCTURAL CONTENT OF DOCUMENTS TO AUTOMATICALLY GENERATE QUALITY METADATA

摘要

著录项

相似文献

相关主题

期刊订阅