首页> 外文会议>IEEE Congress on Information Science and Technology >Towards Semantic ETL for integration of textual scientific documents in a Big Data environment: a theoretical approach
【24h】

Towards Semantic ETL for integration of textual scientific documents in a Big Data environment: a theoretical approach

机译:在大数据环境中集成文本科学文本的语义ETL:一种理论方法

获取原文

摘要

Every day, new scientific documents (textual data) and new domain ontologies are published. It requires putting evolving information systems in order to exploit these massive amounts of textual data and ontologies. Hence, the need for data integration is to collect and combine them so we can provide a unified view, which analyze and visualize them for the different analytical needs of scientific researchers. In this sense, we find the Extract Transform Load (ETL) tool. It is one of the most popular approaches for data integration (DI). However, it does not take into consideration semantic data. This requirement gives birth to the Semantic ETL or ETL based on ontologies, which extends traditional ETL. Mostly known as an issue of interest to most researchers, but the latter found it difficult because of the complexity and the processing of Big Data that are characterized by 5V (Volume, Variety, Velocity, Veracity, Value). Succinctly, this paper attempts to discuss two questions such as: (1) What is the capacity of the existing Semantic ETL to manage Big Data and are they relevant in the present? (2) What is our proposal for a Semantic ETL for Big Data to overcome previous works?
机译:每天,新的科学文档(文本数据)和新域Inalologies都发布。它需要放置不断发展的信息系统,以利用这些大量的文本数据和本体。因此,对数据集成的需求是收集和组合它们,因此我们可以提供统一的视图,该视图为不同的科学研究人员的分析需求分析和可视化。从这个意义上讲,我们发现提取物变换负载(ETL)工具。它是数据集成(DI)最受欢迎的方法之一。但是,它没有考虑语义数据。此要求基于本体研究的语义ETL或ETL生育,该Ontologies延伸了传统ETL。大多数被称为对大多数研究人员感兴趣的问题,但后者发现它难以困难,因为具有5V(体积,品种,速度,准确性,价值)的大数据的复杂性和加工。简洁地,本文试图讨论两个问题,例如:(1)现有语义ETL管理大数据的容量是多少,以及它们在现在相关? (2)我们对大数据的语义ETL的提案是什么,以克服以前的作品?

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号