...
首页> 外文期刊>Journal of software >Architecting an Enterprise Data Lake, A Covid19 Case Study
【24h】

Architecting an Enterprise Data Lake, A Covid19 Case Study

机译:建筑企业数据湖,Covid19案例研究

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Data is increasing at an enormous rate every day. Traditionally data has resided in silosacross any organization,so it’s difficult to have a complete picture for data driven business decision making. Data lake addresses the problem of rate of increase of data by providing “schema on read”, better integration and cheaper storage. It also solves the data silos problemby providing a central platform for a variety of data housing needs. However, implementing a data lake becomes challenging as the implementation needs to address the additional needs like metadata management, data discovery, data governance, data lifecycle management, security and centralized access controls mechanisms. This paper intends to provide a comprehensive architecture of data lake to address these challenges. We have also conducted and documented our experiments with publicly available datasets about COVID19 to validate the design and applicability of the proposed architecture for business analytics purposes.
机译:数据每天都以巨大的速度增加。传统上数据驻留在硅罗斯任何组织中,因此很难为数据驱动的业务决策具有完整的图片。数据湖通过提供“读取模式”,更好的集成和更便宜的存储来解决数据的增加问题。它还解决了数据孤岛问题,为各种数据住房需求提供了一个中心平台。然而,实现数据湖成为具有挑战性,因为实现需要满足元数据管理,数据发现,数据治理,数据生命周期管理,安全性和集中访问控制机制等额外需求。本文打算提供全面的数据湖建筑,以解决这些挑战。我们还通过关于Covid19的公开数据集进行了我们的实验,验证了拟议的业务分析目的的设计和适用性。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号