首页> 外文会议>IEEE International Conference on Big Data >An infrastructure and application of computational archival science to enrich and integrate big digital archival data: Using Taiwan Indigenous Peoples Open Research Data (TIPD) as an example
【24h】

An infrastructure and application of computational archival science to enrich and integrate big digital archival data: Using Taiwan Indigenous Peoples Open Research Data (TIPD) as an example

机译:丰富和整合大数字档案数据的计算档案科学的基础设施和应用:以台湾原住民开放研究数据(TIPD)为例

获取原文

摘要

This paper highlights research on constructing a big archival data called Taiwan Indigenous Peoples Open Research Data (TIPD, see https://osf.io/e4rvz/) based on contemporary census and household registration data sets in 2013-2017 (see http://TIPD.sinica.edu.tw). TIPD utilizes record linkage, geocoding, and high-performance in-memory computing technology to construct various dimensions of Taiwan Indigenous Peoples (TIPs) demographics and developments. Embedded in collecting, cleaning, cleansing, processing, exploring, and enriching individual digital records are archival computational science and data science. TIPD consists of three categories of archival open data: (1) categorical data, (2) household structure and characteristics data, and (3) population dynamics data, including cross-sectional time-series categorical data, longitudinally linked population dynamics data, life tables, household statistics, micro genealogy data, marriage practice and ethnic identity data, internal migration data, geocoded data, etc. TIPD big archival data not only help unveil contemporary TIPs demographics and various developments, but also help overcome research barriers and unleash creativity for TIPs studies.
机译:本文重点介绍了基于2013-2017年的人口普查和户籍数据集构建名为台湾原住民开放研究数据(TIPD,请参阅https://osf.io/e4rvz/)的大档案数据的研究(请参阅http:/ /TIPD.sinica.edu.tw)。 TIPD利用记录链接,地理编码和高性能内存计算技术来构建台湾原住民(TIP)人口统计和发展的各个维度。档案计算科学和数据科学嵌入在收集,清洁,清洗,处理,探索和丰富单个数字记录中。 TIPD由三类档案开放数据组成:(1)分类数据;(2)家庭结构和特征数据;(3)人口动态数据,包括横截面时间序列分类数据,纵向链接的人口动态数据,生活表格,家庭统计数据,微观家谱数据,婚姻习俗和种族身份数据,内部移民数据,地理编码数据等。TIPD大档案数据不仅有助于揭示当代TIP人口统计数据和各种发展趋势,而且还有助于克服研究障碍并释放创造力技巧研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号