首页> 外文会议>International Electronics Symposium >Data Analytics of Human Development Index(HDI) with Features Descriptive and Predictive Mining
【24h】

Data Analytics of Human Development Index(HDI) with Features Descriptive and Predictive Mining

机译:具有描述性和预测性挖掘功能的人类发展指数(HDI)数据分析

获取原文

摘要

The value of the Human Development Index (HDI) in Indonesia is increasing every year. Indonesia has many provinces and districts/cities, it makes the government need more time to analyze data. This research purpose a new method to analyze the data of HDI with a descriptive and predictive mining method. There are two main results of this research. First, a segmentation of HDI data into four segments, there are low, medium, high, and very high. Second, a prediction of HDI data. Before analyzing data, the system does data preprocessing to repair the missing data (cleaning) and normalization (transformation) to convert data into a smaller range(from 0 to 1). To get a segmentation result use the descriptive mining method, in this method, there are two steps, the first system does grouping and labeling data based on the value of HDI indicators(life expectancy, expected years of schooling, mean years of schooling and income per capita) use Hierarchical Clustering Centroid Linkage Method. Second, the system does the interpretation process based on the distance between centroid every cluster and ground(0,0). To get a prediction result use the predictive mining method, this process uses a Weighted Moving Average(WMA) with the last three years of HDI data. The result of this research, the variance accuracy value of the descriptive mining method is 0,203, and the Mean Absolute Percentage Error(MAPE) value of the predictive mining method is 0,27%.
机译:印度尼西亚的人类发展指数(HDI)的价值每年都在增加。印度尼西亚有许多省和地区/城市,这使得政府需要更多时间来分析数据。这项研究的目的是一种使用描述性和预测性挖掘方法来分析HDI数据的新方法。这项研究有两个主要结果。首先,将HDI数据分为四个部分,分别为低,中,高和非常高。其次,对HDI数据的预测。在分析数据之前,系统会进行数据预处理以修复丢失的数据(清理)和规范化(转换),以将数据转换为较小的范围(从0到1)。要使用描述性挖掘方法获得细分结果,该方法分为两个步骤,第一个系统根据HDI指标的值(预期寿命,预期受教育年限,平均受教育年限和收入)对数据进行分组和标记人均)使用层次聚类质心链接方法。其次,系统根据每个聚类的质心与地面之间的距离(0,0)进行解释过程。为了使用预测挖掘方法获得预测结果,此过程使用了加权移动平均(WMA)和最近三年的HDI数据。研究结果表明,描述性挖掘方法的方差精度值为0,203,预测性挖掘方法的平均绝对百分比误差(MAPE)值为0.27%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号