首页> 外文会议>International Conference on ICT for Smart Society >Automatic Multi-label Classification for GDP Economic-phenomenon News
【24h】

Automatic Multi-label Classification for GDP Economic-phenomenon News

机译:自动多标签对GDP经济现象新闻的分类

获取原文

摘要

GDP is a measure of a country's economy. One of the variables used in the process of compiling GDP is the news analysis of economic phenomena. Briefly, the GDP figures are getting better if they are in line with the economic phenomena that occur. Analysis of this economic phenomenon can also be a supporting component of GDP publicity. This study aims to investigate the best classification model for economic phenomenon news to speed up the analysis process. We use a multi-label classification method because each news item has one or more categories. The label in this study corresponds to the GDP by Industry which refers to the Indonesian Standard Industrial Classification (KBLI). We use the Problem Transformation approach in combination with several single label classification algorithms. Label Power-set method combined with Linear SVC showed a better result among others, it reaches 75% for F-Measure and 0.021 for Hamming Loss in 5-fold cross-validation. However, the model performance increase to 84% in testing mode.
机译:GDP是​​一个国家经济的衡量标准。编制GDP过程中使用的变量之一是经济现象的新闻分析。简而言之,如果他们符合发生的经济现象,GDP数字会变得更好。对这种经济现象的分析也可以是GDP宣传的支持组成部分。本研究旨在调查经济现象新闻的最佳分类模型,以加快分析过程。我们使用多标签分类方法,因为每个新闻项目都有一个或多个类别。本研究中的标签对应于工业的GDP,其指印度尼西亚标准工业分类(KBLI)。我们使用问题转换方法结合多个单一标签分类算法。标签功率集方法与线性SVC相结合,在其他方面显示出更好的结果,对于5倍交叉验证,汉明损失为0.021的75%。但是,测试模式下的模型性能增加到84%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号