...
首页> 外文期刊>Decision support systems >An empirical evaluation of the comprehensibility of decision table, tree and rule based predictive models
【24h】

An empirical evaluation of the comprehensibility of decision table, tree and rule based predictive models

机译:基于决策表,树和规则的预测模型的可理解性的经验评估

获取原文
获取原文并翻译 | 示例
           

摘要

An important objective of data mining is the development of predictive models. Based on a number of observations, a model is constructed that allows the analysts to provide classifications or predictions for new observations. Currently, most research focuses on improving the accuracy or precision of these models and comparatively little research has been undertaken to increase their comprehensibility to the analyst or end user. This is mainly due to the subjective nature of "comprehensibility", which depends on many factors outside the model, such as the user's experience and his/her prior knowledge. Despite this influence of the observer, some representation formats are generally considered to be more easily interpretable than others. In this paper, an empirical study is presented which investigates the suitability of a number of alternative representation formats for classification when interpretability is a key requirement. The formats under consideration are decision tables, (binary) decision trees, prepositional rules, and oblique rules. An end-user experiment was designed to test the accuracy, response time, and answer confidence for a set of problem solving tasks involving the former representations. Analysis of the results reveals that decision tables perform significantly better on all three criteria, while post-test voting also reveals a clear preference of users for decision tables in terms of ease of use.
机译:数据挖掘的重要目标是开发预测模型。基于大量观察,构建了一个模型,该模型允许分析人员为新观察提供分类或预测。当前,大多数研究都集中在提高这些模型的准确性或精确性上,而为提高分析人员或最终用户的理解性而进行的研究相对较少。这主要是由于“可理解性”的主观性质,它取决于模型之外的许多因素,例如用户的经验和他/她的先验知识。尽管观察者有这种影响,但通常认为某些表示格式比其他格式更易于解释。在本文中,进行了一项实证研究,调查了当可解释性是关键要求时,多种替代表示形式对分类的适用性。正在考虑的格式是决策表,(二进制)决策树,介词规则和倾斜规则。最终用户实验旨在测试一组涉及前一种表示形式的问题解决任务的准确性,响应时间和回答置信度。对结果的分析表明,决策表在所有三个条件下的表现都明显更好,而测试后投票也表明,在易用性方面,用户明显偏爱决策表。

著录项

  • 来源
    《Decision support systems》 |2011年第1期|p.141-154|共14页
  • 作者单位

    Department of Decision Sciences and Information Management, Katholieke Universiteit Leuven, Naamsestraat 69, B-3000 Leuven, Belgium;

    Department of Decision Sciences and Information Management, Katholieke Universiteit Leuven, Naamsestraat 69, B-3000 Leuven, Belgium;

    School of Management, University of Southampton, Southampton, SO17 IB], United Kingdom;

    Department of Decision Sciences and Information Management, Katholieke Universiteit Leuven, Naamsestraat 69, B-3000 Leuven, Belgium;

    Department of Decision Sciences and Information Management, Katholieke Universiteit Leuven, Naamsestraat 69, B-3000 Leuven, Belgium,School of Management, University of Southampton, Southampton, SO17 IB], United Kingdom,Vlerick Leuven Gent Management School, Leuven, Belgium;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    data mining classification knowledge representation comprehensibility decision tables;

    机译:数据挖掘分类知识表示可理解性决策表;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号