首页> 外文期刊>Neuroscience and Biomedical Engineering >Incremental Rules Induction based on Rule Layers and its Application to Clinical Datasets
【24h】

Incremental Rules Induction based on Rule Layers and its Application to Clinical Datasets

机译:基于规则层的增量规则诱导及其在临床数据集中的应用

获取原文
获取原文并翻译 | 示例
           

摘要

Background: Various kinds of rule induction methods have been proposed, such as induction from decision trees, decision lists, and the AQ family. Several symbolic inductive learning methods have been proposed, such as the induction of decision trees [1, 2, 3], and the AQ family[4, 5, 6]. These methods and many variants initially introduced in the 1980s and 1990s are useful for finding frequent patterns from databases. However, conventional rule mining methods apply to a given dataset when the data has been fixed in the first run, but these methods must run fromscratch every time new data appears. Since the computational complexity is n2, a repeated run would limit the applicability of these methods in the era of “Big Data”. To solve this problem, incremental learning methods have been introduced. However, most of the methods have severalproblems: First, they do not perform worse than conventional rule learning methods. Secondly, those methods do not generate probabilistic rules. Third, computational complexity is heavier than conventional complexity. Methods: By using a framework of the rough set rule induction model,the authors first investigate the theoretical aspects of updates of statistical indices with additional examples used for rule selection criteria. The authors have found four possibilities for the update of indices, which in turn lead to two new rule selection criteria. If the statisticalindices of a rule satisfy the first selection condition, the rule can be used even if an additional example does not support the classification of the rules. If the statistical indices of a rule satisfy the second pair of inequalities, the rule may be removed from the list of classificationrules in the above case, or the rule may be included in the list if an additional example supports the classification. These rules belong to subrule layers. Based on rough set theory, we develop a new rule induction method, called PRIMEROSE-INC5 (Probabilistic Rule Induction Method based onRough Sets for Incremental Learning Methods), which induces probabilistic rules incrementally. Results: The system was evaluated based on the following two medical datasets, which were previously used for evaluation on conventional rule induction methods. One dataset was on the differentialdiagnosis of headaches, which consists of 1477 examples with 10 disease classes and 20 attributes. The other dataset was on meningitis, which consists of 198 examples with 3 classes and 25 attributes. The system was compared with other conventional rule induction methods by using repeated10-fold crossvalidation (repeated times: 100), whose experimental results showed that the proposed system outperformed the previously introduced methods.
机译:背景:提出了各种规则感应方法,例如决策树,决定列表和AQ系列的诱导。已经提出了几种符号感应学习方法,例如决策树[1,2,3]和AQ系列[4,5,6]的诱导。这些方法和最初在20世纪80年代和1990年引入的许多变体对于从数据库中找到频繁的模式是有用的。但是,当数据在第一次运行时已修复数据时,传统的规则挖掘方法适用于给定的数据集,但每次出现新数据时,这些方法必须运行runscratch。由于计算复杂性是N2,重复的运行将限制这些方法在&#8220的时代的适用性;大数据”为了解决这个问题,已经介绍了增量学习方法。但是,大多数方法都有几个问题:首先,它们不会比传统规则学习方法更糟糕。其次,这些方法不会产生概率规则。第三,计算复杂性比传统复杂性更重。方法:通过使用粗糙集规则感应模型的框架,作者首先要调查统计指标更新的理论方面,其中包含用于规则选择标准的附加例子。作者已经发现了四种可能性的指数的可能性,这反过来导致两个新的规则选择标准。如果规则的统计表满足第一个选择条件,即使其他示例不支持规则的分类,也可以使用规则。如果规则的统计指标满足第二对不等式,则可以从上述情况中的分类列表中删除规则,或者如果另一个例子支持分类,则可以在列表中包括规则。这些规则属于子脉层。基于粗糙集理论,我们开发了一种名为Primerose-Inc5的新规则诱导方法(基于概率的基于增量学习方法的概率诱导方法),从而逐渐引起概率规则。结果:基于以下两个医学数据集进行评估,以前用于评估常规规则诱导方法。一个数据集是头痛的差异诊断,其中包含1477个疾病课程和20个属性的例子。其他数据集是脑膜炎,由198个例子组成,其中3个类和25个属性。将系统与其他常规规则诱导方法进行比较,使用重复的10倍交叉透过(重复时间:100),其实验结果表明,所提出的系统优于先前引入的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号