【24h】

Towards Data Mining Without Information on Knowledge Structure

机译:寻求没有知识结构信息的数据挖掘

获取原文
获取原文并翻译 | 示例

摘要

Most knowledge discovery processes are biased since some part of the knowledge structure must be given before extraction. We propose a framework that avoids this bias by supporting all major model structures e.g. clustering, sequences, etc., as well as specifications of data and DM (Data Mining) algorithms, in the same language. A unification operation is provided to match automatically the data to the relevant DM algorithms in order to extract models and their related structure. The MDL principle is used to evaluate and rank models. This evaluation is based on the covering relation that links the data to the models. The notion of schema, related to the category theory, is the key concept of our approach. Intuitively, a schema is an algebraic specification enhanced by the union of types, and the concepts of list and relation. An example based on network alarm mining illustrates the process.
机译:大多数知识发现过程都是有偏见的,因为必须在提取之前给出知识结构的某些部分。我们提出了一个框架,该框架通过支持所有主要的模型结构来避免这种偏差。群集,序列等,以及数据和DM(数据挖掘)算法的规范,都使用相同的语言。提供了一个统一操作,可自动将数据与相关的DM算法匹配,以提取模型及其相关结构。 MDL原理用于评估模型和对模型进行排名。该评估基于将数据链接到模型的覆盖关系。与类别理论相关的图式概念是我们方法的关键概念。直观上,模式是通过类型的结合以及列表和关系的概念而增强的代数规范。一个基于网络警报挖掘的示例说明了该过程。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号