首页> 外文学位 >Design recovery and data mining: A methodology that identifies data-cohesive subsystems based on mining association rules.
【24h】

Design recovery and data mining: A methodology that identifies data-cohesive subsystems based on mining association rules.

机译:设计恢复和数据挖掘:一种基于挖掘关联规则识别数据内聚子系统的方法。

获取原文
获取原文并翻译 | 示例

摘要

Software maintenance is both a technical and an economic concern for organizations. Large software systems are difficult to maintain due to their intrinsic complexity, and their maintenance consumes between 50% and 90% of the cost of their complete life-cycle. An essential step in maintenance is reverse engineering, which focuses on understanding the system. This system understanding is critical to avoid the generation of undesired side effects during maintenance. The objective of this research is to investigate the potential of applying data mining to reverse engineering. This research was motivated by the following: (1) data mining can process large volumes of information, (2) data mining can elicit meaningful information without previous knowledge of the domain, (3) data mining can extract novel non-trivial relationships from a data set, and (4) data mining is automatable. These data mining features are used to help address the problem of understanding large legacy systems.; This research produced a general method to apply data mining to reverse engineering, and a methodology for design recovery, called Identification of Subsystems based on Associations (ISA). ISA uses mined association rules from a database view of the subject system to guide a clustering process that produces a data-cohesive hierarchical subsystem decomposition of the system. ISA promotes object-oriented principles because each identified subsystem consists of a set of data repositories and the code (i.e., programs) that manipulates them. ISA is an automatic multi-step process, which uses the source code of the subject system and multiple parameters as its input. ISA includes two representation models (i.e., text-based and graphic-based representation models) to present the resulting subsystem decomposition.; The automated environment RE-ISA implements the ISA methodology. RE-ISA was used to produce the subsystem decomposition of real-word software systems. Results show that ISA can automatically produce data-cohesive subsystem decompositions without previous knowledge of the subject system, and that ISA always generates the same results if the same parameters are utilized.; This research provides evidence that data mining is a beneficial tool for reverse engineering and provides the foundation for defining methodologies that combine data mining and software maintenance.
机译:对于组织而言,软件维护既是技术问题,也是经济问题。大型软件系统由于其固有的复杂性而难以维护,并且其维护消耗了整个生命周期成本的50%至90%。维护的重要步骤是逆向工程,它着重于对系统的理解。对系统的这种理解对于避免在维护过程中产生不良副作用至关重要。这项研究的目的是研究将数据挖掘应用于逆向工程的潜力。这项研究的动机如下:(1)数据挖掘可以处理大量信息,(2)数据挖掘可以在没有域知识的情况下获取有意义的信息,(3)数据挖掘可以从网络中提取新颖的平凡关系。数据集,并且(4)数据挖掘是自动化的。这些数据挖掘功能用于帮助解决理解大型遗留系统的问题。这项研究产生了一种将数据挖掘应用于逆向工程的通用方法,以及一种用于设计恢复的方法,称为基于关联的子系统识别(ISA)。 ISA使用从主题系统的数据库视图中提取的关联规则来指导群集过程,该群集过程会导致系统的数据聚合层次子系统分解。 ISA提倡面向对象的原理,因为每个已识别的子系统都由一组数据存储库和对其进行操作的代码(即程序)组成。 ISA是一个自动的多步骤过程,它使用主题系统的源代码和多个参数作为输入。 ISA包括两个表示模型(即,基于文本和基于图形的表示模型),以表示所产生的子系统分解。自动化环境RE-ISA实现了ISA方法。 RE-ISA用于产生实词软件系统的子系统分解。结果表明,ISA可以自动生成数据聚合子系统分解,而无需事先了解主题系统,并且如果使用相同的参数,ISA总是会产生相同的结果。这项研究提供了证据,证明数据挖掘是进行逆向工程的有益工具,并为定义结合了数据挖掘和软件维护的方法论提供了基础。

著录项

  • 作者

    Montes de Oca, Carlos.;

  • 作者单位

    Louisiana State University and Agricultural & Mechanical College.;

  • 授予单位 Louisiana State University and Agricultural & Mechanical College.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 1999
  • 页码 168 p.
  • 总页数 168
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号