首页> 外文会议>World Conference on Information Systems and Technologies >An Approach for Knowledge Extraction from Source Code (KNESC) of Typed Programming Languages
【24h】

An Approach for Knowledge Extraction from Source Code (KNESC) of Typed Programming Languages

机译:键入编程语言源代码(KNESK)的知识提取方法

获取原文

摘要

Knowledge extraction is the discovery of knowledge from structured and/or unstructured sources. This knowledge can be used to build or enrich a domain ontology. Source code is rarely used. But implementation platforms evolve faster than business logic and these evolutions are usually integrated directly into source code without updating the conceptual model. In this paper, we present a generic approach for knowledge extraction from source code of typed programming languages using Hidden Markov Models. This approach consist of the definition of the HMM so that it can be used to extract any type of knowledge from the source code. The method is experimented on EPICAM and GeoServer developed in Java and on MapServer developed in C/C++. Structural evaluation shows that source code contains a structure that permit to build a domain ontology and functional evaluation shows that source code contains more knowledge than those contained in both databases and meta-models.
机译:知识提取是从结构化和/或非结构化来源发现知识。此知识可用于构建或丰富域本体。源代码很少使用。但实现平台的发展速度快于业务逻辑,并且这些演变通常直接集成到源代码中,而无需更新概念模型。在本文中,我们使用隐马尔可夫模型提出了一种从类型化编程语言的源代码中提取的通用方法。这种方法包括HMM的定义,以便它可以用于从源代码中提取任何类型的知识。该方法在Java和C / C ++中开发的Java和MapServer上开发的Epicam和GeoServer上进行了实验。结构评估表明,源代码包含允许构建域本体和功能评估的结构,表明源代码包含比包含在数据库和元模型中包含的结构的知识。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号