首页> 外文会议>International Conference on Information Reuse and Integration for Data Science >Combining Data Mining Techniques for Evolutionary Analysis of Programming Languages
【24h】

Combining Data Mining Techniques for Evolutionary Analysis of Programming Languages

机译:组合数据挖掘技术进行编程语言的进化分析

获取原文

摘要

Programming languages have been evolving gradually in response to changes in the programming industry. Many factors have been driving this evolution: for instance, improving language expressiveness, fixing bugs, and introducing new language features. However, modifying programming languages is a challenging process. One of the main difficulties is to gauge the perception of developers regarding the language over time. Thus, we set out to develop a framework aimed at evaluating the evolution of programming languages based on their technical documentation and the community's feedback from online discussions. Essentially, our framework is comprised of three main components: (1) Topic Modeling, which aims to extract the main semantic topics from the language aspects; (2) Sentiment Analysis, whose objective is to evaluate the perception of developers with respect to each identified topic; and (3) Data Visualization, which presents a visual metaphor that summarizes the information obtained in previous steps. To evaluate our proof-of-concept implementation of the framework, we carried out an evolutionary analysis of the Python programming language. According to our results, our framework was able to identify several changes made to the language as well as the programmers' perceptions regarding those changes: for instance, we found that the use of iterators over traditional repetition structures (i.e., count-based repetition) was initially received negatively by the community, but the outlook of developers on this new feature has matured enough for it to be considered beneficial to the programming language.
机译:编程语言一直在不断变化,以响应编程行业的变化。许多因素一直在推动这种演变:例如,提高语言表达性,修复错误,并引入新的语言功能。但是,修改编程语言是一个具有挑战性的过程。其中一个主要困难是衡量随着时间的推移对语言的对开发人员的看法。因此,我们旨在制定一个旨在根据他们的技术文件和社区从在线讨论的反馈评估编程语言演变的框架。基本上,我们的框架由三个主要组成部分组成:(1)主题建模,旨在从语言方面提取主要语义主题; (2)情绪分析,其目标是评估开发商关于每个已识别的主题的看法; (3)数据可视化,这提出了一种概括了先前步骤中获得的信息的视觉隐喻。为了评估我们框架的概念证明,我们对Python编程语言进行了进化分析。根据我们的结果,我们的框架能够识别对语言的几个变化以及程序员对这些变化的看法:例如,我们发现使用迭代器在传统的重复结构上(即基于计数的重复)最初受到社区负面影响,但开发人员对这一新功能的前景已经成熟,因为它被认为是有利于编程语言的。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号