Style-Analyzer: Fixing Code Style Inconsistencies with Interpretable Unsupervised Algorithms

机译：STYLE-Analyzer：修复代码风格与可解释的无监督算法不一致

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Source code reviews are manual, time-consuming, and expensive. Human involvement should be focused on analyzing the most relevant aspects of the program, such as logic and maintainability, rather than amending style, syntax, or formatting defects. Some tools with linting capabilities can format code automatically and report various stylistic violations for supported programming languages. They are based on rules written by domain experts, hence, their configuration is often tedious, and it is impractical for the given set of rules to cover all possible corner cases. Some machine learning-based solutions exist, but they remain uninterpretable black boxes. This paper introduces style-analyzer, a new open source tool to automatically fix code formatting violations using the decision tree forest model which adapts to each codebase and is fully unsupervised. style-analyzer is built on top of our novel assisted code review framework, Lookout. It accurately mines the formatting style of each analyzed Git repository and expresses the found format patterns with compact human-readable rules. style-analyzer can then suggest style inconsistency fixes in the form of code review comments. We evaluate the output quality and practical relevance of style-analyzer by demonstrating that it can reproduce the original style with high precision, measured on 19 popular JavaScript projects, and by showing that it yields promising results in fixing real style mistakes. style-analyzer includes a web application to visualize how the rules are triggered. We release style-analyzer as a reusable and extendable open source software package on GitHub for the benefit of the community.

机译：源代码审查是手动，耗时和昂贵的。人类参与应专注于分析程序的最相关方面，例如逻辑和可维护性，而不是修改样式，语法或格式化缺陷。有些工具具有Linting功能可以自动格式化代码，并为支持的编程语言报告各种风格违规。它们基于由域专家编写的规则，因此，它们的配置通常是乏味的，并且给定的一组规则是不切实际的，以涵盖所有可能的角落案例。一些基于机器的基于机器的解决方案存在，但它们保持不可诠释的黑匣子。本文介绍了STYLE-Analyzer，一个新的开源工具，用于使用判定为每个码布的决策树林模型自动修复代码格式违规，并完全无监督。 STYLE-Analyzer建于我们的小说辅助代码审查框架之上，了解。它准确地挖掘每个分析的GIT存储库的格式样式，并表达了具有紧凑的人类可读规则的找到的格式模式。然后，样式分析仪可以以代码审查评论的形式建议样式不一致修复。我们通过展示它可以通过高精度再现原始风格来评估风格分析仪的输出质量和实际相关性，以19个流行的Javacript项目测量，并通过表明它产生了有希望的结果来解决真正的风格错误。 STYLE-Analyzer包括Web应用程序以可视化触发规则的方式。我们将样式分析仪释放为Github的可重用和可扩展的开源软件包，以便为社区的利益。

著录项

来源
《IEEE/ACM International Conference on Mining Software Repositories》|2019年|xxxiv 606 p. :|共11页
会议地点
作者
Vadim Markovtsev; Waren Long; Hugo Mougard; Konstantin Slavnov; Egor Bulychev;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类安全保密;
关键词
data mining; decision trees; program debugging; program diagnostics; public domain software; software maintenance; software packages; source code (software); unsupervised learning;

机译：数据挖掘;决策树;程序调试;程序诊断;公共域软件;软件维护;软件包;源代码（软件）;无监督学习;

相似文献

外文文献
中文文献
专利

1. Combining supervised and unsupervised machine learning algorithms to predict the learners’ learning styles [J] . Ouafae EL AISSAOUI, Yasser EL ALAMI EL MADANI, Lahcen OUGHDIR, Procedia Computer Science . 2019,第5期

机译：结合监督和无监督的机器学习算法预测学习者的学习方式
2. How does code style inconsistency affect pull request integration? An exploratory study on 117 GitHub projects [J] . Zou Weiqin, Xuan Jifeng, Xie Xiaoyuan, Empirical Software Engineering . 2019,第6期

机译：代码样式不一致如何影响请求请求集成？
3. Inconsistency Detection in Software Component Source Code using Ant Colony Optimization and Neural Network Algorithm [J] . Amit Verma, Srishti Gupta, Iqbaldeep Kaur Indian Journal of Science and Technology . 2016,第40期

机译：基于蚁群算法和神经网络算法的软件组件源代码不一致性检测
4. Style-Analyzer: Fixing Code Style Inconsistencies with Interpretable Unsupervised Algorithms [C] . Vadim Markovtsev, Waren Long, Hugo Mougard, IEEE/ACM International Conference on Mining Software Repositories . 2019

机译：样式分析器：使用可解释的无监督算法修复代码样式不一致
5. Constructions, Analyses and Decoding Algorithms of LDPC codes and Error Control Codes for Flash Coding. [D] . Huang, Qin. 2011

机译：用于闪存编码的LDPC码和差错控制码的构造，分析和解码算法。
6. Inconsistency Calibrating Algorithms for Large Scale Piezoresistive Electronic Skin [O] . Jinhua Ye, Zhengkang Lin, Jinyan You, 2020

机译：大规模压阻电子皮肤的不一致校准算法
7. Unsupervised Detection of Annotation Inconsistencies Using Apriori Algorithm [O] . Václav Novák, Magda Razímová 2010

机译：基于apriori算法的无监督检测注释不一致性

Style-Analyzer: Fixing Code Style Inconsistencies with Interpretable Unsupervised Algorithms

摘要

著录项

相似文献

相关主题

期刊订阅