An Exploratory Study of Log Placement Recommendation in an Enterprise System

机译：企业系统日志放置推荐的探索性研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Logging is a development practice that plays an important role in the operations and monitoring of complex systems. Developers place log statements in the source code and use log data to understand how the system behaves in production. Unfortunately, anticipating where to log during development is challenging. Previous studies show the feasibility of leveraging machine learning to recommend log placement despite the data imbalance since logging is a fraction of the overall code base. However, it remains unknown how those techniques apply to an industry setting, and little is known about the effect of imbalanced data and sampling techniques. In this paper, we study the log placement problem in the code base of Adyen, a large-scale payment company. We analyze 34,526 Java files and 309,527 methods that sum up +2M SLOC. We systematically measure the effectiveness of five models based on code metrics, explore the effect of sampling techniques, understand which features models consider to be relevant for the prediction, and evaluate whether we can exploit 388,086 methods from 29 Apache projects to learn where to log in an industry setting. Our best performing model achieves 79% of balanced accuracy, 81% of precision, 60% of recall. While sampling techniques improve recall, they penalize precision at a prohibitive cost. Experiments with open-source data yield under-performing models over Adyen’s test set; nevertheless, they are useful due to their low rate of false positives. Our supporting scripts and tools are available to the community.

机译：日志记录是一个开发实践，在复杂系统的运营和监控中起着重要作用。开发人员在源代码中将日志语句放置在源代码中，并使用日志数据来了解系统在生产中的行为方式。不幸的是，预测在开发期间登录的地方都具有挑战性。以前的研究表明，尽管日志记录是整个代码基础的一小部分，但是尽管数据不平衡，但耗尽机器学习建议的可行性。然而，它仍然不知道这些技术如何适用于行业环境，并且关于不平衡数据和采样技术的影响很少。在本文中，我们研究了一家大型支付公司Adyen代码库的日志放置问题。我们分析34,526个Java文件和309,527个方法，总结+ 2M Sloc。我们系统地测量基于代码指标的五种模型的有效性，探讨采样技术的效果，了解模型考虑与预测相关的功能，并评估我们是否可以从29个Apache项目中利用388,086种方法来学习登录的位置一个行业环境。我们最好的表演模式达到了79％的均衡准确度，精度的81％，召回了60％。虽然采样技术改善了召回，但他们以禁止的成本惩罚精度。在Adyen的测试集上进行开源数据产量的实验;然而，由于它们的误报率低，它们是有用的。我们的支持脚本和工具可供社区使用。

著录项

来源
《IEEE/ACM International Conference on Mining Software Repositories》|2021年|143-154|共12页
会议地点
作者
Jeanderson Cândido; Jan Haesen; Maurício Aniche; Arie van Deursen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Industries; Measurement; Training; Biological system modeling; Training data; Machine learning; Production;

机译：行业;测量;培训;生物系统建模;培训数据;机器学习;生产;

相似文献

外文文献
中文文献
专利

1. Using Interpretive Qualitative Case Studies for Exploratory Research in Doctoral Studies: A Case of Information Systems Research in Small and Medium Enterprises [J] . Shana R. Ponelis International Journal of Doctoral Studies . 2015,第8期

机译：利用解释性定性案例研究进行博士论文探索性研究：以中小企业信息系统研究为例
2. How can performance measurement systems empower managers? An exploratory study in state-owned enterprises [J] . Martyna Swiatczak, Michele Morner, Nadine Finkbeiner International journal of public sector management . 2015,第4a5期

机译：绩效评估系统如何赋予管理者权力？国有企业探索性研究
3. An exploratory study of the relevance of trans-national global information systems to small and medium enterprises: evidence from Egypt [J] . Khaled Samaha, Adam Baki International journal of management and decision making . 2009,第1a2期

机译：跨国全球信息系统与中小企业相关性的探索性研究：来自埃及的证据
4. Inhibiting factors for adopting enterprise systems in networks of small and medium-sized enterprises - an exploratory case study [C] . Markus Schafermeyer, Christoph Rosenkranz Americas conference on information systems;AMCIS 2008 . 2008

机译：中小企业网络中采用企业系统的阻碍因素-探索性案例研究
5. An exploratory study investigating the organizational and technical impacts of applying disciplined system development processes (CMMI(TM)) in small to medium sized enterprises. [D] . Miluk, Gene. 2006

机译：一项探索性研究，研究在中小型企业中应用规范的系统开发流程（CMMI（TM））的组织和技术影响。
6. A retrospective study to validate an intraoperative robotic classification system for assessing the accuracy of kirschner wire (K-wire) placements with postoperative computed tomography classification system for assessing the accuracy of pedicle screw placements [O] . Tai-Hsin Tsai, Dong-Syuan Wu, Yu-Feng Su, -1

机译：一项回顾性研究旨在验证术中机器人分类系统以评估柯克纳克丝（K-wire）放置的准确性并采用术后计算机体层摄影术分类系统评估椎弓根螺钉放置的准确性
7. An Exploratory Study of Log Placement Recommendation in an Enterprise System [O] . Jeanderson Candido, Jan Haesen, Mauricio Aniche, 2021

机译：企业系统中对数放置建议的探索性研究

An Exploratory Study of Log Placement Recommendation in an Enterprise System

摘要

著录项

相似文献

相关主题

期刊订阅