首页> 外文会议>Working Conference on Mining Software Repositories >Characterization and Prediction of Issue-Related Risks in Software Projects
【24h】

Characterization and Prediction of Issue-Related Risks in Software Projects

机译:软件项目中与问题相关的风险的表征和预测

获取原文

摘要

Identifying risks relevant to a software project and planning measures to deal with them are critical to the success of the project. Current practices in risk assessment mostly rely on high-level, generic guidance or the subjective judgements of experts. In this paper, we propose a novel approach to risk assessment using historical data associated with a software project. Specifically, our approach identifies patterns of past events that caused project delays, and uses this knowledge to identify risks in the current state of the project. A set of risk factors characterizing "risky" software tasks (in the form of issues) were extracted from five open source projects: Apache, Duraspace, JBoss, Moodle, and Spring. In addition, we performed feature selection using a sparse logistic regression model to select risk factors with good discriminative power. Based on these risk factors, we built predictive models to predict if an issue will cause a project delay. Our predictive models are able to predict both the risk impact (i.e. The extend of the delay) and the likelihood of a risk occurring. The evaluation results demonstrate the effectiveness of our predictive models, achieving on average 48% -- 81% precision, 23% -- 90% recall, 29% -- 71% F-measure, and 70% -- 92% Area Under the ROC Curve. Our predictive models also have low error rates: 0.39 -- 0.75 for Macro-averaged Mean Cost-Error and 0.7 -- 1.2 for Macro-averaged Mean Absolute Error.
机译:识别与软件项目相关的风险并制定应对措施,对于项目的成功至关重要。当前的风险评估实践主要依靠高级的通用指导或专家的主观判断。在本文中,我们提出了一种使用与软件项目相关的历史数据进行风险评估的新颖方法。具体来说,我们的方法可以识别导致项目延误的过去事件的模式,并使用此知识来识别项目当前状态下的风险。从五个开源项目(Apache,Duraspace,JBoss,Moodle和Spring)中提取了一组表征“风险”软件任务(以问题的形式)的风险因素。此外,我们使用稀疏逻辑回归模型进行特征选择,以选择具有良好判别力的风险因素。基于这些风险因素,我们建立了预测模型来预测问题是否会导致项目延误。我们的预测模型能够预测风险影响(即延迟的延长)和发生风险的可能性。评估结果证明了我们的预测模型的有效性,该模型平均达到48%-81%的精度,23%-90%的召回率,29%-71​​%的F量度和70%-92%的ROC曲线。我们的预测模型的错误率也很低:宏平均平均成本误差为0.39-0.75,宏平均平均绝对误差为0.7-1.2。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号