首页> 外文会议>IEEE/ACM International Conference on Mining Software Repositories >Continuous Defect Prediction: The Idea and a Related Dataset
【24h】

Continuous Defect Prediction: The Idea and a Related Dataset

机译:连续缺陷预测:想法和相关数据集

获取原文

摘要

We would like to present the idea of our Continuous Defect Prediction (CDP) research and a related dataset that we created and share. Our dataset is currently a set of more than 11 million data rows, representing files involved in Continuous Integration (CI) builds, that synthesize the results of CI builds with data we mine from software repositories. Our dataset embraces 1265 software projects, 30,022 distinct commit authors and several software process metrics that in earlier research appeared to be useful in software defect prediction. In this particular dataset we use TravisTorrent as the source of CI data. TravisTorrent synthesizes commit level information from the Travis CI server and GitHub open-source projects repositories. We extend this data to a file change level and calculate the software process metrics that may be used, for example, as features to predict risky software changes that could break the build if committed to a repository with CI enabled.
机译:我们想介绍我们的持续缺陷预测(CDP)研究以及我们创建和分享的相关数据集。我们的数据集目前是一组超过1100万的数据行,表示连续集成(CI)构建中涉及的文件,该文件合成CI的结果与来自软件存储库的数据。我们的数据集包含1265个软件项目,30,022个不同的提交作者以及早期研究中的几个软件流程指标似乎在软件缺陷预测中有用。在此特定数据集中,我们使用Travistorrent作为CI数据的来源。 Travistorrent从Travis CI Server和GitHub开源项目存储库中合成提交级别信息。我们将此数据扩展到文件更改级别,并计算可以使用的软件进程度量,例如,可以使用,以预测可能会在启用CI的存储库中打破构建的风险软件更改。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号