首页> 外文期刊>Artificial life and robotics >A method of non-bug report identification from bug report repository
【24h】

A method of non-bug report identification from bug report repository

机译:来自错误报告存储库的非错误报告标识方法

获取原文
获取原文并翻译 | 示例
           

摘要

One of the most common issues addressed by bug report studies is misclassification when identifying and then filtering non-bug reports from the bug report repository. Having to filter out unrelated reports wastes time in identifying actual bug reports, and this escalates costs as extra maintenance and effort are required to triage and fix bugs. Therefore, this issue has been seriously studied and is addressed here. To tackle this problem, this study proposes a method of automatically identifying non-bug reports in the bug report repository using classification techniques. Three points are considered here. First, the bug report features used are unigram and CamelCase, where CamelCase words are used for feature expansion. Second, five term weighting schemes are compared to determine an appropriate term weighting scheme for this task. Lastly, the support vector machine (SVM) family i.e. binary-class SVM, one class SVM based on Schoelkopf methodology and support vector data description (SVDD) are used as the main mechanisms for modeling non-bug report identifiers. After testing by recall, precision, and F1, the results demonstrate the efficiency of identifying non-bug reports in the bug report repository. Our results may be acceptable after comparing to the previous well-known studies, and the performance of non-bug report identifiers with tf-igm and modified tf-icf weighting schemes for both Scoelkopf methodology and SVDD methods yielded the best value when compared to others.
机译:错误报告研究中解决的最常见问题之一是在识别中删除错误报告存储库的非错误报告时错误分类。必须过滤掉不相关的报告浪费时间在识别实际错误报告时,这会将成本升级为额外的维护和努力来进行分类并修复错误。因此,这个问题已经认真研究并在此处解决。为了解决这个问题,本研究提出了一种使用分类技术自动识别错误报告存储库中的非错误报告的方法。这里考虑了三个点。首先,使用的错误报告功能是UNIGRAM和CamelCase,其中CamelCase单词用于功能扩展。其次,比较五个术语加权方案以确定此任务的适当术语加权方案。最后,支持向量机(SVM)系列i.e.S中类SVM,基于Schoelkopf方法的一个类SVM和支持向量数据描述(SVDD)用作建模非BUG报告标识符的主要机制。通过调用,精度和F1测试后,结果展示了在错误报告存储库中识别非错误报告的效率。在比较前面的众所周知的研究之后,我们的结果可能是可以接受的,并且对于SCOELKOPF方法和SVDD方法的TF-IgM和改进的TF-ICF加权方案的非Bug报告标识符的性能产生了最佳值与他人相比。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号