首页> 外文会议>2017 IEEE/ACM 39th International Conference on Software Engineering Companion >Fast and Flexible Large-Scale Clone Detection with CloneWorks
【24h】

Fast and Flexible Large-Scale Clone Detection with CloneWorks

机译:使用CloneWorks进行快速灵活的大规模克隆检测

获取原文
获取原文并翻译 | 示例

摘要

Clone detection in very-large inter-project repositories has numerous applications in software research and development. However, existing tools do not provide the flexibility researchers need to explore this emerging domain. We introduce CloneWorks, a fast and flexible clone detector for large-scale clone detection experiments. CloneWorks gives the user full control over the representation of the source code before clone detection, including easy plug-in of custom source transformation, normalization and filtering logic. The user can then perform targeted clone detection for any type or kind of clone of interest. CloneWorks uses our fast and scalable partitioned partial indexes approach, which can handle any input size on an average workstation using input partitioning. CloneWorks can detect Type-3 clones in an input as large as 250 million lines of code in just four hours on an average workstation, with good recall and precision as measured by our BigCloneBench.
机译:大型项目间存储库中的克隆检测在软件研发中具有众多应用。但是,现有工具无法提供研究人员探索这一新兴领域所需的灵活性。我们介绍CloneWorks,这是一种用于大型克隆检测实验的快速灵活的克隆检测器。 CloneWorks在克隆检测之前为用户提供了对源代码表示的完全控制权,包括易于定制的源代码转换,规范化和过滤逻辑的插件。然后,用户可以针对任何类型或种类的目标克隆执行目标克隆检测。 CloneWorks使用我们快速且可扩展的分区部分索引方法,该方法可以使用输入分区处理普通工作站上的任何输入大小。在一个普通的工作站上,CloneWorks可以在短短四个小时内检测出多达2.5亿行代码的输入中的Type-3克隆,这与我们的BigCloneBench所衡量的一样,具有良好的召回率和精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号