首页> 外文期刊>International journal of information system modeling and design >Predicting OSS Development Success: A Data Mining Approach
【24h】

Predicting OSS Development Success: A Data Mining Approach

机译:预测OSS开发成功:一种数据挖掘方法

获取原文
获取原文并翻译 | 示例
           

摘要

Open Source Software (OSS) has reached new levels of sophistication and acceptance by users and commercial software vendors. This research creates tests and validates a model for predicting successful development of OSS projects. Widely available archival data was used for OSS projects from Sourceforge.net. The data is analyzed with multiple Data Mining techniques. Initially three competing models are created using Logistic Regression, Decision Trees and Neural Networks. These models are compared for precision and are refined in several phases. Text Mining is used to create new variables that improve the predictive power of the models. The final model is chosen based on best fit to separate training and validation data sets and the ability to explain the relationship among variables. Model robustness is determined by testing it on a new dataset extracted from the SF repository. The results indicate that end-user involvement, project age, functionality, usage, project management techniques, project type and team communication methods have a significant impact on the development of OSS projects.
机译:开源软件(OSS)已达到用户和商业软件供应商成熟度和接受度的新水平。这项研究创建了测试并验证了预测OSS项目成功开发的模型。 Sourceforge.net的OSS项目使用了广泛可用的档案数据。使用多种数据挖掘技术分析数据。最初,使用Logistic回归,决策树和神经网络创建了三个竞争模型。比较这些模型的精度,并在多个阶段进行完善。文本挖掘用于创建新变量,以提高模型的预测能力。最终模型的选择基于最佳拟合以分离训练和验证数据集,并具有解释变量之间关系的能力。通过在从SF信息库中提取的新数据集上对其进行测试,可以确定模型的健壮性。结果表明,最终用户的参与程度,项目年龄,功能,使用情况,项目管理技术,项目类型和团队沟通方法对OSS项目的开发具有重大影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号