首页> 外文期刊>JMIR Medical Informatics >Clinical Data Miner: An Electronic Case Report Form System With Integrated Data Preprocessing and Machine-Learning Libraries Supporting Clinical Diagnostic Model Research
【24h】

Clinical Data Miner: An Electronic Case Report Form System With Integrated Data Preprocessing and Machine-Learning Libraries Supporting Clinical Diagnostic Model Research

机译:临床数据挖掘器:具有集成的数据预处理和机器学习库的电子病例报告表系统,支持临床诊断模型研究

获取原文
           

摘要

Background Using machine-learning techniques, clinical diagnostic model research extracts diagnostic models from patient data. Traditionally, patient data are often collected using electronic Case Report Form (eCRF) systems, while mathematical software is used for analyzing these data using machine-learning techniques. Due to the lack of integration between eCRF systems and mathematical software, extracting diagnostic models is a complex, error-prone process. Moreover, due to the complexity of this process, it is usually only performed once, after a predetermined number of data points have been collected, without insight into the predictive performance of the resulting models. Objective The objective of the study of Clinical Data Miner (CDM) software framework is to offer an eCRF system with integrated data preprocessing and machine-learning libraries, improving efficiency of the clinical diagnostic model research workflow, and to enable optimization of patient inclusion numbers through study performance monitoring. Methods The CDM software framework was developed using a test-driven development (TDD) approach, to ensure high software quality. Architecturally, CDM’s design is split over a number of modules, to ensure future extendability. Results The TDD approach has enabled us to deliver high software quality. CDM’s eCRF Web interface is in active use by the studies of the International Endometrial Tumor Analysis consortium, with over 4000 enrolled patients, and more studies planned. Additionally, a derived user interface has been used in six separate interrater agreement studies. CDM's integrated data preprocessing and machine-learning libraries simplify some otherwise manual and error-prone steps in the clinical diagnostic model research workflow. Furthermore, CDM's libraries provide study coordinators with a method to monitor a study's predictive performance as patient inclusions increase. Conclusions To our knowledge, CDM is the only eCRF system integrating data preprocessing and machine-learning libraries. This integration improves the efficiency of the clinical diagnostic model research workflow. Moreover, by simplifying the generation of learning curves, CDM enables study coordinators to assess more accurately when data collection can be terminated, resulting in better models or lower patient recruitment costs.
机译:背景技术临床诊断模型研究使用机器学习技术从患者数据中提取诊断模型。传统上,通常使用电子病例报告表(eCRF)系统收集患者数据,而数学软件则使用机器学习技术来分析这些数据。由于eCRF系统与数学软件之间缺乏集成,因此提取诊断模型是一个复杂且容易出错的过程。此外,由于该过程的复杂性,通常在收集到预定数量的数据点后仅执行一次,而不会深入了解所得模型的预测性能。目的研究临床数据挖掘器(CDM)软件框架的目的是提供一个具有集成的数据预处理和机器学习库的eCRF系统,提高临床诊断模型研究工作流程的效率,并通过以下方式优化患者收录人数:学习绩效监控。方法CDM软件框架是使用测试驱动开发(TDD)方法开发的,以确保高质量的软件。在结构上,CDM的设计分为多个模块,以确保将来的可扩展性。结果TDD方法使我们能够提供高质量的软件。国际子宫内膜肿瘤分析协会的研究正在积极使用CDM的eCRF Web界面,该研究已招募了4000多名患者,并计划进行更多研究。此外,派生的用户界面已用于六个单独的跨界协议研究中。 CDM的集成数据预处理和机器学习库简化了临床诊断模型研究工作流程中的其他手动和容易出错的步骤。此外,CDM的库为研究协调员提供了一种随着患者内含物增加而监视研究的预测性能的方法。结论据我们所知,CDM是唯一集成了数据预处理和机器学习库的eCRF系统。这种集成提高了临床诊断模型研究工作流程的效率。此外,通过简化学习曲线的生成,CDM使研究协调员可以更准确地评估何时可以终止数据收集,从而获得更好的模型或降低患者招募成本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号