首页> 外文会议>International Conference on Artificial Intelligence in Medicine >A Cautionary Tale on Using Covid-19 Data for Machine Learning
【24h】

A Cautionary Tale on Using Covid-19 Data for Machine Learning

机译:使用Covid-19数据进行机器学习的警示故事

获取原文

摘要

Introduction: Good quality and real-time epidemiological COVID-19 data are paramount to fight this pandemic through statistical/machine-learning based decision-making support mechanisms. Aims: Evaluate the resources available and used to gather COVID-19 epidemiological data by Portuguese health authorities from the onset of the pandemic until December 2020. The analysis laid on two main topics: (a) work processes at the Public Health Unit (PHU) level and (b) registry forms for epidemiological reporting and control procedures. Recommendations on requirements to overcome problems related to data integration and interoperability in order to build robust decision-making support mechanisms will also be produced. Methods: For topic (a), we revised the Portuguese Directorate-General of Health (DGS) guidelines for data treatment. For topic (b), we analysed the forms used during first and second waves, while comparing them with DGS metadata provided to researchers. Results: On topic (a), we detected the use of two complementary and non-interoperable systems. Further, the workflow does not seem to promote data quality and facilitates the occurrence of communication problems between health professionals. On topic (b), we found 27 deleted questions, 6 new questions, 1 displaced question, and 1 text modification between the 2 form versions. Discussion: Both the workflow and data gathering methods are not the best suited for the generation of good quality data. They do not effectively support Public Health Professionals (PHP) nor provide the elements for posterior data analysis. The use of data by decision-making support mechanisms demands a careful planning of the data used to depict reality, and this condition is not met by the currently used forms.
机译:介绍:优质和实时流行病学Covid-19数据通过统计/机器学习的决策支持机制至关重要。目的:评估可获得的资源,并用来从大流行发作到2020年12月的葡萄牙卫生当局收集Covid-19流行病学数据。分析奠定了两个主要主题:(a)公共卫生单位(PHU)的工作流程流行病学报告和控制程序的级别和(b)注册表表格。还将制作关于克服与数据集成和互操作性相关的问题的要求的建议,以便构建强大的决策支持机制。方法:对于主题(a),我们修订了葡萄牙理事会的健康理事会(DGS)的数据处理指南。对于主题(b),我们分析了在第一和第二波期间使用的表格,同时将其与DGS元数据进行比较,提供给研究人员。结果:主题(a),我们检测了使用两个互补和不可互操作的系统。此外,工作流程似乎并不促进数据质量,并促进卫生专业人员之间的沟通问题的发生。主题(b),我们发现27个删除问题,6个新问题,1个流离失所问题,以及2个形式版本之间的文本修改。讨论:工作流程和数据收集方法都不是最适合生成良好质量数据的。他们没有有效地支持公共卫生专业人员(PHP),也不提供后部数据分析的要素。通过决策支持机制使用数据要求仔细规划用于描述现实的数据,并且目前使用的表格不满足这种情况。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号