【24h】

Learning from Electronic Health Records:From Temporal Abstractions to Time Series Interpretability

机译:从电子病历中学习:从时间抽象到时间序列可解释性

获取原文

摘要

The first part of the talk will focus on data mining methods for learning from Electronic Health Records (EHRs), which are typically perceived as big and complex patient data sources. On them, scientists strive to perform predictions on patients' progress, to understand and predict response to therapy, to detect adverse drug effects, and many other learning tasks. Medical researchers are also interested in learning from cohorts of population-based studies and of experiments. Learning tasks include the identification of disease predictors that can lead to new diagnostic tests and the acquisition of insights on interventions. The talk will elaborate on data sources, methods, and case studies in medical mining. The second part of the talk will tackle the issue of interpretability and explainability of opaque machine learning models, with focus on time series classification. Time series classification has received great attention over the past decade with a wide range of methods focusing on predictive performance by exploiting various types of temporal features. Nonetheless, little emphasis has been placed on interpretability and explainability. This talk will formulate the novel problem of explainable time series tweaking, where, given a time series and an opaque classifier that provides a particular classification decision for the time series, the objective is to find the minimum number of changes to be performed to the given time series so that the classifier changes its decision to another class. Moreover, it will be shown that the problem is NP-hard. Two instantiations of the problem will be presented. The classifier under investigation will be the random shapelet forest classifier. Moreover, two algorithmic solutions for the two problem instantiations will be presented along with simple optimizations, as well as a baseline solution using the nearest neighbor classifier.
机译:演讲的第一部分将重点讨论从电子病历(EHR)中学习的数据挖掘方法,这些方法通常被认为是庞大而复杂的患者数据源。在他们身上,科学家致力于对患者的病情进行预测,了解和预测对治疗的反应,发现药物不良作用以及许多其他学习任务。医学研究人员也有兴趣从基于人群的研究和实验中学习。学习任务包括识别可能导致新的诊断测试的疾病预测因子以及对干预措施的见识。演讲将详细阐述医学采矿中的数据来源,方法和案例研究。演讲的第二部分将解决不透明机器学习模型的可解释性和可解释性问题,重点是时间序列分类。在过去的十年中,时间序列分类已受到广泛关注,其通过利用各种类型的时间特征来关注预测性能的方法广泛。尽管如此,很少强调可解释性和可解释性。该演讲将阐述可解释的时间序列调整的新问题,其中,给定时间序列和为时间序列提供特定分类决策的不透明分类器,目标是找到要对给定时间执行的更改的最小数量时间序列,以便分类器将其决策更改为另一个类。而且,将表明该问题是NP困难的。将提出该问题的两个实例。研究中的分类器将是随机小波森林分类器。此外,将为两个问题实例化提供两种算法解决方案以及简单的优化方法,以及使用最近邻分类器的基准解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号