Multimodal Explanations: Justifying Decisions and Pointing to the Evidence

机译：多式联运的解释：做出合理的决定并指向证据

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep models that are both effective and explainable are desirable in many settings; prior explainable models have been unimodal, offering either image-based visualization of attention weights or text-based generation of post-hoc justifications. We propose a multimodal approach to explanation, and argue that the two modalities provide complementary explanatory strengths. We collect two new datasets to define and evaluate this task, and propose a novel model which can provide joint textual rationale generation and attention visualization. Our datasets define visual and textual justifications of a classification decision for activity recognition tasks (ACT-X) and for visual question answering tasks (VQA-X). We quantitatively show that training with the textual explanations not only yields better textual justification models, but also better localizes the evidence that supports the decision. We also qualitatively show cases where visual explanation is more insightful than textual explanation, and vice versa, supporting our thesis that multimodal explanation models offer significant benefits over unimodal approaches.

机译：在许多情况下都需要有效且可解释的深度模型。先前的可解释模型是单峰的，可以提供基于图像的注意权重可视化或基于文本的事后证明生成。我们提出了一种解释的多模式方法，并认为这两种模式提供了互补的解释优势。我们收集了两个新的数据集来定义和评估此任务，并提出了一个新颖的模型，该模型可以提供联合的文本基本原理生成和注意力可视化。我们的数据集定义了针对活动识别任务（ACT-X）和可视问题回答任务（VQA-X）的分类决策的视觉和文本依据。我们定量地表明，使用文本解释进行的训练不仅可以产生更好的文本辩护模型，而且可以更好地定位支持该决定的证据。我们还定性地显示了视觉解释比文本解释更具有洞察力的情况，反之亦然，这支持了我们的观点，即多模式解释模型比单模式方法具有明显优势。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition》|2018年|8779-8788|共10页
会议地点 Salt Lake City(US)
作者
Dong Huk Park; Lisa Anne Hendricks; Zeynep Akata; Anna Rohrbach; Bernt Schiele; Trevor Darrell; Marcus Rohrbach;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Visualization; Task analysis; Activity recognition; Image segmentation; Knowledge discovery; Predictive models;

机译：可视化；任务分析；活动识别；图像分割知识发现；预测模型;

相似文献

外文文献
中文文献
专利

1. Justifying our decisions about surgical technique: Evidence from coaching conversations [J] . Arielle E. Kanters, Sarah P. Shubeck, Gurjit Sandhu, Surgery . 2018,第3期

机译：有关手术技术的决定，证明了来自教练谈话的证据
2. 'We don't have recipes; we just have loads of ingredients': explanations of evidence and clinical decision making by speech and language therapists [J] . McCurti Arlene, Carter Bernie Journal of evaluation in clinical practice . 2015,第6期

机译：``我们没有食谱;我们只有很多成分”：言语和语言治疗师的证据解释和临床决策
3. Articulating ideology: How liberals and conservatives justify political affiliations using morality-based explanations [J] . Rempala Daniel M., Okdie Bradley M., Garvey Kilian J. Motivation and emotion . 2016,第5期

机译：阐明意识形态：自由主义者和保守主义者如何使用基于道德的解释为政治联系辩护
4. Multimodal Explanations: Justifying Decisions and Pointing to the Evidence [C] . Dong Huk Park, Lisa Anne Hendricks, Zeynep Akata, IEEE/CVF Conference on Computer Vision and Pattern Recognition . 2018

机译：多式化解释：证明并指向证据的证明和指出证据
5. Giving reasons: Why and how public institutions justify their decisions. [D] . Cohen, Mathilde. 2009

机译：给出原因：公共机构为何以及如何证明其决策合理。
6. Justifying Our Decisions About Surgical Technique: Evidence from Coaching Conversations [O] . Arielle E Kanters, Sarah P Shubeck, Gurjit Sandhu, -1

机译：证明我们对手术技术的决定是正确的：来自教练对话的证据
7. Multimodal Explanations: Justifying Decisions and Pointing to the Evidence [O] . Dong Huk Park, Lisa Anne Hendricks, Zeynep Akata, 2018

机译：多式化解释：证明并指向证据的证明和指出证据

Multimodal Explanations: Justifying Decisions and Pointing to the Evidence

摘要

著录项

相似文献

相关主题

期刊订阅