'I've Seen Things You People Wouldn't Believe': Hallucinating Entities in Guess What?!

机译：“我已经看到了你的东西不会相信'：猜测的幻觉实体是什么？！

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Natural language generation systems have witnessed important progress in the last years, but they are shown to generate tokens that are unrelated to the source input. This problem affects computational models in many NLP tasks, and it is particularly unpleasant in multi-modal systems. In this work, we assess the rate of object hallucination in multimodal conversational agents playing the GuessWhat?! referential game. Better visual processing has been shown to mitigate this issue in image cap-tioning; hence, we adapt to the GuessWhat?! task the best visual processing models at disposal, and propose two new models to play the Questioner agent. We show that (he new models generate few hallucinations compared to other renowned models available in the literature. Moreover, their hallucinations are less severe (affect task-accuracy less) and are more human-like. We also analyse where hallucinations tend to occur more often through the dialogue: hallucinations are less frequent in earlier turns, cause a cascade hallucination effect, and are often preceded by negative answers, which have been shown to be harder to ground.

机译：自然语言生成系统在过去几年中见证了重要进展，但它们被证明生成与源输入无关的令牌。此问题会影响许多NLP任务中的计算模型，并且在多模态系统中特别令人不愉快。在这项工作中，我们评估了在猜测的多式联合会话代理中的对象幻觉速度？！参照比赛。已经显示出更好的视觉处理来减轻图像缩写中的这个问题;因此，我们适应猜测？！任务以处理最佳的视觉处理模型，并提出两个新型号来播放提问者。我们表明，与文献中可用的其他着名模型相比，他的新模型产生了一些幻觉。此外，他们的幻觉不太严重（影响任务 - 准确性较少）并且更为人性化。我们还分析了幻觉往往发生的幻觉通常通过对话：幻觉在早期的转弯时不太频繁，导致级联幻觉效果，并且通常在负面答案之前，这已被证明是更难的地面。

著录项

来源
《Annual Meeting of the Association for Computational Linguistics;International Joint Conference on Natural Language Processing》|2021年|101-111|共11页
会议地点
作者
Alberto Testoni; Raffaella Bernardi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. How many bereaved people hallucinate about their loved one? A systematic review and meta-analysis of bereavement hallucinations (Retraction of Vol 243, Pg 463, 2018) [J] . Kamp Karina Stengaard, Due Helena Journal of affective disorders . 2019,第期

机译：有多少失去的人对他们所爱的人幻觉？对丧亲衰退的系统审查和荟萃分析（卷243，PG 463,2018的撤回）
2. How many bereaved people hallucinate about their loved one? A systematic review and meta-analysis of bereavement hallucinations [J] . Kamp Karina Stengaard, Due Helena Journal of affective disorders . 2019,第期

机译：有多少失去的人对他们所爱的人幻觉？丧亲幻觉的系统评价与荟萃分析
3. Males are more sensitive to reward and less sensitive to loss than females among people with internet gaming disorder: fMRI evidence from a card-guessing task [J] . Jialin Zhang, Yan Hu, Ziliang Wang, BMC Psychiatry . 2020,第1期

机译：雄性对互联网游戏障碍的人们的奖励和损失更敏感，而不是女性更敏感：来自卡猜测任务的FMRI证据
4. Traversal and Relations Discovery among Business Entities and People using Semantic Web Technologies and Trust Management [C] . Dejan LAVBIC, Slavko ZITNIK, Lovro SUBELJ, International Baltic Conference on Databases and Information Systems . 2013

机译：使用语义网络技术和信任管理的商业实体和人员之间的遍历与关系发现
5. Second-guessing and self-monitoring: Monitoring the need for accurate information in second-guessing. [D] . Numainville, Brian Edward. 1993

机译：二次猜测和自我监视：在二次猜测中监视对准确信息的需求。
6. Males are more sensitive to reward and less sensitive to loss than females among people with internet gaming disorder: fMRI evidence from a card-guessing task [O] . Jialin Zhang, Yan Hu, Ziliang Wang, 2020

机译：雄性对互联网游戏障碍的人们的奖励和损失更敏感而不是女性更敏感：来自卡猜测任务的FMRI证据
7. Males are more sensitive to reward and less sensitive to loss than females among people with internet gaming disorder: fMRI evidence from a card-guessing task [O] . Jialin Zhang, Yan Hu, Ziliang Wang, 2020

机译：雄性对互联网游戏障碍的人们的奖励和损失更敏感，而不是女性更敏感：来自卡猜测任务的FMRI证据
8. Entity Profiling for Intelligence Using the Graphical Overview of Social and Semantic Interactions of People (GOSSIP) Software Tool [R] . Kwantes, P., Terhaar, P. 2010

机译：使用人员社交和语义交互的图形概述（GOssIp）软件工具进行智能实体分析

'I've Seen Things You People Wouldn't Believe': Hallucinating Entities in Guess What?!

摘要

著录项

相似文献

相关主题

期刊订阅