【24h】

'I've Seen Things You People Wouldn't Believe': Hallucinating Entities in Guess What?!

机译:“我已经看到了你的东西不会相信':猜测的幻觉实体是什么?!

获取原文

摘要

Natural language generation systems have witnessed important progress in the last years, but they are shown to generate tokens that are unrelated to the source input. This problem affects computational models in many NLP tasks, and it is particularly unpleasant in multi-modal systems. In this work, we assess the rate of object hallucination in multimodal conversational agents playing the GuessWhat?! referential game. Better visual processing has been shown to mitigate this issue in image cap-tioning; hence, we adapt to the GuessWhat?! task the best visual processing models at disposal, and propose two new models to play the Questioner agent. We show that (he new models generate few hallucinations compared to other renowned models available in the literature. Moreover, their hallucinations are less severe (affect task-accuracy less) and are more human-like. We also analyse where hallucinations tend to occur more often through the dialogue: hallucinations are less frequent in earlier turns, cause a cascade hallucination effect, and are often preceded by negative answers, which have been shown to be harder to ground.
机译:自然语言生成系统在过去几年中见证了重要进展,但它们被证明生成与源输入无关的令牌。此问题会影响许多NLP任务中的计算模型,并且在多模态系统中特别令人不愉快。在这项工作中,我们评估了在猜测的多式联合会话代理中的对象幻觉速度?!参照比赛。已经显示出更好的视觉处理来减轻图像缩写中的这个问题;因此,我们适应猜测?!任务以处理最佳的视觉处理模型,并提出两个新型号来播放提问者。我们表明,与文献中可用的其他着名模型相比,他的新模型产生了一些幻觉。此外,他们的幻觉不太严重(影响任务 - 准确性较少)并且更为人性化。我们还分析了幻觉往往发生的幻觉通常通过对话:幻觉在早期的转弯时不太频繁,导致级联幻觉效果,并且通常在负面答案之前,这已被证明是更难的地面。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号