首页> 外文会议>International Joint Conference on Neural Networks >A Cross-Modal Guiding and Fusion Method for Multi-Modal RSVP-based Image Retrieval
【24h】

A Cross-Modal Guiding and Fusion Method for Multi-Modal RSVP-based Image Retrieval

机译:一种基于多模态RSVP的跨模态引导融合图像检索方法

获取原文

摘要

Rapid Serial Visual Presentation (RSVP) is an important paradigm in Brain-Computer Interface (BCI). It can be used in speller, image retrieval, anomaly detection, etc. RSVP paradigm uses a small number of target pictures in a high speed presented picture sequence to induce specific event-related potential (ERP) components. However, the application of RSVP based BCI is challenged by the accuracy of ERP detection. Thus, the goal of this study is to introduce other related modalities to the traditional EEG-based BCI to make robust predictions and improve the detection performance. First, we introduce the eye movement modality into the RSVP-based BCI and collect a multimodality RSVP-based dataset simultaneously during the image retrieval task. Second, we design a simple but efficient CNN-based network with two modality fusion modules to fully utilize the multi-modality data in two stages. In the feature extraction stage, we propose a Cross-modality-Guided Feature Calibration (cm-GFC) module to enable the EEG modality feature to modify the eye movement modality feature, and the aim is to make eye movement modality features and EEG modality features are more complementary. In the feature fusion stage, we propose a Dynamic Gated Fusion (DGF) module, which applies modality-specific gates to retain the complementary information of the two modalities and reduce redundant information from the two modalities. To evaluate our method, we conduct extensive experiments on the dataset with EEG and eye movement data are from 20 subjects. The proposed method achieves a high balanced accuracy of 87.83 ± 2.31% of classification, which outperforms a series of single modality and multi-modality approaches.
机译:快速串行视觉呈现(RSVP)是脑机接口(BCI)中的一种重要模式。它可以用于拼写、图像检索、异常检测等。RSVP范式使用高速呈现的图片序列中的少量目标图片来诱导特定的事件相关电位(ERP)成分。然而,基于RSVP的脑机接口的应用受到ERP检测准确性的挑战。因此,本研究的目的是在传统的基于脑电信号的脑机接口中引入其他相关模式,以做出稳健的预测并提高检测性能。首先,我们将眼动模式引入基于RSVP的BCI,并在图像检索任务中同时收集基于RSVP的多模态数据集。其次,我们设计了一个简单而高效的基于CNN的网络,其中包含两个模态融合模块,以在两个阶段充分利用多模态数据。在特征提取阶段,我们提出了一个跨模态引导特征校正(cm-GFC)模块,使脑电模态特征能够修改眼动模态特征,目的是使眼动模态特征和脑电模态特征更加互补。在特征融合阶段,我们提出了一种动态门控融合(DGF)模块,该模块应用特定于模式的门来保留两种模式的互补信息,并减少两种模式的冗余信息。为了评估我们的方法,我们使用20名受试者的脑电图和眼动数据对数据集进行了广泛的实验。该方法实现了87.83±2.31%的分类精度,优于一系列单模态和多模态方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号