A Cross-Modal Guiding and Fusion Method for Multi-Modal RSVP-based Image Retrieval

机译：一种基于多模态RSVP的跨模态引导融合图像检索方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Rapid Serial Visual Presentation (RSVP) is an important paradigm in Brain-Computer Interface (BCI). It can be used in speller, image retrieval, anomaly detection, etc. RSVP paradigm uses a small number of target pictures in a high speed presented picture sequence to induce specific event-related potential (ERP) components. However, the application of RSVP based BCI is challenged by the accuracy of ERP detection. Thus, the goal of this study is to introduce other related modalities to the traditional EEG-based BCI to make robust predictions and improve the detection performance. First, we introduce the eye movement modality into the RSVP-based BCI and collect a multimodality RSVP-based dataset simultaneously during the image retrieval task. Second, we design a simple but efficient CNN-based network with two modality fusion modules to fully utilize the multi-modality data in two stages. In the feature extraction stage, we propose a Cross-modality-Guided Feature Calibration (cm-GFC) module to enable the EEG modality feature to modify the eye movement modality feature, and the aim is to make eye movement modality features and EEG modality features are more complementary. In the feature fusion stage, we propose a Dynamic Gated Fusion (DGF) module, which applies modality-specific gates to retain the complementary information of the two modalities and reduce redundant information from the two modalities. To evaluate our method, we conduct extensive experiments on the dataset with EEG and eye movement data are from 20 subjects. The proposed method achieves a high balanced accuracy of 87.83 ± 2.31% of classification, which outperforms a series of single modality and multi-modality approaches.

机译：快速串行视觉呈现（RSVP）是脑机接口（BCI）中的一种重要模式。它可以用于拼写、图像检索、异常检测等。RSVP范式使用高速呈现的图片序列中的少量目标图片来诱导特定的事件相关电位（ERP）成分。然而，基于RSVP的脑机接口的应用受到ERP检测准确性的挑战。因此，本研究的目的是在传统的基于脑电信号的脑机接口中引入其他相关模式，以做出稳健的预测并提高检测性能。首先，我们将眼动模式引入基于RSVP的BCI，并在图像检索任务中同时收集基于RSVP的多模态数据集。其次，我们设计了一个简单而高效的基于CNN的网络，其中包含两个模态融合模块，以在两个阶段充分利用多模态数据。在特征提取阶段，我们提出了一个跨模态引导特征校正（cm-GFC）模块，使脑电模态特征能够修改眼动模态特征，目的是使眼动模态特征和脑电模态特征更加互补。在特征融合阶段，我们提出了一种动态门控融合（DGF）模块，该模块应用特定于模式的门来保留两种模式的互补信息，并减少两种模式的冗余信息。为了评估我们的方法，我们使用20名受试者的脑电图和眼动数据对数据集进行了广泛的实验。该方法实现了87.83±2.31%的分类精度，优于一系列单模态和多模态方法。

著录项

来源
《International Joint Conference on Neural Networks》|2021年|1-7|共7页
会议地点
作者
Jiayu Mao; Shuang Qiu; Dan Li; Wei Wei; Huiguang He;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Visualization; Fuses; Image retrieval; Neural networks; Logic gates; Feature extraction; Electroencephalography;

机译：可视化;保险丝;图像检索;神经网络;逻辑门;特征提取;脑电图;

相似文献

外文文献
中文文献
专利

1. Face image retrieval based on shape and texture feature fusion [J] . Zongguang Lu, Jing Yang, Qingshan Liu 计算可视媒体（英文） . 2017,第004期
2. Face image retrieval based on shape and texture feature fusion [J] . Zongguang Lu, Jing Yang, Qingshan Liu 计算可视媒体(英文版) . 2017,第004期
3. Fast Image Retrieval of Textile Industrial Accessory Based on Multi-Feature Fusion [J] . 沈文忠, 杨杰东华大学学报（英文版） . 2004,第003期
4. See clearly on rainy days:Hybrid multiscale loss guided multifeature fusion network for single image rain removal [J] . Huiyuan Fu, Yu Zhang, Huadong MaBeijing 计算可视媒体(英文版) . 2021,第004期
5. A Multi-Modal Incompleteness Ontology model (MMIO) to enhance information fusion for image retrieval [J] . Stefan Poslad, Kraisak Kesorn Information Fusion . 2014,第Null期

机译：多模不完整本体模型（MMIO），用于增强信息融合以进行图像检索
6. Cross-modal image fusion guided by subjective visual attention [J] . Fang Aiqing, Zhao Xinbo, Zhang Yanning Neurocomputing . 2020,第Nova13期

机译：主观视觉关注指导的跨模型图像融合
7. Multi-Modal Image Fusion via Convolutional Morphological Component Analysis and Guided Filter [J] . Guo Peng, Xie Guoqi, Li Renfa, Journal of circuits, systems and computers . 2021,第2期

机译：多模态图像融合通过卷积形态分析分析和引导滤波器
8. Facial Attribute Guided Deep Cross-Modal Hashing for Face Image Retrieval [C] . Fariborz Taherkhani, Veeru Talreja, Hadi Kazemi, International Conference of the Biometrics Special Interest Group . 2018

机译：面部属性引导的深度跨模态哈希用于面部图像检索
9. Image annotation and retrieval based on multi-modal feature clustering and similarity propagation. [D] . Ben Ismail, Mohamed Maher. 2011

机译：基于多模式特征聚类和相似度传播的图像标注和检索。
10. A Method for Measuring the Accuracy of Multi-modal Image Fusion system for Catheter-based Cardiac Interventions Using a Novel Motion Enabled Targeting Phantom [O] . Charles R. Hatt, Douglas Stanton, Vijay Parthasarathy, -1

机译：测量多模态的图像融合系统的基于导管的心脏介入使用一种新的运动的精度的方法已启用的定位幻影
11. An Effective Contour Detection based Image Retrieval using Multi-Fusion Method and Neural Network [O] . Rohit Raja, Sandeep Kumar, Shilpa Choudhary, 2021

机译：基于多融合方法和神经网络的基于实图像检测的有效轮廓检测
12. Multi-Modal Retrieval of Trademark Images Using Global Similarity [R] . Ravela, S. , Manmatha, R. 2005

机译：利用全局相似性进行商标图像的多模态检索

A Cross-Modal Guiding and Fusion Method for Multi-Modal RSVP-based Image Retrieval

摘要

著录项

相似文献

相关主题

期刊订阅