A Crossmodal Approach to Multimodal Fusion in Video Hyperlinking

Vedran Vukotić; Christian Raymond; Guillaume Gravier

首页> 外文期刊>IEEE multimedia >A Crossmodal Approach to Multimodal Fusion in Video Hyperlinking

【24h】

A Crossmodal Approach to Multimodal Fusion in Video Hyperlinking

机译：视频超链接中的一种多模态融合的交叉模态方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the recent resurgence of neural networks and the proliferation of massive amounts of unlabeled multimodal data, recommendation systems and multimodal retrieval systems based on continuous representation spaces and deep learning methods are becoming of great interest. Multimodal representations are typically obtained with autoencoders that reconstruct multimodal data. In this article, we describe an alternative method to perform high-level multimodal fusion that leverages crossmodal translation by means of symmetrical encoders cast into a bidirectional deep neural network (BiDNN). Using the lessons learned from multimodal retrieval, we present a BiDNN-based system that performs video hyperlinking and recommends interesting video segments to a viewer. Results established using TRECVIDs 2016 video hyperlinking benchmarking initiative show that our method obtained the best score, thus defining the state of the art.

机译：随着神经网络的兴起和大量未标记的多模式数据的泛滥，基于连续表示空间和深度学习方法的推荐系统和多模式检索系统变得越来越受关注。通常使用重构多峰数据的自动编码器获得多峰表示。在本文中，我们描述了一种执行高级多模式融合的替代方法，该方法通过将对称编码器转换为双向深度神经网络（BiDNN）来利用交叉模式转换。利用从多模式检索中获得的经验教训，我们介绍了一个基于BiDNN的系统，该系统执行视频超链接并向观众推荐有趣的视频片段。使用TRECVIDs 2016视频超链接基准测试计划建立的结果表明，我们的方法获得了最高分，从而定义了最新技术。

著录项

来源
《IEEE multimedia》 |2018年第2期|11-23|共13页
作者
Vedran Vukotić; Christian Raymond; Guillaume Gravier;
展开▼
作者单位

INRIA/IRISA Rennes and INSA Rennes;

INRIA/IRISA Rennes and INSA Rennes;

INRIA/IRISA Rennes and CNRS;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Computer architecture; Task analysis; Neural networks; Visualization; Streaming media; Hypertext systems; Training;

机译：计算机体系结构;任务分析;神经网络;可视化;流媒体;超文本系统;培训;

相似文献

外文文献
中文文献
专利

1. A Generic Approach to Semantic Video Indexing using Adaptive Fusion of Multimodal Classifiers [J] . Dae-Jin Kim, Hichem Frigui, Aleksey Fadeev International journal of imaging systems and technology . 2008,第2a3期

机译：一种基于多模态分类器自适应融合的语义视频索引通用方法
2. Crossmodal Matching: The Case for Developing and Employing a Valid and Feasible Approach to Equate Perceived Stimulus Intensities in Multimodal Research [J] . Riggs Sara Lu, Sarter Nadine Human Factors . 2019,第1期

机译：跨峰匹配：在多峰研究中开发和采用有效且可行的方法来等效感知刺激强度的案例
3. Hybrid grass bee optimization-multikernal extreme learning classifier: Multimodular fusion strategy and optimal feature selection for multimodal sentiment analysis in social media videos [J] . Alqahtani Abdullah Saleh, Saravanan Pandiaraj, Maheswari Murali, Concurrency and computation: practice and experience . 2021,第16期

机译：混合草蜜蜂优化 - 多立方英学频道：社交媒体视频多模峰情感分析的多模融合策略和最优特征选择
4. A Study on Multimodal Video Hyperlinking with Visual Aggregation [C] . Mateusz. Budnik, Mikail Demirdelen, Guillaume Gravier IEEE International Conference on Multimedia and Expo . 2018

机译：视觉聚合的多模式视频超链接研究
5. Multimodal approach to pain management in patients undergoing spinal fusion for chronic pain. [D] . Winkelmeyer, Lucy. 2014

机译：接受脊柱融合术治疗慢性疼痛的患者的多模式疼痛管理。
6. Robust Multimodal Emotion Recognition from Conversation with Transformer-Based Crossmodality Fusion [O] . Baijun Xie, Mariia Sidulova, Chung Hyuk Park 2021

机译：从谈话与基于变压器的横向融合的鲁棒多模态情绪识别
7. A Crossmodal Approach to Multimodal Fusion in Video Hyperlinking [O] . Vedran Vukotic, Christian Raymond, Guillaume Gravier 2018

机译：视频超链接中多峰融合的跨型方法

A Crossmodal Approach to Multimodal Fusion in Video Hyperlinking

摘要

著录项

相似文献

相关主题

期刊订阅