Supervised Scene Boundary Detection with Relational and Sequential Information

机译：具有关系和顺序信息的监督场景边界检测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes a novel scene boundary detector by considering different features appropriate for definition changes of scenes according to target services or tasks. In the proposed method, the information in shots is categorized into two groups: relational and sequential information. Relational information is acquired by the multi-layered convolution neural networks by merging and embedding similarity vectors from visual and audio features. Sequential information that contains particular patterns of continuous shots is handled with dual recurrent neural networks. The different definitions of scenes are reflected in the proposed method by supervised parameter estimation with a sampling method. Scene boundaries are rarely observed in video content. Thus, it results in skewed class distribution. The sampling method tries to expand instances in scene boundary using reverse order shots, while it reduces the number of non-boundary shots by variance preserved shot filtering. A focal loss is finally adopted for the training process to lead better parameters from an imbalanced dataset. The proposed method is evaluated with three datasets constructed with real-world movies. We empirically proved that different definitions of scene boundary could affect the performance of scene boundary detection through experiments. The proposed deep neural networks with both relational and sequential information show the ability to handle diverse scene definitions in experiments. With supervised learning, the proposed method can reflect the definition bias in each dataset. As a result, the proposed method shows its effectiveness in handling different types of information and adopting other scene definitions by achieving state-of-the-art performances in two benchmark datasets.

机译：本文提出了一种新颖的场景边界检测器，通过考虑适合于定义场景的定义根据目标服务或任务的改变。在所提出的方法中，拍摄中的信息分为两组：关系和顺序信息。通过从视觉和音频功能的相似向量合并和嵌入相似性向量，由多层卷积神经网络获取关系信息。包含连续镜头特定模式的顺序信息由双重经常性神经网络处理。通过使用采样方法监督参数估计，在所提出的方法中反映了场景的不同定义。在视频内容中很少观察到场景边界。因此，它导致偏斜类分布。采样方法尝试使用相反阶射击展开场景边界中的实例，而通过方差保存射击滤波减少了非边界拍摄的数量。训练过程终于采用了焦点损失，以从不平衡数据集中引导更好的参数。建议的方法用三个与现实世界电影构建的三个数据集进行评估。我们经常证明了场景边界的不同定义可能会通过实验影响场景边界检测的性能。具有关系和顺序信息的建议的深度神经网络，显示了在实验中处理不同场景定义的能力。通过监督学习，所提出的方法可以反映每个数据集中的定义偏差。结果，该方法通过在两个基准数据集中实现最先进的性能来实现其处理不同类型的信息并采用其他场景定义的有效性。

著录项

来源
《International Joint Conference on Web Intelligence and Intelligent Agent Technology》|2020年|250-258|共9页
会议地点
作者
Jeong-Woo Son; Alex Lee; Chang-Uk Kwak; Sun-Joong Kim;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Visualization; Parameter estimation; Recurrent neural networks; Annotations; Supervised learning; Merging;

机译：培训;可视化;参数估计;经常性神经网络;注释;监督学习;合并;

相似文献

外文文献
中文文献
专利

1. A weakly supervised framework for abnormal behavior detection and localization in crowded scenes [J] . Neurocomputing . 2020,第Mara28期

机译：在拥挤场景中用于异常行为检测和定位的弱监督框架
2. SemiText: Scene text detection with semi-supervised learning [J] . Liu Juhua, Zhong Qihuang, Yuan Yuan, Neurocomputing . 2020,第Sepa24期

机译：SEMITEXT：现场文本检测与半监督学习
3. Self-supervised blur detection from synthetically blurred scenes [J] . Alvarez-Gila Aitor, Galdran Adrian, Garrote Estibaliz, Image and Vision Computing . 2019,第Deca期

机译：从综合模糊场景中进行自我监督模糊检测
4. Supervised and Unsupervised Detections for Multiple Object Tracking in Traffic Scenes: A Comparative Study [C] . Hui-Lee Ooi, Guillaume-Alexandre Bilodeau, Nicolas Saunier International Conference on Image Analysis and Recognition . 2020

机译：交通场景中多目标跟踪的有监督和无监督检测：比较研究
5. Relational discovery in sequentially-connected data streams: Efficient algorithms for lossless pattern discovery and change detection. [D] . Coble, Jeffrey Allen. 2005

机译：顺序连接的数据流中的关系发现：用于无损模式发现和更改检测的高效算法。
6. Semi-Supervised Anomaly Detection in Video-Surveillance Scenes in the Wild [O] . Mohammad Ibrahim Sarker, Cristina Losada-Gutiérrez, Marta Marrón-Romera, 2021

机译：野外视频监控场景中的半监督异常检测
7. Supervised and Unsupervised Detections for Multiple Object Tracking in Traffic Scenes: A Comparative Study [O] . Hui-Lee Ooi, Guillaume-Alexandre Bilodeau, Nicolas Saunier 2020

机译：对交通场景中多个对象跟踪的监督和无监督检测：比较研究
8. Detection, Perception, and Memory for Rapid Sequential Scenes [R] . Intraub, H. 1979

机译：快速连续场景的检测，感知和记忆

Supervised Scene Boundary Detection with Relational and Sequential Information

摘要

著录项

相似文献

相关主题

期刊订阅