OPMP: An Omnidirectional Pyramid Mask Proposal Network for Arbitrary-Shape Scene Text Detection

Sheng Zhang; Yuliang Liu; Lianwen Jin; Zhongrong Wei; Chunhua Shen

首页> 外文期刊>Multimedia, IEEE Transactions on >OPMP: An Omnidirectional Pyramid Mask Proposal Network for Arbitrary-Shape Scene Text Detection

【24h】

OPMP: An Omnidirectional Pyramid Mask Proposal Network for Arbitrary-Shape Scene Text Detection

机译：OPMP：任意形状场景文本检测的全向金字塔面罩建议网络

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Scene text detection methods have achieved significant progresses. However, stack-omnidirectional text dilemma, under-segmentation of very close text words, and over-segmentation of arbitrary-shape long text lines, are still main challenges. Motivated by these problems, we proposed a two stage method called omnidirectional pyramid mask proposal text detector (OPMP). OPMP removes anchor mechanism that requires heuristic non-maximum suppress processing. Instead, it uses an effective pyramid lengthwise and sidewise residual sequence modeling method to produce arbitrary-shape proposals. To accurately extract the features of text shape, OPMP enhances the backbone layers by a multiple arbitrary-shape fitting mechanism. Finally, a multi-grain text classification module is proposed, which reclassifies each text region robustly. Comprehensive ablation studies demonstrate the effectiveness of each proposed component. In addition, experiments on various benchmarks, including ICDAR2015, MLT, MSRA-TD500, CTW1500, and Total-text, show that our method outperforms previous state-of-the-art methods.

机译：场景文本检测方法取得了重大进展。但是，堆栈 - 全向文本困境，非常接近的文本词的下分割，以及任意形状的长文本线的过分分割，仍然是主要的挑战。通过这些问题的激励，我们提出了一种称为全向金字塔掩模提案文本检测器（OPMP）的两级方法。 OPMP删除需要启发式非最大抑制处理的锚机制。相反，它使用有效的金字塔纵向和侧向残余序列建模方法来产生任意形状的提案。为了准确提取文本形状的特征，OPMP通过多个任意形状拟合机构增强骨架层。最后，提出了一种多粒文本分类模块，其鲁棒地重新分类每个文本区域。综合消融研究证明了每个提出的组件的有效性。此外，在各种基准测试中，包括ICDAR2015，MLT，MSRA-TD500，CTW1500和全文，表明我们的方法优于先前的最先进的方法。

著录项

来源
《Multimedia, IEEE Transactions on》 |2021年第1期|454-467|共14页
作者
Sheng Zhang; Yuliang Liu; Lianwen Jin; Zhongrong Wei; Chunhua Shen;
展开▼
作者单位

School of Electronic and Information Engineering South China University of Technology Guangzhou China;

The University of Adelaide Adelaide SA Australia;

School of Electronic and Information Engineering South China University of Technology Guangzhou China;

School of Electronic and Information Engineering South China University of Technology Guangzhou China;

School of Computer Science The University of Adelaide Adelaide SA Australia;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Proposals; Feature extraction; Image segmentation; Detectors; Shape; Robustness; Benchmark testing;

机译：提案;特征提取;图像分割;探测器;形状;鲁棒性;基准测试;

相似文献

外文文献
中文文献
专利

1. Natural scene text detection based on multiscale connectionist text proposal network [J] . Huang Min, Lan Chaohao, Huang Wei, . 2020,第13期

机译：基于多尺度连接主义文本提案网络的自然场景文本检测
2. Specific category region proposal network for text detection in natural scene [J] . Zhong Yuanhong, Cheng Xinyu, Zhou Zhaokun, Image Processing, IET . 2020,第9期

机译：特定类别区域提案网络，用于自然场景中的文本检测
3. Realtime multi-scale scene text detection with scale-based region proposal network [J] . He Wenhao, Zhang Xu-Yao, Yin Fei, Pattern Recognition: The Journal of the Pattern Recognition Society . 2020,第期

机译：实时多尺度场景文本检测与基于比例的区域提案网络
4. Mask R-CNN With Pyramid Attention Network for Scene Text Detection [C] . Zhida Huang, Zhuoyao Zhong, Lei Sun, IEEE Winter Conference on Applications of Computer Vision . 2019

机译：具有金字塔注意网络的Mask R-CNN用于场景文本检测
5. Context modeling for semantic text matching and scene text detection [D] . Huang, Wenyi. 2016

机译：语义文本匹配和场景文本检测的上下文建模
6. An Algorithm Based on Text Position Correction and Encoder-Decoder Network for Text Recognition in the Scene Image of Visual Sensors [O] . Zhiwei Huang, Jinzhao Lin, Hongzhi Yang, 2020

机译：基于文本位置校正和编解码器网络的视觉传感器场景图像文本识别算法
7. FTPN: Scene Text Detection With Feature Pyramid Based Text Proposal Network [O] . Fagui Liu, Cheng Chen, Dian Gu, 2019

机译：FTPN：现场文本检测与特征基于金字塔的文本提案网络
8. Text Detection and Translation from Natural Scenes [R] . Gao, J. , Yang, J. , Zhang, Y. , 2001

机译：自然场景中的文本检测与翻译

OPMP: An Omnidirectional Pyramid Mask Proposal Network for Arbitrary-Shape Scene Text Detection

摘要

著录项

相似文献

相关主题

期刊订阅