...
首页> 外文期刊>Multimedia, IEEE Transactions on >OPMP: An Omnidirectional Pyramid Mask Proposal Network for Arbitrary-Shape Scene Text Detection
【24h】

OPMP: An Omnidirectional Pyramid Mask Proposal Network for Arbitrary-Shape Scene Text Detection

机译:OPMP:任意形状场景文本检测的全向金字塔面罩建议网络

获取原文
获取原文并翻译 | 示例
           

摘要

Scene text detection methods have achieved significant progresses. However, stack-omnidirectional text dilemma, under-segmentation of very close text words, and over-segmentation of arbitrary-shape long text lines, are still main challenges. Motivated by these problems, we proposed a two stage method called omnidirectional pyramid mask proposal text detector (OPMP). OPMP removes anchor mechanism that requires heuristic non-maximum suppress processing. Instead, it uses an effective pyramid lengthwise and sidewise residual sequence modeling method to produce arbitrary-shape proposals. To accurately extract the features of text shape, OPMP enhances the backbone layers by a multiple arbitrary-shape fitting mechanism. Finally, a multi-grain text classification module is proposed, which reclassifies each text region robustly. Comprehensive ablation studies demonstrate the effectiveness of each proposed component. In addition, experiments on various benchmarks, including ICDAR2015, MLT, MSRA-TD500, CTW1500, and Total-text, show that our method outperforms previous state-of-the-art methods.
机译:场景文本检测方法取得了重大进展。但是,堆栈 - 全向文本困境,非常接近的文本词的下分割,以及任意形状的长文本线的过分分割,仍然是主要的挑战。通过这些问题的激励,我们提出了一种称为全向金字塔掩模提案文本检测器(OPMP)的两级方法。 OPMP删除需要启发式非最大抑制处理的锚机制。相反,它使用有效的金字塔纵向和侧向残余序列建模方法来产生任意形状的提案。为了准确提取文本形状的特征,OPMP通过多个任意形状拟合机构增强骨架层。最后,提出了一种多粒文本分类模块,其鲁棒地重新分类每个文本区域。综合消融研究证明了每个提出的组件的有效性。此外,在各种基准测试中,包括ICDAR2015,MLT,MSRA-TD500,CTW1500和全文,表明我们的方法优于先前的最先进的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号