首页> 外文会议>Annual conference on Neural Information Processing Systems >Joint Line Segmentation and Transcription for End-to-End Handwritten Paragraph Recognition
【24h】

Joint Line Segmentation and Transcription for End-to-End Handwritten Paragraph Recognition

机译:端到端手写段落识别的联合行分割和转录

获取原文

摘要

Offline handwriting recognition systems require cropped text line images for both training and recognition. On the one hand, the annotation of position and transcript at line level is costly to obtain. On the other hand, automatic line segmentation algorithms are prone to errors, compromising the subsequent recognition. In this paper, we propose a modification of the popular and efficient Multi-Dimensional Long Short-Term Memory Recurrent Neural Networks (MDLSTM-RNNs) to enable end-to-end processing of handwritten paragraphs. More particularly, we replace the collapse layer transforming the two-dimensional representation into a sequence of predictions by a recurrent version which can select one line at a time. In the proposed model, a neural network performs a kind of implicit line segmentation by computing attention weights on the image representation. The experiments on paragraphs of Rimes and LAM databases yield results that are competitive with those of networks trained at line level, and constitute a significant step towards end-to-end transcription of full documents.
机译:离线手写识别系统需要裁剪的文本行图像以进行训练和识别。一方面,在行级别获得位置和成绩单的注释是昂贵的。另一方面,自动线段分割算法容易出错,从而损害了后续的识别能力。在本文中,我们提出了对流行且有效的多维长短期记忆循环神经网络(MDLSTM-RNN)的修改,以实现手写段落的端到端处理。更具体地说,我们用可以一次选择一行的循环版本替换将二维表示转换为预测序列的崩溃层。在提出的模型中,神经网络通过计算图像表示的注意力权重来执行一种隐式线分割。在Rimes和LAM数据库的段落上进行的实验所产生的结果与在行级训练的网络的结果具有竞争力,并且是朝着完整文档的端到端转录迈出的重要一步。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号