首页> 外国专利> A Text Line Recognition Method in Chinese Scenes Based on Residual Convolutional and Recurrent Neural Networks

A Text Line Recognition Method in Chinese Scenes Based on Residual Convolutional and Recurrent Neural Networks

机译：基于残差卷积和递归神经网络的中文场景文本行识别方法

页面导航

摘要
著录项
相似文献

摘要

#$%^&*AU2020101229A420200806.pdf#####Abstract of Description The invention discloses a text line recognition method in Chinese scenes based on residual convolutional and recurrent neural networks, which comprises the following steps: collect text training images in Chinese scenes, perform normalization processing on the training image size, perform data augmentation processing on the training images, design residual convolutional neural network, residual recurrent neural network and CTC model, train horizontal text lines and vertical text lines and select the results with higher degree of confidence as the recognition results; the invention solves the problem of text line recognition in Chinese scenes by combining convolutional neural network and recurrent neural network, avoids the character segmentation of text lines and the error recognition caused by error segmentation, and accelerates the training of model by adding residual connection in convolutional neural network and recurrent neural network, so as to obtain a practical text recognition model in Chinese scenes. What's more, it has strong robustness and can recognize Chinese text lines with complex backgrounds, complex lighting and multiple fonts.Drawings of Description Start Collect text training images in Chinese scenes Perform normalization processing on the training image size Perform data augmentation processing on the training images Design deep neural network respectively to recognize horizontal and vertical text lines Obtain the recognition model by training a large number of training data Input the image to be recognized into the horizontal and vertical recognition models respectively, and select the results with higher degree of confidence as the recognition results End Figure 1 1/2

机译：＃$％^＆* AU2020101229A420200806.pdf #####描述摘要本发明公开了一种基于中文场景的文本行识别方法关于残积卷积和递归神经网络，其中包括请执行以下步骤：收集中文场景中的文本训练图像，进行归一化对训练图像尺寸进行处理，对训练图像进行数据增强处理训练图像，设计残差卷积神经网络，残差递归神经网络和CTC模型，训练水平文本行和垂直文本行以及选择具有较高置信度的结果作为识别结果;的本发明通过结合解决中文场景中的文本行识别问题卷积神经网络和递归神经网络，避免了特征文本行分割和由错误分割引起的错误识别，以及通过在卷积中添加残差连接来加速模型的训练神经网络和递归神经网络，以获得实用的文本中国场景中的识别模型。而且，它具有强大的鲁棒性并且可以识别具有复杂背景，复杂照明和多种字体。说明图开始收集文字训练图片中文场景执行归一化处理上训练图像尺寸执行数据扩充处理培训图片设计深度神经网络分别到识别水平和垂直文字行获得认可通过训练模型大量的培训数据输入图像为公认的水平和垂直识别模型分别选择结果更高置信度为识别结果结束图11/2

著录项

公开/公告号AU2020101229A4

专利类型
公开/公告日2020-08-06

原文格式PDF
申请/专利权人 SOUTH CHINA UNIVERSITY OF TECHNOLOGY;ZHUHAI INSTITUTE OF MODERN INDUSTRIAL INNOVATION SOUTH CHINA UNIVERSITY OF TECHNOLOGY;
展开▼

申请/专利号AU20200101229
发明设计人 JIN LIANWEN;LI ZHAOHAI;MAO HUIYUN;LUO CANJIE;ZHOU WEIYING;YANG FAN;
展开▼

申请日2020-07-02
分类号G06K9/46;G06N3/08;G06T7/143;
国家 AU
入库时间 2022-08-21 11:11:34

相似文献

专利
外文文献
中文文献