Segmentation-Free Multi-Font Printed Manchu Word Recognition Using Deep Convolutional Features and Data Augmentation

机译：使用深度卷积特征和数据增强的无分割多字体印刷满族文字识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Precise Manchu character segmentation of segmentation-based Manchu recognition methods is difficult to realize because of complex Manchu language spelling rules and existence multi Manchu fonts. To avoid the influence of incorrect segmentation, this work proposes the idea of segmentation-free recognition to recognize Manchu word instead of Manchu characters. In addition, an end-to-end 9-layer convolutional neural network is proposed to automatically extract deep hierarchy features on Manchu word image. The proposed recognition model is applied on Manchu words with 12 Manchu fonts to evaluate its ability of multi-font recognition. Deep neural network needs massive data for training, whereas Manchu language is an endangered language lacking in document data. To solve this contradiction, this work firstly builds a Manchu dataset prototype and a multi-font Manchu word testing set, and then designs a data augmentation system to generate synthetic data for training. The data augmentation system contains 7 generation methods, including character structure distortion and image quality transformation. Experiments demonstrate the proposed convolutional neural network for Manchu word recognition achieves a new state-of-the-art accuracy on multi-font printed Manchu word. For printed Manchu fonts, the highest recognition accuracy reaches 0.95; the lowest accuracy is 0.88; the average accuracy of printed Manchu fonts reaches 0.91. Experiments also demonstrate the proposed data augmentation system is an effective way to solve insufficient data problem.

机译：由于复杂的满族语言拼写规则和存在多种满族字体，难以实现基于分割的满族识别方法的精确满族字符分割。为了避免不正确的分割的影响，这项工作提出了一种无分割识别的思想，可以识别满族单词而不是满族字符。此外，提出了一种端到端的9层卷积神经网络，以自动提取满族单词图像上的深层次特征。将所提出的识别模型应用于具有12种满族字体的满族单词，以评估其多字体识别能力。深度神经网络需要大量数据来进行训练，而满族语言是文档数据中缺少的一种濒临灭绝的语言。为解决这一矛盾，本文首先建立了满族数据集原型和多字体满族单词测试集，然后设计了数据增强系统以生成用于训练的综合数据。数据增强系统包含7种生成方法，包括字符结构失真和图像质量转换。实验表明，所提出的用于满族单词识别的卷积神经网络在多字体打印满族单词上实现了新的最新精度。对于印刷的满族字体，最高的识别精度达到0.95;最低精度为0.88;印刷的满族字体的平均精度达到0.91。实验还表明，提出的数据增强系统是解决数据不足问题的有效方法。

著录项

来源
《International Congress on Image and Signal Processing, BioMedical Engineering and Informatics》|2018年|1-6|共6页
会议地点
作者
Ruirui Zheng; Min Li; Jianjun He; Jiajing Bi; Baochun Wu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Image segmentation; Feature extraction; Prototypes; Character recognition; Convolutional neural networks; Optical character recognition software; Distortion;

机译：图像分割;特征提取;原型;字符识别;卷积神经网络;光学字符识别软件;失真;

相似文献

外文文献
中文文献
专利

1. Automatic recognition of tunnel lining elements from GPR images using deep convolutional networks with data augmentation [J] . Qin Hui, Zhang Donghao, Tang Yu, Automation in construction . 2021,第Octa期

机译：使用具有数据增强的深卷积网络自动识别来自GPR图像的GPR图像
2. Data augmentation and directional feature maps extraction for in-air handwritten Chinese character recognition based on convolutional neural network [J] . Qu Xiwen, Wang Weiqiang, Lu Ke, Pattern recognition letters . 2018,第AUGa1期

机译：基于卷积神经网络的空中手写汉字识别数据扩充与方向特征图提取
3. Land Cover Classification based on Deep Convolutional Neural Network with Feature-based Data Augmentation [J] . Wang Bo, Huang Chengeng, Guo Yuhua, Journal of Imaging Science and Technology . 2021,第1期

机译：基于深度卷积神经网络的土地覆盖分类，具有基于特征的数据增强
4. Segmentation-Free Multi-Font Printed Manchu Word Recognition Using Deep Convolutional Features and Data Augmentation [C] . Ruirui Zheng, Min Li, Jianjun He, International Congress on Image and Signal Processing, BioMedical Engineering and Informatics . 2018

机译：使用深度卷积特征和数据增强的分割 - 无字体打印满族字识别
5. Data-Driven Material Recognition and Photorealistic Image Editing Using Deep Convolutional Neural Networks [D] . Upchurch, Paul Robert. 2018

机译：深度卷积神经网络的数据驱动材料识别和逼真的图像编辑
6. On Urinary Bladder Cancer Diagnosis: Utilization of Deep Convolutional Generative Adversarial Networks for Data Augmentation [O] . Ivan Lorencin, Sandi Baressi Šegota, Nikola Anđelić, 2021

机译：关于膀胱癌诊断：利用深卷积生成对抗网络进行数据增强
7. Comparison of convolutional neural network and bag of features for multi-font digit recognition [O] . Nasibah Husna Mohd Kadir, Sharifah Nur Syafiqah Mohd Nur Hidayah, Norasiah Mohammad, 2019

机译：多字体数字识别卷积神经网络与特征的比较

Segmentation-Free Multi-Font Printed Manchu Word Recognition Using Deep Convolutional Features and Data Augmentation

摘要

著录项

相似文献

相关主题

期刊订阅