Keyword spotting for self-training of BLSTM NN based handwriting recognition systems

Volkmar Frinken; Andreas Fischer; Markus Baumgartner; Horst Bunke

首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Keyword spotting for self-training of BLSTM NN based handwriting recognition systems

【24h】

Keyword spotting for self-training of BLSTM NN based handwriting recognition systems

机译：基于LSTM CNN的手写识别系统自训练的关键字识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The automatic transcription of unconstrained continuous handwritten text requires well trained recognition systems. The semi-supervised paradigm introduces the concept of not only using labeled data but also unlabeled data in the learning process. Unlabeled data can be gathered at little or not cost. Hence it has the potential to reduce the need for labeling training data, a tedious and costly process. Given a weak initial recognizer trained on labeled data, self-training can be used to recognize unlabeled data and add words that were recognized with high confidence to the training set for re-training. This process is not trivial and requires great care as far as selecting the elements that are to be added to the training set is concerned. In this paper, we propose to use a bidirectional long short-term memory neural network handwritten recognition system for keyword spotting in order to select new elements. A set of experiments shows the high potential of self-training for bootstrapping handwriting recognition systems, both for modern and historical handwritings, and demonstrate the benefits of using keyword spotting over previously published self-training schemes.

机译：不受约束的连续手写文本的自动转录需要训练有素的识别系统。半监督范例引入了不仅在学习过程中使用标记数据，而且使用未标记数据的概念。未加标签的数据可以花很少或很少的钱收集。因此，它有可能减少对训练数据加标签的需求，这是一个乏味且昂贵的过程。给定在标签数据上受过训练的较弱的初始识别器，则可以使用自训练来识别未标记的数据，并以高置信度将被识别的单词添加到训练集中进行重新训练。对于选择要添加到训练集中的元素，此过程并非易事，需要格外小心。在本文中，我们建议使用双向长短期记忆神经网络手写识别系统进行关键字识别，以选择新元素。一组实验表明，对于现代手写笔迹和历史手写笔迹，自训练对于自举手写识别系统具有很高的潜力，并证明了使用关键字查找技术优于以前发布的自训练方案的好处。

著录项

来源
《Pattern Recognition: The Journal of the Pattern Recognition Society》 |2014年第3期|共10页
作者
Volkmar Frinken; Andreas Fischer; Markus Baumgartner; Horst Bunke;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Document retrieval; Keyword spotting; Handwriting recognition; Neural networks; Semi-supervised learning;

机译：文档检索;关键词发现;笔迹识别;神经网络;半监督学习;

相似文献

外文文献
中文文献
专利

1. Keyword spotting for self-training of BLSTM NN based handwriting recognition systems [J] . Volkmar Frinken, Andreas Fischer, Markus Baumgartner, Pattern Recognition: The Journal of the Pattern Recognition Society . 2014,第3期

机译：基于LSTM CNN的手写识别系统自训练的关键字识别
2. Hybrid HMM/BLSTM system for multi-script keyword spotting in printed and handwritten documents with identification stage [J] . Neural computing & applications . 2020,第13期

机译：用于多脚本关键字在具有识别阶段的打印和手写文档中的多脚本关键字的混合HMM / BLSTM系统
3. A Russian Keyword Spotting System Based on Large Vocabulary Continuous Speech Recognition and Linguistic Knowledge [J] . Valentin Smirnov, Dmitry Ignatov, Michael Gusev, Journal of electrical and computer engineering . 2016,第PTa2期

机译：基于大词汇量连续语音识别和语言知识的俄语关键词点播系统
4. Self-Training of BLSTM with Lexicon Verification for Handwriting Recognition [C] . Bruno Stuner, Clément Chatelain, Thierry Paquet IAPR International Conference on Document Analysis and Recognition . 2017

机译：BLSTM的自训练与词汇验证，用于手写识别
5. Novel Word Recognition and Word Spotting Systems for Offline Urdu Handwriting. [D] . Sagheer, Malik Waqas. 2010

机译：用于脱机乌尔都语手写体的新型单词识别和单词发现系统。
6. Hough Transform-Based Angular Features for Learning-Free Handwritten Keyword Spotting [O] . Subhranil Kundu, Samir Malakar, Zong Woo Geem, 2021

机译：基于Hough的转换的角度特征用于无学习手写关键字斑点
7. Adapting BLSTM neural network based keyword spotting trained on modern data to historical documents [O] . Frinken, Volkmar, Fischer, Andreas, Bunke, Horst, 2010

机译：将基于BLSTM神经网络的，基于现代数据训练的关键词发现与历史文档相适应

Keyword spotting for self-training of BLSTM NN based handwriting recognition systems

摘要

著录项

相似文献

相关主题

期刊订阅