Applications of Recurrent Neural Network Language Model in Offline Handwriting Recognition and Word Spotting

机译：递归神经网络语言模型在离线手写识别和词点识别中的应用

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The recurrent neural network language model (RNNLM) is a discriminative, non-Markovian model that can capture long-span word history in natural language. It has been proved to be successful in automatic speech recognition and machine translation. In this work, we applied RNNLM to the n-best rescoring stage of the state-of-the-art BBN Byblos OCR (optical character recognition) system for handwriting recognition.1 With RNNLM scores as additional features, our system achieved significant improvement (p < 0.001), a 3.5% relative reduction on OCR word error rate, compared with a high baseline that uses n-gram language model for rescoring. We have also developed a novel method to integrate the OCR n-best RNNLM scores into the ord posterior probabilities in OCR confusion networks, which resulted in consistent observable improvements in word spotting for OCR'ed handwritten documents, as measured by both mean average precision (MAP) and detection-error tradeoff (DET) curves.

机译：递归神经网络语言模型（RNNLM）是一种可区分的非马尔可夫模型，可以捕获自然语言中的大跨度单词历史。它已被证明在自动语音识别和机器翻译中是成功的。在这项工作中，我们将RNNLM应用于最先进的BBN Byblos OCR（光学字符识别）系统进行手写识别的n个最佳记录阶段。1通过将RNNLM得分作为附加功能，我们的系统取得了显着改进（ p <0.001），与使用n-gram语言模型进行记录的较高基线相比，OCR字错误率相对降低了3.5％。我们还开发了一种新颖的方法，可以将OCR n最佳RNNLM得分整合到OCR混淆网络中的ord后验概率中，从而通过OCR手写文档的平均平均精度（ MAP）和检测错误权衡（DET）曲线。

著录项

来源
《International Conference on Frontiers in Handwriting Recognition》|2014年|134-139|共6页
会议地点
作者
Li Nan; Chen Jinying; Cao Huaigu; Zhang Bing; Natarajan Prem;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Character recognition; Handwriting recognition; Hidden Markov models; Lattices; Optical character recognition software; Recurrent neural networks; Training; information retrieval; keyword search; optical character recognition; recurrent neural networks;

机译：字符识别手写识别隐藏Markov模型格子光学字符识别软件递归神经网络培训信息检索关键字搜索光学字符识别递归神经网络;

相似文献

外文文献
中文文献
专利

1. Improving Recurrent Neural Networks for Offline Arabic Handwriting Recognition by Combining Different Language Models [J] . Jemni Sana Khamekhem, Kessentini Yousri, Kanoun Slim International Journal of Pattern Recognition and Artificial Intelligence . 2020,第12期

机译：通过组合不同的语言模型，改进反际阿拉伯语手写识别的反复性神经网络
2. Feature Set Evaluation for Offline Handwriting Recognition Systems: Application to the Recurrent Neural Network Model [J] . Youssouf Chherawala, Partha Pratim Roy, Mohamed Cheriet Cybernetics, IEEE Transactions on . 2016,第12期

机译：离线手写识别系统的特征集评估：在递归神经网络模型中的应用
3. Latent Words Recurrent Neural Network Language Models for Automatic Speech Recognition [J] . Ryo MASUMURA, Taichi ASAMI, Takanobu OBA, IEICE transactions on information and systems . 2019,第12期

机译：潜在词递归神经网络语言模型用于自动语音识别
4. A context-sensitive-chunk BPTT approach to training deep LSTM/BLSTM recurrent neural networks for offline handwriting recognition [C] . Kai Chen, Zhi-Jie Yan, Qiang Huo International Conference on Document Analysis and Recognition . 2015

机译：上下文敏感块BPTT方法训练深度LSTM / BLSTM递归神经网络以进行离线手写识别
5. Novel Word Recognition and Word Spotting Systems for Offline Urdu Handwriting. [D] . Sagheer, Malik Waqas. 2010

机译：用于脱机乌尔都语手写体的新型单词识别和单词发现系统。
6. Convolutional and recurrent neural network for human activity recognition: Application on American sign language [O] . Vincent Hernandez, Tomoya Suzuki, Gentiane Venture 2020

机译：卷积和递归神经网络用于人类活动识别：在美国手语上的应用
7. Latent Words Recurrent Neural Network Language Models for Automatic Speech Recognition [O] . Ryo MASUMURA, Taichi ASAMI, Takanobu OBA, 2019

机译：潜在的自动语音识别复发性神经网络语言模型

Applications of Recurrent Neural Network Language Model in Offline Handwriting Recognition and Word Spotting

摘要

著录项

相似文献

相关主题

期刊订阅