首页> 美国卫生研究院文献>SpringerPlus >Urdu Nasta’liq text recognition using implicit segmentation based on multi-dimensional long short term memory neural networks
【2h】

Urdu Nasta’liq text recognition using implicit segmentation based on multi-dimensional long short term memory neural networks

机译:基于多维长短期记忆神经网络的隐式分割的Urdu Nastaliq文本识别

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The recognition of Arabic script and its derivatives such as Urdu, Persian, Pashto etc. is a difficult task due to complexity of this script. Particularly, Urdu text recognition is more difficult due to its Nasta’liq writing style. Nasta’liq writing style inherits complex calligraphic nature, which presents major issues to recognition of Urdu text owing to diagonality in writing, high cursiveness, context sensitivity and overlapping of characters. Therefore, the work done for recognition of Arabic script cannot be directly applied to Urdu recognition. We present Multi-dimensional Long Short Term Memory (MDLSTM) Recurrent Neural Networks with an output layer designed for sequence labeling for recognition of printed Urdu text-lines written in the Nasta’liq writing style. Experiments show that MDLSTM attained a recognition accuracy of 98% for the unconstrained Urdu Nasta’liq printed text, which significantly outperforms the state-of-the-art techniques.
机译:由于该脚本的复杂性,识别阿拉伯文字及其派生词(如乌尔都语,波斯语,普什图语等)是一项艰巨的任务。特别是,由于其Nasta'liq书写风格,乌尔都语文本识别更加困难。 Nasta'liq的书写风格继承了复杂的书法性质,由于书写的对角线,高度的草率,上下文敏感度和字符重叠,这给识别Urdu文本带来了主要问题。因此,为识别阿拉伯文字所做的工作不能直接应用于乌尔都语识别。我们提供了多维长期短期记忆(MDLSTM)递归神经网络,其输出层设计用于序列标签,以识别以Nasta'liq书写方式书写的乌尔都语文字行。实验表明,对于不受约束的Urdu Nasta'liq印刷文本,MDLSTM达到了98%的识别准确度,这大大超过了最新技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号