首页> 外国专利> ARABIC OPTICAL CHARACTER RECOGNITION METHOD USING HIDDEN MARKOV MODELS AND DECISION TREES

ARABIC OPTICAL CHARACTER RECOGNITION METHOD USING HIDDEN MARKOV MODELS AND DECISION TREES

机译:隐马尔可夫模型和决策树的阿拉伯语光学字符识别方法

摘要

Disclosed is an Arabic optical character recognition method using Hidden Markov Models and decision trees, comprising: receiving an input image containing Arabic text, removing all diacritics from the input image by detecting a bounding box of each diacritic and comparing coordinates thereof to those of a bounding box of a text body, segmenting the input image into four layers, and conducting feature extraction on the segmented four layers, inputting results of feature extraction into a Hidden Markov Model thereby generating HMM models for representing each Arabic character, conducting iterative training of the HMM models until an overall likelihood criterion is satisfied, and inputting results of iterative training into a decision tree thereby predicting locations and the classes of the diacritics and producing final recognition results. The invention is capable of facilitating simple recognition of Arabic by utilizing writing feature thereof, and meanwhile featuring comparatively high recognition precision.
机译:公开了一种使用隐马尔可夫模型和决策树的阿拉伯光学字符识别方法,包括:接收包含阿拉伯文本的输入图像,通过检测每个变音符号的边界框并将其坐标与边界坐标相比较,从输入图像中去除所有变音符号。文本框,将输入图像分割成四层,然后在分割的四层上进行特征提取,将特征提取的结果输入到隐马尔可夫模型中,从而生成用于表示每个阿拉伯字符的HMM模型,并对HMM进行迭代训练进行建模,直到满足总体似然准则为止,然后将迭代训练的结果输入决策树,从而预测变音符号的位置和类别,并产生最终的识别结果。通过利用本发明的书写特征,本发明能够促进对阿拉伯语的简单识别,同时具有相对较高的识别精度。

著录项

  • 公开/公告号US2017017854A1

    专利类型

  • 公开/公告日2017-01-19

    原文格式PDF

  • 申请/专利权人 HUAZHONG UNIVERSITY OF SCIENCE AND TECHNOLOGY;

    申请/专利号US201514844713

  • 发明设计人 MOHAMMED LUTF;XINGE YOU;

    申请日2015-09-03

  • 分类号G06K9/18;G06K9/46;G06T7/00;G06K9/62;

  • 国家 US

  • 入库时间 2022-08-21 13:49:00

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号