首页> 外国专利> SYSTEM AND METHODS FOR ARABIC TEXT RECOGNITION BASED ON EFFECTIVE ARABIC TEXT FEATURE EXTRACTION

SYSTEM AND METHODS FOR ARABIC TEXT RECOGNITION BASED ON EFFECTIVE ARABIC TEXT FEATURE EXTRACTION

机译:基于有效阿拉伯文本特征提取的阿拉伯文本识别系统和方法

摘要

A method for automatically recognizing Arabic text includes digitizing a line of Arabic characters to form a two-dimensional array of pixels each associated with a pixel value, wherein the pixel value is expressed in a binary number, dividing the line of the Arabic characters into a plurality of line images, defining a plurality of cells in one of the plurality of line images, wherein each of the plurality of cells comprises a group of adjacent pixels, serializing pixel values of pixels in each of the plurality of cells in one of the plurality of line images to form a binary cell number, forming a text feature vector according to binary cell numbers obtained from the plurality of cells in one of the plurality of line images, and feeding the text feature vector into a Hidden Markov Model to recognize the line of Arabic characters.
机译:一种自动识别阿拉伯文本的方法,包括数字化阿拉伯字符的行以形成像素的二维阵列,每个像素与一个像素值相关联,其中像素值以二进制数表示,将阿拉伯字符的行划分为一个多个线图像,在多个线图像中的一个中定义多个单元,其中,多个单元中的每个单元包括一组相邻的像素,将多个单元中的一个单元中的每个单元中的像素的像素值序列化线图像形成二进制单元号,根据从多个线图像之一中的多个单元获得的二进制单元号形成文本特征向量,并将文本特征向量馈入隐马尔可夫模型以识别线阿拉伯字符。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号