首页>
外国专利>
ARABIC OPTICAL CHARACTER RECOGNITION METHOD USING HIDDEN MARKOV MODELS AND DECISION TREES
ARABIC OPTICAL CHARACTER RECOGNITION METHOD USING HIDDEN MARKOV MODELS AND DECISION TREES
展开▼
机译:隐马尔可夫模型和决策树的阿拉伯语光学字符识别方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
Disclosed is an Arabic optical character recognition method using Hidden Markov Models and decision trees, comprising: receiving an input image containing Arabic text, removing all diacritics from the input image by detecting a bounding box of each diacritic and comparing coordinates thereof to those of a bounding box of a text body, segmenting the input image into four layers, and conducting feature extraction on the segmented four layers, inputting results of feature extraction into a Hidden Markov Model thereby generating HMM models for representing each Arabic character, conducting iterative training of the HMM models until an overall likelihood criterion is satisfied, and inputting results of iterative training into a decision tree thereby predicting locations and the classes of the diacritics and producing final recognition results. The invention is capable of facilitating simple recognition of Arabic by utilizing writing feature thereof, and meanwhile featuring comparatively high recognition precision.
展开▼