首页> 外文会议>International Conference on Machine Learning, Big Data and Business Intelligence >Research on Scene Chinese Character Recognition Method Based on Similar Chinese Characters
【24h】

Research on Scene Chinese Character Recognition Method Based on Similar Chinese Characters

机译:基于类似汉字的场景汉字识别方法研究

获取原文

摘要

Text recognition in natural scenes has always been a hot topic of research. At present, OCR in academia can support multiple languages and has certain versatility. However, the recognition accuracy of Chinese characters, especially those with similar shapes, is not ideal. Therefore, this paper proposes the Similar-CRNN algorithm based on the traditional CNN + RNN + CTC algorithm model from the perspective of the structure of similar characters and the semantic information of the context. Firstly, we construct a similar character library based on the similarity algorithm of Chinese characters, and conduct enhanced training for the feature differences of similar Chinese characters to improve the recognition accuracy of similar Chinese characters from the aspect of Chinese character structure. Then, after obtaining the preliminary results, add a "semantic detector" to perform three stages of error detection, candidate recall and error correction sorting after Chinese word segmentation, to correct semantically irrelevant error recognition results, and further improve the recognition accuracy rate at the semantic level of Chinese characters.
机译:在自然场景中的文本识别一直是一个热门的研究。目前,在学术界的OCR可以支持多种语言并具有某种多种功能性。然而,汉字的识别准确性,尤其是具有相似形状的人物,并不理想。因此,本文从类似字符的结构和上下文的语义信息的角度提出了基于传统CNN + RNN + CTC算法模型的类似-CLNN算法。首先,我们构建一个基于汉字的相似性算法的类似字符库,并对同类汉字的特征差异进行增强培训,以提高来自汉字结构方面的类似汉字的识别准确性。然后,在获得初步结果之后,添加“语义检测器”在中文分割后执行“语义检测器”,以执行三个错误检测,候选召回和纠错排序,以更正语义无关的错误识别结果,进一步提高识别精度率汉字的语义水平。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号