首页> 外文期刊>Multimedia Tools and Applications >HP_DocPres: a method for classifying printed and handwritten texts in doctor's prescription
【24h】

HP_DocPres: a method for classifying printed and handwritten texts in doctor's prescription

机译:hp_docpres:一种在医生处方中对打印和手写文本进行分类的方法

获取原文
获取原文并翻译 | 示例
           

摘要

Optical Character Recognition (OCR) system is used to convert the document images, either printed or handwritten, into its electronic counterpart. But dealing with handwritten texts is much more challenging than printed ones due to the erratic writing style of the individuals. The problem becomes more severe when the input image is a doctor's prescription. Before feeding such an image to the OCR engine, the classification of printed and handwritten texts is a necessity as a doctor's prescription contains both handwritten and printed texts which are to be processed separately. Much work has been done in the domain of handwritten and printed text separation albeit work related to doctor's handwriting. In this paper, a method is proposed which first localizes the position of texts in a doctor's prescription, and then separates out the printed texts from the handwritten ones. Due to the unavailability of a large database, we have used some standard data (image) augmentation techniques to evaluate as well as to prove the robustness of our method. Besides, we have also designed a Graphical User Interface (GUI) so that anybody can visualize the output by providing a prescription image as input.
机译:光学字符识别(OCR)系统用于将文档图像转换为打印或手写的电子对应物。但由于个人的写作风格不稳定,处理手写文本比印刷文本更具挑战性。当输入图像是医生的处方时,问题变得更加严重。在将这样的图像馈送到OCR引擎之前,印刷和手写文本的分类是必需的,因为医生的处方包含要单独处理的手写和印刷文本。在手写和印刷文本分离领域已经完成了很多工作虽然与医生的笔迹相关的工作。在本文中,提出了一种方法,该方法首先定位在医生处方中文本的位置,然后将印刷文本与手写的文本分开。由于大型数据库的不可用,我们使用了一些标准数据(图像)增强技术来评估以及证明我们方法的稳健性。此外,我们还设计了一个图形用户界面(GUI),使得任何人都可以通过将处方图像提供作为输入来可视化输出。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号