首页> 外文期刊>ACM transactions on Asian language information processing >Classification of Printed Gujarati Characters Using Low-Level Stroke Features
【24h】

Classification of Printed Gujarati Characters Using Low-Level Stroke Features

机译:使用低级描边特征对印刷的古吉拉特语字符进行分类

获取原文
获取原文并翻译 | 示例
           

摘要

This article presents an elegant technique for extracting the low-level stroke features, such as endpoints, junction points, line elements, and curve elements, from offline printed text using a template matching approach. The proposed features are used to classify a subset of characters from Gujarati script. The database consists of approximately 16,782 samples of 42 middle-zone symbols from the Gujarati character set collected from three different sources: machine printed books, newspapers, and laser printed documents. The purpose of this division is to add variety in terms of size, font type, style, ink variation, and boundary deformation. The experiments are performed on the database using a k-nearest neighbor (kNN) classifier and results are compared with other widely used structural features, namely Chain Codes (CO, Directional Element Features (DEF), and Histogram of Oriented Gradients (HoG). The results show that the features are quite robust against the variations and give comparable performance with other existing works.
机译:本文介绍了一种优雅的技术,可以使用模板匹配方法从脱机打印的文本中提取低级笔触特征,例如端点,交点,线元素和曲线元素。拟议的功能用于对古吉拉特语脚本的字符子集进行分类。该数据库包含来自古吉拉特语字符集中的42个中间区域符号的大约16,782个样本,这些样本来自三个不同的来源:机器印刷的书籍,报纸和激光印刷的文档。该划分的目的是在大小,字体类型,样式,墨水变化和边界变形方面增加多样性。使用k最近邻(kNN)分类器在数据库上进行了实验,并将结果与​​其他广泛使用的结构特征(即链码(CO,方向元素特征(DEF)和定向梯度直方图(HoG)))进行了比较。结果表明,这些特征对于这些变化具有相当强的鲁棒性,并且可以提供与其他现有作品相当的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号