Classification of Printed Gujarati Characters Using Low-Level Stroke Features

MUKESH M. GOSWAMI; SUMAN K. MITRA

首页> 外文期刊>ACM transactions on Asian language information processing >Classification of Printed Gujarati Characters Using Low-Level Stroke Features

【24h】

Classification of Printed Gujarati Characters Using Low-Level Stroke Features

机译：使用低级描边特征对印刷的古吉拉特语字符进行分类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This article presents an elegant technique for extracting the low-level stroke features, such as endpoints, junction points, line elements, and curve elements, from offline printed text using a template matching approach. The proposed features are used to classify a subset of characters from Gujarati script. The database consists of approximately 16,782 samples of 42 middle-zone symbols from the Gujarati character set collected from three different sources: machine printed books, newspapers, and laser printed documents. The purpose of this division is to add variety in terms of size, font type, style, ink variation, and boundary deformation. The experiments are performed on the database using a k-nearest neighbor (kNN) classifier and results are compared with other widely used structural features, namely Chain Codes (CO, Directional Element Features (DEF), and Histogram of Oriented Gradients (HoG). The results show that the features are quite robust against the variations and give comparable performance with other existing works.

机译：本文介绍了一种优雅的技术，可以使用模板匹配方法从脱机打印的文本中提取低级笔触特征，例如端点，交点，线元素和曲线元素。拟议的功能用于对古吉拉特语脚本的字符子集进行分类。该数据库包含来自古吉拉特语字符集中的42个中间区域符号的大约16,782个样本，这些样本来自三个不同的来源：机器印刷的书籍，报纸和激光印刷的文档。该划分的目的是在大小，字体类型，样式，墨水变化和边界变形方面增加多样性。使用k最近邻（kNN）分类器在数据库上进行了实验，并将结果与其他广泛使用的结构特征（即链码（CO，方向元素特征（DEF）和定向梯度直方图（HoG）））进行了比较。结果表明，这些特征对于这些变化具有相当强的鲁棒性，并且可以提供与其他现有作品相当的性能。

著录项

来源
《ACM transactions on Asian language information processing》 |2016年第4期|25.1-25.26|共26页
作者
MUKESH M. GOSWAMI; SUMAN K. MITRA;
展开▼
作者单位

Dept. of Information Technology, Faculty of Technology, Dharm-sinh Desai University, College Road, Nadiad-387001, Gujarat (India);

Dhirubhai Ambani Institute of Information and Communication Technology, Near Indroda Circle, Gandhinagar-382007 Gujarat (India);

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Characters classification; Gujarati characters; stroke features;

机译：字符分类;古吉拉特语字符;中风特征;

相似文献

外文文献
中文文献
专利

1. Offline handwritten Gujarati numeral recognition using low-level strokes [J] . Mukesh M. Goswami, Suman K. Mitra International Journal of Applied Pattern Recognition . 2015,第4期

机译：使用低级笔划的离线手写古吉拉特语数字识别
2. A Coarse Classification Scheme on Printed Chinese Characters by Encoding the Feature Points [J] . MING-GANG WEN, CHIN-CHUAN HAN, KUO-CHIN FAN, Journal of information science and engineering . 2003,第4期

机译：基于特征点编码的汉字粗分类方案
3. ON-LINE RECOGNITION OF CURSIVE KOREAN CHARACTERS USING ART-BASED STROKE CLASSIFICATION (RECOGNITION OF CURSIVE KOREAN CHARACTERS) [J] . HANG-JOON KIM, SANG-KYOON KIM International Journal of Pattern Recognition and Artificial Intelligence . 1996,第7期

机译：使用基于笔划的笔划分类在线识别朝鲜语字符（识别朝鲜语字符）
4. Printed Gujarati Character Classification Using High-Level Strokes [C] . Mukesh M. Goswami, Suman K. Mitra International Conference on Computer Vision and Image Processing . 2018

机译：使用高级笔划印刷的古吉拉拉蒂字符分类
5. Efficient linear and nonlinear feature extraction and its application to fingerprint classification. [D] . Park, Cheong Hee. 2004

机译：高效的线性和非线性特征提取及其在指纹分类中的应用。
6. Correction: A Method of Neighbor Classes Based SVM Classification for Optical Printed Chinese Character Recognition [O] . Jie Zhang, Xiaohong Wu, Yanmei Yu, -1

机译：校正：一种基于邻类的支持向量机分类的光学印刷汉字识别方法
7. A Study to Recognize Printed Gujarati Characters Using Tesseract OCR [O] . Milind Kumar Audichya 2017

机译：使用TESSERACT OCR识别印刷古吉拉特人物的研究
8. Method and Evaluation of Character Stroke Preservation on Handprint Recognition [R] . Garris, M. D. 1995

机译：手印识别中字符笔划保存的方法与评价

Classification of Printed Gujarati Characters Using Low-Level Stroke Features

摘要

著录项

相似文献

相关主题

期刊订阅