Synthetic Data and DAG-SVM Classifier for Segmentation-Free Manchu Word Recognition

机译：合成数据和DAG-SVM分类器用于无段满语识别

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

There are a few studies on Manchu recognition, and the existing methods are mainly based on segmentation on characters or strokes. Thus, their performances are strongly dependent on segmentation accuracy. In this paper, a whole word recognition method for segmentation-free Manchu word is proposed to avoid the mis-segmentation of Manchu word. Firstly, we build an initial Manchu word image dataset, and then augment it with synthetic data, which are harvested via structural distortions on Manchu word image. Secondly, the support vector machine classifier with polynomial kernel function combined with directed acyclic graph is used for classification of Manchu words from 2 to 100 classes. The experiment results show that the precise is 78% for the 100-way classification problem, even above 90% for classes less than 40. The synthetic data method proposed in this paper is an effective way to augment the training and test dataset for Manchu word recognition.

机译：关于满族识别的研究很少，现有的方法主要基于字符或笔划的分割。因此，它们的性能在很大程度上取决于分割精度。提出了一种完整的无分割满族词识别方法，避免了满族词的误分词。首先，我们建立一个初始的满族单词图像数据集，然后用合成数据对其进行扩充，这些合成数据是通过对满族单词图像进行结构变形而获得的。其次，将支持多项式核函数与有向无环图相结合的支持向量机分类器用于2〜100个满族词的分类。实验结果表明，该方法对100种分类问题的准确度为78％，对于40种分类问题的准确度甚至超过90％。本文提出的综合数据方法是一种增强满族单词训练和测试数据集的有效方法。承认。

著录项

来源
《2017 International Conference on Computing Intelligence and Information System》|2017年|46-50|共5页
会议地点 Nanjing(CN)
作者
Di Huang; Min Li; Ruirui Zheng; Shuang Xu; Jiajing Bi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Kernel; Support vector machines; Image segmentation; Character recognition; Training; Strain; Image recognition;

机译：内核;支持向量机;图像分割;字符识别;训练;应变;图像识别;;

相似文献

外文文献
中文文献
专利

1. PRINTED MANCHU CHARACTER RECOGNITION BASED ON MULTI CLASSIFIER FUSION DECISION [J] . Li M., Xu S., Zheng R. R. Journal of investigative medicine . 2015,第8Suppla期

机译：基于多分类器融合决策的印刷人物字符识别
2. Segmentation-free MRF Recognition Method in Combination with P2DBMN-MQDF for Online Handwritten Cursive Word [J] . Bilan ZHU, Arti Shivram, Srirangaraj Setlur, 電子情報通信学会技術研究報告 . 2013,第495期

机译：结合P2DBMN-MQDF的在线手写草书词无分割MRF识别方法
3. Segmentation-free MRF Recognition Method in Combination with P2DBMN-MQDF for Online Handwritten Cursive Word [J] . Bilan ZHU, Arti Shivram, Srirangaraj Setlur, 電子情報通信学会技術研究報告. パターン認識·メディア理解. Pattern Recognition and Media Understanding . 2012,第495期

机译：结合P2DBMN-MQDF的在线手写草书词无分割MRF识别方法
4. Synthetic Data and DAG-SVM Classifier for Segmentation-Free Manchu Word Recognition [C] . Di Huang, Min Li, Ruirui Zheng, International Conference on Computing Intelligence and Information System . 2017

机译：用于分割满族识别的合成数据和DAG-SVM分类器
5. A segmentation-free approach to text recognition with application to Arabic text. [D] . Al-Badr, Badr H. 1995

机译：一种无分段的文本识别方法，适用于阿拉伯文本。
6. Recognition Times for 54 Thousand Dutch Words: Data from the Dutch Crowdsourcing Project [O] . Marc Brysbaert, Emmanuel Keuleers, Paweł Mandera 2019

机译：54000个荷兰语单词的识别时间：来自荷兰众包项目的数据
7. Structural Information Implant in a Context Based Segmentation-Free HMM Handwritten Word Recognition System for Latin and Bangla Script [O] . Szilárd Vajda, Abdel Belaïd 2012

机译：基于上下文的无分段HMM拉丁文字和Bangla文字识别系统中的结构信息植入

Synthetic Data and DAG-SVM Classifier for Segmentation-Free Manchu Word Recognition

摘要

著录项

相似文献

相关主题

期刊订阅