首页> 外文会议>2017 International Conference on Computing Intelligence and Information System >Synthetic Data and DAG-SVM Classifier for Segmentation-Free Manchu Word Recognition
【24h】

Synthetic Data and DAG-SVM Classifier for Segmentation-Free Manchu Word Recognition

机译:合成数据和DAG-SVM分类器用于无段满语识别

获取原文
获取原文并翻译 | 示例

摘要

There are a few studies on Manchu recognition, and the existing methods are mainly based on segmentation on characters or strokes. Thus, their performances are strongly dependent on segmentation accuracy. In this paper, a whole word recognition method for segmentation-free Manchu word is proposed to avoid the mis-segmentation of Manchu word. Firstly, we build an initial Manchu word image dataset, and then augment it with synthetic data, which are harvested via structural distortions on Manchu word image. Secondly, the support vector machine classifier with polynomial kernel function combined with directed acyclic graph is used for classification of Manchu words from 2 to 100 classes. The experiment results show that the precise is 78% for the 100-way classification problem, even above 90% for classes less than 40. The synthetic data method proposed in this paper is an effective way to augment the training and test dataset for Manchu word recognition.
机译:关于满族识别的研究很少,现有的方法主要基于字符或笔划的分割。因此,它们的性能在很大程度上取决于分割精度。提出了一种完整的无分割满族词识别方法,避免了满族词的误分词。首先,我们建立一个初始的满族单词图像数据集,然后用合成数据对其进行扩充,这些合成数据是通过对满族单词图像进行结构变形而获得的。其次,将支持多项式核函数与有向无环图相结合的支持向量机分类器用于2〜100个满族词的分类。实验结果表明,该方法对100种分类问题的准确度为78%,对于40种分类问题的准确度甚至超过90%。本文提出的综合数据方法是一种增强满族单词训练和测试数据集的有效方法。承认。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号