首页> 外文会议>12th International Conference on Frontiers in Handwriting Recognition >SCUT-COUCH Textline_NU: An Unconstrained Online Handwritten Chinese Text Lines Dataset
【24h】

SCUT-COUCH Textline_NU: An Unconstrained Online Handwritten Chinese Text Lines Dataset

机译:SCUT-COUCH Textline_NU:无限制的在线手写中文文本行数据集

获取原文

摘要

An unconstrained online handwritten Chinese text lines dataset, SCUT-COUCH Textline_NU, a subset of SCUT-COUCH [1] [2], is built to facilitate the research of unconstrained online Chinese text recognition. Texts for hand copying are sampled from China Daily corpus with a stratified random manner. The current vision of SCUT-COUCH Textline_NU has 8,809 text lines (4,813 lines are collected by touch screen LCD and 3,996 by digital pen) and 159,866 characters in total that are written by more than 157 participants. To demonstrate that the dataset is practical, an over-segmentation, dynamic programming and semantic model based algorithm was presented for segmenting and recognizing the unconstrained online Chinese text lines. In preliminary experiments on the dataset, the proposed algorithm recognition achieves a baseline accuracy of 56.41%.
机译:建立不受约束的在线手写中文文本行数据集SCUT-COUCH Textline_NU,它是SCUT-COUCH [1] [2]的子集,以促进无约束的在线中文文本识别的研究。手工抄写的文本以分层随机方式从《中国日报》语料库中抽取。 SCUT-COUCH Textline_NU当前的愿景是拥有8,809条文本行(触摸屏LCD收集4,813行,数字笔收集3,996行),共有157,866个字符,由157多名参与者编写。为了证明该数据集的实用性,提出了一种基于超分割,动态规划和语义模型的算法,用于分割和识别不受约束的在线中文文本行。在数据集的初步实验中,提出的算法识别达到了56.41%的基线准确度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号