首页> 外文会议>International Conference on Intelligent Computing >Segmentation of Mixed Chinese/English Documents Based on Chinese Radicals Recognition and Complexity Analysis in Local Segment Pattern

【24h】

Segmentation of Mixed Chinese/English Documents Based on Chinese Radicals Recognition and Complexity Analysis in Local Segment Pattern

机译：基于中国自由基识别与局部段模式复杂性分析的混合中/英语文件的分割

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Segmentation based on character recognition is one of the most popular methods of segmenting mixed Chinese/English documents. However, the rejection to outliers is always the bottleneck of this method. A new method is provided to alleviate the problem in this paper. We will give language attribute of each segment as possible as we can and then merge or split segment according to the language attribute. First of all, we construct a mixed OCR engine for Chinese radical and English character and some English character-pairs. Furthermore, English negative samples are trained to improve the capability of rejection to outliers. Finally, language determination of segments based on the mixed OCR engine and complexity analysis of local pattern is conducted. Encouraging performance has been obtained according to the test results.

机译：基于字符识别的分割是分割混合中文/英语文件的最流行方法之一。然而，对异常值的拒绝始终是这种方法的瓶颈。提供了一种新方法来缓解本文的问题。我们将尽可能多地提供每个段的语言属性，然后根据语言属性合并或分割段。首先，我们为中国激进和英语角色和一些英文角色对构建一个混合的OCR引擎。此外，培训英语阴性样本以提高拒绝异常值的能力。最后，进行了基于混合OCR发动机的段的语言确定和局部模式的复杂性分析。根据测试结果获得了令人鼓舞的表现。

著录项

来源
《International Conference on Intelligent Computing》|2006年||共10页
会议地点
作者

展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. A method for improving the accuracy of automatic indexing of Chinese-English mixed documents [J] . Yan ZHAO, Hui SHI 中国文献情报（英文刊） . 2012,第004期

机译：一种提高中英文混合文档自动索引准确性的方法
2. A method for improving the accuracy of automatic indexing of Chinese-English mixed documents [J] . Yan, ZHAO, Hui, 中国文献情报：英文版 . 2012,第004期

机译：一种提高中英文混合文档自动索引准确性的方法
3. Chinese character recognition is limited by overall complexity, not by number of strokes or stroke patterns [J] . On-Ting Lo, Sing-Hang Cheung Journal of vision . 2010,第7期

机译：汉字识别不受整体复杂度的限制，而不受笔画或笔划样式的限制
4. Segmentation of Mixed Chinese/English Documents Based on Chinese Radicals Recognition and Complexity Analysis in Local Segment Pattern [C] . . 2006

机译：基于中文部首识别和局部句段复杂度分析的中英文混合文档分割
5. Acquisition of Chinese characters among beginning Chinese readers: Effect of visual complexity and radical presence. [D] . Li, Ying. 2011

机译：初级中文读者中的汉字习得：视觉复杂性和激进存在的影响。
6. Aging and Pattern Complexity Effects on the Visual Span: Evidence from Chinese Character Recognition [O] . Fang Xie, Lin Li, Sainan Zhao, 2019

机译：年龄和图案复杂度对视觉范围的影响：来自汉字识别的证据
7. Character Segmentation and Recognition of Alphanumeric-mixed Documents Based on Pattern Recognition Information [O] . Yasuo Hongo 2002

机译：基于模式识别信息的字母分割与识别字母数字混合文档

Segmentation of Mixed Chinese/English Documents Based on Chinese Radicals Recognition and Complexity Analysis in Local Segment Pattern

摘要

著录项

相似文献

相关主题

期刊订阅