The analysis on mistaken segmentation of Tibetan words based on statistical method

机译：基于统计方法的藏文单词错误分割分析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, by using the Tibetan word segmentation system, IEA-TWordSeg, the authors attempt segmentation of the total 1271 sentences in the closed set and 1000 sentences in an open set. The accuracy of testing is 99.54% and 92.41% respectively. The authors describe the wrong segmentation types as well as the causes of the mistakes, and demonstrate the proportion of different types of segmentation errors. The purpose of the article is to provide clues for those who intend to improve the accuracy of Tibetan word segmentation system.

机译：本文使用藏语分词系统IEA-TWordSeg，尝试对封闭集中的1271个句子和开放集中的1000个句子进行切分。测试的准确性分别为99.54％和92.41％。作者描述了错误的细分类型以及错误的原因，并说明了不同类型的细分错误所占的比例。本文的目的是为那些打算提高藏文分词系统准确性的人提供线索。

著录项

来源
《International conference on asian language processing》|2014年|74-77|共4页
会议地点
作者
Congjun Long; Yiyong Lan; Xiaobin Zhao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
natural language processing; statistical analysis; word processing; IEA-TWordSeg; Tibetan word segmentation system; Tibetan words; mistaken segmentation; segmentation error; statistical method; Accuracy; Educational institutions; Information processing; Labeling; Testing; Training; Writing; Tibetan word segmentation; disambiguation segmentation; the segmentation errors; variant written form;

机译：自然语言处理;统计分析;词处理; IEA-TWordSeg;藏语词分割系统;藏语词;错误分割;分段错误;统计方法;准确性;教育机构;信息处理;标签;测试;培训;写作;藏语词分割消歧分割分割错误多种书面形式;

相似文献

外文文献
中文文献
专利

1. Tibetan Word Segmentation and POS Tagging Research Based on Knowledge Feedback [J] . Wei Bao, Luobsang Karten Journal of Residuals Science & Technology . 2016,第8期

机译：基于知识反馈的藏语分词与POS标记研究
2. Recognizing handwritten Chinese day and month words by combining a holistic method and a segmentation-based method [J] . Chongyang Zhang, Wei Li Neural Computing and Applications . 2013,第6期

机译：结合整体和基于分割的方法识别手写的中文日月单词
3. Recognizing handwritten Chinese day and month words by combining a holistic method and a segmentation-based method [J] . Chongyang Zhang, Wei Li Neural computing & applications . 2013,第6期

机译：结合整体和基于分割的方法识别手写的中文日月单词
4. The analysis on mistaken segmentation of Tibetan words based on statistical method [C] . Congjun Long, Yiyong Lan, Xiaobin Zhao International conference on asian language processing . 2014

机译：基于统计方法的藏语误解分析分析
5. Geometric statistically based methods for the segmentation and registration of medical imagery. [D] . Gao, Yi. 2011

机译：基于几何统计的医学图像分割和配准方法。
6. Discussion: Comparison of Statistical Methods for AssessingSpatial Correlations Between Maps of Different Arterial Properties (RowlandE. M. Mohamied Y. Chooi K. Y. Bailey E. L. and Weinberg P. D. 2015ASME J. Biomech. Eng. 137(10)p. 101003): An Alternative Approach Using Segmentation Based on LocalHemodynamics [O] . Heather A. Himburg, Deborah M. Grzybowski, Andrew L. Hazel, -1

机译：讨论：评估统计方法的比较不同动脉特性的地图之间的空间相关性（RowlandE.M.MohamiedY.ChooiK.Y.BaileyE.L。和WeinbergP.D.2015年ASME J.Biomech。工程137（10）p。 101003）：使用基于局部的细分的替代方法血液动力学

The analysis on mistaken segmentation of Tibetan words based on statistical method

摘要

著录项

相似文献

相关主题

期刊订阅